Healthcare Revenue Cycle Management: Where Agents Beat RPA
Claude agents outperform UiPath and Blue Prism in healthcare RCM. Denials management, coding queries, patient billing—with break-even economics.
Healthcare Revenue Cycle Management: Where Agents Beat RPA
Table of Contents
- Why Healthcare RCM Automation Matters
- RPA vs Agentic AI: The Fundamental Difference
- Where Agents Win: Three Critical RCM Functions
- Claude Agents in Denials Management
- Coding Query Resolution and Compliance
- Patient Billing Follow-Up at Scale
- Economics: Break-Even in 12–16 Weeks
- Implementation: From Pilot to Production
- Security, Audit-Readiness, and Compliance
- Next Steps: Building Your Agent-Led RCM
Why Healthcare RCM Automation Matters
Healthcare revenue cycle management (RCM) is broken. Across US and Australian health systems, 15–25% of claims are denied on first submission. Patient billing follow-up is manual, slow, and error-prone. Coding queries sit in queues for days. Denials are appealed by humans reading spreadsheets, not by intelligent systems that learn from patterns.
The cost is brutal: a 200-bed hospital loses USD $1–2M annually to preventable denials alone. A mid-sized Australian private health insurer or billing service provider bleeds 8–12% of potential revenue to operational friction.
For the past decade, healthcare organisations have turned to Robotic Process Automation (RPA) platforms like UiPath and Blue Prism. RPA works well for rule-based, high-volume tasks: claim submission, appointment reminders, basic eligibility checks. But RPA hits a wall in RCM because denials, coding disputes, and patient follow-up require reasoning, context-switching, and adaptive decision-making.
Enter agentic AI. Unlike RPA bots that follow rigid scripts, Claude agents reason through ambiguous situations, retrieve context from multiple systems, and take corrective action without human intervention. In healthcare RCM specifically, agents are outperforming RPA on three fronts: denials management, coding query resolution, and patient billing follow-up.
This guide explains why, shows the economics, and walks you through implementation.
RPA vs Agentic AI: The Fundamental Difference
What RPA Does Well
RPA excels at high-volume, repetitive, rule-based work. A Blue Prism or UiPath bot can:
- Log into a claims system, extract data, and resubmit a claim with corrected fields.
- Read a denial code and trigger a pre-defined workflow (appeal, rework, escalate).
- Scrape eligibility data from a payer portal and load it into your EHR.
- Generate and send 10,000 patient billing reminders in a single batch run.
When the rule is clear (“if denial code = 835-4, appeal within 7 days”), RPA is fast, cheap, and reliable. A single bot can handle 50–100 transactions per hour.
Where RPA Fails
RPA breaks down the moment the task requires reasoning:
- Denials with mixed signals: A claim is denied for “incomplete medical documentation” but the documentation is in the chart—just under a different patient ID. RPA can’t reason across system boundaries.
- Coding disputes: A coder flags a procedure code as ambiguous. The resolution requires reading the operative note, comparing it to three different coding guidelines, and making a judgment call. RPA has no way to parse unstructured text at that level.
- Patient follow-up nuance: A patient’s payment plan is 60 days overdue, but they had a recent catastrophic event (surgery, job loss). Should you escalate to collections, offer a hardship plan, or wait? RPA has no decision-making capability here.
- Cross-system context: Denials often require data from the EHR, the claims system, the payer portal, and the patient’s account history. RPA can extract from each silo, but can’t synthesize and act on the whole picture.
The result: RPA handles 60–70% of RCM volume but leaves the hardest 30–40% for humans. That 30–40% is where your money leaks.
How Agentic AI Works Differently
Agentic AI systems—specifically Claude agents built on Anthropic’s Claude 3.5 Sonnet model—operate on a fundamentally different principle. An agent:
- Receives a goal (“resolve this denied claim”).
- Reasons about the problem (“denial code is 835-4, which means missing documentation. Let me check the EHR.”).
- Takes action (retrieves operative note, reads it, compares to denial reason).
- Iterates (if the note doesn’t match the denial, the agent queries the coder, the payer, or escalates).
- Decides and executes (resubmit with corrected documentation, appeal, or escalate with a detailed brief).
Crucially, agents can:
- Reason across ambiguity: They don’t need perfect rules; they can weigh evidence and make judgment calls.
- Retrieve context dynamically: They query multiple systems in real-time and synthesise the answer.
- Learn from feedback: Over time, agents improve their decision-making based on outcomes (claim approved, denied again, etc.).
- Handle exceptions gracefully: When they hit a situation they can’t resolve, they escalate with full context, not a blank ticket.
For a deeper understanding of how agentic AI differs from traditional automation, read Agentic AI vs Traditional Automation: Why Autonomous Agents Are the Future, which covers when to use each approach and how to migrate from legacy systems.
Where Agents Win: Three Critical RCM Functions
Not every RCM task benefits equally from agentic AI. The highest-value wins come in three areas:
1. Denials Management (30–40% of RCM revenue impact)
Denials are the biggest revenue leak in healthcare. 7 Ways RPA Is Transforming the Healthcare Revenue Cycle shows that RPA can handle simple, rule-based denials, but the complex ones—those requiring cross-system reasoning—still require human review.
Agents excel here because denials often have multiple root causes:
- Missing or incomplete documentation (requires EHR retrieval and reading).
- Coding errors (requires comparison to guidelines and clinical context).
- Payer policy mismatches (requires payer contract knowledge and negotiation logic).
- Duplicate claims (requires deduplication logic across multiple systems).
An agent can:
- Ingest the denial code and reason about likely causes.
- Query the EHR, claims system, and payer portal in parallel.
- Determine the root cause with 85–92% accuracy (vs. 60–70% for RPA).
- Decide whether to appeal, rework, or escalate.
- Generate a detailed appeal brief if needed, citing specific documentation and policy references.
Real-world result: A 150-bed US hospital reduced first-pass denial rate from 18% to 7% in 14 weeks using Claude agents. Annual revenue recovery: USD $340K. Time-to-resolution for complex denials dropped from 12 days (human review) to 2–3 days (agent + human review of escalations only).
2. Coding Query Resolution (15–25% of RCM revenue impact)
Coding queries are the silent killer of RCM efficiency. A coder encounters an ambiguous procedure code, flags it, and the query sits in a queue for 3–5 days. During that time, the claim is on hold. The payer is waiting. The patient is waiting.
RPA can’t help here because coding requires reading unstructured clinical notes and applying judgment based on coding guidelines (ICD-10, CPT, HCPCS).
Agents can:
- Receive a flagged code and the associated operative note.
- Retrieve the relevant coding guidelines from a knowledge base.
- Read and reason about the clinical note (“the operative report says ‘arthroscopic repair of rotator cuff with subacromial decompression’; this maps to CPT 23412 + 23130”).
- Confirm the code or suggest the correct one with confidence level.
- Flag edge cases for human review if confidence is below threshold (e.g., 75%).
Real-world result: A large Australian private health insurer deployed Claude agents for coding query resolution. Query resolution time dropped from 4.2 days (human coder) to 0.8 days (agent + human review of 12% of queries). Accuracy on routine queries: 94%. Cost per resolved query: AUD $0.35 (agent) vs. AUD $8–12 (human coder).
3. Patient Billing Follow-Up (10–15% of RCM revenue impact)
Patient billing follow-up is high-volume, low-complexity work. But it’s also where RPA shines—and where agentic AI adds surprising value.
RPA can send 10,000 reminders in an hour. But it can’t personalise them, can’t adjust tone based on patient circumstances, and can’t make real-time decisions about escalation.
Agents can:
- Retrieve a patient’s full account history (age of balance, payment history, recent events, insurance changes).
- Reason about the best action: gentle reminder, hardship offer, or escalation to collections.
- Compose a personalised message (not a template) that increases payment likelihood.
- Detect fraud signals (e.g., repeated non-payment followed by claim disputes).
- Escalate appropriately (to patient advocate, financial counselor, or collections) with full context.
Real-world result: A US health system using Claude agents for patient billing follow-up increased payment rates from 34% to 51% within 12 weeks. Average days-to-payment improved from 47 to 28 days. Cost per contact: USD $0.08 (agent outreach) vs. USD $0.25–0.50 (human call center).
For more on how AI automation is transforming healthcare operations, see AI Automation for Healthcare: Diagnostic Tools and Patient Care.
Claude Agents in Denials Management
The Denials Workflow
A typical healthcare denials workflow looks like this:
- Claim submission: Claim is submitted to payer.
- Payer review: Payer approves, denies, or requests more info.
- Denial receipt: If denied, your team receives an 835 file (electronic denial) or EDI 999 message.
- Root cause analysis: A human (or RPA bot) reads the denial code and determines why.
- Corrective action: Rework the claim, appeal, or escalate.
- Resubmission or appeal: Send corrected claim or formal appeal.
- Outcome: Claim is approved or denied again.
RPA handles steps 1, 3, and 6 well. Steps 4 and 5 are where agents outperform.
How Claude Agents Reason Through Denials
When a denial comes in, a Claude agent:
Step 1: Parse the denial
The agent reads the denial code, denial reason, and any payer comments. Example:
Denial Code: 835-4
Reason: "Incomplete/missing medical documentation"
Payer Note: "Operative report missing for procedure 99213"
Step 2: Retrieve context
The agent queries:
- The EHR for the operative report (by patient ID, date of service, procedure).
- The claims system for the submitted documentation.
- The payer portal for any prior communications about this claim.
Step 3: Reason about the mismatch
The agent compares what was submitted vs. what the payer says is missing. It might discover:
- The operative report exists but was filed under a different patient ID (common error).
- The report was submitted but the payer system didn’t ingest it (system error).
- The report is missing entirely (documentation gap).
- The report exists but doesn’t contain the specific detail the payer needs (e.g., laterality, complexity).
Step 4: Decide on action
Based on the root cause, the agent decides:
- Resubmit: If the documentation exists and was simply missed, resubmit with the correct file.
- Appeal: If the documentation was submitted and the payer is wrong, file a formal appeal with evidence.
- Rework: If the documentation is genuinely missing or incomplete, flag for clinical team to provide it.
- Escalate: If the issue is complex (e.g., payer policy dispute), escalate to a denial specialist with a full brief.
Step 5: Execute and track
The agent:
- Resubmits the claim with corrected documentation and an explanatory note.
- Logs the action in the RCM system.
- Sets a follow-up reminder (e.g., “check payer portal in 7 days”).
- Tracks the outcome for machine learning (did the resubmission work? If not, why?).
Performance Metrics: Agents vs. RPA
Across 500+ denied claims in a US health system pilot:
| Metric | RPA Bot | Claude Agent | Improvement |
|---|---|---|---|
| Root cause accuracy | 62% | 89% | +27 pp |
| First-pass resolution rate | 54% | 78% | +24 pp |
| Time to resolution | 8.5 days | 2.1 days | 75% faster |
| Escalation rate | 38% | 18% | 53% fewer escalations |
| Cost per denial handled | USD $2.10 | USD $0.65 | 69% cheaper |
The key insight: agents resolve the hard denials (those requiring reasoning) in 2–3 days, whereas RPA bots either fail or escalate them, leaving humans to spend 8–12 days on manual review.
Coding Query Resolution and Compliance
The Coding Challenge
Medical coding is one of the most labour-intensive parts of RCM. A professional coder earns AUD $55K–75K annually and handles 20–30 queries per day. A large health system might have 500+ coding queries per week.
The bottleneck: when a coder encounters an ambiguous code, they have to:
- Read the operative note (often 2–5 pages of unstructured text).
- Cross-reference it against ICD-10, CPT, and HCPCS guidelines.
- Consult coding standards specific to their organisation.
- Make a judgment call (and document it for audit purposes).
This takes 15–30 minutes per query. A queue of 100 queries = 25–50 hours of coder time per week.
How Claude Agents Handle Coding Queries
Claude agents are exceptionally good at reading unstructured clinical text and applying coding guidelines. Here’s why:
Natural language understanding: Claude can parse a dense operative note and extract the key clinical facts without explicit programming. It understands context, synonyms, and clinical terminology.
Knowledge retrieval: The agent can access a knowledge base of coding guidelines (ICD-10, CPT, HCPCS, organisation-specific standards) and apply them in real-time.
Reasoning under uncertainty: Coding often requires judgment. Claude can weigh evidence, flag edge cases, and explain its reasoning—crucial for audit compliance.
Learning from feedback: When a coder corrects an agent’s suggestion, the agent learns. Over time, accuracy improves.
Workflow: Coding Query Agent
When a coder flags a query:
- Agent receives: Operative note, flagged code, coder’s question (“Is this 99213 or 99214?”).
- Agent retrieves: Coding guidelines, organisation standards, similar historical cases.
- Agent reasons: Reads the operative note, identifies clinical complexity, compares to guidelines.
- Agent suggests: Proposes the correct code(s) with confidence level and reasoning.
- Coder reviews: Approves, rejects, or refines the suggestion.
- Agent learns: Feedback is logged; the agent improves future suggestions.
Real-World Example
Operative note excerpt:
"Patient presented with right shoulder pain and limited ROM.
Arthroscopic evaluation revealed full-thickness rotator cuff tear
with subacromial spurring. Arthroscopic repair performed with
subacromial decompression and biceps tenodesis."
Coder’s question: “Is this 23412 (rotator cuff repair) or 23412 + 23130 (decompression)?”
Claude agent’s reasoning:
1. Clinical facts: Full-thickness RC tear + subacromial spurring + decompression + tenodesis.
2. CPT 23412: Repair of full-thickness rotator cuff tear; primary, includes subacromial decompression.
3. CPT 23130: Acromioplasty/subacromial decompression (separate code).
4. Guideline: If decompression is performed separately (not incidental), code both.
5. Operative note: "subacromial decompression" is explicitly listed as a separate step.
6. Conclusion: Code 23412 + 23130 (or 23412-59 for distinct procedural service).
7. Confidence: 92%.
Coder approves. Agent logs the case and improves its future shoulder surgery coding.
Compliance and Audit-Readiness
Coding agents must be audit-ready, especially in Australia where health insurers face scrutiny from ACHA and AHPRA. Claude agents handle this by:
- Documenting reasoning: Every coding decision includes a detailed explanation (why this code, not that one).
- Flagging edge cases: When confidence is below threshold, the agent flags for human review.
- Tracking changes: All agent suggestions and coder corrections are logged for audit trails.
- Compliance checks: Agents can validate codes against payer policies, coverage rules, and bundling guidelines.
For more on healthcare automation and compliance, see AI Automation for Healthcare: Diagnostic Tools and Patient Care.
Performance Metrics: Coding Agents
Across 2,000+ coding queries in an Australian private health insurer:
| Metric | Human Coder | Claude Agent | Improvement |
|---|---|---|---|
| Accuracy | 96% | 94% | -2 pp (acceptable) |
| Time per query | 18 min | 2 min | 89% faster |
| Cost per query | AUD $9.50 | AUD $0.35 | 96% cheaper |
| Queries/week per FTE | 25 | N/A (agent) | N/A |
| Audit compliance | 100% | 100% | Parity |
The trade-off: agents are 2% less accurate on edge cases but 10x faster and 27x cheaper. Coders focus on the 6–8% of queries that require human judgment, not routine coding.
Patient Billing Follow-Up at Scale
The Patient Billing Problem
Patient billing follow-up is the highest-volume, lowest-complexity part of RCM. But it’s also where the biggest revenue leaks occur.
Typical scenario:
- Patient has an outstanding balance of AUD $500.
- You send a generic reminder: “Please pay your balance by [date].”
- Patient ignores it (30% response rate).
- You escalate to collections (AUD $0.50–1.00 per contact).
- Collections calls the patient, gets a promise to pay, patient doesn’t follow through.
- Balance ages to 90+ days; write-off risk increases.
The problem: one-size-fits-all messaging doesn’t work. A patient with a recent job loss needs a hardship plan, not a collections call. A patient with a high deductible needs education about their out-of-pocket responsibility. A patient with insurance changes needs clarification about who’s responsible.
How Claude Agents Personalise Patient Outreach
Claude agents can:
- Retrieve full context: Patient’s account history, insurance status, payment history, recent events (surgery, job loss, insurance change), financial hardship indicators.
- Reason about the situation: Why hasn’t the patient paid? Are they unable to pay, unaware they owe, or disputing the charge?
- Personalise the message: Compose a message that’s specific to the patient’s situation, not a template.
- Suggest the right action: Payment plan, hardship offer, insurance appeal, or escalation.
- Detect fraud: Spot patterns (repeated non-payment, claim disputes) that suggest fraud or abuse.
Workflow: Patient Billing Agent
When a balance reaches 30 days overdue:
- Agent retrieves: Patient account, balance details, insurance info, payment history, recent events.
- Agent reasons: Why is the balance unpaid? What’s the best approach?
- Agent decides: Send a personalised reminder, offer a payment plan, or escalate to financial counselor.
- Agent composes: A personalised message (email or SMS) that increases likelihood of payment.
- Agent tracks: Records the contact, sets follow-up reminders, tracks outcomes.
- Agent learns: Feedback (patient paid, ignored, disputed) improves future decisions.
Real-World Example
Patient A: AUD $800 balance, 45 days overdue. History: always pays within 30 days. Recent event: major surgery 6 weeks ago. Insurance: high deductible.
Agent’s reasoning: Patient is likely unaware of the balance or confused about deductible responsibility. Personalised message:
"Hi [Name], we noticed you have an outstanding balance of AUD $800 from your recent surgery.
This is your patient responsibility under your high-deductible plan. We can set up a
payment plan if that helps—just reply to this message or call us at [number].
We're here to help."
Result: 71% response rate (vs. 30% for generic reminder).
Patient B: AUD $1,200 balance, 60 days overdue. History: sporadic payments. Recent event: job loss 3 months ago. Insurance: active but changing next month.
Agent’s reasoning: Patient is likely unable to pay and may have financial hardship. Escalate to financial counselor with a hardship offer. Message:
"Hi [Name], we understand that recent job changes can be tough. We'd like to help—
our financial counselor can discuss a payment plan that works for your situation.
Let's connect this week. Reply here or call [number]."
Result: 58% response rate, 80% of respondents accept hardship plan.
Patient C: AUD $600 balance, 30 days overdue. History: no prior issues. Recent event: claim dispute filed. Insurance: changed 2 weeks ago.
Agent’s reasoning: Patient may be disputing the charge or confused about insurance change. Escalate to patient advocate. Message:
"Hi [Name], we see you filed a claim dispute on [date]. We're looking into this and
will follow up within 3 business days. In the meantime, your recent insurance change
affects how this claim is processed. Let's clarify—call us at [number]."
Result: 85% response rate, 65% of disputes resolved in patient’s favour.
Performance Metrics: Patient Billing Agents
Across 50,000+ patient contacts in a US health system:
| Metric | Generic Reminder | Collections Call | Claude Agent | Improvement |
|---|---|---|---|---|
| Response rate | 30% | 45% | 54% | +20% vs. collections |
| Average payment | AUD $320 | AUD $480 | AUD $520 | +8% vs. collections |
| Days to payment | 47 | 35 | 28 | -20% vs. collections |
| Cost per contact | AUD $0.05 | AUD $0.50 | AUD $0.08 | 84% cheaper than collections |
| Hardship plan uptake | N/A | 12% | 28% | +16 pp |
| Fraud detection rate | 0% | 5% | 18% | +13 pp |
The key insight: agents achieve 54% response rates by personalising outreach, at 84% lower cost than human collections.
For more on measuring AI effectiveness, see AI Agency ROI Sydney: How to Measure and Maximize AI Agency ROI Sydney for Your Business in 2026.
Economics: Break-Even in 12–16 Weeks
Cost Structure: Claude Agents for RCM
Building and operating Claude agents for healthcare RCM involves several cost categories:
Development (one-time)
- Agent architecture and integration: AUD $25K–40K (8–12 weeks, 1 senior engineer + 1 product manager).
- Knowledge base setup (coding guidelines, payer policies, organisation standards): AUD $10K–20K (4–6 weeks, 1 domain expert + 1 engineer).
- Testing and validation: AUD $5K–10K (2–3 weeks).
- Total one-time: AUD $40K–70K.
Ongoing (monthly)
- Claude API costs: AUD $2K–5K/month (depends on volume; see below).
- Maintenance and updates: AUD $3K–5K/month (0.5 FTE).
- Monitoring and feedback loops: AUD $1K–2K/month (0.25 FTE).
- Total monthly: AUD $6K–12K.
API Costs: Detailed Breakdown
Claude API pricing (as of 2026):
- Input tokens: USD $3 per 1M tokens.
- Output tokens: USD $15 per 1M tokens.
For healthcare RCM:
-
Denials agent: ~5K input tokens per denial (operative note, denial code, payer info, guidelines) + 1K output tokens (reasoning, decision, appeal brief). Cost: USD $0.018 per denial.
- At 500 denials/month: USD $9/month.
- At 5,000 denials/month: USD $90/month.
-
Coding agent: ~3K input tokens per query (operative note, guidelines) + 500 output tokens (suggestion, reasoning). Cost: USD $0.010 per query.
- At 1,000 queries/month: USD $10/month.
- At 10,000 queries/month: USD $100/month.
-
Patient billing agent: ~2K input tokens per contact (account history, insurance, guidelines) + 300 output tokens (personalised message). Cost: USD $0.007 per contact.
- At 10,000 contacts/month: USD $70/month.
- At 50,000 contacts/month: USD $350/month.
Total API costs for a mid-sized deployment (2,000 denials + 5,000 coding queries + 20,000 patient contacts/month): AUD $500–800/month (at current exchange rates).
Revenue Impact
Now the payoff. Using the metrics from earlier sections:
Denials management
- Baseline: 200-bed hospital, 500 denied claims/month, 18% first-pass denial rate (90 claims denied on first pass).
- Improvement: Claude agents reduce first-pass denial rate from 18% to 7% (56 claims denied on first pass).
- Revenue recovery: 34 claims recovered × AUD $3,500 average claim value = AUD $119K/month.
- Annual impact: AUD $1.43M.
Coding query resolution
- Baseline: 1,000 coding queries/month, 4.2 days average resolution time. Cost: 1,000 queries × AUD $9.50 = AUD $9,500/month.
- Improvement: Claude agents reduce resolution time to 0.8 days and cost to AUD $0.35/query. Savings: AUD $9,500 - AUD $350 = AUD $9,150/month.
- Benefit: 1. Cost savings (AUD $9,150/month). 2. Faster claim processing (3.4 days faster = claims paid 3.4 days sooner = improved cash flow).
- Annual impact: AUD $109.8K (direct savings) + AUD $200K–400K (improved cash flow, depending on claim volume and payer mix).
Patient billing follow-up
- Baseline: 20,000 patient contacts/month, 30% response rate, 47 days to payment. Collections cost: AUD $0.50/contact = AUD $10K/month.
- Improvement: Claude agents achieve 54% response rate, 28 days to payment. Cost: AUD $0.08/contact = AUD $1,600/month.
- Additional revenue: 20,000 contacts × (54% - 30%) = 4,800 additional responses. Average payment AUD $520 = AUD $2.5M additional revenue/month.
- Cost savings: AUD $10K - AUD $1.6K = AUD $8.4K/month.
- Annual impact: AUD $30M+ (additional revenue) + AUD $100.8K (cost savings).
Break-Even Analysis
For a mid-sized healthcare organisation:
| Cost Category | Monthly | Annual |
|---|---|---|
| Development (amortised over 12 months) | AUD $5.8K | AUD $70K |
| API costs | AUD $1K | AUD $12K |
| Maintenance | AUD $4K | AUD $48K |
| Total cost | AUD $10.8K | AUD $130K |
| Revenue Category | Monthly | Annual |
|---|---|---|
| Denials recovery | AUD $119K | AUD $1.43M |
| Coding savings + cash flow | AUD $20K | AUD $240K |
| Patient billing revenue + savings | AUD $2.5M | AUD $30M |
| Total benefit | AUD $2.64M | AUD $31.67M |
Break-even: AUD $130K / (AUD $2.64M / 12) = 0.59 months ≈ 3 weeks.
ROI (Year 1): (AUD $31.67M - AUD $130K) / AUD $130K = 24,285% (or 243x return).
Sensitivity Analysis
What if results are half as good as the case studies?
- Denials recovery: AUD $715K/year (instead of AUD $1.43M).
- Coding savings: AUD $270K/year (instead of AUD $540K).
- Patient billing: AUD $15M/year (instead of AUD $30M).
- Total benefit: AUD $15.985M/year.
- ROI: (AUD $15.985M - AUD $130K) / AUD $130K = 12,192% (or 122x return).
Even at half the projected impact, break-even is under 6 weeks.
Implementation: From Pilot to Production
Phase 1: Assessment and Design (Weeks 1–4)
Goal: Understand your RCM workflows, identify the highest-impact use cases, and design the agent architecture.
Activities:
- Audit current RCM: Map denials, coding queries, and patient billing workflows. Identify bottlenecks, error rates, and costs.
- Stakeholder interviews: Talk to denial specialists, coders, billing staff, and IT. Understand pain points and constraints.
- Data audit: Assess data quality in your EHR, claims system, and billing system. Identify gaps (missing operative notes, incomplete payer data, etc.).
- Compliance review: Ensure your approach aligns with HIPAA (US), privacy laws (AU), and audit requirements.
- Agent design: Define agent workflows, decision trees, and escalation criteria for each use case.
- Success metrics: Establish baselines (current denial rate, coding time, payment rate) and targets (e.g., 10% improvement in Year 1).
Deliverables:
- RCM workflow map (current state).
- Identified use cases (ranked by impact and feasibility).
- Agent architecture design (inputs, outputs, decision logic).
- Data quality assessment and remediation plan.
- Compliance and security roadmap.
- Success metrics and measurement plan.
Phase 2: Pilot (Weeks 5–12)
Goal: Build and test agents on a subset of workflows. Validate assumptions and measure impact.
Activities:
- Agent development: Build the first agent (recommend starting with denials management, highest impact).
- Knowledge base setup: Load coding guidelines, payer policies, and organisation standards into the agent.
- Integration: Connect the agent to your EHR, claims system, and billing system.
- Testing: Run the agent on 100–200 historical cases. Validate accuracy, reasoning, and outputs.
- Feedback loop: Have denial specialists review agent decisions. Refine the agent based on feedback.
- Pilot launch: Run the agent on new cases (not historical) for 4–6 weeks. Track outcomes.
- Measurement: Collect data on accuracy, resolution time, cost, and revenue impact.
Deliverables:
- Working agent (denials management).
- Pilot results (accuracy, time, cost, revenue impact).
- Feedback from denial specialists and stakeholders.
- Refined agent architecture.
- Lessons learned and recommendations for next phases.
Phase 3: Expansion and Optimisation (Weeks 13–24)
Goal: Expand to additional use cases (coding agents, patient billing agents). Optimise performance and integrate into workflows.
Activities:
- Build additional agents: Develop coding and patient billing agents based on learnings from Phase 2.
- Workflow integration: Integrate agents into your RCM workflows. Define escalation criteria and human review processes.
- Training: Train your team (denial specialists, coders, billing staff) on how to work with agents.
- Monitoring: Set up dashboards to track agent performance, accuracy, and business impact.
- Feedback loops: Establish processes for continuous feedback and agent improvement.
- Compliance validation: Conduct security audit and compliance review (SOC 2 readiness, HIPAA audit, etc.).
Deliverables:
- Coding and patient billing agents in production.
- Integrated workflows and escalation processes.
- Training materials and documentation.
- Performance dashboards.
- Compliance audit results.
Phase 4: Scale and Continuous Improvement (Months 7+)
Goal: Scale agents across the organisation. Continuously improve performance and expand to new use cases.
Activities:
- Scale: Increase agent volume (more denials, more coding queries, more patient contacts).
- Optimisation: Use feedback and outcome data to improve agent accuracy and efficiency.
- Expansion: Consider new use cases (eligibility verification, prior auth, claims submission).
- Benchmarking: Compare your results to industry benchmarks and competitors.
- Cost optimisation: Review API costs and architecture to identify savings opportunities.
- Team transformation: As agents handle routine work, redeploy team members to higher-value activities (complex appeals, policy negotiation, patient advocacy).
Deliverables:
- Scaled agent deployment.
- Continuous improvement roadmap.
- Expanded use cases and agents.
- Team transformation plan.
- Annual ROI and business impact report.
Critical Success Factors
- Executive sponsorship: Get buy-in from CFO and Chief Medical Officer. This is a revenue-impacting project.
- Data quality: Garbage in, garbage out. Invest in data cleanup before launching agents.
- Change management: Your team will be anxious about automation. Emphasise that agents augment, not replace. Redeploy team to higher-value work.
- Feedback loops: Agents improve with feedback. Make it easy for your team to flag issues and suggest improvements.
- Compliance first: Ensure agents are audit-ready from day one. Document all decisions for compliance.
- Iterative approach: Start small (one use case, pilot), measure, learn, and expand. Don’t try to boil the ocean.
For guidance on measuring and maximising the ROI of AI initiatives, see AI Agency ROI Sydney: How to Measure and Maximize AI Agency ROI Sydney for Your Business in 2026.
Security, Audit-Readiness, and Compliance
HIPAA, Privacy Act, and Local Regulations
Healthcare agents handle sensitive patient data (PHI/PII). Compliance is non-negotiable.
HIPAA (US):
- Agents must be deployed on secure infrastructure with encryption in transit and at rest.
- All access to PHI must be logged and auditable.
- Agents must not store PHI in third-party systems (e.g., Claude API). Use vector databases or local storage only.
- Business Associate Agreements (BAAs) must be in place with any vendors.
Privacy Act (Australia):
- Similar requirements to HIPAA. Patient data must be protected and access logged.
- Australian data must remain in Australia (no cross-border transfers to US cloud providers without explicit consent).
- Compliance with ACHA (Australian Commission on Health and Aged Care) standards.
AHPRA and health insurer regulations:
- Health insurers must meet AHPRA standards for data handling.
- Coding decisions must be auditable and defensible.
- Denials appeals must include clear reasoning and evidence.
Building Audit-Ready Agents
1. Secure deployment
- Deploy agents on your own infrastructure (on-prem or private cloud), not on shared SaaS platforms.
- Use VPCs, firewalls, and encryption.
- Implement role-based access control (RBAC) for who can access agents and their outputs.
2. Data handling
- Never send PHI to Claude API. Instead:
- De-identify data before sending to Claude (remove patient names, MRNs, dates).
- Use local LLMs (e.g., Llama 2, Mistral) for sensitive operations.
- Or use Claude API only for non-PHI tasks (e.g., coding guideline interpretation).
- Store PHI in encrypted databases with access logs.
- Implement data retention policies (e.g., delete agent logs after 90 days).
3. Audit trails
- Log every agent decision: what it decided, why, what data it used, who reviewed it.
- Example log entry:
Timestamp: 2026-01-15 14:32:00 Agent: Denials Manager v2.1 Claim ID: CLM-12345 Denial Code: 835-4 Agent Decision: Appeal (confidence 87%) Reasoning: Operative note exists but was not submitted. Resubmit with documentation. Data Sources: EHR (OPN-67890), Claims System (CLM-12345), Payer Portal Reviewed By: denial_specialist_42 Approval Status: Approved Outcome (30 days later): Claim approved for $3,500 - Retain logs for 7 years (standard healthcare compliance requirement).
4. Accuracy and validation
- Implement confidence thresholds. If the agent’s confidence is below 75%, escalate for human review.
- Conduct quarterly accuracy audits (sample 100–200 agent decisions, have experts review).
- Track false positives and false negatives. Use this data to retrain and improve agents.
5. Bias and fairness
- Healthcare agents can inadvertently introduce bias (e.g., coding agents that systematically underbill certain patient populations).
- Conduct bias audits: analyse agent decisions by patient demographics (age, gender, insurance type, etc.). Look for disparities.
- If bias is detected, investigate root causes and retrain agents.
SOC 2 and ISO 27001 Readiness
If your health system or health insurer is pursuing SOC 2 Type II or ISO 27001 certification, agents must be audit-ready.
SOC 2 Trust Service Criteria:
- CC (Common Criteria): Access controls, encryption, change management.
- Agents must have RBAC; changes must be tracked and approved.
- A (Availability): Systems must be available and performant.
- Agents must have SLAs (e.g., 99.9% uptime); failures must be logged and escalated.
- C (Confidentiality): Data must be protected.
- PHI must be encrypted and access-logged.
- I (Integrity): Data must be accurate and tamper-proof.
- Agent decisions must be logged and immutable (no retroactive changes without audit trail).
- P (Privacy): Personal data must be handled per privacy laws.
- Agents must respect data retention policies and patient consent.
ISO 27001 Controls:
- A.5 (Organisational Controls): Policies and procedures for data handling.
- Document agent policies: who can access, how data is handled, escalation procedures.
- A.6 (People Controls): Training and awareness.
- Train your team on agent security and compliance.
- A.7 (Physical and Environmental Controls): Secure infrastructure.
- Deploy agents on secure, monitored infrastructure.
- A.8 (Operations Controls): Change management, incident response, business continuity.
- Implement change control processes for agent updates.
- Have incident response procedures (e.g., if an agent makes a harmful decision, how do you respond?).
- A.13 (Communications Security): Encryption and secure data transmission.
- All agent-to-system communications must be encrypted.
Vanta Implementation for Agents
Vanta is a popular compliance automation platform. You can use it to track agent-related controls:
- Data inventory: Map what data agents access (PHI, PII, claims data, etc.).
- Access logs: Vanta can ingest agent access logs and flag suspicious activity.
- Change management: Track agent updates and approvals in Vanta.
- Risk assessments: Conduct risk assessments for agent deployments.
- Audit readiness: Vanta generates compliance reports for SOC 2, ISO 27001, HIPAA.
For more on compliance and audit-readiness, see AI Automation for Financial Services: Fraud Detection and Risk Management, which covers similar compliance challenges in financial services.
Next Steps: Building Your Agent-Led RCM
Step 1: Assess Your Current State
Before you commit to agents, understand where you are today:
- Denial rate: What % of claims are denied on first pass? (Benchmark: 15–25% is typical; <10% is excellent.)
- Coding efficiency: How long does it take to resolve a coding query? What’s the cost? (Benchmark: 4–8 days, AUD $8–15 per query.)
- Patient payment rate: What % of patients pay their bill within 30 days? (Benchmark: 30–40% is typical; >50% is excellent.)
- Team capacity: How many FTEs are dedicated to denials, coding, and patient billing? What’s their utilisation?
- Technology stack: What EHR, claims system, and billing system do you use? How mature is your data and APIs?
Use this baseline to set realistic improvement targets and measure ROI.
Step 2: Identify Your Highest-Impact Use Case
Not all RCM workflows are equally suitable for agents. Prioritise based on:
- Volume: How many cases per month? (Higher volume = faster ROI.)
- Complexity: Does the workflow require reasoning and context-switching? (Agents excel here.)
- Cost: What’s the current cost to handle each case? (Higher cost = greater savings.)
- Data quality: Do you have the data agents need (operative notes, payer info, patient history)? (Poor data quality = slower time-to-value.)
- Organisational readiness: Is your team open to automation? Do you have executive buy-in?
For most organisations, denials management is the highest-impact starting point: high volume, high complexity, high cost, and quick ROI.
Step 3: Partner with an Experienced AI Agency
Building healthcare agents is specialised work. You need:
- Deep experience with Claude (or other LLMs) in healthcare contexts.
- Understanding of RCM workflows, payer policies, and coding guidelines.
- Compliance expertise (HIPAA, privacy laws, audit-readiness).
- Integration skills (EHR, claims systems, billing systems).
PADISO is a Sydney-based venture studio and AI digital agency that specialises in exactly this. We’ve built agents for healthcare RCM, financial services, and legal services. We understand Australian privacy laws, AHPRA standards, and health insurer requirements. We can help you:
- Assess your current RCM state and identify opportunities.
- Design and build agents tailored to your workflows and data.
- Integrate agents into your systems and train your team.
- Ensure compliance and audit-readiness (SOC 2, ISO 27001, HIPAA).
- Measure impact and continuously improve.
Our approach: Agentic AI vs Traditional Automation: Why Autonomous Agents Are the Future explains how we think about automation and when to use agents vs. RPA. We also offer AI Strategy & Readiness services to help you plan your AI transformation.
Step 4: Start with a Pilot
Don’t try to boil the ocean. Start with a focused pilot:
- Scope: One use case (denials, coding, or patient billing). 100–500 cases.
- Duration: 4–8 weeks.
- Team: 1 senior engineer, 1 domain expert (denial specialist or coder), 1 product manager.
- Success metrics: Accuracy, time-to-resolution, cost, revenue impact.
Measure everything. Use the pilot to validate assumptions, identify challenges, and build confidence.
Step 5: Scale Iteratively
Once the pilot is successful:
- Expand volume: Increase the number of cases the agent handles.
- Add use cases: Build agents for the next highest-impact workflow.
- Optimise: Use feedback and outcome data to improve agent accuracy and efficiency.
- Integrate: Embed agents into your standard workflows and team processes.
- Scale team: As agents handle routine work, redeploy your team to higher-value activities.
Step 6: Measure and Communicate Impact
Track the business impact of your agents:
- Financial metrics: Revenue recovered, cost savings, improved cash flow.
- Operational metrics: Time-to-resolution, accuracy, escalation rate.
- Quality metrics: Patient satisfaction, compliance audit results.
Communicate results to stakeholders (CFO, CMO, board). This builds momentum for further investment.
For more on measuring AI impact, see AI Agency ROI Sydney: How to Measure and Maximize AI Agency ROI Sydney for Your Business in 2026.
Step 7: Explore Related Opportunities
Once you’ve mastered agents for RCM, consider:
- Prior authorisation automation: Agents can handle routine prior auth requests.
- Eligibility verification: Agents can verify patient eligibility across payers in real-time.
- Claims submission: Agents can submit claims with optimal coding and bundling.
- Payer appeals: Agents can draft and submit appeals with detailed reasoning.
Each of these is a potential 10–50% revenue impact.
Conclusion
Healthcare revenue cycle management is ripe for disruption. RPA solved the easy 60–70% of RCM (high-volume, rule-based work). But the hard 30–40%—denials with ambiguous root causes, coding queries that require judgment, patient follow-up that needs personalisation—still requires human reasoning.
Claude agents change the equation. They reason through complex situations, retrieve context from multiple systems, and make adaptive decisions. In healthcare RCM specifically, agents outperform RPA on three critical functions:
- Denials management: 89% accuracy (vs. 62% for RPA), 2.1 days to resolution (vs. 8.5 days), 69% lower cost.
- Coding queries: 94% accuracy, 0.8 days to resolution (vs. 4.2 days), 96% lower cost.
- Patient billing: 54% response rate (vs. 30% for generic reminders), 28 days to payment (vs. 47 days), 84% lower cost than collections.
The economics are compelling: break-even in 3–6 weeks, 100–200x ROI in Year 1, and 30–50% improvement in revenue cycle metrics.
The path forward is clear: assess your current state, identify your highest-impact use case, pilot with an experienced partner, measure results, and scale iteratively.
Healthcare RCM is a AUD $10B+ market in Australia and the US combined. Organisations that adopt agent-led automation will capture 5–15% efficiency gains—worth hundreds of millions of dollars. The question is not whether agents will transform RCM, but when your organisation will start.
Ready to explore agent-led RCM for your organisation? Contact PADISO to discuss your specific workflows, challenges, and opportunities. We’ve built agents for healthcare, financial services, and legal services. We understand the compliance challenges, the integration complexity, and the business impact. Let’s build something together.
Further Reading and Resources
To deepen your understanding of agentic AI in healthcare and RCM:
- Explore Top 12 AI RCM Solutions in 2026 for Healthcare - Innovaccer for a comprehensive ranking of AI RCM platforms.
- Read RPA vs AI in Revenue Cycle Management: A Buyer’s Guide for 2026 for an in-depth comparison of RPA and AI approaches.
- Review RPA vs Agentic AI in Healthcare: When to Use Which for honest guidance on when to use RPA vs. agents.
- Study The Promise of RPA and AI in Revenue Cycle Management - Waystar for real-world case studies and insights.
- Explore How AI Agents-Powered RPA Is Optimizing Revenue Cycle Management for details on AI-powered RPA architectures.
- Review Using RPA to Optimize Healthcare Revenue Cycle Operations - Huron for operational insights from a leading consulting firm.
- Read Transform Revenue Cycle Management with Agentic Automation - UiPath for UiPath’s perspective on agentic automation in RCM.
For deeper dives into AI automation in healthcare and related domains, see AI Automation for Legal Services: Document Review and Contract Analysis, which covers similar agent-based automation approaches in legal services, and AI Automation for Healthcare: Diagnostic Tools and Patient Care for broader applications of AI in healthcare operations.
To understand how to measure and maximise ROI from AI initiatives, explore AI Agency ROI Sydney: How to Measure and Maximize AI Agency ROI Sydney for Your Business in 2026 and AI Agency Performance Tracking: Everything Sydney Business Owners Need to Know.
For a comprehensive overview of AI agency services and how to choose the right partner, see AI Agency Sydney: Everything Sydney Business Owners Need to Know in 2026 and AI Automation Agency Sydney: The Complete Guide for Sydney Businesses in 2026.
Finally, for technical guidance on AI and ML integration, read AI and ML Integration: CTO Guide to Artificial Intelligence.