
Adding AI-powered customer support to an existing product in 2026 typically costs $0 to $5,000 in build cost if you wire up an off-the-shelf agent like Intercom Fin or Decagon, or $15,000 to $90,000 if you build a retrieval-augmented assistant on your own knowledge base. After that, the real number to track is monthly run-cost per deflected ticket, which ranges from about $0.30 (DIY at scale) to $2.00 (Zendesk PAYG) per resolved conversation.
This post is specifically about customer-support automation: deflecting tickets, escalating cleanly to humans, integrating with helpdesks like Zendesk, Intercom, Front, and Pylon. If you want general AI chatbot economics, read our breakdown of the cost to build an AI chatbot and the companion piece on adding a chatbot to your existing app. The numbers here are different because support agents need ticket-system writebacks, escalation logic, CSAT tracking, and outcome measurement that a marketing chatbot does not.
Before we list vendor prices, agree on the unit economics. A SaaS support ticket costs $18 to $35 to resolve with a human (Tier 1), and $30 to $60 in B2B (Tier 2 or Tier 3). If your AI agent deflects a ticket for $1.00, you save roughly $17 to $34 per resolution. If it costs $5.00 to deflect, you still save money, but the case for building gets thinner.
Three numbers anchor the math:
If you cannot produce these three numbers with a straight face, your savings model is fiction. We have seen founders sign $300k Decagon contracts based on a vendor's "70% deflection" demo, then hit 28% in production because the demo was on FAQs and prod traffic was account-specific.
The phrase covers four layers, each with its own cost:
A "chatbot" is mostly the first two. A real customer-support agent does all four. The action and escalation layers are where the build cost balloons.
| Approach | Build cost | Run cost | Timeline | Pros | Cons |
|---|---|---|---|---|---|
| Intercom Fin (inside Intercom) | $0 build | $0.99 / resolution + $29-$139/seat/mo | 1-3 days | Fast setup, pay only for resolved tickets, native Intercom integration | Locked into Intercom helpdesk; gray areas in what counts as resolved |
| Intercom Fin (standalone w/ Zendesk, Salesforce, HubSpot) | $0 build | $0.99 / resolution, $49.50/mo minimum | 3-7 days | Works with your existing helpdesk | Still per-resolution opacity; less control over fallback |
| Zendesk Advanced AI + Automated Resolutions | $0 build | $50/agent/mo + $1.50 per committed resolution ($2.00 PAYG) | 1-2 weeks | Native if you already run Zendesk; outcome-based billing | More expensive per resolution than Fin; tier minimums on resolutions per agent |
| Decagon (enterprise) | ~$25k-$75k implementation | ~$0.50-$0.99 / conversation + ~$50k platform fee; median contract $400k/yr | 4-8 weeks | Best for high-volume, complex B2C; deep workflow customization | Annual commit, no list price, billing disputes on "what counts" |
| Lorikeet | Implementation included | Pay-per-successful-resolution, "mark-bad" credit policy | 4-6 weeks | Aligned incentives, strong on multi-step complex tickets | Unpredictable monthly bill, premium pricing, custom quote |
| Pylon AI (B2B SaaS) | $0 build | $59-$139/seat/mo + $100/mo AI Agents add-on (scales with volume) | 1-2 weeks | Built for B2B SaaS support over Slack/email/Teams; clear seat pricing | Less mature autonomous-agent layer than Fin or Decagon |
| DIY RAG on your knowledge base | $15,000-$90,000 (10-30 engineer weeks) | $200-$3,000/mo (LLM + vector DB + infra) | 4-12 weeks | Full control, cheapest at high volume, no per-resolution tax | You own the eval, fallback, drift, and 3am pages |
| Cadence senior engineer (DIY-RAG built on-demand) | $1,500/wk x 4-8 weeks = $6,000-$12,000 | $200-$3,000/mo infra | 48-hour trial, 4-8 weeks to ship | AI-native by default, weekly billing, replace any week | You still own the eval and on-call |
The DIY row understates how much engineering judgment matters. A senior who has shipped RAG before will do this in 4 weeks. A mid who hasn't will take 8 to 12 weeks and ship a worse fallback. The cost gap between the right hire and the wrong one is bigger than the gap between vendors.
Where vendors win or lose depends on volume. We ran the numbers at 1k, 10k, and 100k tickets per month, assuming 50% deflection.
| Volume / Approach | 1,000 tickets/mo (500 deflected) | 10,000 tickets/mo (5,000 deflected) | 100,000 tickets/mo (50,000 deflected) |
|---|---|---|---|
| Intercom Fin (in Intercom) | $495 + 1 seat ($29-$139) | $4,950 + seats | $49,500 + seats |
| Zendesk Auto Resolutions (PAYG) | $1,000 ($2 x 500) | $10,000 | $100,000 |
| Decagon (est. $0.99 conv) | $50k platform + $990 = ~$5,150/mo | ~$54,150/mo | ~$103,000/mo |
| DIY RAG (Claude Haiku 4.5 + pgvector) | ~$60 LLM + $200 infra = $260 | ~$600 + $400 = $1,000 | ~$6,000 + $1,000 = $7,000 |
Cost per deflected ticket at each tier:
The crossover is real. Below ~5k tickets/mo, off-the-shelf wins on total cost of ownership because your engineering hours are more valuable than the vendor markup. Above ~50k tickets/mo with stable workflows, DIY pays back the build cost in 3 to 6 months.
If you go DIY, here is where the engineer-weeks go. Every estimate assumes a Cadence senior at $1,500/week.
| Component | Engineer-weeks | Real cost |
|---|---|---|
| Knowledge ingestion (crawl + chunk + embed) | 1-2 weeks | $1,500-$3,000 |
| Vector store + retrieval pipeline (pgvector or Pinecone) | 0.5-1 week | $750-$1,500 |
| LLM agent loop (Claude or GPT, tool use, citations) | 1-2 weeks | $1,500-$3,000 |
| Helpdesk integration (Zendesk/Intercom API, ticket writeback) | 1-2 weeks | $1,500-$3,000 |
| Action layer (refund, plan change, lookups with auth + audit) | 2-4 weeks | $3,000-$6,000 |
| Escalation logic + human handoff with context | 1-2 weeks | $1,500-$3,000 |
| Eval harness (golden test set, regression on every prompt change) | 1-2 weeks | $1,500-$3,000 |
| Admin UI for support team to review + correct | 1-3 weeks | $1,500-$4,500 |
| Observability (per-conversation cost, latency, deflection dashboard) | 0.5-1 week | $750-$1,500 |
Total: 9 to 19 weeks, $13,500 to $28,500. The wide range is the action layer. If you only need read-only Q&A, you skip 4 to 6 weeks of work. If you need write actions across 5+ internal systems, add 4 to 8 weeks.
Run-cost components for a Claude Haiku 4.5 RAG stack at 10k resolutions/mo:
That is $400 to $1,350/mo all-in for 10k resolutions, or $0.04 to $0.14 per deflected ticket in raw run cost. The break-even calculus changes if you need GPT-4-class models for complex routing, which can 5x the LLM bill.
Three line items will surprise you on every approach:
1. Eval and tuning labor. Whether vendor or DIY, someone has to grade conversations, write golden test sets, retrain embeddings on missed questions, and re-prompt when behaviors drift. Plan for 0.25 to 0.5 FTE forever if you want deflection to stay above 50%. At a fully loaded $150k US salary, that is $37k to $75k/yr in ongoing labor. Most "Fin is so cheap" pitches ignore this.
2. Knowledge base cleanup. Your AI agent is only as good as the docs it reads. Most teams discover their help center has 3-year-old contradictory articles. Budget 2 to 4 weeks of a senior support lead's time to audit, merge, and version content before launch. This is not an engineering cost, but it kills timelines.
3. Escalation tax. Every conversation the AI fails on costs more than a normal ticket because the human has to read the AI's transcript first. Anecdotally we have seen a 1.3x to 1.5x time multiplier on escalated tickets. If your deflection is 50% and your escalation tax is 1.5x, your effective cost reduction is closer to 25%, not 50%.
Five moves that consistently work:
If you are starting from scratch:
For step 3, every engineer on Cadence is AI-native by default, vetted on Cursor, Claude Code, and Copilot fluency before they unlock bookings. A senior at $1,500/week typically scopes a support-RAG MVP in the 48-hour free trial and ships v1 in 4 to 6 weeks. The same senior who wrote your last RAG pipeline will not be available on a job-board next week, but you can book one in 2 minutes on Cadence and replace any week if it is not working.
If you are debating build vs buy and just want a sanity check, the /tools/decide Build/Buy/Book tool runs the math against your specific volume and answers in under a minute.
The chatbot economy and the support-agent economy overlap. If you want the broader picture:
Try it without committing. If you want to scope a DIY support-RAG build with someone who has shipped one before, book a senior engineer on Cadence for a 48-hour free trial. Weekly billing, replace any week, no notice. If the vendor route turns out cheaper for your volume, the engineer will tell you that directly and you part ways at the end of week one.
Fin is $0.99 per successful resolution with a 50-resolution monthly minimum (~$49.50/mo) when used standalone with Zendesk, Salesforce, or HubSpot. Inside Intercom there is no resolution minimum, but you need at least one paid Intercom seat ($29 to $139/mo). At 1,000 resolved conversations per month, expect about $1,000 to $1,150 all-in.
Decagon makes sense above ~50k tickets/mo with complex workflows that need deep customization. Median contract values land around $400k/year per Vendr data, with a $50k floor. Below 25k tickets/mo, Fin or Zendesk Automated Resolutions almost always wins on cost per resolution.
Yes, but it takes a senior engineer 4 to 8 weeks for a real v1, not 4 to 8 days. The hard parts are the action layer (auth, audit, rollback), the escalation logic (when to give up), and the eval harness (so you can iterate without breaking prod). Plan for 9 to 19 engineer-weeks for a full system with write actions.
Realistic year-one deflection in 2026 is 35% to 55% for general SaaS, 60% to 75% for narrow-domain B2C use cases, and 20% to 40% for complex B2B. Vendor demos quoting 80%+ are usually run on a curated FAQ subset; production traffic always pulls the number down.
Track three numbers monthly: baseline cost per human-resolved ticket, AI cost per deflected ticket (vendor fees plus your eval labor), and escalation tax (time multiplier on tickets the AI failed on). True savings = (deflected tickets x baseline cost) minus (vendor fees + eval labor + escalation tax). Most teams overstate savings by 30 to 50% because they ignore escalation tax and tuning labor.
The per-resolution price floor has dropped from ~$2 in 2024 to ~$1 in 2026. It will keep falling. If you are <5k tickets/mo, the savings from waiting 12 months are smaller than the savings from deflecting 50% of tickets for those 12 months. Build or buy now, just keep the contract short (Fin and Pylon are month-to-month; avoid 3-year Decagon commits unless you are clearly enterprise scale).