AI in customer service: the 2026 buyer's guide
Most vendor AIs run the same foundation models under different wrappers. What actually decides the contract is how they bill it and what they let it read first.
Deskwoot Team·May 2, 2026·6 min readAI in customer service in 2026 is a commoditized layer where every vendor runs the same underlying models, so the buying decision comes down to two things: how the vendor bills the AI, and how well it can read your own content before it answers. It's almost never special. They're all running Claude, GPT, or Gemini under different wrappers, and the wrappers cost roughly the same to build. What decides the contract is two things: how the vendor bills the AI, and what it lets the model read before it answers. Get either wrong and you'll spend the next two years fighting your invoice.
How vendors bill AI, and why it matters
There are two patterns in 2026. One bills you every time the AI marks something solved. Zendesk does this at $1.50 to $2.00 per resolution. The other bills you per conversation, whatever the outcome. Deskwoot does this at $0.03 to $0.07 per conversation. The numbers behind the two models are far enough apart that they sit in different budget categories.
The hidden problem with the first model is what happens during a spike. A viral thread, a holiday rush, a podcast mention, your conversation volume doubles, and so does your bill. Every resolution is the meter ticking again. With per-conversation billing, the worst case is bounded by your traffic, which finance can already predict.
| Vendor | How AI is billed | 5,000 conversations/month |
|---|---|---|
| Deskwoot Bot | Per conversation, $0.03 to $0.07 | $50 to $150 |
| Zendesk Advanced AI | Per resolution, $1.50 to $2.00 | $7,500 to $10,000 |
| Intercom Fin | Per resolution, $0.99 | ~$4,950 |
| Freshdesk Freddy | Per session, $0.10 | ~$500 |
Grounding matters more than the model
Every vendor at this point ships a Copilot for agents and a bot for customers. The model under the hood is almost always one of three foundation models. The differentiator is what the vendor lets it read before it answers.
A vendor that pipes a customer's question straight to Claude with nothing else attached will give you confident answers that get the details wrong. Your prices, your refund policy, your edge cases. None of that is in the model's training data. The reply reads like marketing.
Vendors who do this properly use RAG. They feed your help center, your training docs, your past tickets, sometimes your product wiki, into the model's context window before it generates anything. The reply that comes back reads like your team would have written it, because the model is literally working from what your team has written before.
If you're evaluating, the test is simple. Pick five real customer questions you've answered in the past month. Run them through each vendor's Copilot. Read what each one drafts. The grounded ones produce text you can ship with one or two edits. The ungrounded ones produce paragraphs you'd rewrite from scratch.
The questions to actually ask
Most evaluations focus on the demo flow, which tells you almost nothing because every demo is built to succeed. These questions surface real differences.
What is the AI billed by? Get it in writing before you sign anything. The phrasing matters. "Pricing depends on usage" can mean per resolution, per session, per conversation, or per token, and the difference is two orders of magnitude.
What foundation model is underneath? If the answer is "our proprietary model," they are reselling Claude, GPT, or Gemini in a wrapper. That is fine, but you should know which one. Some industries forbid sending data to certain providers.
What does the grounding actually pull from? Just the help center? Past tickets too? Your product wiki? The bigger the corpus, the better the answers, but also the more your team has to maintain.
What happens when the AI doesn't know? Does it hand off to a human cleanly, or does the customer get stuck repeating themselves? Test the failure path. Most demos show the success path because it makes the bot look smart.
What is the prompt injection posture? If the answer is "we'll get back to you," that's an answer.
How to run the comparison
A week is enough. Two vendors, the same hundred real customer conversations from your last month, both products running the same volume on the same data. Track five things: how fast each Copilot drafted a reply, how much your agents had to edit before sending, what share of conversations the bot handled without help, the customer satisfaction score on those AI threads, and the total AI cost each vendor would have charged.
Whoever wins three of those five is your answer. Don't overthink it.
Watch on YouTube
Enjoying this?
Get the Deskwoot newsletter
One email a month. Practical guides on AI customer support, no marketing fluff.
One thing to avoid
Don't sign a multi-year deal priced per resolution until you've measured your actual resolution count on real traffic. Vendors quote based on usage estimates that often turn out to be three or four times lower than reality. By month six you're looking at a bill you didn't budget for. The escape hatch is per-conversation billing, where the worst case is your total volume, which you can already forecast.
FAQ
What does grounding actually mean?
Feeding your own content into the AI's context window before it generates a reply. Without it, you get generic answers. With it, you get answers that sound like your product, your policy, and your tone.
Can I run my own AI provider keys?
Some vendors let you. Deskwoot does. That usually means unlimited usage at provider rates, and your prompts go straight from your tenant to the provider you picked, with nothing shared in between.
How long should the comparison run?
A week is enough to compare two vendors on the same conversation set. Two weeks is better if your shift coverage varies day to day.
Run the comparison on Deskwoot
The free trial includes the full AI stack. AI Copilot is in every paid plan, the bot Fynn is grounded in your help center plus the Training Hub, and AI is billed per conversation at $0.03 to $0.07. Run the comparison on your own data before you decide.
How is AI used in customer service?
AI in customer service is used across 5 jobs in 2026: answering routine customer questions directly (deflection), drafting replies for human agents (Copilot), summarizing long conversation threads (handoff prep), classifying incoming messages by intent (routing), and translating between languages on the fly (multilingual support). Each job uses the same underlying language model but with different prompts and guardrails.
The majority of teams start with the Copilot pattern because it touches every conversation without customer risk: the human still approves every reply. Once the team trusts the Copilot's drafts, the next step is enabling the customer-facing bot on a single channel (typically live chat) where the cost of a miss is low and the AI can hand off to a human cleanly.
What is an example of AI customer service in practice?
A practical example: a customer types "my order #1234 has not arrived" into live chat. The AI bot reads the message, looks up order 1234 in the connected Shopify or commerce platform, sees the carrier tracking shows delivery delayed, and replies in seconds with the updated ETA plus a link to the tracking page. The same conversation would have taken a human agent 3 to 5 minutes of manual lookup.
If the AI sees something it cannot handle (refund over the auto-approval threshold, complaint, or a question outside its grounding content), it escalates to a human with a summary of what the customer already explained. Deskwoot's Fynn ships with this escalation pattern built in and is grounded in your help center, so its order-status answers cite your real shipping policy, not invented rules.
Frequently asked questions
Quick answers on the topics covered above.
Which AI model do customer service tools use?
How is AI customer service billed?
What is AI grounding in customer service?
Is AI customer service safe for sensitive data?
How do I evaluate AI customer service vendors?
Ready to improve your customer support?
Try Deskwoot free for 7 days. Cancel anytime.
Get started for free