Vapi AI Pricing: True Cost Breakdown in 2026

Doğa Kaplan
April 16, 2026
12
 min read
Contents

If you've been exploring Vapi AI for voice agent infrastructure, you've probably noticed that the pricing page raises more questions than it answers. The platform runs on a pay-as-you-go model, which sounds simple until you realize that the number on your invoice comes from at least four different cost layers stacked on top of each other. 

This article breaks down Vapi AI pricing in full: what each cost layer covers, how they interact, and what you'll realistically pay. We also look at Zeeg as an alternative if you need voice AI that's directly tied to appointment scheduling.

Skip the build. Start booking calls today.

Zeeg's AI agent answers calls, qualifies leads, and books meetings — no providers to configure, no code to write. Try any paid plan free for 14 days.

Sign up for free

What is Vapi AI?

Before digging into the numbers, here’s a quick orientation for you. Vapi is a developer-focused platform that lets engineering teams build, deploy, and scale voice AI agents. It handles the infrastructure layer (real-time audio processing, latency management, and the orchestration between different AI model providers) so developers don't have to build that themselves.

It's mainly used for things like inbound customer service agents, outbound sales dialers, appointment reminder bots, and similar phone-based automation. The platform connects to external providers for speech-to-text (STT), text-to-speech (TTS), large language models (LLM), and telephony. And here's the thing, each of those providers charges its own fee. Vapi's cost is the sum of all of them, plus its own hosting fee on top.

That layered structure is precisely why Vapi AI cost can feel hard to pin down. So let's go layer by layer.

The four cost layers in Vapi AI pricing

There's no single "Vapi price" as what you pay is the combined total of four distinct components. Most pricing pages for tools like this present a tidy number, but Vapi is unusually transparent about what goes into it. That's both refreshing and, admittedly, a bit daunting at first.

Here's what you're actually paying for:

1. Vapi hosting cost — this is the platform fee Vapi charges for running your agents. As of 2026, it's $0.05 per minute for calls. With no subscription or seat fees.

2. Large Language Model (LLM) costs — this is what you pay to the AI model processing the conversation. You can use OpenAI, Anthropic, Groq, or others. These are charged at cost from the provider, Vapi passes through whatever the LLM provider charges, without markup.

3. Text-to-Speech (TTS) costs — the voice synthesis layer. ElevenLabs, Deepgram, OpenAI TTS, and others are supported. Again, at-cost pricing from the provider.

4. Speech-to-Text (STT) costs — transcription of the caller's voice. Deepgram is the default, though others are available. Same deal: at-cost passthrough.

Telephony (the actual phone call transport) is a fifth consideration, though it's billed separately through your own telephony provider like Twilio, Vonage, Telnyx, or similar. Vapi doesn't resell phone numbers or minutes directly; you bring your own provider.

Breaking down Vapi hosting cost

The $0.05/minute Vapi hosting fee is the fixed part of the equation. It covers the infrastructure for running your agent: real-time audio processing, the WebSocket connections, the orchestration between providers, and the platform itself.

For context: 1,000 calls at an average of 5 minutes each equals 5,000 minutes, which is $250 in Vapi hosting alone before adding any model provider costs. That number scales linearly with usage, which is predictable but worth factoring in from the start.

Vapi also charges for call concurrency. The default plan includes 10 concurrent lines, with additional lines available at $10 per line per month. If you're running a high-volume outbound campaign where dozens of calls happen simultaneously, that cost adds up.

For SMS and chat interactions, pricing is $0.005 per message which is a lot less than voice, though it depends on your use case.

LLM provider costs: The wildcard

This is where the numbers start to diverge significantly depending on your setup. LLM costs are billed at cost from whichever provider you use, and the per-minute equivalent varies a lot.

A few reference points:

  • GPT-4o (OpenAI): roughly $0.005 per 1K input tokens and $0.015 per 1K output tokens
  • GPT-4o mini: $0.00015 input / $0.0006 output per 1K tokens — much cheaper, reasonable for simpler interactions
  • Claude Haiku (Anthropic): competitive with GPT-4o mini for lightweight tasks
  • Groq-hosted Llama models: among the cheapest options, particularly for latency-sensitive voice use cases

An average one-minute phone call might involve somewhere between 500 and 2,000 tokens depending on conversational complexity, the length of your system prompt, and how verbose the caller is. That puts LLM cost roughly between $0.002 and $0.10 per minute at the model layer, which depends entirely on the model chosen and conversation depth.

If you bring your own API key for the LLM provider, Vapi still charges its $0.05/min hosting fee, but the LLM costs go directly to your provider account rather than through Vapi. That means you get your provider's native pricing without any additional margin.

TTS and STT costs

These two layers are often smaller than the LLM component, but they still add up over volume.

Speech-to-Text (STT): Deepgram Nova-2 is the most commonly used option on Vapi. It's priced at around $0.0043 per minute for streaming transcription. Assemblyai and others are also supported at similar price points.

Text-to-Speech (TTS): This one varies more. ElevenLabs, which many teams use for more natural-sounding voices, runs approximately $0.18 per 1,000 characters. For a one-minute conversation with average verbosity, that might mean 200–400 characters of TTS output, putting the cost around $0.036–$0.072 per minute. OpenAI TTS is generally cheaper. Deepgram Aura is another lower-cost option if voice quality requirements are flexible.

The voice you choose has a real cost implication, in other words. A "premium" synthetic voice can meaningfully change your per-minute total.

Telephony costs: The layer Vapi doesn't cover

Here's something that catches people off guard: Vapi does not handle telephony directly. You need a separate telephony provider (Twilio, Vonage, Telnyx, Bandwidth, or similar) to actually place or receive calls. Vapi connects to your provider via SIP or PSTN, but the phone number rental and call minutes are billed by that provider independently.

Twilio, for example, charges:

  • Around $0.0085/minute for inbound calls
  • Around $0.013/minute for outbound calls
  • $1.15–$2/month per phone number (local US numbers)

Telnyx tends to be cheaper, often around $0.004–$0.009/minute for inbound and slightly more for outbound. Vonage and Bandwidth have their own rate structures. The point is that this cost layer is real, it's not small at scale, and it's entirely separate from anything Vapi invoices you for.

What does a real call actually cost?

Let's put it all together with a realistic example. Assume:

  • A 3-minute inbound customer support call
  • GPT-4o mini as the LLM
  • Deepgram for STT
  • OpenAI TTS for voice
  • Twilio for telephony

Here's a rough cost estimate per call:

Cost layer Rate Estimated cost (3-min call)
Vapi hosting $0.05/min $0.15
LLM (GPT-4o mini) ~$0.003–0.01/min $0.01–$0.03
STT (Deepgram) $0.0043/min $0.013
TTS (OpenAI) ~$0.015/min equiv. $0.045
Telephony (Twilio) $0.0085/min $0.026
Total estimate ~$0.23–$0.26

So for 1,000 calls at 3 minutes each, you're looking at roughly $230–$260 per month in total per-call costs, and that's using a cheaper LLM and standard voice. Swap GPT-4o mini for GPT-4o and add ElevenLabs, and you could be looking at $0.40–$0.60 per call instead.

At 10,000 calls a month, those differences really start to matter.

Concurrency, data retention, and add-on costs

Beyond the per-minute charges, there are a few other cost factors worth knowing.

Concurrency is probably the most overlooked one. By default, Vapi gives you 10 concurrent lines. For a low-volume setup, that's fine. But if you're running outbound campaigns or handling peak inbound volumes, you may need more; at $10 per additional concurrent line per month. For a contact center running 50 simultaneous calls, that alone is an extra $400/month.

Data retention is also something to think about. On the default plan, call history is kept for 14 days and chat history for 30 days. If your use case requires longer records (for compliance, for example) that's a paid add-on. HIPAA compliance with zero data retention is available as a $1,000/month add-on, which tells you something about the target market at that tier.

Enterprise features like SSO, RBAC, SOC2 compliance, named support engineers, and SLAs are available on enterprise plans, but Vapi doesn't publish those prices publicly. You'll need to contact their sales team, which is standard practice for enterprise tiers but worth flagging if you're planning a larger deployment.

Is Vapi AI pricing actually competitive?

Relative to building the same infrastructure yourself, yes; Vapi is genuinely cost-efficient. Handling real-time audio orchestration, managing the latency between multiple AI providers, and keeping everything stable at scale is not trivial engineering work. The $0.05/min hosting fee is a reasonable charge for avoiding that.

Compared to other voice AI platforms, Vapi sits on the lower end of the pricing spectrum for similar capabilities. Platforms like Retell AI, Bland AI, and others in the space tend to price in the $0.07–$0.15/min range for an all-in-one hosted solution. The trade-off is that those platforms abstract away the provider complexity as you pay a single fee and don't have to manage STT, TTS, and LLM configurations yourself.

Vapi's model gives you more control and potentially lower costs at scale if you optimize your provider stack. But that flexibility requires more engineering investment. If you're a solo operator or a small team without dedicated developers, the overhead of managing four separate provider accounts and bills might not be worth the savings.

Who is Vapi actually built for?

Honestly, Vapi is a developer's tool. The documentation, the architecture, and the pricing model all point to a primary audience of engineering teams building voice AI products at scale. Companies using Vapi typically include:

  • SaaS companies building voice features into their own products
  • Agencies developing AI phone agents for clients
  • Larger businesses with dedicated dev resources deploying call center automation

It's not really well-suited for small businesses or non-technical teams looking for a plug-and-play AI phone agent. There's no visual no-code builder, no out-of-the-box booking integration, and no CRM layer. You get the infrastructure, what you build on top of it is up to you.

That's a meaningful distinction if you're evaluating tools for appointment booking, lead qualification, or customer scheduling specifically.

Zeeg: A different angle on voice AI for scheduling

Speaking of which, if your main goal is AI-powered phone scheduling rather than custom voice infrastructure, it's worth looking at Zeeg before committing to a Vapi build.

Zeeg is a scheduling CRM that has a natively built voice AI agent, meaning the calling, booking, and CRM layers are all in one platform. There are no separate accounts to manage across Twilio, OpenAI, ElevenLabs, and Vapi — it's all handled under one roof.

The difference in setup experience is significant. With Vapi, getting a working voice agent requires selecting and configuring multiple providers, writing prompts, setting up telephony, and building your own booking logic on top. With Zeeg, you write a prompt, pick a phone number, set routing rules, and your agent is live in a matter of minutes with no code required.

From a cost standpoint, Zeeg uses a credit-based model for call minutes. Inbound calls run approximately €0.07/minute, and outbound calls around €0.19/minute. Minute bundles are available to reduce the per-minute rate at higher volumes. Every call logs its exact cost in the CRM, so there are no surprise invoices at the end of the month.

Here's how the two platforms compare for a scheduling-focused use case:

Vapi AI Zeeg
Primary audience Engineering teams Any team size
Setup complexity High (multi-provider config) Low (no-code, minutes)
Booking integration Build it yourself Native, built-in
CRM included No Yes, native CRM
Telephony Separate provider required Included (buy in-app or SIP import)
Per-minute cost $0.05 + providers (total ~$0.07–0.15+) ~€0.07 inbound / €0.19 outbound (all-in)
Call records & transcripts Available (14-day retention default) Built into CRM, per-call cost visible
Base platform plan Pay-as-you-go (no subscription) From $10/user/month (Professional)
Ideal for Custom voice AI products AI-powered scheduling & lead booking

The two tools aren't really direct competitors as Vapi is infrastructure, while Zeeg is an end-to-end scheduling platform. But if your use case is getting an AI agent to answer calls and book meetings, Zeeg is worth evaluating seriously before choosing to build on Vapi from scratch.

Zeeg's voice AI agent handles inbound and outbound calls, routes based on caller intent using plain-language rules, captures lead details, and books appointments, all logged automatically in the CRM. You can import an existing SIP number or purchase a local number directly in the platform. The Zeeg Professional plan starts at $10/user/month billed annually, with voice AI available from that tier.

One platform. Calling, booking, and CRM — all built in.

No Twilio account. No LLM setup. No separate booking tool. Zeeg handles it all, with transparent per-call costs logged directly in your CRM.

Try Zeeg free

Wrapping up on Vapi AI’s 2026 costs

Vapi AI pricing is genuinely pay-as-you-go, which is good for flexibility. But the true Vapi AI cost is the sum of at least four billing layers (platform hosting, LLM, TTS, STT) plus a separate telephony provider you manage yourself. For a simple 3-minute inbound call with a budget model stack, you're looking at roughly $0.23–$0.26 per call. With premium voice and a more capable LLM, that number can exceed $0.50. 

It's a competitive model for engineering teams that want full control over their voice AI stack and are comfortable managing multiple provider accounts. For teams who just want AI-powered phone scheduling without the build overhead, a more integrated platform like Zeeg offers a comparable per-minute cost with a fraction of the setup complexity and the booking, CRM, and call records all come built in.

FAQ: Vapi AI pricing

Does Vapi have a free plan?

Vapi offers $10 in free credits to get started, which is enough to test and experiment with the platform. There's no ongoing free tier — beyond the initial credits, all usage is billed.

Can I get a fixed monthly price instead of pay-as-you-go?

Not on the standard plan. Vapi's model is fully usage-based. If you need predictable costs, you'd need to negotiate an enterprise arrangement or build your own cost caps on top of the API.

Does Vapi include telephony?

No. Vapi handles the AI orchestration layer, but actual phone calls require a separate telephony provider like Twilio, Telnyx, or Vonage. You manage that account and its billing independently.

If I use my own OpenAI API key, do I still pay Vapi?

Yes. Vapi's $0.05/minute hosting fee applies regardless of whether you bring your own LLM keys. The difference is that using your own keys means LLM costs go directly to your OpenAI (or other provider) account rather than through Vapi.

What is call concurrency and why does it matter?

Concurrency is the number of calls your agents can handle at the same time. The default is 10 simultaneous calls. If you need more — for example, during an outbound campaign — you pay $10 per additional concurrent line per month.

How does HIPAA compliance work on Vapi?

HIPAA compliance with zero data retention is available as a paid add-on at $1,000/month. For most standard use cases without healthcare data requirements, this isn't necessary.

Can I get volume discounts on Vapi?

Volume discounts are available at scale, but not through the self-serve plan. You'd need to contact Vapi's sales team to discuss pricing at higher volumes.

What happens to my call history after 14 days?

On the default plan, call history is purged after 14 days and chat history after 30 days. Extended data retention is available as an add-on — contact Vapi for pricing.

Is Vapi reliable enough for production use?

Vapi states their self-serve platform is production-ready, and it's used by a range of companies in live deployments. For enterprise-level SLA guarantees, you'd need to be on an enterprise contract.

What's the minimum enterprise plan cost?

Vapi doesn't publish enterprise pricing publicly. Based on available information and general market positioning, enterprise contracts typically start in the $1,000+/month range, but you'd need to get a quote directly from their sales team.