LLM Pricing Comparison — find the right AI model for your project.

Answer three quick questions — what you want to do, how heavy your usage is, and which models you’d like to compare. We’ll show you the real monthly cost and feature differences, in plain language.

Tracks 109+ AI models, refreshed monthly.
Plain-language labels — no technical jargon.
Share your results via link.

What the data shows

As of April 2026, the cheapest production-grade AI model in Buzzi.ai's pricing database is devstral-small-2505 at $0.00 per million input tokens. We track 97 models across pricing, quality benchmarks, context window and data-residency — refreshed daily.

How it works

Three quick questions. A real cost number in return.

No sign-up, no spreadsheet, no jargon. Built for founders, product teams, and engineers who need an answer in under a minute.

01
Pick a scenario.
Tell us what you want AI to do — chat, code, extract data, understand images, reason, or bulk processing. We filter the model list to what matters.
02
Set your usage.
Share a rough sense of volume and message length. No tokens, no math — plain English with anchors like "a side project" or "production scale."
03
Compare real costs.
Every model card shows your personalized monthly cost. Side-by-side bars surface the cheapest pick and how much you’d save by switching.

What we track

One database. Every provider worth watching.

97 production-ready models across 21 providers — pricing, context window, benchmarks, regions, and compliance on every row. Refreshed each morning from official pricing pages, cross-checked against third-party aggregators.

Production-ready models

tracked

Providers covered

worldwide

Quality benchmarks

per model

Refresh cadence

Daily

price sync

All providers

Click to see every model

Priced today

The latest flagship model from every major lab.

Prices are per 1 million tokens. Cached and batched rates apply when you reuse prompts or accept a delay. Click a row to open the full model page.

Provider	Model	Context	Input /1M	Output /1M	Cached /1M	Batch /1M	Capabilities
OpenAI	GPT-4.5	128K	$75.00	$150.00	$37.50	$37.50	ImagesTool use
OpenAI	GPT-5 pro	400K	$15.00	$120.00	—	—	Deep thinkingImagesTool use
OpenAI	o1	200K	$15.00	$60.00	$7.50	$7.50	Deep thinking
Anthropic	Claude Opus 4.7	200K	$5.00	$25.00	$1.50	$7.50	ImagesTool use
Anthropic	Claude Opus 4.6	200K	$5.00	$25.00	$1.50	$7.50	ImagesTool use
Anthropic	Claude Sonnet 4.6	200K	$3.00	$15.00	$0.3	$1.50	ImagesTool use
Google	Gemini 3 Pro	2M	$2.00	$12.00	$0.625	$1.25	ImagesTool use
Google	Gemini 2 Pro	2M	$3.50	$10.50	$0.875	$1.75	ImagesTool use
Google	Gemini 2.5 Pro	2M	$1.25	$10.00	—	—	ImagesTool use
Meta	LLaMA 3.1 405B	128K	$3.00	$3.00	—	—	—
Meta	LLaMA 4 Maverick	1M	$1.00	$3.00	—	—	Images
Meta	LLaMA 3.2 90B	128K	$0.6	$1.80	—	—	Images
Mistral	Mistral Large	128K	$2.00	$6.00	—	—	Tool use
Mistral	Mistral Medium	32K	$0.4	$2.00	—	—	—
Mistral	Mixtral 8x22B	66K	$1.20	$1.20	—	—	—
Alibaba (Qwen)	Qwen 2.5	131K	$0.5	$1.50	—	—	—
Alibaba (Qwen)	qwen3.5-9b	262K	$0.4	$1.50	—	—	—
Alibaba (Qwen)	qwen3.5-4b	—	$0.3	$1.50	—	—	—
DeepSeek	DeepSeek R1	128K	$0.55	$2.19	—	—	Deep thinking
DeepSeek	DeepSeek V3.2	128K	$0.27	$1.10	$0.07	—	Tool use
DeepSeek	DeepSeek V3	128K	$0.27	$1.10	$0.07	—	Tool use
Amazon	Titan Text Premier	32K	$0.5	$1.50	—	—	—
Amazon	nova-micro-v1	128K	$0.35	$1.40	—	—	—
Amazon	Titan Text Express	8K	$0.2	$0.6	—	—	—
xAI	Grok 4.1	256K	$3.00	$15.00	$0.75	—	ImagesTool use
xAI	Grok 4	256K	$3.00	$15.00	—	—	ImagesTool use
xAI	Grok 3	131K	$3.00	$15.00	—	—	Tool use
NVIDIA	nemotron-nano-9b-v2	131K	$0.4	$1.60	—	—	—
NVIDIA	Nemotron-4	128K	$1.00	$1.00	—	—	—
MiniMax	MiniMax-Text-01	1M	$0.2	$1.10	—	—	—
MiniMax	MiniMax-01	1M	$0.2	$1.10	—	—	—
Moonshot AI	Kimi K2	2M	$0.15	$2.50	—	—	—

Top 3 priced models per provider by list price. Prices refreshed daily from each provider’s public pricing page.

Built for every workload

Pick your use case, get a focused shortlist.

Each scenario maps to models that were built — or tuned — for that job. Skip the ones that don’t fit.

Beyond sticker price

Five calculators that sit behind the main flow.

Once you’ve narrowed down, dig deeper — migration math, real-prompt costs, curated stacks, lifecycle risk, and compliance.

Switch cost calculator
Before you migrate, see how migration engineering hours weigh against the monthly savings over 12 months.
Prompt cost
Paste a real prompt and reply. Get a per-provider cost at today’s rates, tokenized with the right family coefficient.
Model stacks
Editorial picks for budget, balanced, and frontier use — curated by our applied-AI team, refreshed monthly.
Lifecycle timeline
Which models are sunsetting, when, and what the provider is pushing customers toward.
Compliance matrix
Regions and certifications (SOC 2, HIPAA, GDPR, FedRAMP) per provider, in one grid.

Try the full toolkit

FAQ

Questions we get asked most.

Pricing freshness, sourcing, cache and batch discounts, embedding, alerts — all the things teams ask before picking a model.

Get instant answers from our AI agent

As of April 2026, the lowest input $/1M on our comparison is devstral-small-2505. Real-world cost depends on your cache hit rate and batch eligibility.

We mirror pricing from official provider pricing pages and docs. Each model row has a "last verified" timestamp and a link to the source so you can check yourself.

A nightly snapshot cron diffs against the previous day. When a change is detected we log it and email subscribed users within 24 hours.

Models that offer cached input pricing get a separate column. The volume calculator multiplies your cache hit rate by the cached price and the rest by the standard input price.

Providers that support async batch endpoints usually list a reduced price. If a model row has a batch price, you can set the "batch eligible" slider to model cost savings for that workload share.

Our top recommendation: pick two candidates from the filtered shortlist, estimate break-even with the switch-cost calculator, and run your real prompts through "Compare my prompt" for a grounded test. Top 3 this month: devstral-small-2505, GPT-5 nano, Gemini 2.0 Flash-Lite.

Yes. The comparison, calculators, and public JSON API are free. Signing in with Google enables the "Compare my prompt", saved comparisons, and price alerts features.

We list the top open-weight models (Meta Llama, Mistral, DeepSeek) when a pay-per-token API exists. Self-host cost modeling is not included since it depends on your GPU inventory.

Yes — the /embed route renders a minimal iframe with attribution. Use the embed builder on the main page to generate the snippet.

After signing in you can subscribe to any model. When the nightly snapshot detects a price change or deprecation, you get an email within 24 hours.

Each task has a weighted score over benchmarks relevant to that task plus a price pillar. We publish the exact weights on the methodology page.

No. The ranking is not pay-to-play. Providers pay us nothing.

Deploying at scale?

Need help picking a model for your use case?

A 30-minute call with a Buzzi applied-AI lead. We’ll look at your volume, your data, and your constraints and recommend a stack you can actually ship.

Book a free consultation How we source the data

About

Insights

Streamline

Integration

Solutions

Healthcare AI

Use Cases

Industries

LLM Pricing Comparison — find the right AI model for your project.

Three quick questions. A real cost number in return.

Pick a scenario.

Set your usage.

Compare real costs.

One database. Every provider worth watching.

The latest flagship model from every major lab.

Pick your use case, get a focused shortlist.

Five calculators that sit behind the main flow.

Switch cost calculator

Prompt cost

Model stacks

Lifecycle timeline

Compliance matrix

Questions we get asked most.

Need help picking a model for your use case?

LLM Pricing Comparison — find the right AI model for your project.

Three quick questions. A real cost number in return.

Pick a scenario.

Set your usage.

Compare real costs.

One database. Every provider worth watching.

The latest flagship model from every major lab.

Pick your use case, get a focused shortlist.

Five calculators that sit behind the main flow.

Switch cost calculator

Prompt cost

Model stacks

Lifecycle timeline

Compliance matrix

Questions we get asked most.

Get the LLM Market Pulse in your inbox.

Need help picking a model for your use case?