Provider

IBM API Cost Calculator and Comparison

Every IBM model, side by side β€” current API rates, context window, benchmarks, and a live calculator that ranks them at your exact workload. 3 active models, 3 with public pricing, from $0.17/1M to $0.80/1M input. Prices refreshed daily from IBM’s official pricing page.

Models tracked

3

Active

3

With public pricing

3

Cheapest input

$0.17/1M

As of April 2026, IBM offers 3 active AI models via API, ranging from $0.17/1M to $0.80/1M input tokens. The most context-rich model handles up to 131K tokens. All prices are in USD per 1 million tokens; output tokens cost more than input on every model.

Interactive

Calculate your IBM API cost at your workload.

Set your workload β€” every priced model ranks in real time.

Adjust the workload

Every model below updates in real time.

1,00010,00050,000250,0001M10M

Ranked by your monthly bill

No models with public pricing available to compare right now.

Pricing at a glance

Blended $/1M tokens across the lineup.

Blended price uses a 3-to-1 input/output ratio β€” a common industry standard. Green bar = cheapest.

Every model

Every IBM model β€” pricing, context & capabilities.

ModelContextInput /1MOutput /1M
granite-4.0-h-micro131K$0.17$1.10
Granite 13B8K$0.6$0.6
Granite 20B8K$0.8$0.8

FAQ

IBM β€” questions we see most.

Pricing patterns, best-known use cases, and how this provider stacks up.

Get instant answers from our AI agent

IBM API pricing ranges from $0.17 to $0.80 per 1M input tokens. Output tokens cost more than input on every model. Prices are per 1 million tokens (1M β‰ˆ 750,000 words). Use the calculator above to estimate your monthly spend at your actual workload.
granite-4.0-h-micro is the lowest-priced IBM model with public pricing at $0.17/1M input tokens. It suits high-volume tasks where cost matters most β€” classification, extraction, summarization, and similar workloads that don't need frontier reasoning.
Granite 20B is IBM's highest-tier model at $0.80/1M input. It delivers the most sophisticated reasoning, instruction-following, and nuance. For workloads that don't require frontier performance, a mid-tier model typically cuts inference costs substantially.
IBM does not currently list cached or batch pricing in our database. Check IBM's official pricing page for the latest discount tiers β€” providers add these options regularly.
IBM has historically adjusted prices when launching new model generations, often cutting rates to stay competitive. Buzzi.ai snapshots pricing daily β€” you can subscribe to price-drop alerts on any IBM model using the "Alert me" button on its detail page.
Use the main comparison wizard to run the same calculator across IBM, Anthropic, Google, Meta, Mistral, and 20+ other providers. Set your exact workload and get a ranked cost chart in under a minute.

Look wider

Compare IBM against other providers.

Open the full wizard β€” pick a use case, set your usage, and cross-compare against OpenAI, Anthropic, Google, and 20+ more.