June 2026 · Updated LLM pricing

LLM API Cost Calculator

Compare GPT-4o, Claude 3.5 Sonnet, Gemini, and 5 more models. Enter your token volume and request count — see exact monthly costs. Free, no signup, runs in your browser.

Advertisement

🤖 LLM API Cost Calculator

Select model · Enter token volume · Results update live

Estimated cost
$0
— per month
Input / output per 1M tokens
Cost per request
Input cost
Output cost
Total tokens

2026 LLM API Pricing Table

All prices in USD per 1 million tokens, pay-as-you-go, June 2026.

ModelInput / 1MOutput / 1MContextBest for
GPT-4o miniCHEAPEST$0.15$0.60128KHigh-volume tasks
Gemini 1.5 Flash$0.075$0.301MUltra-high volume
Claude 3.5 Haiku$0.80$4.00200KQuality on a budget
GPT-4o$2.50$10.00128KGeneral flagship
Claude 3.5 Sonnet$3.00$15.00200KCoding & reasoning
Gemini 1.5 Pro$1.25$5.001MLong context tasks
OpenAI o1$15.00$60.00200KHard reasoning
Claude 3 Opus$15.00$75.00200KMost capable Claude
Advertisement
💡 50% off with Batch API

OpenAI, Anthropic, and Google all offer batch processing at 50% discount for async workloads (up to 24h latency). Zero quality difference.

Real-World Cost Examples

Monthly cost for common production workloads.

WorkloadVolumeGPT-4o miniGPT-4oClaude Sonnet
Customer support bot10K req/day · 1K+300 tok$4.05$82.50$97.50
Document summarizer1K docs/day · 4K+800 tok$9.36$204$246
Code review assistant500 req/day · 3K+1K tok$9.45$195$247.50
RAG Q&A system5K req/day · 2K+500 tok$22.50$450$562.50

Ready to build? Start with free cloud credits:

Frequently Asked Questions

How much does GPT-4o cost per 1 million tokens?+

GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens (pay-as-you-go, June 2026).

What is the cheapest LLM API in 2026?+

Gemini 1.5 Flash at $0.075/$0.30 per million tokens is the cheapest capable model. GPT-4o mini ($0.15/$0.60) is the cheapest OpenAI option.

How much does Claude 3.5 Sonnet cost?+

Claude 3.5 Sonnet costs $3.00 per million input tokens and $15.00 per million output tokens via the Anthropic API.

How do I reduce LLM API costs?+

Use cheaper models (GPT-4o mini vs GPT-4o is 16× cheaper). Enable Batch API for 50% off async workloads. Cache repeated prompts. Trim system prompts. Set max_tokens explicitly.

What is a token in LLM pricing?+

A token is roughly 4 characters or 0.75 words. A 1,000-word document is approximately 1,333 tokens. Most LLM APIs charge separately for input (prompt) and output (completion) tokens.

Advertisement

Related Calculators