Cloud API · Pay-per-token

Hosted AI API price comparison

Don’t want to babysit a GPU? Pay-per-token APIs across 11 providers, ranked by output-token price. The same model often costs 938× more on closed-weight providers — pick wisely.

11 providers

All models — sorted cheapest output first

“Example call” = 500 input tokens + 200 output tokens (a typical chat exchange). 1¢ = 100 calls of this size at the marker.

ProviderModel$/1M in$/1M outExample callType
GroqLlama 3.1 8B$0.05$0.08$0.00004open
CerebrasLlama 3.1 8B$0.10$0.10$0.00007open
Google GeminiGemini 2.5 Nano$0.04$0.15$0.00005closed
Together AILlama 3.1 8B$0.18$0.18$0.00013open
Fireworks AILlama 3.1 8B$0.20$0.20$0.00014open
GroqMixtral 8x7B$0.24$0.24$0.00017open
Google GeminiGemini 2.5 Flash$0.07$0.30$0.00010closed
OpenAIGPT-4o mini$0.15$0.60$0.00019closed
Together AIMixtral 8x7B$0.60$0.60$0.00042open
MistralMistral Small 24B$0.20$0.60$0.00022open
GroqLlama 3.3 70B$0.59$0.79$0.00045open
GroqQwen 2.5 32B$0.79$0.79$0.00055open
CerebrasQwen 3 32B$0.40$0.80$0.00036open
Together AIQwen 3 32B$0.40$0.80$0.00036open
OpenRouterLlama 3.3 70B via OR$0.60$0.80$0.00046open
Together AILlama 3.3 70B$0.88$0.88$0.00062open
Fireworks AILlama 3.3 70B$0.90$0.90$0.00063open
Fireworks AIDeepSeek V3$0.90$0.90$0.00063open
MistralCodestral 25.01$0.30$0.90$0.00033open
DeepSeekDeepSeek V3$0.27$1.10$0.00036open
DeepSeekDeepSeek V3 (cache)$0.07$1.10$0.00026open
OpenRouterDeepSeek V3 via OR$0.27$1.10$0.00036open
CerebrasLlama 3.3 70B$0.85$1.20$0.00067open
OpenRouterBest LLM (auto-route)$0.30$1.20$0.00039open
DeepSeekDeepSeek R1$0.55$2.19$0.00071open
ReplicateLlama 3 70B$0.65$2.75$0.00088open
ReplicateFLUX.1 Schnell (img)$3.00$0.00060open
OpenAIo3-mini$1.10$4.40$0.00143closed
AnthropicClaude Haiku 4$1.00$5.00$0.00150closed
Google GeminiGemini 2.5 Pro$1.25$5.00$0.00163closed
MistralMistral Large$2.00$6.00$0.00220open
OpenAIGPT-4o$2.50$10.00$0.00325closed
AnthropicClaude Sonnet 4.6$3.00$15.00$0.00450closed
Together AIFLUX.1 Schnell (img)$27.00$0.00540open
ReplicateFLUX.1 Pro (img)$55.00$0.01100open
OpenAIo1$15.00$60.00$0.01950closed
AnthropicClaude Opus 4$15.00$75.00$0.02250closed

Considering self-hosting instead? Check what your hardware can run → · Or rent a GPU by the hour →

Last updated 2026-05-02. Prices in USD per million tokens. 'In' = input/prompt; 'Out' = output/completion. Where a single price applies to both, that figure goes in both columns.