Compare pricing, benchmarks, and capabilities across 17 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
DeepSeek R2 ★ | DeepSeek | $0.55 | $2.19 | 128K | 91% | 60 tok/s | — | |
GPT-4.1 ★ | OpenAI | $2 | $8 | 1M | 90.5% | 80 tok/s | — | |
Claude Sonnet 4.6 ★ | Anthropic | $3 | $15 | 200K | 86.8% | 100 tok/s | — | |
o4-mini | OpenAI | $1.1 | $4.4 | 200K | 93.4% | 100 tok/s | — | |
Grok 3 | xAI | $3 | $15 | 131K | 87.5% | 90 tok/s | — | |
Qwen3-Max | Alibaba Cloud | $0.4 | $1.2 | 32K | 87% | 90 tok/s | — | |
Gemini 3 Pro | Google DeepMind | $3.5 | $10.5 | 1M | 87% | 100 tok/s | — | |
Mistral Large | Mistral AI | $2 | $6 | 128K | 84% | 90 tok/s | — | |
Grok 3 Mini | xAI | $0.3 | $0.5 | 131K | 83% | 160 tok/s | — | |
GPT-4o mini | OpenAI | $0.15 | $0.6 | 128K | 82% | 200 tok/s | — | |
Gemini 3 Flash | Google DeepMind | $0.075 | $0.3 | 1M | 82% | 250 tok/s | — | |
Command R+ | Cohere | $2.5 | $10 | 128K | 78% | 80 tok/s | — | |
Claude Haiku 4.5 | Anthropic | $0.8 | $4 | 200K | 75.2% | 250 tok/s | — | |
Gemini 3.1 Flash-Lite | Google DeepMind | $0.01 | $0.04 | 1M | 72% | 500 tok/s | — | |
Mistral Small | Mistral AI | $0.1 | $0.3 | 32K | 72% | 200 tok/s | — | |
Command R | Cohere | $0.15 | $0.6 | 128K | 72% | 150 tok/s | — | |
Codestral | Mistral AI | $0.3 | $0.9 | 32K | — | 180 tok/s | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.