Compare pricing, benchmarks, and capabilities across 10 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
DeepSeek R2 ★ | DeepSeek | $0.55 | $2.19 | 128K | 91% | 60 tok/s | — | |
Qwen3-Max | Alibaba Cloud | $0.4 | $1.2 | 32K | 87% | 90 tok/s | — | |
Grok 3 Mini | xAI | $0.3 | $0.5 | 131K | 83% | 160 tok/s | — | |
GPT-4o mini | OpenAI | $0.15 | $0.6 | 128K | 82% | 200 tok/s | — | |
Gemini 3 Flash | Google DeepMind | $0.075 | $0.3 | 1M | 82% | 250 tok/s | — | |
Claude Haiku 4.5 | Anthropic | $0.8 | $4 | 200K | 75.2% | 250 tok/s | — | |
Mistral Small | Mistral AI | $0.1 | $0.3 | 32K | 72% | 200 tok/s | — | |
Command R | Cohere | $0.15 | $0.6 | 128K | 72% | 150 tok/s | — | |
Gemini 3.1 Flash-Lite | Google DeepMind | $0.01 | $0.04 | 1M | 72% | 500 tok/s | — | |
Codestral | Mistral AI | $0.3 | $0.9 | 32K | — | 180 tok/s | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.