Compare pricing, benchmarks, and capabilities across 18 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
DeepSeek R2 ★ | DeepSeek | $0.55 | $2.19 | 128K | 91% | 60 tok/s | — | |
Llama 3.3 70B Open★ | Meta AI | $0.23 | $0.92 | 128K | 86% | 80 tok/s | — | |
DeepSeek V3 Open | DeepSeek | $0.27 | $1.1 | 128K | 88.5% | 80 tok/s | — | |
Qwen3-Max | Alibaba Cloud | $0.4 | $1.2 | 32K | 87% | 90 tok/s | — | |
Qwen3-72B Open | Alibaba Cloud | Free | Free | 32K | 85% | 100 tok/s | — | |
Phi-4 Open | Microsoft | $0.07 | $0.14 | 16K | 84.8% | 300 tok/s | — | |
Grok 3 Mini | xAI | $0.3 | $0.5 | 131K | 83% | 160 tok/s | — | |
Gemini 3 Flash | Google DeepMind | $0.075 | $0.3 | 1M | 82% | 250 tok/s | — | |
GPT-4o mini | OpenAI | $0.15 | $0.6 | 128K | 82% | 200 tok/s | — | |
Claude Haiku 4.5 | Anthropic | $0.8 | $4 | 200K | 75.2% | 250 tok/s | — | |
Gemma 3 27B Open | Google DeepMind | Free | Free | 128K | 75% | 120 tok/s | — | |
DBRX Open | Databricks | $0.75 | $2.25 | 33K | 73.7% | 100 tok/s | — | |
Llama 3.2 11B Vision Open | Meta AI | $0.18 | $0.18 | 128K | 73% | 150 tok/s | — | |
Command R | Cohere | $0.15 | $0.6 | 128K | 72% | 150 tok/s | — | |
Gemini 3.1 Flash-Lite | Google DeepMind | $0.01 | $0.04 | 1M | 72% | 500 tok/s | — | |
Mistral Small | Mistral AI | $0.1 | $0.3 | 32K | 72% | 200 tok/s | — | |
Falcon 180B Open | TII | Free | Free | 4K | 70.4% | 20 tok/s | — | |
Codestral | Mistral AI | $0.3 | $0.9 | 32K | — | 180 tok/s | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.