Compare pricing, benchmarks, and capabilities across 9 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
Llama 3.3 70B Open★ | Meta AI | $0.23 | $0.92 | 128K | 86% | 80 tok/s | — | |
DeepSeek V3 Open | DeepSeek | $0.27 | $1.1 | 128K | 88.5% | 80 tok/s | — | |
Llama 3.1 405B Open | Meta AI | $3 | $3 | 128K | 87.3% | 30 tok/s | — | |
Qwen3-72B Open | Alibaba Cloud | Free | Free | 32K | 85% | 100 tok/s | — | |
Phi-4 Open | Microsoft | $0.07 | $0.14 | 16K | 84.8% | 300 tok/s | — | |
Gemma 3 27B Open | Google DeepMind | Free | Free | 128K | 75% | 120 tok/s | — | |
DBRX Open | Databricks | $0.75 | $2.25 | 33K | 73.7% | 100 tok/s | — | |
Llama 3.2 11B Vision Open | Meta AI | $0.18 | $0.18 | 128K | 73% | 150 tok/s | — | |
Falcon 180B Open | TII | Free | Free | 4K | 70.4% | 20 tok/s | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.