Compare pricing, benchmarks, and capabilities across 58 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
Gemini 3 Pro Preview (low) | — | — | — | 41.3 | — | — | ||
Gemini 3 Flash Preview (Reasoning) | — | — | — | 46.4 | 195 tok/s | 5.9s | ||
Gemini 2.5 Pro Preview (Mar' 25) | — | — | — | 30.3 | — | — | ||
Gemini 2.5 Pro | — | — | — | 34.6 | 127 tok/s | 22.0s | ||
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) | — | — | — | 25.7 | — | — | ||
Gemini 2.5 Flash Preview (Sep '25) (Reasoning) | — | — | — | 31.1 | — | — | ||
Gemini 2.5 Pro Preview (May' 25) | — | — | — | 29.5 | — | — | ||
Gemini 2.5 Flash (Reasoning) | — | — | — | 27 | 205 tok/s | 13.3s | ||
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | — | — | — | 21.6 | — | — | ||
Gemini 2.5 Flash (Non-reasoning) | — | — | — | 20.6 | 180 tok/s | 0.5s | ||
Gemini 2.0 Pro Experimental (Feb '25) | — | — | — | 18.1 | — | — | ||
Gemini 2.5 Flash Preview (Reasoning) | — | — | — | 24.3 | — | — | ||
Gemini 2.0 Flash Thinking Experimental (Jan '25) | — | — | — | 19.6 | — | — | ||
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | — | — | — | 19.4 | — | — | ||
Gemini 2.0 Flash (Feb '25) | — | — | — | 18.5 | — | — | ||
Gemini 2.0 Flash (experimental) | — | — | — | 16.8 | — | — | ||
Gemini 2.5 Flash Preview (Non-reasoning) | — | — | — | 17.8 | — | — | ||
Gemini 2.5 Flash-Lite (Reasoning) | — | — | — | 17.6 | 295 tok/s | 12.3s | ||
Gemini 1.5 Pro (Sep '24) | — | — | — | 16 | — | — | ||
Gemini 2.5 Flash-Lite (Non-reasoning) | — | — | — | 12.7 | 260 tok/s | 0.4s | ||
Gemini 2.0 Flash-Lite (Feb '25) | — | — | — | 14.7 | — | — | ||
Gemini 1.5 Flash (Sep '24) | — | — | — | 13.8 | — | — | ||
Gemini 1.5 Pro (May '24) | — | — | — | 12 | — | — | ||
Gemma 3 12B Instruct | — | — | — | 8.8 | 30 tok/s | 10.2s | ||
Gemini 1.5 Flash-8B | — | — | — | 11.1 | — | — | ||
Gemini 1.5 Flash (May '24) | — | — | — | 10.5 | — | — | ||
Gemma 3n E4B Instruct | — | — | — | 6.4 | 14 tok/s | 0.4s | ||
Gemma 3n E4B Instruct Preview (May '25) | — | — | — | 10.1 | — | — | ||
Gemini 1.0 Pro | — | — | — | 8.5 | — | — | ||
Gemma 3 4B Instruct | — | — | — | 6.3 | 30 tok/s | 1.1s | ||
Gemma 3n E2B Instruct | — | — | — | 4.8 | 51 tok/s | 0.5s | ||
Gemma 3 1B Instruct | — | — | — | 5.5 | 48 tok/s | 0.6s | ||
Gemma 3 270M | — | — | — | 7.7 | — | — | ||
Gemini 2.5 Flash TTS (Dec 2025) | — | — | — | — | — | — | ||
Gemini 3 Deep Think | — | — | — | — | — | — | ||
Gemma 4 31B (Reasoning) | — | — | — | 39.2 | 35 tok/s | 1.0s | ||
Gemma 4 26B A4B (Non-reasoning) | — | — | — | 27.1 | — | — | ||
Gemma 4 E2B (Non-reasoning) | — | — | — | 12.1 | — | — | ||
Gemma 4 E4B (Reasoning) | — | — | — | 18.8 | — | — | ||
Gemma 4 E4B (Non-reasoning) | — | — | — | 14.8 | — | — | ||
Gemma 4 E2B (Reasoning) | — | — | — | 15.2 | — | — | ||
Gemini 3.1 Pro Preview | — | — | — | 57.2 | 124 tok/s | 28.7s | ||
Gemini 3.1 Flash-Lite Preview | — | — | — | 33.5 | 319 tok/s | 5.7s | ||
Gemini 2.0 Flash-Lite (Preview) | — | — | — | 14.5 | — | — | ||
Gemini 2.0 Flash Thinking Experimental (Dec '24) | — | — | — | 12.3 | — | — | ||
PALM-2 | — | — | — | 8.6 | — | — | ||
Gemini 1.0 Ultra | — | — | — | 10.1 | — | — | ||
Studio | — | — | — | — | — | — | ||
Journey | — | — | — | — | — | — | ||
Gemini 2.5 Pro (Dec 2025) | — | — | — | — | — | — | ||
WaveNet | — | — | — | — | — | — | ||
Gemma 4 26B A4B (Reasoning) | — | — | — | 31.2 | — | — | ||
Chirp 3: HD | — | — | — | — | — | — | ||
Standard | — | — | — | — | — | — | ||
Neural2 | — | — | — | — | — | — | ||
Gemini 3.1 Flash TTS | — | — | — | — | — | — | ||
Gemini 2.5 Flash Lite TTS | — | — | — | — | — | — | ||
Gemma 4 31B (Non-reasoning) | — | — | — | 32.3 | — | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.