Compare pricing, benchmarks, and capabilities across 81 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
Qwen3 235B A22B 2507 (Reasoning) | Alibaba | — | — | — | 29.5 | 58 tok/s | 1.2s | |
Qwen3 Max (Preview) | Alibaba | — | — | — | 26.1 | 45 tok/s | 1.7s | |
Qwen3 VL 235B A22B (Reasoning) | Alibaba | — | — | — | 27.6 | 38 tok/s | 1.2s | |
Qwen3 235B A22B 2507 Instruct | Alibaba | — | — | — | 25 | 66 tok/s | 1.1s | |
Qwen3 235B A22B (Reasoning) | Alibaba | — | — | — | 19.8 | 61 tok/s | 1.2s | |
Qwen3 VL 235B A22B Instruct | Alibaba | — | — | — | 20.8 | 51 tok/s | 1.1s | |
Qwen3 Max Thinking (Preview) | Alibaba | — | — | — | 32.5 | 42 tok/s | 1.8s | |
Qwen3 Next 80B A3B (Reasoning) | Alibaba | — | — | — | 26.7 | 163 tok/s | 1.1s | |
Qwen3 VL 32B (Reasoning) | Alibaba | — | — | — | 24.7 | 96 tok/s | 1.3s | |
Qwen3 Next 80B A3B Instruct | Alibaba | — | — | — | 20.1 | 166 tok/s | 1.1s | |
Qwen3 30B A3B 2507 (Reasoning) | Alibaba | — | — | — | 22.4 | 148 tok/s | 1.1s | |
Qwen3 VL 30B A3B (Reasoning) | Alibaba | — | — | — | 19.7 | 125 tok/s | 1.1s | |
Qwen3 32B (Reasoning) | Alibaba | — | — | — | 16.5 | 104 tok/s | 1.0s | |
Qwen3 Omni 30B A3B (Reasoning) | Alibaba | — | — | — | 15.6 | 97 tok/s | 1.1s | |
Qwen3 Coder 480B A35B Instruct | Alibaba | — | — | — | 24.8 | 66 tok/s | 1.7s | |
Qwen3 VL 32B Instruct | Alibaba | — | — | — | 17.2 | 72 tok/s | 1.1s | |
Qwen3 30B A3B 2507 Instruct | Alibaba | — | — | — | 15 | 114 tok/s | 1.1s | |
Qwen3 30B A3B (Reasoning) | Alibaba | — | — | — | 15.3 | 71 tok/s | 1.3s | |
Qwen3 14B (Reasoning) | Alibaba | — | — | — | 16.2 | 64 tok/s | 1.0s | |
Qwen3 VL 30B A3B Instruct | Alibaba | — | — | — | 16.1 | 123 tok/s | 1.0s | |
QwQ 32B | Alibaba | — | — | — | 19.7 | 31 tok/s | 0.5s | |
Qwen2.5 Max | Alibaba | — | — | — | 16.3 | 49 tok/s | 1.2s | |
Qwen3 235B A22B (Non-reasoning) | Alibaba | — | — | — | 17 | 62 tok/s | 1.1s | |
Qwen3 VL 8B (Reasoning) | Alibaba | — | — | — | 16.7 | 131 tok/s | 1.1s | |
Qwen3 8B (Reasoning) | Alibaba | — | — | — | 13.2 | 91 tok/s | 1.0s | |
Qwen3 4B 2507 (Reasoning) | Alibaba | — | — | — | 18.2 | — | — | |
Qwen3 32B (Non-reasoning) | Alibaba | — | — | — | 14.5 | 104 tok/s | 1.1s | |
Qwen3 Omni 30B A3B Instruct | Alibaba | — | — | — | 10.7 | 108 tok/s | 0.9s | |
Qwen2.5 Instruct 72B | Alibaba | — | — | — | 15.6 | 55 tok/s | 1.3s | |
Qwen3 Coder 30B A3B Instruct | Alibaba | — | — | — | 20 | 111 tok/s | 1.5s | |
Qwen3 30B A3B (Non-reasoning) | Alibaba | — | — | — | 12.5 | 68 tok/s | 1.1s | |
Qwen3 VL 4B (Reasoning) | Alibaba | — | — | — | 13.7 | — | — | |
Qwen3 4B (Reasoning) | Alibaba | — | — | — | 14.2 | 104 tok/s | 1.1s | |
Qwen2.5 Instruct 32B | Alibaba | — | — | — | 13.2 | — | — | |
Qwen3 VL 8B Instruct | Alibaba | — | — | — | 14.3 | 144 tok/s | 0.9s | |
Qwen3 14B (Non-reasoning) | Alibaba | — | — | — | 12.8 | 64 tok/s | 1.0s | |
Qwen3 4B 2507 Instruct | Alibaba | — | — | — | 12.9 | — | — | |
QwQ 32B-Preview | Alibaba | — | — | — | 15.2 | — | — | |
Qwen3 8B (Non-reasoning) | Alibaba | — | — | — | 10.6 | 84 tok/s | 1.0s | |
Qwen2.5 Coder Instruct 32B | Alibaba | — | — | — | 12.9 | — | — | |
Qwen2.5 Turbo | Alibaba | — | — | — | 12 | 68 tok/s | 1.2s | |
Qwen3 VL 4B Instruct | Alibaba | — | — | — | 9.6 | — | — | |
Qwen2 Instruct 72B | Alibaba | — | — | — | 11.7 | — | — | |
Qwen3 4B (Non-reasoning) | Alibaba | — | — | — | 12.5 | 105 tok/s | 1.0s | |
Qwen3 1.7B (Reasoning) | Alibaba | — | — | — | 8 | 138 tok/s | 0.9s | |
Qwen2.5 Coder Instruct 7B | Alibaba | — | — | — | 10 | — | — | |
Qwen3 1.7B (Non-reasoning) | Alibaba | — | — | — | 6.8 | 139 tok/s | 1.0s | |
Qwen3 0.6B (Reasoning) | Alibaba | — | — | — | 6.5 | 225 tok/s | 0.9s | |
Qwen3 0.6B (Non-reasoning) | Alibaba | — | — | — | 5.7 | 222 tok/s | 0.9s | |
Qwen3.5 27B (Reasoning) | Alibaba | — | — | — | 42.1 | 92 tok/s | 1.4s | |
Qwen3.5 Omni Flash | Alibaba | — | — | — | 25.9 | 243 tok/s | 0.9s | |
Qwen3 Coder Next | Alibaba | — | — | — | 28.3 | 127 tok/s | 1.0s | |
Qwen3.5 122B A10B (Reasoning) | Alibaba | — | — | — | 41.6 | 162 tok/s | 1.1s | |
Qwen3.5 2B (Reasoning) | Alibaba | — | — | — | 16.3 | — | — | |
Qwen3.5 397B A17B (Reasoning) | Alibaba | — | — | — | 45 | 52 tok/s | 1.7s | |
Qwen3.5 122B A10B (Non-reasoning) | Alibaba | — | — | — | 35.9 | 163 tok/s | 1.1s | |
Qwen3.5 0.8B (Reasoning) | Alibaba | — | — | — | 10.5 | — | — | |
Qwen3.6 Plus | Alibaba | — | — | — | 50 | 53 tok/s | 1.7s | |
Qwen3.5 35B A3B (Reasoning) | Alibaba | — | — | — | 37.1 | 118 tok/s | 1.1s | |
Qwen3 Max Thinking | Alibaba | — | — | — | 39.9 | 46 tok/s | 1.5s | |
Qwen3.5 397B A17B (Non-reasoning) | Alibaba | — | — | — | 40.1 | 53 tok/s | 1.8s | |
Qwen1.5 Chat 110B | Alibaba | — | — | — | 9.5 | — | — | |
Qwen3.5 35B A3B (Non-reasoning) | Alibaba | — | — | — | 30.7 | 134 tok/s | 1.2s | |
Qwen3.5 27B (Non-reasoning) | Alibaba | — | — | — | 37.2 | 94 tok/s | 1.4s | |
Qwen3 TTS Flash | Alibaba | — | — | — | — | — | — | |
Qwen3 TTS | Alibaba | — | — | — | — | — | — | |
Qwen Chat 14B | Alibaba | — | — | — | 7.4 | — | — | |
Qwen3.5 9B (Non-reasoning) | Alibaba | — | — | — | 27.3 | — | — | |
Qwen3.6 27B (Reasoning) | Alibaba | — | — | — | 45.8 | 64 tok/s | 1.5s | |
Qwen3.5 4B (Non-reasoning) | Alibaba | — | — | — | 22.6 | 201 tok/s | 0.2s | |
Qwen3.6 35B A3B (Reasoning) | Alibaba | — | — | — | 43.5 | 189 tok/s | 1.5s | |
Qwen3.5 Omni Flash | Alibaba | — | — | — | — | — | — | |
Qwen3.6 35B A3B (Non-reasoning) | Alibaba | — | — | — | 31.5 | 182 tok/s | 1.4s | |
Qwen3.5 2B (Non-reasoning) | Alibaba | — | — | — | 14.7 | 343 tok/s | 0.2s | |
Qwen3.6 27B (Non-reasoning) | Alibaba | — | — | — | 37.1 | 61 tok/s | 1.4s | |
Qwen3.5 0.8B (Non-reasoning) | Alibaba | — | — | — | 9.9 | 356 tok/s | 0.2s | |
Qwen3.5 Omni Plus | Alibaba | — | — | — | 38.6 | 56 tok/s | 1.3s | |
Qwen3.5 4B (Reasoning) | Alibaba | — | — | — | 27.1 | 199 tok/s | 0.2s | |
Qwen3.6 Max Preview | Alibaba | — | — | — | 51.8 | 38 tok/s | 2.0s | |
Qwen Chat 72B | Alibaba | — | — | — | 8.8 | — | — | |
Qwen3.5 9B (Reasoning) | Alibaba | — | — | — | 32.4 | 71 tok/s | 0.4s |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.