Compare pricing, benchmarks, and capabilities across 76 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
Qwen3 VL 235B A22B (Reasoning) | Alibaba | — | — | — | 27.6 | 45 tok/s | 1.2s | |
Qwen3 Max (Preview) | Alibaba | — | — | — | 26.1 | 47 tok/s | 1.8s | |
Qwen3 235B A22B 2507 (Reasoning) | Alibaba | — | — | — | 29.5 | 51 tok/s | 1.3s | |
Qwen3 235B A22B 2507 Instruct | Alibaba | — | — | — | 25 | 70 tok/s | 1.2s | |
Qwen3 235B A22B (Reasoning) | Alibaba | — | — | — | 19.8 | 65 tok/s | 1.3s | |
Qwen3 Next 80B A3B Instruct | Alibaba | — | — | — | 20.1 | 166 tok/s | 1.0s | |
Qwen3 Max Thinking (Preview) | Alibaba | — | — | — | 32.5 | 43 tok/s | 1.8s | |
Qwen3 Next 80B A3B (Reasoning) | Alibaba | — | — | — | 26.7 | 164 tok/s | 1.1s | |
Qwen3 VL 32B (Reasoning) | Alibaba | — | — | — | 24.7 | 97 tok/s | 1.4s | |
Qwen3 VL 235B A22B Instruct | Alibaba | — | — | — | 20.8 | 57 tok/s | 1.2s | |
Qwen3 30B A3B 2507 (Reasoning) | Alibaba | — | — | — | 22.4 | 148 tok/s | 1.1s | |
Qwen3 VL 30B A3B (Reasoning) | Alibaba | — | — | — | 19.7 | 127 tok/s | 1.0s | |
Qwen3 32B (Reasoning) | Alibaba | — | — | — | 16.5 | 103 tok/s | 1.1s | |
Qwen3 VL 32B Instruct | Alibaba | — | — | — | 17.2 | 83 tok/s | 1.3s | |
Qwen3 Omni 30B A3B (Reasoning) | Alibaba | — | — | — | 15.6 | 93 tok/s | 1.0s | |
Qwen3 Coder 480B A35B Instruct | Alibaba | — | — | — | 24.8 | 65 tok/s | 1.7s | |
Qwen3 30B A3B 2507 Instruct | Alibaba | — | — | — | 15 | 92 tok/s | 1.3s | |
Qwen3 30B A3B (Reasoning) | Alibaba | — | — | — | 15.3 | 70 tok/s | 1.2s | |
Qwen3 14B (Reasoning) | Alibaba | — | — | — | 16.2 | 65 tok/s | 1.1s | |
Qwen3 VL 30B A3B Instruct | Alibaba | — | — | — | 16.1 | 123 tok/s | 1.0s | |
Qwen3 235B A22B (Non-reasoning) | Alibaba | — | — | — | 17 | 63 tok/s | 1.2s | |
QwQ 32B | Alibaba | — | — | — | 19.7 | 33 tok/s | 0.4s | |
Qwen2.5 Max | Alibaba | — | — | — | 16.3 | 46 tok/s | 1.1s | |
Qwen3 VL 8B (Reasoning) | Alibaba | — | — | — | 16.7 | 135 tok/s | 1.1s | |
Qwen3 8B (Reasoning) | Alibaba | — | — | — | 13.2 | 91 tok/s | 1.0s | |
Qwen3 4B 2507 (Reasoning) | Alibaba | — | — | — | 18.2 | — | — | |
Qwen3 32B (Non-reasoning) | Alibaba | — | — | — | 14.5 | 102 tok/s | 1.2s | |
Qwen3 Omni 30B A3B Instruct | Alibaba | — | — | — | 10.7 | 106 tok/s | 1.1s | |
Qwen2.5 Instruct 72B | Alibaba | — | — | — | 15.6 | 55 tok/s | 1.2s | |
Qwen3 Coder 30B A3B Instruct | Alibaba | — | — | — | 20 | 113 tok/s | 1.4s | |
Qwen3 30B A3B (Non-reasoning) | Alibaba | — | — | — | 12.5 | 67 tok/s | 1.2s | |
Qwen3 4B (Reasoning) | Alibaba | — | — | — | 14.2 | 104 tok/s | 1.0s | |
Qwen2.5 Instruct 32B | Alibaba | — | — | — | 13.2 | — | — | |
Qwen3 VL 4B (Reasoning) | Alibaba | — | — | — | 13.7 | — | — | |
Qwen3 VL 8B Instruct | Alibaba | — | — | — | 14.3 | 148 tok/s | 0.9s | |
Qwen3 14B (Non-reasoning) | Alibaba | — | — | — | 12.8 | 65 tok/s | 1.0s | |
Qwen3 4B 2507 Instruct | Alibaba | — | — | — | 12.9 | — | — | |
QwQ 32B-Preview | Alibaba | — | — | — | 15.2 | 43 tok/s | 0.5s | |
Qwen3 8B (Non-reasoning) | Alibaba | — | — | — | 10.6 | 94 tok/s | 0.9s | |
Qwen2.5 Coder Instruct 32B | Alibaba | — | — | — | 12.9 | — | — | |
Qwen3 VL 4B Instruct | Alibaba | — | — | — | 9.6 | — | — | |
Qwen2.5 Turbo | Alibaba | — | — | — | 12 | 68 tok/s | 1.2s | |
Qwen2 Instruct 72B | Alibaba | — | — | — | 11.7 | — | — | |
Qwen3 4B (Non-reasoning) | Alibaba | — | — | — | 12.5 | 105 tok/s | 1.0s | |
Qwen3 1.7B (Reasoning) | Alibaba | — | — | — | 8 | 138 tok/s | 1.0s | |
Qwen2.5 Coder Instruct 7B | Alibaba | — | — | — | 10 | — | — | |
Qwen3 1.7B (Non-reasoning) | Alibaba | — | — | — | 6.8 | 141 tok/s | 0.9s | |
Qwen3 0.6B (Reasoning) | Alibaba | — | — | — | 6.5 | 189 tok/s | 0.9s | |
Qwen3 0.6B (Non-reasoning) | Alibaba | — | — | — | 5.7 | 194 tok/s | 0.9s | |
Qwen3 Coder Next | Alibaba | — | — | — | 28.3 | 165 tok/s | 0.8s | |
Qwen3.5 9B (Reasoning) | Alibaba | — | — | — | 32.4 | 56 tok/s | 0.4s | |
Qwen3.5 4B (Reasoning) | Alibaba | — | — | — | 27.1 | 177 tok/s | 0.3s | |
Qwen3.5 122B A10B (Non-reasoning) | Alibaba | — | — | — | 35.9 | 152 tok/s | 1.1s | |
Qwen3.6 Plus | Alibaba | — | — | — | 50 | 53 tok/s | 1.6s | |
Qwen3.5 27B (Reasoning) | Alibaba | — | — | — | 42.1 | 92 tok/s | 1.4s | |
Qwen3.5 Omni Flash | Alibaba | — | — | — | 25.9 | 170 tok/s | 1.2s | |
Qwen Chat 72B | Alibaba | — | — | — | 8.8 | — | — | |
Qwen3.5 Omni Plus | Alibaba | — | — | — | 38.6 | 55 tok/s | 1.3s | |
Qwen3.5 2B (Reasoning) | Alibaba | — | — | — | 16.3 | — | — | |
Qwen3.5 35B A3B (Reasoning) | Alibaba | — | — | — | 37.1 | 149 tok/s | 1.2s | |
Qwen3 Max Thinking | Alibaba | — | — | — | 39.9 | 36 tok/s | 1.7s | |
Qwen3.5 397B A17B (Non-reasoning) | Alibaba | — | — | — | 40.1 | 52 tok/s | 1.4s | |
Qwen3.5 35B A3B (Non-reasoning) | Alibaba | — | — | — | 30.7 | 153 tok/s | 1.1s | |
Qwen3.5 27B (Non-reasoning) | Alibaba | — | — | — | 37.2 | 92 tok/s | 1.4s | |
Qwen3.6 Max Preview | Alibaba | — | — | — | 51.8 | 57 tok/s | 1.9s | |
Qwen3.5 4B (Non-reasoning) | Alibaba | — | — | — | 22.6 | 178 tok/s | 0.3s | |
Qwen3.5 2B (Non-reasoning) | Alibaba | — | — | — | 14.7 | 232 tok/s | 0.3s | |
Qwen Chat 14B | Alibaba | — | — | — | 7.4 | — | — | |
Qwen3.5 9B (Non-reasoning) | Alibaba | — | — | — | 27.3 | 143 tok/s | 0.3s | |
Qwen1.5 Chat 110B | Alibaba | — | — | — | 9.5 | — | — | |
Qwen3.5 0.8B (Non-reasoning) | Alibaba | — | — | — | 9.9 | 285 tok/s | 0.3s | |
Qwen3.6 35B A3B (Reasoning) | Alibaba | — | — | — | 43.5 | 238 tok/s | 1.7s | |
Qwen3.6 35B A3B (Non-reasoning) | Alibaba | — | — | — | 31.5 | 193 tok/s | 1.5s | |
Qwen3.5 397B A17B (Reasoning) | Alibaba | — | — | — | 45 | 52 tok/s | 1.5s | |
Qwen3.5 122B A10B (Reasoning) | Alibaba | — | — | — | 41.6 | 159 tok/s | 1.1s | |
Qwen3.5 0.8B (Reasoning) | Alibaba | — | — | — | 10.5 | — | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.