Compare pricing, benchmarks, and capabilities across 18 AI models
| Model | Provider | Input $/1M↕ | Output $/1M↕ | Context↕ | Intelligence↑ | Speed↕ | Latency | API |
|---|---|---|---|---|---|---|---|---|
GLM-4.7 (Reasoning) | Z AI | — | — | — | 42.1 | 109 tok/s | 0.7s | |
GLM-4.5 (Reasoning) | Z AI | — | — | — | 26.4 | 38 tok/s | 0.9s | |
GLM-4.6 (Reasoning) | Z AI | — | — | — | 32.5 | 36 tok/s | 0.9s | |
GLM-4.5-Air | Z AI | — | — | — | 23.2 | 65 tok/s | 1.3s | |
GLM-4.6V (Reasoning) | Z AI | — | — | — | 23.4 | 27 tok/s | 1.2s | |
GLM-4.5V (Reasoning) | Z AI | — | — | — | 15.1 | 45 tok/s | 1.0s | |
GLM-4.7 (Non-reasoning) | Z AI | — | — | — | 34.2 | 106 tok/s | 0.7s | |
GLM-4.6 (Non-reasoning) | Z AI | — | — | — | 30.2 | 67 tok/s | 0.9s | |
GLM-4.6V (Non-reasoning) | Z AI | — | — | — | 17.1 | 23 tok/s | 5.9s | |
GLM-4.5V (Non-reasoning) | Z AI | — | — | — | 12.7 | 39 tok/s | 29.9s | |
GLM-5.1 (Non-reasoning) | Z AI | — | — | — | 43.8 | 47 tok/s | 2.1s | |
GLM-5 (Non-reasoning) | Z AI | — | — | — | 40.6 | 53 tok/s | 1.4s | |
GLM 5V Turbo (Reasoning) | Z AI | — | — | — | 42.9 | — | — | |
GLM-5.1 (Reasoning) | Z AI | — | — | — | 51.4 | 43 tok/s | 1.2s | |
GLM-5-Turbo | Z AI | — | — | — | 46.8 | — | — | |
GLM-4.7-Flash (Reasoning) | Z AI | — | — | — | 30.1 | 91 tok/s | 0.9s | |
GLM-4.7-Flash (Non-reasoning) | Z AI | — | — | — | 22.1 | 105 tok/s | 1.0s | |
GLM-5 (Reasoning) | Z AI | — | — | — | 49.8 | 67 tok/s | 0.9s |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
6 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.