Compare pricing, benchmarks, and capabilities across 3 AI models
| Model | Provider | Type | ELO Rank↑ | ELO Score | Released |
|---|---|---|---|---|---|
StepAudio 2.5 TTS | StepFun | text-to-speech | #3 | 1187 | — |
Step TTS 2 (Mar 2026) | StepFun | text-to-speech | #9 | 1148 | — |
Step Audio EditX (Mar 2026) | StepFun | text-to-speech | #18 | 1104 | — |
Enter your expected usage to compare costs across models
e.g. 1,000,000 = ~750,000 words
Usually 30–50% of input volume
3 models selected
Prices are approximate and may vary. Check provider documentation for current pricing.