AI Model Comparison

Compare pricing, benchmarks, and capabilities across 58 AI models

58 models tracked0 open source

All Language Models Text → Image Text → Video Text → Speech Image → Video

Type

Provider

All AI21 Labs Alibaba Alibaba Cloud Allen Institute for AI Amazon Anthropic Arcee AI Baidu ByteDance Seed Cartesia China Mobile Cohere Coqui Databricks Deep Cogito DeepSeek ElevenLabs Fish Audio Google Google DeepMind Gradium Hume AI IBM Inception InclusionAI Inworld Kimi KlingAI Kokoro Korea Telecom KwaiKAT LG AI Research LMNT Liquid AI LongCat MBZUAI Institute of Foundation Models Maya Research Meta Meta AI MetaVoice Microsoft MiniMax Mistral Mistral AI Motif Technologies Murf AI NVIDIA Nanbeige Naver Neuphonic Nous Research OpenAI OpenChat OpenVoice Perplexity Prime Intellect Reka AI Resemble AI Rime Sarvam ServiceNow Smallest.ai Snowflake Speechify StepFun StyleTTS Swiss AI Initiative TII TII UAE Tencent Trillion Labs Upstage Xiaomi Z AI Zyphra async xAI

Price

Any Free <$1/M <$5/M <$20/M

Sort

Best Benchmark Cheapest First Most Expensive Largest Context Fastest

Clear all filters

Model	Provider	Input $/1M↕	Output $/1M↕	Context↕	Intelligence↑	Speed↕	Latency
Gemini 3 Pro Preview (low)	Google	—	—	—	41.3	—	—
Gemini 3 Flash Preview (Reasoning)	Google	—	—	—	46.4	203 tok/s	6.4s
Gemini 2.5 Pro Preview (Mar' 25)	Google	—	—	—	30.3	—	—
Gemini 2.5 Pro	Google	—	—	—	34.6	135 tok/s	20.0s
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)	Google	—	—	—	25.7	—	—
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)	Google	—	—	—	31.1	—	—
Gemini 2.5 Pro Preview (May' 25)	Google	—	—	—	29.5	—	—
Gemini 2.5 Flash (Reasoning)	Google	—	—	—	27	246 tok/s	14.3s
Gemini 2.0 Pro Experimental (Feb '25)	Google	—	—	—	18.1	—	—
Gemini 2.5 Flash (Non-reasoning)	Google	—	—	—	20.6	215 tok/s	0.5s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)	Google	—	—	—	21.6	—	—
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)	Google	—	—	—	19.4	—	—
Gemini 2.0 Flash Thinking Experimental (Jan '25)	Google	—	—	—	19.6	—	—
Gemini 2.5 Flash Preview (Reasoning)	Google	—	—	—	24.3	—	—
Gemini 2.0 Flash (Feb '25)	Google	—	—	—	18.5	—	—
Gemini 2.5 Flash Preview (Non-reasoning)	Google	—	—	—	17.8	—	—
Gemini 2.0 Flash (experimental)	Google	—	—	—	16.8	—	—
Gemini 2.5 Flash-Lite (Reasoning)	Google	—	—	—	17.6	302 tok/s	24.7s
Gemini 1.5 Pro (Sep '24)	Google	—	—	—	16	—	—
Gemini 2.0 Flash-Lite (Feb '25)	Google	—	—	—	14.7	—	—
Gemini 2.5 Flash-Lite (Non-reasoning)	Google	—	—	—	12.7	268 tok/s	1.7s
Gemini 1.5 Flash (Sep '24)	Google	—	—	—	13.8	—	—
Gemini 1.5 Pro (May '24)	Google	—	—	—	12	—	—
Gemma 3 12B Instruct	Google	—	—	—	8.8	—	—
Gemini 1.5 Flash-8B	Google	—	—	—	11.1	—	—
Gemini 1.5 Flash (May '24)	Google	—	—	—	10.5	—	—
Gemma 3n E4B Instruct	Google	—	—	—	6.4	30 tok/s	0.7s
Gemma 3n E4B Instruct Preview (May '25)	Google	—	—	—	10.1	—	—
Gemini 1.0 Pro	Google	—	—	—	8.5	—	—
Gemma 3 4B Instruct	Google	—	—	—	6.3	—	—
Gemma 3n E2B Instruct	Google	—	—	—	4.8	—	—
Gemma 3 1B Instruct	Google	—	—	—	5.5	—	—
Gemma 3 270M	Google	—	—	—	7.7	—	—
Gemini 2.5 Flash TTS (Dec 2025)	Google	—	—	—	—	—	—
Gemini 3 Deep Think	Google	—	—	—	—	—	—
Gemma 4 31B (Reasoning)	Google	—	—	—	39.2	35 tok/s	1.0s
Gemma 4 26B A4B (Non-reasoning)	Google	—	—	—	27.1	—	—
Gemma 4 E2B (Non-reasoning)	Google	—	—	—	12.1	—	—
Gemma 4 E4B (Reasoning)	Google	—	—	—	18.8	44 tok/s	1.0s
Gemma 4 E4B (Non-reasoning)	Google	—	—	—	14.8	55 tok/s	0.5s
Gemma 4 E2B (Reasoning)	Google	—	—	—	15.2	—	—
Gemini 3.1 Pro Preview	Google	—	—	—	57.2	130 tok/s	22.5s
Gemini 3.1 Flash-Lite Preview	Google	—	—	—	33.5	350 tok/s	5.1s
Gemini 2.0 Flash-Lite (Preview)	Google	—	—	—	14.5	—	—
Gemini 2.0 Flash Thinking Experimental (Dec '24)	Google	—	—	—	12.3	—	—
PALM-2	Google	—	—	—	8.6	—	—
Gemini 1.0 Ultra	Google	—	—	—	10.1	—	—
Studio	Google	—	—	—	—	—	—
Journey	Google	—	—	—	—	—	—
Gemini 2.5 Pro (Dec 2025)	Google	—	—	—	—	—	—
WaveNet	Google	—	—	—	—	—	—
Gemma 4 26B A4B (Reasoning)	Google	—	—	—	31.2	—	—
Chirp 3: HD	Google	—	—	—	—	—	—
Standard	Google	—	—	—	—	—	—
Neural2	Google	—	—	—	—	—	—
Gemini 3.1 Flash TTS	Google	—	—	—	—	—	—
Gemini 2.5 Flash Lite TTS	Google	—	—	—	—	—	—
Gemma 4 31B (Non-reasoning)	Google	—	—	—	32.3	—	—

Model	Input Cost	Output Cost	Total/Month	vs Cheapest
Gemini 3 Pro Preview (low) Google	—	—	—	—
Gemini 3 Flash Preview (Reasoning) Google	—	—	—	—
Gemini 2.5 Pro Preview (Mar' 25) Google	—	—	—	—
Gemini 2.5 Pro Google	—	—	—	—
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) Google	—	—	—	—
Gemini 2.5 Flash Preview (Sep '25) (Reasoning) Google	—	—	—	—

AI Model Comparison

Estimate Your Monthly Cost