AI Model Comparison

Compare pricing, benchmarks, and capabilities across 18 AI models

18 models tracked8 open source

All Language Models Text → Image Text → Video Text → Speech Image → Video

Type

All Proprietary Open Source

Provider

All AI21 Labs Alibaba Alibaba Cloud Allen Institute for AI Amazon Anthropic Arcee AI Baidu ByteDance Seed Cartesia China Mobile Cohere Coqui Databricks Deep Cogito DeepSeek ElevenLabs Fish Audio Google Google DeepMind Gradium Hume AI IBM Inception InclusionAI Inworld Kimi KlingAI Kokoro Korea Telecom KwaiKAT LG AI Research LMNT Liquid AI LongCat MBZUAI Institute of Foundation Models Maya Research Meta Meta AI MetaVoice Microsoft MiniMax Mistral Mistral AI Motif Technologies Murf AI NVIDIA Nanbeige Naver Neuphonic Nous Research OpenAI OpenChat OpenVoice Perplexity Prime Intellect Reka AI Resemble AI Rime Sarvam ServiceNow Smallest.ai Snowflake Speechify StepFun StyleTTS Swiss AI Initiative TII TII UAE Tencent Trillion Labs Upstage Xiaomi Z AI Zyphra async xAI

Price

Any Free <$1/M <$5/M <$20/M

Sort

Best Benchmark Cheapest First Most Expensive Largest Context Fastest

Clear all filters

Model	Provider	Input $/1M↕	Output $/1M↕	Context↕	Intelligence↑	Speed↕	Latency	API
DeepSeek R2 ★	DeepSeek	$0.55	$2.19	128K	91%	60 tok/s	—
Llama 3.3 70B Open★	Meta AI	$0.23	$0.92	128K	86%	80 tok/s	—
DeepSeek V3 Open	DeepSeek	$0.27	$1.1	128K	88.5%	80 tok/s	—
Qwen3-Max	Alibaba Cloud	$0.4	$1.2	32K	87%	90 tok/s	—
Qwen3-72B Open	Alibaba Cloud	Free	Free	32K	85%	100 tok/s	—
Phi-4 Open	Microsoft	$0.07	$0.14	16K	84.8%	300 tok/s	—
Grok 3 Mini	xAI	$0.3	$0.5	131K	83%	160 tok/s	—
Gemini 3 Flash	Google DeepMind	$0.075	$0.3	1M	82%	250 tok/s	—
GPT-4o mini	OpenAI	$0.15	$0.6	128K	82%	200 tok/s	—
Claude Haiku 4.5	Anthropic	$0.8	$4	200K	75.2%	250 tok/s	—
Gemma 3 27B Open	Google DeepMind	Free	Free	128K	75%	120 tok/s	—
DBRX Open	Databricks	$0.75	$2.25	33K	73.7%	100 tok/s	—
Llama 3.2 11B Vision Open	Meta AI	$0.18	$0.18	128K	73%	150 tok/s	—
Command R	Cohere	$0.15	$0.6	128K	72%	150 tok/s	—
Gemini 3.1 Flash-Lite	Google DeepMind	$0.01	$0.04	1M	72%	500 tok/s	—
Mistral Small	Mistral AI	$0.1	$0.3	32K	72%	200 tok/s	—
Falcon 180B Open	TII	Free	Free	4K	70.4%	20 tok/s	—
Codestral	Mistral AI	$0.3	$0.9	32K	—	180 tok/s	—

Estimate Your Monthly Cost

Enter your expected usage to compare costs across models

Input tokens per month

e.g. 1,000,000 = ~750,000 words

Output tokens per month

Usually 30–50% of input volume

Select models to compare

DeepSeek R2DeepSeekLlama 3.3 70BMetaDeepSeek V3DeepSeekQwen3-MaxAlibabaQwen3-72BAlibabaPhi-4MicrosoftGrok 3 MinixAIGemini 3 FlashGoogleGPT-4o miniOpenAIClaude Haiku 4.5AnthropicGemma 3 27BGoogleDBRXDatabricksLlama 3.2 11B VisionMetaCommand RCohereGemini 3.1 Flash-LiteGoogleMistral SmallMistralFalcon 180BTIICodestralMistral

6 models selected

Model	Input Cost	Output Cost	Total/Month	vs Cheapest
Qwen3-72B Alibaba Cloud	$0.00	$0.00	$0.00	—
Phi-4 Microsoft	$0.07	$0.07	$0.14	✓ Best value
Llama 3.3 70B Meta AI	$0.23	$0.46	$0.69	4.9× more
DeepSeek V3 DeepSeek	$0.27	$0.55	$0.82	5.9× more
Qwen3-Max Alibaba Cloud	$0.40	$0.60	$1.00	7.1× more
DeepSeek R2 DeepSeek	$0.55	$1.09	$1.65	11.7× more

Prices are approximate and may vary. Check provider documentation for current pricing.