AI Model Comparison

Compare pricing, benchmarks, and capabilities across 17 AI models

17 models tracked0 open source

All Language Models Text → Image Text → Video Text → Speech Image → Video

Type

All Proprietary Open Source

Provider

All AI21 Labs Alibaba Alibaba Cloud Allen Institute for AI Amazon Anthropic Arcee AI Baidu ByteDance Seed Cartesia China Mobile Cohere Coqui Databricks Deep Cogito DeepSeek ElevenLabs Fish Audio Google Google DeepMind Gradium Hume AI IBM Inception InclusionAI Inworld Kimi KlingAI Kokoro Korea Telecom KwaiKAT LG AI Research LMNT Liquid AI LongCat MBZUAI Institute of Foundation Models Maya Research Meta Meta AI MetaVoice Microsoft MiniMax Mistral Mistral AI Motif Technologies Murf AI NVIDIA Nanbeige Naver Neuphonic Nous Research OpenAI OpenChat OpenVoice Perplexity Prime Intellect Reka AI Resemble AI Rime Sarvam ServiceNow Smallest.ai Snowflake Speechify StepFun StyleTTS Swiss AI Initiative TII TII UAE Tencent Trillion Labs Upstage Xiaomi Z AI Zyphra async xAI

Price

Any Free <$1/M <$5/M <$20/M

Sort

Best Benchmark Cheapest First Most Expensive Largest Context Fastest

Clear all filters

Model	Provider	Input $/1M↕	Output $/1M↕	Context↕	Intelligence↑	Speed↕	Latency	API
Llama 4 Maverick	Meta	—	—	—	18.4	113 tok/s	0.6s
Llama 4 Scout	Meta	—	—	—	13.5	134 tok/s	0.6s
Llama 3.1 Instruct 405B	Meta	—	—	—	17.4	66 tok/s	0.6s
Llama 3.3 Instruct 70B	Meta	—	—	—	14.5	93 tok/s	0.6s
Llama 3.1 Instruct 70B	Meta	—	—	—	12.5	34 tok/s	0.6s
Llama 3.2 Instruct 90B (Vision)	Meta	—	—	—	11.9	48 tok/s	0.6s
Llama 3 Instruct 70B	Meta	—	—	—	8.9	46 tok/s	0.7s
Llama 3.1 Instruct 8B	Meta	—	—	—	11.8	203 tok/s	0.5s
Llama 3.2 Instruct 11B (Vision)	Meta	—	—	—	8.7	86 tok/s	0.4s
Llama 2 Chat 70B	Meta	—	—	—	8.4	—	—
Llama 2 Chat 13B	Meta	—	—	—	8.4	—	—
Llama 3 Instruct 8B	Meta	—	—	—	6.4	82 tok/s	0.5s
Llama 3.2 Instruct 3B	Meta	—	—	—	9.7	52 tok/s	0.7s
Llama 3.2 Instruct 1B	Meta	—	—	—	6.3	98 tok/s	0.6s
Llama 2 Chat 7B	Meta	—	—	—	9.7	98 tok/s	10.3s
Muse Spark	Meta	—	—	—	52.1	—	—
Llama 65B	Meta	—	—	—	7.4	—	—

Estimate Your Monthly Cost

Enter your expected usage to compare costs across models

Input tokens per month

e.g. 1,000,000 = ~750,000 words

Output tokens per month

Usually 30–50% of input volume

Select models to compare

Llama 4 MaverickMetaLlama 4 ScoutMetaLlama 3.1 Instruct 405BMetaLlama 3.3 Instruct 70BMetaLlama 3.1 Instruct 70BMetaLlama 3.2 Instruct 90B (Vision)MetaLlama 3 Instruct 70BMetaLlama 3.1 Instruct 8BMetaLlama 3.2 Instruct 11B (Vision)MetaLlama 2 Chat 70BMetaLlama 2 Chat 13BMetaLlama 3 Instruct 8BMetaLlama 3.2 Instruct 3BMetaLlama 3.2 Instruct 1BMetaLlama 2 Chat 7BMetaMuse SparkMetaLlama 65BMeta

6 models selected

Model	Input Cost	Output Cost	Total/Month	vs Cheapest
Llama 4 Maverick Meta	—	—	—	—
Llama 4 Scout Meta	—	—	—	—
Llama 3.1 Instruct 405B Meta	—	—	—	—
Llama 3.3 Instruct 70B Meta	—	—	—	—
Llama 3.1 Instruct 70B Meta	—	—	—	—
Llama 3.2 Instruct 90B (Vision) Meta	—	—	—	—

Prices are approximate and may vary. Check provider documentation for current pricing.