∞AI
ToolsModelsJobsHackathons
SubmitSign In

AI Model Comparison

Compare pricing, benchmarks, and capabilities across 19 AI models

19 models tracked0 open source
AllLanguage ModelsText → ImageText → VideoText → SpeechImage → Video
Type
AllProprietaryOpen Source
Provider
AllAI21 LabsAlibabaAlibaba CloudAllen Institute for AIAmazonAnthropicArcee AIBaiduByteDance SeedCartesiaChina MobileCohereCoquiDatabricksDeep CogitoDeepSeekElevenLabsFish AudioGoogleGoogle DeepMindHume AIIBMInceptionInclusionAIInworldKimiKokoroKorea TelecomKwaiKATLG AI ResearchLMNTLiquid AILongCatMBZUAI Institute of Foundation ModelsMaya ResearchMetaMeta AIMetaVoiceMicrosoftMicrosoft AzureMiniMaxMistralMistral AIMotif TechnologiesMurf AINVIDIANanbeigeNaverNeuphonicNous ResearchOpenAIOpenChatOpenVoicePerplexityPrime IntellectReka AIResemble AIRimeSarvamServiceNowSmallest.aiSnowflakeSpeechifyStepFunStyleTTS Swiss AI InitiativeTIITII UAETrillion LabsUpstageXiaomiZ AIZyphraasyncxAI
Price
AnyFree<$1/M<$5/M<$20/M
Sort
Best BenchmarkCheapest FirstMost ExpensiveLargest ContextFastest
Clear all filters
ModelProviderInput $/1M↕Output $/1M↕Context↕Intelligence↑Speed↕LatencyAPI
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA———
15
42 tok/s0.7s
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA———
18.7
60 tok/s0.3s
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA———
24.3
133 tok/s1.3s
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA———
18.5
——
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA———
14.9
151 tok/s0.5s
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA———
13.2
153 tok/s0.7s
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA———
14.8
117 tok/s0.3s
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA———
14.3
——
Llama 3.1 Nemotron Instruct 70B
NVIDIA———
13.4
46 tok/s0.3s
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA———
14.6
58 tok/s0.3s
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA———
10.1
175 tok/s0.7s
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA———
13.2
78 tok/s0.3s
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA———
14.4
——
Magpie Multilingual
NVIDIA——————
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA———
36
154 tok/s1.1s
Nemotron Cascade 2 30B A3B
NVIDIA———
28.4
——
NVIDIA Nemotron 3 Nano 4B
NVIDIA———
14.7
——
Magpie-Multilingual 357M
NVIDIA——————
Magpie-Multilingual 357M (Feb 2026)
NVIDIA——————
∞AI

Everything AI. In one place.

Platform

ToolsModelsJobsHackathonsSubmit

Company

AboutContact

Stay updated

Get weekly AI news in your inbox

© 2026 ∞AI. Built for the AI community.everythingai.tech

Estimate Your Monthly Cost

Enter your expected usage to compare costs across models

e.g. 1,000,000 = ~750,000 words

Usually 30–50% of input volume

6 models selected

ModelInput CostOutput CostTotal/Monthvs Cheapest
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA
————
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA
————
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA
————
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA
————
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA
————
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA
————

Prices are approximate and may vary. Check provider documentation for current pricing.