Skip to main content
Back to Models
Live Rankings

AI Model Benchmarks

Compare 113 AI models side by side — arena rankings, pricing, context windows, and capabilities. Updated weekly.

Total Models

113

tracked & ranked

Open Source

58

freely available

Avg Input Price

$3.39

per 1M tokens

Providers

36

AI companies

Most Affordable (Input)

FREE

Models by Provider

OpenAI12
Google DeepMind11
Mistral AI10
Anthropic8
Meta AI7
Cohere6
xAI4
Microsoft4

All Models (113)

Model Provider Context Input / 1M Output / 1M Rank Best For
Claude Opus 4Anthropic200K$5.00$25.00#1Complex reasoning, coding, agentic tasks
GPT-o3OpenAI200K$2.00$8.00#2Advanced reasoning, agentic tasks, research
GPT-4oOpenAI128K$2.50$10.00#2General purpose, coding, analysis
GPT-o1OpenAI200K$15.00$60.00#3Complex reasoning, math, science, coding
Claude Sonnet 4Anthropic200K$3.00$15.00#3Coding, writing, long documents
DeepSeek R1OSSDeepSeek128K$0.55$2.19#3Complex reasoning, math, science, coding
GPT-4.5OpenAI128K$75.00$150.00#4Creative writing, nuanced understanding, EQ tasks
Gemini 1.5 ProGoogle DeepMind1M$3.50$10.50#4Long documents, multimodal analysis, coding
Gemini 2.5 ProGoogle DeepMind1M$1.25$10.00#4Long documents, multimodal, reasoning
Grok 3xAI128K$3.00$15.00#5Real-time info, reasoning, math
GPT-4 TurboOpenAI128K$10.00$30.00#5Complex tasks, coding, analysis, vision
DeepSeek V3OSSDeepSeek128K$0.27$1.10#5Coding, math, general reasoning
Qwen 2.5 72BOSSAlibaba DAMO128KFree (open)Free (open)#6Multilingual, coding, math, reasoning
GPT-o4 MiniOpenAI200K$1.10$4.40#6Affordable reasoning, coding, STEM tasks
Llama 4 MaverickOSSMeta1MFreeFree#7Open source, self-hosted, multilingual
Claude 3 OpusAnthropic200K$15.00$75.00#7Complex analysis, research, nuanced writing
Grok-2xAI128K$2.00$10.00#7Real-time information, reasoning, coding
Qwen 3OSSAlibaba128KFreeFree#7Multilingual, reasoning, agentic tasks
Mistral LargeMistral AI256K$0.50$1.50#8European privacy, multilingual, code
Gemini 2.0 FlashGoogle DeepMind1M$0.10$0.40#8Agentic tasks, multimodal, tool use
Mistral Large 2OSSMistral AI128K$2.00$6.00#8Multilingual, coding, complex reasoning
Moonshot Kimi k2OSSMoonshot AI131K$0.55$2.20#8Coding, agentic tasks, reasoning
Claude 3.5 SonnetAnthropic200K$3.00$15.00#8Coding, analysis, writing, vision
Llama 3.1 405BOSSMeta AI128KFree (open)Free (open)#9Complex reasoning, coding, multilingual tasks
Qwen 2.5 MaxAlibaba32K$1.60$6.40#9Multilingual, Chinese/English, reasoning
Gemini 1.5 FlashGoogle DeepMind1M$0.075$0.30#10High-volume tasks, summarization, chat
Gemini 2.5 FlashGoogle DeepMind1M$0.30$2.50#10Fast reasoning, cost-efficient, multimodal
MiniMax-01OSSMiniMax4M$0.50$1.10#11Ultra-long context, document analysis
Llama 3.2 90B VisionOSSMeta AI128KFree (open)Free (open)#11Image understanding, visual QA, multimodal tasks
Llama 4 ScoutOSSMeta10MFreeFree#12Long context, open source, multilingual
Claude 3.5 HaikuAnthropic200K$0.80$4.00#12Fast coding, data extraction, classification
Llama 3.3 70BOSSMeta AI128KFree (open)Free (open)#13Instruction following, coding, reasoning
Llama 3.3OSSMeta128KFreeFree#13General purpose, multilingual, coding
Llama 3.1 70BOSSMeta AI128KFree (open)Free (open)#14Balanced performance, fine-tuning, deployment
GPT-4o MiniOpenAI128K$0.15$0.60#15Fast responses, cost-efficient tasks, lightweight apps
Claude Haiku 4.5Anthropic200K$1.00$5.00#15Fast responses, classification, extraction
Mixtral 8x22BOSSMistral AI64K$0.90$2.70#16Efficient reasoning, multilingual, coding
Mistral MediumMistral AI128K$0.40$2.00#16Enterprise tasks, European languages
Grok 3 MinixAI128K$0.30$0.50#16Lightweight reasoning, fast responses, chat
Grok-2 MinixAI128K$0.30$1.50#16Fast responses, chat, lightweight tasks
Reka CoreReka AI128K$2.00$2.00#17Multimodal reasoning, video understanding, multilingual
Command R+OSSCohere128K$2.50$10.00#17RAG, enterprise search, multilingual
Gemma 2 27BOSSGoogle DeepMind8KFree (open)Free (open)#18Research, fine-tuning, on-premise deployment
Qwen 2.5 CoderOSSAlibaba128KFreeFree#18Code generation, code review, debugging
Gemma 3OSSGoogle DeepMind128KFreeFree#19Open source, on-device, research
Mistral SmallOSSMistral AI32K$0.20$0.60#19Fast inference, cost-effective tasks, chat
DBRXOSSDatabricks32KFree (open)Free (open)#20Enterprise AI, data analysis, coding
Claude 3 HaikuAnthropic200K$0.25$1.25#20Quick tasks, chatbots, content moderation
Gemini 2.0 Flash LiteGoogle DeepMind1M$0.075$0.30#22High-volume, low-cost tasks
Llama 3.1 8BOSSMeta AI128KFree (open)Free (open)#22Edge deployment, mobile, fast inference
Command ROSSCohere128K$0.15$0.60#23Cost-effective RAG, summarization, chat
Baichuan 4Baichuan AI128K$2.00$2.00#24Chinese language, enterprise tasks
GPT-3.5 TurboOpenAI16K$0.50$1.50#25Fast responses, chatbots, simple tasks
Gemma 2OSSGoogle DeepMind8KFreeFree#26On-device AI, research, fine-tuning
Mistral NemoOSSMistral AI128K$0.30$0.30#27Lightweight tasks, drop-in replacement
Phi-4OSSMicrosoft16KFreeFree#28Small model research, edge deployment, reasoning
Falcon 180BOSSTechnology Innovation Institute4KFree (open)Free (open)Research, multilingual generation, fine-tuning
Pika 1.5PikaN/A (video)Credits-basedCredits-basedVideo generation, video editing, effects
Whisper V3OSSOpenAIN/A (audio)FreeFreeSpeech-to-text, transcription, translation
Jamba 1.5 LargeOSSAI21 Labs256K$2.00$8.00Long documents, enterprise RAG, analysis
Falcon 40BOSSTechnology Innovation Institute2KFree (open)Free (open)General tasks, fine-tuning, research
Whisper Large v3OSSOpenAIN/A (audio)Free (open)Free (open)Speech recognition, transcription, translation
Llama 3 8BOSSMeta AI8KFree (open)Free (open)Edge deployment, fast inference, fine-tuning
Llama 3 70BOSSMeta AI8KFree (open)Free (open)General tasks, fine-tuning, instruction following
Mistral 7BOSSMistral AI32KFree (open)Free (open)Efficient tasks, fine-tuning, edge deployment
Mixtral 8x7BOSSMistral AI32KFree (open)Free (open)Efficient inference, multilingual, coding
GLM-4Zhipu AI128KUndisclosedUndisclosedChinese language tasks, reasoning, coding
Jamba 1.5 Mini (SSM)OSSAI21 Labs256K/bin/zsh.20/bin/zsh.40Efficient long-context processing, throughput
Stable Diffusion 3.5 LargeOSSStability AIN/A (image)FreeFreeOpen source image generation, customization, fine-tuning
FLUX.1 ProBlack Forest LabsN/A (image)API-basedAPI-basedProfessional image generation, design, marketing
Pixtral LargeOSSMistral AI128K$2.00$6.00Image understanding, visual reasoning, documents
Stable Diffusion 3OSSStability AIN/A (image)Free (open)Free (open)Image generation, art creation, design
Inflection 2.5Inflection AI8KN/AN/AConversational AI, emotional intelligence, empathy
Cohere Embed v4Cohere128K$0.12$0.12Semantic search, RAG embeddings, document retrieval
Stable Video DiffusionOSSStability AIN/A (video)Free (open)Free (open)Video generation, animation, visual effects
Gen-3 AlphaRunwayN/A (video)Credits-basedCredits-basedProfessional video generation, filmmaking
Eleven Turbo v2.5ElevenLabsN/A (audio)Credits-basedCredits-basedReal-time speech synthesis, conversational AI
Nemotron 4 340BOSSNVIDIA4KFree (open)Free (open)Synthetic data generation, training pipelines
Phi-3 MediumOSSMicrosoft128KFree (open)Free (open)Balanced performance, reasoning, coding
ArcticOSSSnowflake4KFree (open)Free (open)SQL generation, enterprise data tasks, coding
WizardLM-2 8x22BOSSMicrosoft64KFree (open)Free (open)Complex instructions, reasoning, coding
Claude 2.1Anthropic200K$8.00$24.00Long documents, analysis, reduced hallucinations
Gemini 1.0 UltraGoogle DeepMind32KSubscription-basedSubscription-basedComplex reasoning, multimodal understanding
KrutrimKrutrim128K$0.10$0.30Hindi, Indian languages
Flux 1.1 ProBlack Forest Labs$0.04/imgImage generation, design
FLUX.1 SchnellOSSBlack Forest LabsN/A (image)Free (open)Free (open)Fast image generation, prototyping, development
Midjourney V6.1MidjourneyN/A (image)$10/month (Basic)$30/month (Standard)Photorealistic images, artistic creation, visual design
Aya 23 35BOSSCohere8KFree (open)Free (open)Multilingual tasks, low-resource languages
Aya ExpanseOSSCohere128KFreeFreeMultilingual (23 languages), research
Solar 10.7BOSSUpstage4KFree (open)Free (open)Korean-English bilingual, fine-tuning, enterprise
StarCoder2 15BOSSHugging Face16KFree (open)Free (open)Code completion, code generation, development
Yi-Large01.AI32KUndisclosedUndisclosedComplex reasoning, multilingual, analysis
Eleven Multilingual v2ElevenLabsN/A (audio)Credits-basedCredits-basedMultilingual TTS, audiobooks, dubbing
Zephyr 7BOSSHugging Face32KFree (open)Free (open)Chat, instruction following, lightweight deployment
SoraOpenAIVideo generation from text
CodestralMistral AI32K$0.30$0.90Code generation, code completion, debugging
Veo 2Google DeepMindVideo generation, cinematic shots
DALL-E 3OpenAIN/A (image)$0.04/image (1024x1024)$0.08/image (1792x1024)Image generation, creative design, illustration
Phi-3 MiniOSSMicrosoft128KFree (open)Free (open)Edge deployment, mobile, on-device AI
Qwen 2.5 Coder 32BOSSAlibaba DAMO128KFree (open)Free (open)Code generation, code review, debugging
Cohere Embed v3Cohere512 tokens$0.10/1M tokensN/A (embeddings)Search, RAG, semantic similarity, clustering
Sarvam-MOSSSarvam AI32K$0.20$0.20Indian languages, Indic NLP
SDXL TurboOSSStability AIN/A (image)Free (open)Free (open)Real-time image generation, rapid prototyping
DeepSeek Coder V2OSSDeepSeek128K$0.14$0.28Code generation, debugging, code review
Jamba 1.5 MiniOSSAI21 Labs256K$0.20$0.40Cost-effective long-context, summarization
KimiMoonshot AI2MUndisclosedUndisclosedUltra-long documents, research, analysis
Midjourney v6MidjourneyN/A (image)Subscription-basedSubscription-basedArtistic image generation, creative design
Yi-1.5 34BOSS01.AI4KFree (open)Free (open)Bilingual tasks, fine-tuning, research
Ernie 4.0Baidu AI128KUndisclosedUndisclosedChinese language, enterprise AI, search
QwQ 32BOSSAlibaba DAMO32KFree (open)Free (open)Reasoning, math, logical problem-solving
Hailuo AIMiniMaxN/A (video)Free tier availableFree tier availableVideo generation, Chinese market content
Gen-2RunwayN/A (video)Credits-basedCredits-basedVideo generation, creative content, effects
Dream MachineLuma AIN/A (video)Credits-basedCredits-basedVideo generation, 3D content, visual effects

Frequently Asked Questions

What is the best AI model in 2026?
Based on arena rankings, Claude Opus 4 by Anthropic holds the #1 position. It excels at Complex reasoning, coding, agentic tasks with a 200K context window.
Which AI model is cheapest?
Llama 4 Maverick offers the lowest input pricing at Free/1M tokens. There are also 11 free models available.
Which AI model has the largest context window?
Llama 4 Scout leads with 10M context window, followed by MiniMax-01 at 4M.
How many open-source AI models are there?
Awaira tracks 58 open-source models out of 113 total. Open-source models include DeepSeek R1, DeepSeek V3, Qwen 2.5 72B and 55 more.
What is the average AI model pricing?
The average input pricing across paid models is $3.39 per 1M tokens. Output pricing is typically 2-5x higher than input pricing.