DeepSeek V3
DeepSeek V3 ranks in the top 10 on the Arena leaderboard. Context window: 0.128K tokens.
Context
128K
Input
$0.27
Key Specifications
Arena Rank
#5
Context Window
128K
Input Price
per 1M tokens
$0.27
Output Price
per 1M tokens
$1.10
Parameters
671B (37B active)
Open Source
Best For
About DeepSeek V3
DeepSeek V3, developed by DeepSeek, is a Mixture-of-Experts model with 671 billion total parameters (37 billion active) and a 128K token context window. The model uses multi-head latent attention and auxiliary-loss-free load balancing for efficient expert routing. Reportedly trained for approximately $5.6 million, DeepSeek V3 challenged industry assumptions about the compute costs required for frontier AI. It performs competitively with GPT-4o and Claude 3.5 Sonnet across general reasoning, coding, and multilingual benchmarks. Priced at $0.27 per million input tokens and $1.10 per million output tokens, it offers strong capability at accessible pricing. As a fully open-source model, it can be self-hosted and fine-tuned. DeepSeek V3 ranks #5 on the Chatbot Arena leaderboard, reflecting its status as one of the most capable open models available.
Pricing per 1M tokens
Input Tokens
$0.27
Output Tokens
$1.10
Compare DeepSeek V3
See how DeepSeek V3 stacks up against other leading AI models