Skip to main content
DeepSeekReleased December 26, 2024

DeepSeek V3

Open Source#5 Arena Rank671B (37B active) parameters

DeepSeek V3 ranks in the top 10 on the Arena leaderboard. Context window: 0.128K tokens.

Context

128K

Input

$0.27

Key Specifications

🏆

Arena Rank

#5

📐

Context Window

128K

📥

Input Price

per 1M tokens

$0.27

📤

Output Price

per 1M tokens

$1.10

🧠

Parameters

671B (37B active)

🔓

Open Source

Yes

Best For

Codingmathgeneral reasoning

About DeepSeek V3

DeepSeek V3, developed by DeepSeek, is a Mixture-of-Experts model with 671 billion total parameters (37 billion active) and a 128K token context window. The model uses multi-head latent attention and auxiliary-loss-free load balancing for efficient expert routing. Reportedly trained for approximately $5.6 million, DeepSeek V3 challenged industry assumptions about the compute costs required for frontier AI. It performs competitively with GPT-4o and Claude 3.5 Sonnet across general reasoning, coding, and multilingual benchmarks. Priced at $0.27 per million input tokens and $1.10 per million output tokens, it offers strong capability at accessible pricing. As a fully open-source model, it can be self-hosted and fine-tuned. DeepSeek V3 ranks #5 on the Chatbot Arena leaderboard, reflecting its status as one of the most capable open models available.

Built byDeepSeek

Pricing per 1M tokens

Input Tokens

$0.27

Output Tokens

$1.10

Frequently Asked Questions

What is DeepSeek V3?
DeepSeek V3, developed by DeepSeek, is a Mixture-of-Experts model with 671 billion total parameters (37 billion active) and a 128K token context window. The model uses multi-head latent attention and auxiliary-loss-free load balancing for efficient expert routing. Reportedly trained for approximately $5.6 million, DeepSeek V3 challenged industry assumptions about the compute costs required for frontier AI. It performs competitively with GPT-4o and Claude 3.5 Sonnet across general reasoning, coding, and multilingual benchmarks. Priced at $0.27 per million input tokens and $1.10 per million output tokens, it offers strong capability at accessible pricing. As a fully open-source model, it can be self-hosted and fine-tuned. DeepSeek V3 ranks #5 on the Chatbot Arena leaderboard, reflecting its status as one of the most capable open models available.
How much does DeepSeek V3 cost?
DeepSeek charges $0.27 per 1M input tokens for DeepSeek V3, with output at $1.10. Competitive with other models in its tier.
What is DeepSeek V3's context window?
DeepSeek V3 supports up to 128K tokens per request. A larger context window allows the model to reason over longer inputs, which matters for document analysis, code review, and multi-turn conversations.
Is DeepSeek V3 open source?
Yes — DeepSeek released DeepSeek V3 as open source. That means you're free to deploy it however you want: cloud, on-prem, edge. No API lock-in.
What is DeepSeek V3 best for?
DeepSeek positions DeepSeek V3 for: Coding, math, general reasoning. Real-world performance will depend on your specific prompts and data, but these are the intended strengths.