OpenAIReleased November 6, 2023

Whisper Large v3

Open Source1.5B parameters

Context

N/A (audio)

Input

Free (open)

Key Specifications

🏆

Arena Rank

Not disclosed

📐

Context Window

N/A (audio)

📥

Input Price

per 1M tokens

Free (open)

📤

Output Price

per 1M tokens

Free (open)

🧠

Parameters

1.5B

🔓

Open Source

Yes

Best For

Speech recognitiontranscriptiontranslation

About Whisper Large v3

Whisper Large v3 is OpenAI's most capable automatic speech recognition model, supporting transcription and translation across 100+ languages. At 1.5 billion parameters, it delivers near-human accuracy on many languages and handles noisy, accented, and multilingual audio with remarkable robustness. It has become the de facto standard for open-source speech recognition.

Built byOpenAI

Pricing per 1M tokens

Input Tokens

Free (open)

Output Tokens

Free (open)

Frequently Asked Questions

What is Whisper Large v3?
Whisper Large v3 is OpenAI's most capable automatic speech recognition model, supporting transcription and translation across 100+ languages. At 1.5 billion parameters, it delivers near-human accuracy on many languages and handles noisy, accented, and multilingual audio with remarkable robustness. It has become the de facto standard for open-source speech recognition.
How much does Whisper Large v3 cost?
Whisper Large v3 costs Free (open) per 1 million input tokens and Free (open) per 1 million output tokens. Pricing is based on token usage, making it cost-effective for both small and large-scale applications.
What is Whisper Large v3's context window?
Whisper Large v3 has a context window of N/A (audio) tokens. This determines how much text the model can process in a single request — larger context windows allow the model to handle longer documents, maintain more conversation history, and reason over bigger codebases.
Is Whisper Large v3 open source?
Yes, Whisper Large v3 is open source. This means the model weights are publicly available, allowing developers and organizations to download, fine-tune, and self-host the model on their own infrastructure. Open-source models offer greater flexibility and data privacy control.
What is Whisper Large v3 best for?
Whisper Large v3 is best suited for: Speech recognition, transcription, translation. These use cases leverage the model's specific strengths in terms of capability, speed, and cost-effectiveness within OpenAI's model lineup.