Question 1

Is this actually a drop-in OpenAI Whisper replacement?

Accepted Answer

Yes. Same endpoint shape (/audio/transcriptions), same request fields, same response JSON. If you point the OpenAI SDK at https://www.tryspeakeasy.io/api/v1 your existing code keeps working — no rewrite, no new SDK.

Question 2

How is it 44% cheaper without losing quality?

Accepted Answer

Same Whisper model family, leaner deployment. OpenAI's $0.36/hr ($0.006/min list price) bakes in a heavy margin and brand premium on top of the inference cost. We run the same checkpoint on commodity GPUs with aggressive batching, charge $0.20/hr, and still run a sustainable margin. There's no quality trade-off because there's no model substitution — you're getting the same weights, just billed differently.

Question 3

What about Deepgram or AssemblyAI?

Accepted Answer

Both are great products but priced for enterprise — Deepgram Nova at ~$0.43/hr, AssemblyAI at ~$0.37/hr. Their billing is also opaque (per-second tiers, feature add-ons). SpeakEasy is hours-based and predictable. If you need diarization or real-time streaming, look at Deepgram. If you need cheap, accurate transcription with one API call, this is the answer.

Question 4

Are there rate limits I should worry about?

Accepted Answer

The free playground above is rate-limited (5 transcriptions/day/IP) to stop abuse. The paid API has generous per-account limits — multi-thousand RPM on the entry plan. If you hit them, we lift them on request.

Question 5

What languages does it handle?

Accepted Answer

Whisper supports 99 languages out of the box. Our deployment passes that through unchanged — set language='auto' to detect, or hint a specific language code (e.g. 'en', 'de', 'es') to skip detection and shave a few hundred ms.

Question 6

What's the catch?

Accepted Answer

Honestly, none. We don't do streaming yet (working on it), we don't do speaker diarization (also coming), and we don't do TTS on the same endpoint (separate /audio/speech endpoint exists). For batch transcription of recorded audio — meetings, podcasts, voice notes — there's no catch. It's just cheaper.

Provider	$ / hr	Notes
SpeakEasycheapest	$0.20	Whisper-quality, OpenAI-compatible API, hours-based billing
OpenAI Whisper API	$0.36	$0.006/min — the price floor we beat
Deepgram Nova	$0.43	$0.0072/min on pay-as-you-go
AssemblyAI	$0.37	$0.00062/sec on the universal model
Google Cloud Speech	$0.96	$0.016/min standard model

$0.20/hr vs OpenAI's $0.36/hr.

One-line swap from OpenAI

FAQ