Question 1

What's an SRT file and where can I use it?

Accepted Answer

SubRip Subtitle (.srt) is the most widely supported subtitle format. Drop it into YouTube, Premiere, DaVinci Resolve, Final Cut, VLC, OBS, Vimeo, or any video editor — it just works. The file is plain text: a sequence number, a start/end timestamp, the line of text.

Question 2

How accurate are the timestamps?

Accepted Answer

Whisper produces segment-level timestamps with sub-second accuracy on clean audio. On noisy or overlapping speech you'll see drift in the 200–500ms range, which is normal for any ASR system at this price point. Open the SRT in your editor and you can nudge cues by hand if you need broadcast-grade timing.

Question 3

What audio formats can I upload?

Accepted Answer

Audio and video files both work — anything ffmpeg can decode. Common picks for subtitles: an .mp4 export from your video editor, an .m4a voice memo, a .wav recording from your DAW, or an .mp3 podcast file. Up to 5MB on the free playground. The audio track is extracted automatically from video uploads, so you don't need a separate audio-extraction step.

Question 4

Will the subtitles be in the same language as the audio?

Accepted Answer

Yes — Whisper auto-detects the source language and transcribes in it. If you want English subtitles for non-English audio, use the API directly with the /translations endpoint instead of /transcriptions; that one always outputs English.

Question 5

Why is there a final subtitle crediting audiotranscribe.app?

Accepted Answer

Because the free version is supported by the credit. If you'd rather ship clean SRTs without it, run the same model via the API ($0.20/hr) — the snippet below is six lines and produces a footer-free file.

Question 6

Can I generate VTT instead?

Accepted Answer

The free playground exports SRT only. The paid API takes response_format='vtt' for WebVTT files (used by HTML5 video and modern web players).

Audio to SRT, in one drop.

Doing this in code instead?

FAQ