PST

Audio to subtitle

Upload an audio file, generate subtitles with Whisper, then review and export as SRT, VTT, or ASS. For video, use the Video to Subtitle tool.

Upload audio
Please upload audio only: MP3, WAV, M4A, AAC, FLAC, OGG, Opus.
No audio file selected.
Upload an audio file and generate subtitles to preview results here.

Convert Audio to Subtitles with AI

Powered by OpenAI Whisper, this free online tool transcribes your audio files into accurate, timestamped subtitles — then edit and download as SRT, VTT, or ASS in seconds.

Powered by OpenAI Whisper

Uses OpenAI's industry-leading Whisper model to deliver highly accurate speech-to-text transcription across dozens of languages and accents.

Common audio formats

Upload MP3, WAV, M4A, AAC, FLAC, OGG, Opus, and more up to 100 MB. This tool accepts audio files only—use Video to Subtitle for video uploads.

Automatic language detection

Let Whisper automatically detect the spoken language, or manually select from 20+ supported languages for more accurate results.

Export SRT, VTT, or ASS

Edit cues in the preview, pick SubRip, WebVTT, or Advanced SubStation Alpha, then download—ready for players, browsers, or stylized playback.

How to Generate Subtitles from Audio

  1. 1

    Upload your audio file

    Drag and drop or click to select an audio file from your device. Supports MP3, WAV, M4A, AAC, FLAC, OGG, and more up to 100 MB.

  2. 2

    Select language and transcribe

    Choose the spoken language or use auto-detect, then click Generate Subtitles. Whisper AI processes the audio and produces timestamped subtitles.

  3. 3

    Review, edit, and download

    Check the subtitle preview, double-click any cue to correct errors, choose SRT, VTT, or ASS, then download for your player or editor.

Frequently Asked Questions

What audio formats are supported?
The tool accepts common audio formats such as MP3, WAV, M4A, AAC, FLAC, OGG, and Opus. The maximum file size is 100 MB. Video files (e.g. MP4) are not supported on this page—use the Video to Subtitle tool instead.
How accurate is the automatic transcription?
Transcription accuracy depends on audio quality, background noise, and the clarity of speech. Whisper performs best with clear, single-speaker audio. You can always edit the generated subtitles before downloading.
Does it support multiple languages?
Yes. Whisper supports transcription in over 20 languages including English, Chinese, Japanese, Korean, Spanish, French, German, Arabic, Hindi, and more. You can select the language manually or let the tool detect it automatically.
Is there a file length limit?
Audio under 1 minute can be transcribed for free without signing in. For longer audio files, sign in with a free Google account. The maximum file size is 100 MB.
What subtitle formats can I download?
You can export SubRip (.srt), WebVTT (.vtt), or Advanced SubStation Alpha (.ass). SRT and VTT work almost everywhere; ASS is common when you need richer styling in compatible players.
Can I use this to add subtitles to a video?
Generate subtitles from your audio (or extract audio from your project first), edit if needed, then download SRT, VTT, or ASS and load it in your editor or player. To transcribe directly from a video file, use our Video to Subtitle tool.