Question 1

Is the audio uploaded to a server?

Accepted Answer

No. The transcription model runs entirely inside your browser using WebAssembly or WebGPU. Your audio file never leaves your device — there is no upload step, no API call, and no temporary server-side storage. You can verify this by disconnecting from the internet after the model is downloaded; transcription will continue to work.

Question 2

Which audio formats are supported?

Accepted Answer

We support every format your browser can decode natively: MP3, WAV, M4A, OGG, FLAC, and AAC. Video files (MP4, MKV, WebM) work if your browser can decode their audio track. If a file fails to decode, the tool will show a clear error with the supported format list.

Question 3

How accurate is the transcription?

Accepted Answer

We use OpenAI Whisper, the same speech recognition model that powers most professional transcription services. The Balanced level (default) is excellent for clear English speech and good for most other languages. The Best quality level noticeably improves accuracy for non-English audio, accents, and noisy recordings.

Question 4

How long can the audio file be?

Accepted Answer

There is no hard limit. For audio longer than 30 seconds, the tool automatically chunks the audio into 30-second windows with a 5-second overlap, then stitches the results together. We have tested files up to several hours long. Very large files may use a lot of memory, so 4 GB+ of free RAM is recommended for hour-long recordings.

Question 5

Why is the first transcription slow?

Accepted Answer

The first time you drop a file, the engine prepares itself in the background — your browser fetches the Whisper model (around 76 MB at Balanced quality) and warms up the runtime. This is a one-time cost; afterwards the model is cached by your browser, so subsequent visits load instantly. Warm-up adds another 1–7 seconds before the first transcription starts.

Question 6

What is the difference between Transcribe and Translate?

Accepted Answer

Transcribe keeps the source language: a Spanish recording becomes Spanish text. Translate converts any source language directly to English text in one pass. Translation works for the languages Whisper supports — Indo-European, East Asian, Arabic, Hindi, and more.

Question 7

Can I download the transcript?

Accepted Answer

Yes. The Copy button copies plain text to your clipboard. Download .txt saves a plain-text file. Download .srt produces a valid SubRip subtitle file with timestamps you can drop directly into video editors like Premiere, DaVinci Resolve, or YouTube Studio.

Question 8

Does it work on mobile?

Accepted Answer

Yes, though mobile devices generally run a slower fallback. We recommend the Fast quality level on mobile (about 4× smaller and quicker to prepare). Hour-long files may be too memory-intensive for older phones; for big jobs, use a desktop browser.

Transcribe Audio to Text

1. Quality

2. Language & task

3. Upload audio

How to Transcribe Audio to Text

Pick a quality tier

Drop your audio file

Copy or save the transcript

How It Works

Real Whisper, running locally

Why your audio stays on-device

Real auto-detect (not just defaulting to English)

Long-audio chunking

Who Uses Free Audio Transcription?

Journalists & researchers

Podcasters & content creators

Students

Legal & medical professionals

Frequently Asked Questions

Is the audio uploaded to a server?

Which audio formats are supported?

How accurate is the transcription?

How long can the audio file be?

Why is the first transcription slow?

What is the difference between Transcribe and Translate?

Can I download the transcript?

Does it work on mobile?

Privacy notice

ABN Validator

Age Calculator

AI Detector

Alcohol Calculator

Annual Leave Calculator UK

BAC Calculator

Kalcify

1. Quality

2. Language & task

3. Upload audio

How to Transcribe Audio to Text

Pick a quality tier

Drop your audio file

Copy or save the transcript

How It Works

Real Whisper, running locally

Why your audio stays on-device

Real auto-detect (not just defaulting to English)

Long-audio chunking

Who Uses Free Audio Transcription?

Journalists & researchers

Podcasters & content creators

Students

Legal & medical professionals

Frequently Asked Questions

Is the audio uploaded to a server?

Which audio formats are supported?

How accurate is the transcription?

How long can the audio file be?

Why is the first transcription slow?

What is the difference between Transcribe and Translate?

Can I download the transcript?

Does it work on mobile?

Privacy notice

Related Tools

ABN Validator

Age Calculator

AI Detector

Alcohol Calculator

Annual Leave Calculator UK

BAC Calculator