Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.wittify.ai/llms.txt

Use this file to discover all available pages before exploring further.

The Upload Audio modal is reached from the dashboard’s Upload File card and from the topbar’s + Transcribe Audio dropdown. It accepts a single audio file, validates the size and type, and queues it for transcription.

What this modal looks like

A 600 px popup over the dashboard with a header, a dropzone, an uploaded-file card, a language selector, two toggles, and a submit button.
RegionWhat you see
HeaderTitle Upload Audio with a short subtitle below. A small X in the upper end-edge corner closes the modal.
DropzoneA 166 px tall area with a dashed border. Click anywhere inside it or drag a file onto it to add the audio. While you’re dragging a file over it, the dropzone tints to the brand color and the label changes to Drop the file here…
Uploaded-file cardOnce you’ve added a file, the dropzone is replaced by a green card showing the file name, the size in MB, and the file type. A trash icon on the end edge removes the file so you can try again.
Language selectorA dropdown with 16 languages plus Auto Detect. Hovering it shows the tooltip Selecting a language increases the accuracy of the transcription.
Detect Speakers Automatically switchWhen on, the AI labels each speaker so you can rename them in the editor. Default: on.
Include subtitles switchWhen on, the engine produces a subtitle file alongside the transcript. Default: off.
Start transcribing buttonThe brand-gradient submit button. Disabled until a valid file is selected.

Using the modal

1

Open the modal

Click Upload File on the dashboard’s Start Transcribing grid, or + Transcribe Audio in the topbar then Upload File.
2

Add a file

Drag the file onto the dropzone, or click anywhere in the dropzone to browse. While you’re dragging over it the dropzone tints to the brand color and the label changes to Drop the file here… until you let go.
3

Confirm the file

The dropzone is replaced by a green confirmation card showing the file name, size in MB, and type. Click the trash icon to clear it and try again.
4

Pick a language

Choose a language from the dropdown or leave it on Auto Detect. Hover the tooltip for the accuracy hint.
5

Tune the toggles

Turn Detect Speakers Automatically on (default) or off. Turn Include subtitles on if you want subtitles produced.
6

Submit

Click Start transcribing. A toast confirms File uploaded successfully with the file name. You go straight to the Transcript Editor while the file processes in the background.

Validation

The modal checks the file before accepting it. If a check fails, a toast appears and the dropzone stays empty.
RuleLimitToast on failure
File size500 MB maximumFile too large. Maximum size is 500 MB.
File typeMust be audio (.mp3, .wav, .m4a, .aac, .ogg, .flac, .mp4, .webm)Please upload a valid audio file.

Settings

Detect Speakers Automatically

The AI separates voices by speaker. The transcript carries a label per speaker that you can rename later in the Transcript Editor. Default: on.

Include subtitles

The engine produces a subtitle file alongside the transcript. You can export the subtitles to SRT or VTT later. Default: off.

Supported languages

16 languages plus Auto Detect. The same list shows up in the Live Transcribe modal.
LanguageDefault flag
Auto Detect,
EnglishGB
ArabicSA
FrenchFR
SpanishES
GermanDE
TurkishTR
UrduPK
HindiIN
ChineseCN
JapaneseJP
KoreanKR
PortugueseBR
RussianRU
DutchNL
ItalianIT
Verify the detected language before submitting. Picking the right language is the single biggest accuracy improvement you can make.

What happens after submit

  1. A new file row appears under Your Files in the sidebar with the status Processing… (a pulsing amber chip).
  2. You’re pushed to the Transcript Editor, which starts loading the transcript as soon as it’s ready.
  3. When transcription finishes, the row flips to Completed and the audio duration replaces the Processing… chip.

Limits

LimitValue
Maximum file size500 MB
Maximum length5 hours
Maximum files in your library200 (oldest are removed when you go past this)

Common questions

The limit is 500 MB. If your file is over that, compress it (lower bit rate or convert to a more compact format like AAC) and try again. The 5-hour duration cap also applies regardless of file size.
Only .mp3, .wav, .m4a, .aac, .ogg, .flac, .mp4, and .webm are accepted. If you’re trying to upload a video, the .mp4 container works, the engine extracts the audio. Other formats (.mov, .avi, .wma) are not supported.
If your audio is mostly one language, picking it explicitly gives you the best accuracy. Use Auto Detect when your audio mixes languages or when you genuinely don’t know what’s on the recording.
Most audio with more than one speaker (interviews, calls, meetings) reads better when each speaker has their own line. You can always turn it off if your audio is a single narrator.
A rule of thumb is about 30 seconds of processing per minute of audio. Files near the 5-hour cap can take a while. The row in Your Files stays in Processing… until done. You can leave the page and come back, the file will be there with Completed.
Yes. Go to the File Library and use the 3-dot menu’s Delete action on the row, that stops the job and removes the row.
Files are tied to your Wittify account. Other users on your team can’t see them. If you need to share a transcript, export it to TXT, DOCX, SRT, or VTT from the Transcript Editor and send the file directly.
Your oldest unpinned file is removed automatically to make room for the new one. Pinned files (pinned via the 3-dot menu in File Library) are never auto-removed.
Don’t upload sensitive content. Audio is stored against your account, but treat any recording as you would any document, redact personal data before transcription if your policy requires it.