Documentation Index
Fetch the complete documentation index at: https://docs.wittify.ai/llms.txt
Use this file to discover all available pages before exploring further.
Page layout
The page is split into two panels. On screens narrower thanmd they stack with the transcript on top and the settings panel underneath.
| Panel | What it holds |
|---|---|
| Header bar (top) | File name + filename, then a Clock chip showing the live audio length, a Globe chip showing the language, Speakers + Subtitles chips when enabled, the file Status pill, and on the end edge a Transcript download dropdown (TXT / Word / PDF) and an Audio download button. |
| Editor toolbar (between header and transcript) | Merge, Split, Reassign (dropdown of speakers), Undo, Save changes. The buttons enable based on what you have selected. |
| Transcript (left, scrollable) | The list of speaker turn rows. Click any row to select it. Click any word to edit it. The audio timeline sticks to the bottom of this panel. The Save bar slides in below the timeline whenever you have unsaved edits. |
| Settings (right) | Transcribe Settings at the top, then a tab switcher between Summary and Intent, then Cue Properties at the bottom. |
Default speakers
Two speakers ship by default: Speaker 1 and Speaker 2. Each name is editable in place via the start-edge input on its Timeline Row. The name you type is what appears everywhere: the transcript bubble label, the side-panel Summary rows, the Reassign dropdown in the toolbar. Names render correctly in mixed Arabic and Latin scripts. The transcript text itself can be in any language. The dashboard chrome (button labels, dialog copy, toast text) translates between English and Arabic, but the transcript content stays exactly as the audio came in — Arabic, English, a third language, or a mix.Header — Downloads
Two downloads in the file header:- Transcript
- Audio
- Plain text (.txt) — UTF-8 with BOM so Excel can open Arabic correctly.
- Word (.doc) — opens directly in Microsoft Word.
- PDF — opens the formatted transcript in a new browser tab. Use File → Print → Save as PDF in your browser to save it.
Transcript pane
Speaker turn rows
Each row is one speaker turn. The row carries:- A selection checkbox at the start edge. Click anywhere on the row (except a word) to toggle selection.
- A vertical accent line: blue for Speaker 1, green for Speaker 2, red when the turn has an error.
- A speaker label showing the current speaker name (whatever you typed in the timeline), with the start time of that turn formatted
MM:SS Sec. - The transcript text as a list of clickable words separated by spaces.
Editable words
Every word is independently clickable. Click any word to flip it into a small inline input:- The input grows or shrinks to fit the text you type.
- The background tints to a light brand color and the input has a thin brand border so you know it’s active.
- Press Enter or click anywhere outside the input to commit the change. Press Escape to cancel the edit.
Karaoke highlight
While the audio plays, the word currently being spoken is highlighted in amber. The highlight follows along automatically — no setup needed. The estimate is based on the message’s start time and the length of the current message; over time, as our transcription engine gets more confident, the highlight will land on the exact spoken word.Top toolbar
A row of actions sits between the file header and the transcript:| Action | Enables when | What it does |
|---|---|---|
| Merge | 2 or more consecutive lines are selected | Combines the selected lines into a single transcript bubble. Uses the first line’s speaker and start time. The waveforms on the timeline re-bind to the merged bubble automatically. |
| Split | exactly 1 line is selected and it has at least 2 words | Splits the selected bubble in half (at the midpoint by word count). The left half keeps any waveforms attached; the right half starts unattached — drag waves onto it from the timeline to assign them. |
| Reassign | at least 1 line is selected | Opens a dropdown of speakers; pick a speaker to reassign every selected line to. The waveforms on the timeline change color in lockstep. |
| Undo | at least one change has been made | Reverts the last change. History is bounded at the last 50 changes. Cmd + Z (macOS) / Ctrl + Z (Windows / Linux) does the same thing — except when you’re editing a word, where it falls through to the browser’s native text-input undo. |
| Save changes | the editor is dirty | Persists your edits and clears the dirty flag. |
Audio Timeline
A sticky strip at the bottom of the transcript pane. The timeline always renders left-to-right, even on Arabic pages, because timestamps and waveforms only make sense in that direction.| Sub-component | What it does |
|---|---|
| Timeline Header | A ruler with ticks. The interval auto-scales: a tick every 5 s for files under 1 minute, every 15 s for files under 2 minutes, every 30 s for longer files. |
| Timeline Row (one per speaker) | An editable speaker name input on the start edge plus the speaker’s waveform segments laid out by time. Each segment’s width is proportional to its duration. |
| Waveform | Animated bars drawn for each segment. Speaker 1 is blue. Speaker 2 is green. Any segment that the engine flagged as “negative” (frustrated / problematic) renders red, regardless of which speaker it belongs to. The colors swap automatically when you toggle dark mode. |
| Timeline Controls | A primary Play / Pause circular button (40 px) and a small speed toggle that cycles 1.0x → 1.5x → 2.0x → 0.5x → 1.0x. |
| Timeline Scrubber | Click anywhere on the timeline to jump the audio to that moment. The position indicator slides to where you clicked. |
Drag a wave to fix a wrong speaker or wrong time
A single drag does both:- Drag down or up to move the wave to a different speaker.
- Drag left or right to move the wave to a different time.
- Both at once is also supported — diagonal drags work.
Save bar
Whenever the editor has unsaved edits, a sliding bar appears below the audio timeline:- A pulsing amber dot + Unsaved changes label so you can’t miss it.
- Undo — reverts the last change.
- Discard — opens a confirmation dialog (“All edits since the last save will be lost. This can’t be undone.”). Confirming reverts everything since the last save.
- Save changes — persists everything and the bar disappears.
Unsaved-changes guard
If you try to navigate away (click another sidebar item, the dashboard logo, your profile menu, etc.) while you have unsaved edits, a dialog appears with three options:- Save and continue — saves your edits, then takes you to where you wanted to go.
- Discard (red) — drops your edits, then takes you to where you wanted to go.
- Cancel — keeps you on the page; nothing happens.
Right panel
Transcribe Settings (top of the panel)
| Field | What it does |
|---|---|
| Language Detection | An outline button showing the language detection setting you chose pre-processing (defaults to Auto-Detect). The Globe chip in the file header shows the actual detected language separately. |
| Include subtitles switch | Turn on to generate subtitles for export to SRT or VTT. The AI labels each speaker so you can rename them on the fly. |
| Detect words button | An outline button that runs word-level detection over the audio. |
| Run spell check button | A brand-gradient primary button that runs spell check across the transcript. |
Tabs section (middle of the panel)
Two tabs: Summary and Intent.- Summary
- Intent
- A row per speaker showing Speaker 1 / Speaker 2 label and the editable name.
- Executive Overview heading with a thin bordered underline, followed by the conversation’s narrative paragraph.
Cue Properties (bottom of the panel)
A static card showing the time bounds of the current audio:| Field | Format | Example |
|---|---|---|
| Start time | HH:MM:SS.cc | 00:00:00.00 |
| End time | HH:MM:SS.cc | matches the live audio duration |
| Delete button | An outline button styled red (red text, red border). Removes the active cue. |
Common questions
Drag-and-drop isn't working — clicking a wave just seeks the audio.
Drag-and-drop isn't working — clicking a wave just seeks the audio.
I dragged a wave to another speaker but the transcript bubble didn't change color.
I dragged a wave to another speaker but the transcript bubble didn't change color.
My speaker name change didn't stick.
My speaker name change didn't stick.
Cmd + Z / Ctrl + Z doesn't undo when I'm editing a word.
Cmd + Z / Ctrl + Z doesn't undo when I'm editing a word.
I lost my edits when I navigated away.
I lost my edits when I navigated away.
The Audio download fails with a generic error.
The Audio download fails with a generic error.
My PDF download just opened a new tab — where's the file?
My PDF download just opened a new tab — where's the file?
.pdf download once the server-side export ships.The Word file Microsoft Word opens looks like an HTML preview.
The Word file Microsoft Word opens looks like an HTML preview.
.doc is a Word-readable HTML document. Microsoft Word opens it natively. We’ll switch to a real .docx once the server-side export ships.I exported a very long transcript and it's missing the end.
I exported a very long transcript and it's missing the end.
The waveform is showing the same color for both speakers.
The waveform is showing the same color for both speakers.
Play / Pause doesn't work.
Play / Pause doesn't work.
My language detection is wrong.
My language detection is wrong.

