Documentation Index
Fetch the complete documentation index at: https://docs.wittify.ai/llms.txt
Use this file to discover all available pages before exploring further.
Text to Speech Studio is the main workspace for Faseeh. The page subtitle reads Turn text into lifelike speech in any voice, any language. On the left you write or paste text. On the right you pick a voice, a language, optionally a dialect, tune Generation Settings, and click Generate speech. The audio plays back in a card directly under the text editor, and the generation is automatically saved to your History in the sidebar.
Page anatomy
The page has three regions plus a History sidebar that the dashboard provides automatically.| Region | What you see |
|---|---|
| Topbar (provided by the dashboard) | A + Generate New TTS button that wipes the editor for a fresh project, and a + Create Voice button that opens the Create a new voice picker. |
| Sidebar (provided by the dashboard) | The sidebar shows History: a list of every speech project you have generated. Click any row to load it back into the editor. |
| Center column | Page title, the text editor, and the generated-audio card (only shows up after you generate something or load a project from History). |
| Right column | The voice card, the language picker, the dialect picker (only when the chosen language has dialects), the Generation Settings panel, and the Generate speech button. |
The text editor
The editor is one big textarea. It does not change size when the right panel expands or collapses.| Element | What it does |
|---|---|
| Placeholder | Start typing here or paste any text you want to turn into lifelike speech… |
| Character counter | A small footer at the bottom-end edge that reads {used} / {max} characters, where the cap is shared across the app. The counter uses Arabic-Indic digits in Arabic mode. |
| Length warning | When you reach the cap, the counter turns red. If you paste a longer string, the editor silently keeps only the first {max} characters. |
| Disabled state | The editor goes faded while a speech is being generated. You cannot type or paste during generation. |
The voice card
The voice card sits at the top of the right column. The label above it reads VOICE.| Element | What it does |
|---|---|
| Avatar circle | A round tile with the voice’s first initial and a deterministic color tint generated from the voice’s id. The color is the same every time the same voice appears. |
| Name | The voice name. In Arabic mode, the Arabic name is used when the voice ships one. |
| Subtitle | The voice’s gender (Male or Female) and, when present, the dialect code in uppercase, separated by a middle dot. |
| Chevron | A small chevron on the end edge. |
| Click | Opens the Voice Picker modal. The “change voice” hint reads Change voice. |
| Disabled state | The whole card is dimmed while a speech is being generated. |
The language picker
The language picker is the next field on the right column. The label above it reads LANGUAGE. The field shows the active language with a flag emoji on the start edge and a chevron on the end edge. The default value is Auto (detect from text), with a 🌐 globe glyph as the flag. Click the field to open the language popover.| Inside the popover | Behaviour |
|---|---|
| Search box | Type to filter the list by English name, Arabic name, or language code. Up to 200 rows are rendered at a time. The search input is capped at 80 characters. |
| Auto row | First in the list, with the 🌐 globe glyph. The default value. The model auto-detects the language from your text. |
| Per-language rows | Each row shows the country flag, the language name (English or Arabic depending on UI language), and a small + badge on the end edge when the language has a dialect list. |
| Empty state | When no row matches the search, the popover shows No voices match your filters. |
The dialect picker
The DIALECT field only shows up when the chosen language has a dialect list. When it does, the field works the same way as the language picker.| Inside the popover | Behaviour |
|---|---|
| Auto row | First in the list, using the parent language’s flag. Default. |
| Per-dialect rows | Each row shows the dialect’s regional flag and name. |
Generation Settings
This is the collapsible panel below the dialect picker. Its header reads Generation Settings (optional) with a chevron on the end edge. Click to expand or collapse. Default state is collapsed. Every control inside has its own small reset arrow that only resets that one control. There is also a Reset link at the bottom of the panel that resets all controls at once. The panel exposes six controls, no more.| Control | Default | What it does |
|---|---|---|
| Speed | Normal | Slider with three internal tiers, Slow, Normal, Fast. The track shows a smooth gradient and the thumb anchors to the active tier. End labels read Slow and Fast. Help text: Adjust the pace of speech. |
| Quality | Balanced | Slider with three tiers, Faster, Balanced, Higher quality. End labels read Speed and Quality. Help text: Move toward speed for faster generation, or toward quality for higher fidelity. |
| Similarity | Balanced | Slider with three tiers, Stable, Balanced, Similar. End labels read Stable and Similar. Help text: Lower for stable, consistent delivery. Higher for closer match to the reference voice. |
| Denoise | On | Checkbox. When on, the engine removes background hiss from cloned voices. Help text: Default: enabled. Uncheck to disable denoising. |
| Preprocess Prompt | On | Checkbox. When on, the engine trims silence and adds punctuation at the end of reference text if it is missing. Help text: Apply silence removal and trimming to the reference audio, add punctuation at the end of reference text (if not already). |
| Postprocess Output | On | Checkbox. When on, the engine removes long silences from the generated audio. Help text: Remove long silences from generated audio. |
Generate speech
The Generate speech button sits at the bottom of the right column.| Button state | Label | Triggered by |
|---|---|---|
| Idle | Generate speech | Default, when the editor has text and a voice is selected. |
| Disabled | Generate speech (faded) | The editor is empty, or no voice is selected, or a generation is already running. |
| Generating | Generating… with a spinner | A request is in flight. The whole right column dims. |
| Condition | Toast |
|---|---|
| Editor is empty | Add some text before generating. |
| Voice is missing | Pick a voice before generating. |
| Generation succeeded | Speech ready to play. |
| Generation failed | Generation failed. Please try again. |
The generated-audio card
A new card appears in the center column (under the text editor) the moment you start generating, or when you click a project in History. The card is titled Generated Speech. The card has three states.Generating
A row with a spinner, the label Generating…, and a one-line preview of the text being rendered. No player and no actions yet.Failed
A red-tinted strip with the warning text Generation failed. Please try again. and an alert icon. Use the Generate speech button again to retry.Ready
The full player. From start-edge to end-edge:| Element | What it does |
|---|---|
| Play / Pause button | A round brand-gradient button with Play or Pause icon. The aria label flips between Play preview and Pause preview. |
| Progress bar | Brand-gradient fill on a muted track. Click anywhere on the bar to scrub to that position. The track flips for Arabic so playback always reads start-to-end visually. |
| Position counter | Mono digits, : format, on the start edge of the time row. |
| Duration counter | Mono digits, : format, on the end edge of the time row. |
| Download | Small icon button on the card header. Downloads the audio with a sanitized file name based on the project’s text. The file extension is .wav or .mp3 depending on what the engine returned. A toast confirms Download started. |
| Delete | Small icon button on the card header. Removes the project from your History. A toast confirms Removed from history. There is no confirmation dialog on Studio, deletion is one click. |
Loading a project from History
Click any row in the History sidebar (or follow a?project=<id> link) and the studio rehydrates the editor. The text comes back, the voice flips back to the saved one, the language and dialect are split back into the two-step picker, and the Generation Settings card is restored.
The voice card uses the saved voice even if you have since deleted it from your Voice Library, the row simply shows the saved name with a fallback color tint.
Account isolation
Cloned voices, designed voices, TTS projects, and your Favorites are all private to your account. Other users signed in to the same workspace cannot see them. The History sidebar, the Voice Library, and the Favorites tab in the picker all filter strictly by your account.Common questions
Why do I see a different default voice the first time I open Studio?
Why do I see a different default voice the first time I open Studio?
A built-in Sarah voice is preselected for new accounts. Once you pick a different voice, your last selection is what comes back next time.
The Generate speech button is greyed out.
The Generate speech button is greyed out.
My generation finished, where did the audio go?
My generation finished, where did the audio go?
Look at the Generated Speech card directly under the text editor. The audio is also a row at the top of the History sidebar. Click the row to come back to it later.
The text I pasted got cut off.
The text I pasted got cut off.
The editor caps the text at the shared length limit. The counter shows you how close you are. Long inputs get silently trimmed to fit.
Can I generate the same text with a different voice without retyping it?
Can I generate the same text with a different voice without retyping it?
Yes. Change the voice in the voice card, click Generate speech again. A second project lands in History. You can also click the row of an old project, switch the voice, then Generate speech to make a sibling.
Where is the Stability or Style Exaggeration slider that I see in older screenshots?
Where is the Stability or Style Exaggeration slider that I see in older screenshots?
Removed. The current Generation Settings panel only exposes Speed, Quality, Similarity, Denoise, Preprocess Prompt, and Postprocess Output. Older docs and tutorial videos referencing those sliders are stale.
How do I create a new voice?
How do I create a new voice?
Click the topbar’s + Create Voice button. The Create a new voice picker opens. Pick Clone a voice to upload or record a sample, or Design a voice to describe one from attributes. Both flows are documented in the Clone Wizard and Design Wizard pages.
My generation failed. What now?
My generation failed. What now?
A red Generation failed. Please try again. strip replaces the player. The most common causes are a network hiccup, a voice that was momentarily unavailable, or text that was too long for the chosen voice to handle in one pass. Click Generate speech again, the original project is replaced.
Does my speech ever get used to train models?
Does my speech ever get used to train models?
No. Cloned voices, designed voices, and generated speech are tied to your account and never used outside it. The History sidebar, Voice Library, and Favorites are all scoped to you.
Where to go next
Voice Picker
Browse the full voice catalog: Explore, My Voices, Favorites, All Voices.
Voice Library
See and manage your cloned and designed voices.
Clone Wizard
Clone a voice from a 3-to-10-second sample.
Design Wizard
Design a voice from gender, age, pitch.
History
Rehydrate any past project.

