Documentation Index
Fetch the complete documentation index at: https://docs.wittify.ai/llms.txt
Use this file to discover all available pages before exploring further.
This step is for Voice and Hybrid agents. Text-only agents skip it entirely. Voice-only agents start here, so they get the Agent Name field on this page (Hybrid agents already named the agent in Step 1).
What this page looks like
Five cards stacked in the main column, in order:- Voice Setting , the voice picker, language detection toggle, and interruption toggle
- Core Instructions , the voice version of the system prompt
- Creativity , temperature slider for voice answers
- Advanced Voice Settings , collapsed by default, 7 fine-tuning fields
- Background Sounds , 5 ambient tracks plus a volume slider
Agent name (Voice-only variant only)
Voice-only agents add the Agent name field at the top of this page, since they skipped Text Agent Settings. Same rules as the text variant: internal use only, max 100 characters, error Agent name is required appears if you blur it empty. Hybrid agents don’t see this field here, they named the agent in Step 1.Voice Setting card
Selected voice
A panel shows the currently selected voice with:- A Play button on the start edge (turns into a Stop square while audio is playing)
- The voice’s display name (Arabic or English depending on the wizard locale)
- A Change Voice button on the end edge (brand-gradient styling)
Voice Picker modal
Click Change Voice to open the Voice Picker modal. It’s 900px wide and works as a flat paginated list, 10 voices per page.| Control | Behavior |
|---|---|
| Search box | Searches by voice name, voice ID, or Arabic name |
| Gender dropdown | Filters Male / Female / All |
| Preview text area | Up to 140 characters. Type your own copy and click any voice’s play button to hear it speak that text. |
| Pagination | Brand-gradient button on the active page, prev / next arrows. The currently-selected voice always pins to the top. |
| Use action per row | Picks that voice and closes the modal. |
Advanced voice tuning (where available)
Some voices expose extra tuning knobs. When such a voice is selected, a chevron expands the row to show:| Setting | Range | What it does |
|---|---|---|
| Model | dropdown | Picks the model variant that powers the voice (each variant trades quality for speed differently). |
| Stability | 0 to 1 | Lower = more emotion and variation, higher = more consistent reads. |
| Similarity | 0 to 1 | How closely the output matches the original voice. |
| Style | 0 to 1 | Style exaggeration. Higher pushes the voice’s personality further. |
| Speaker Boost | toggle | Enhances the speaker’s distinctive characteristics. |
Auto-detect language
A toggle below the voice picker. When on, the agent listens for the language the caller speaks and replies in the same language. Helpful for multilingual customer bases.Allow interruption
A toggle. When on, the caller can talk over the agent and the agent will stop speaking and listen. When off, the agent always finishes its sentence before listening. Most natural-feeling calls have this on.Core Instructions card (Required)
The voice version of the system prompt. This is the same idea as in Text Settings, but the prompt is tuned for spoken output.Buttons in this card
- Edit Instructions , a pencil icon button that opens the Prompt Editor (the same editor used in Text Settings, with up to 2,500 characters, Enhance with AI, Keep / Revert, quick template chips, and the More templates dropdown).
- Copy from Text , for Hybrid agents, copies whatever you wrote in the Text Settings core instructions into the voice prompt. Disabled until the Text prompt has content. A toast confirms Copied from Text, and a tooltip explains the action: Copy your text agent instructions and tune them for voice.
Phrasing for voice
- Shorter sentences than text (a long sentence sounds robotic out loud).
- No markdown (asterisks and headings are read literally).
- Natural pacing , add commas where you want pauses.
- Spell out abbreviations the voice should pronounce (e.g. write VAT if you want it spelled out, write value-added tax if you want it spoken).
Creativity card
Same as the text creativity card, but it controls voice temperature specifically. The voice agent can be tuned to a different creativity level than the text agent (helpful when, for example, you want voice replies to be more formal than chat replies).| Slider value | Label | Use for |
|---|---|---|
| 0.0 to 0.3 | Precise (green) | Policy bots, IVR menus, formal customer service |
| 0.4 to 0.6 | (middle) | Balanced. Most general voice agents. |
| 0.7 to 1.0 | Creative (purple) | Storytelling, character voices, marketing |
Advanced Voice Settings card (collapsible)
Closed by default. Click to expand. Seven fields in a 2-column grid, each with a help icon and tooltip.| Setting | Range | Default | What it does |
|---|---|---|---|
| Speaking speed | 0.5 to 2.0 | 1.0 | How fast the agent speaks. Slower for narration, faster for ads. |
| Min silence duration | 0.5 to 5.0 seconds | 1.0 | How long the agent waits after the caller stops talking before treating it as the agent’s turn. Lower = more responsive (but may interrupt). |
| End call on silence | seconds | 60 | Auto-hangup if neither party has spoken for this long. Set to 0 to disable. |
| Max call duration | 1 to 180 minutes | (unset) | Hard ceiling on how long any call can go. The agent ends the call when the limit is hit. |
| Min speech duration | seconds | 0.35 | Filters out very short sounds (coughs, “um”, lip-smacks). Anything shorter than this isn’t treated as speech. |
| Widget popup delay | seconds | 0.1 | How long after page load the voice widget appears on your website. |
| Ask for feedback | toggle | on | Whether the agent asks for a 1-to-5 rating after the call ends. |
Background Sounds card
A grid of five ambient tracks. Pick one, or pick None to disable. A volume slider sets the level from 0 to 100.| Sound | Best for |
|---|---|
| None | Sterile, minimal, headset-only feel |
| Contact Center | Ambient call-center hum (other operators in the background, occasional ringing) |
| Keyboard | Light typing in the background , makes the agent feel like a real person taking notes |
| Hold Music | Soft instrumental music, useful for hold loops |
| Cafe | Subtle cafe noise, casual feel |
The bottom action bar
Same as every other wizard step:| Button | Behavior |
|---|---|
| Cancel / Exit | Cancel before first save, Exit (with unsaved-changes guard) after |
| Back | Returns to the previous step (Text Settings for Hybrid, dashboard for Voice-only) |
| Step dots | Jump to any unlocked step |
| Save draft | Saves what you have without moving forward |
| Next , Appearance | Saves this step and moves to Step 3 (or Step 2 for Voice-only) |
The wizard sidebar
The current step is highlighted in the sidebar’s step list. The agent’s name shows in the Agent name card at the top after your first save. For a full walk-through of the wizard sidebar, topbar, tutorial, and bottom nav, see Text Agent Settings.Variant differences
- Hybrid
- Voice only
- Text only
This is Step 2. The agent name was set in Step 1, so this card doesn’t show on Hybrid. The next step is Appearance. Hybrid agents have 7 steps total.
Common questions
How do I preview a voice without committing to it?
How do I preview a voice without committing to it?
Open the Voice Picker, type some sample text in the Preview text area (up to 140 characters), and click the play icon next to any voice. The audio plays in your browser. Click Use when you’ve found one you like.
Can I import my own voice?
Can I import my own voice?
Voice cloning is part of the Wittify Faseeh product. See Text to Speech , Clone Wizard. Once cloned, the voice appears in the Voice Picker.
The agent talks over the caller.
The agent talks over the caller.
Lower Min silence duration further. The agent treats this much consecutive silence as “your turn”.
The caller talks over the agent and is ignored.
The caller talks over the agent and is ignored.
Turn Allow interruption on. By default, voice agents finish their sentence before listening.
The agent ends calls too quickly when there's a pause.
The agent ends calls too quickly when there's a pause.
Raise End call on silence (default 60 seconds). For sales calls, 90 to 120 is common.
I want different creativity for voice vs chat (Hybrid).
I want different creativity for voice vs chat (Hybrid).
The Creativity slider on this page controls voice only. The chat creativity is on the Text Agent Settings page. They can be different.
Why does the chevron with extra settings not show on some voices?
Why does the chevron with extra settings not show on some voices?
The Stability / Similarity / Style / Speaker Boost knobs are only available on certain Wittify voices. Voices that don’t expose those knobs simply hide the chevron, the voice still works exactly as well, you just don’t tune the extra parameters.
Can I use voice cloning for telephony?
Can I use voice cloning for telephony?
Yes. Once a voice is cloned in Faseeh, it shows up in this picker for both web and telephony deployments. Telephony quality is also tuned by the Speaking speed and Stability values.
Next
Step 3: Appearance
Brand the chat widget, set per-language text, configure the voice call screen.

