Looking for a quick and high-quality way to convert text to speech online? This tool lets you turn any text into realistic, clear speech in 54 languages — no extra steps required. Just enter your text, choose a voice, adjust tone or speed — and get the result in seconds. Perfect for ad creatives.
The speech synthesis runs directly in your browser — no software installation, no sign-up, completely free. Choose from male and female voices, emotions, pauses, pitch accents, and SSML format support.
Instruction: How To Use SSML For Text-To-Speech
SSML (Speech Synthesis Markup Language) lets you control pauses, emphasis, intonation, speed, and how numbers/dates are read.
Most-Used Tags
Review the essentials below. You can paste the examples into the field and listen to the result.
Pause — <break>
- Exact pause: time=”400ms” (typical 100–2000 ms)
- Preset strength: strength=”weak|medium|strong|x-strong”
Example:
<speak>Hi there! <break time="400ms"/> Let’s begin.</speak>
Emphasis — <emphasis>
- level=”moderate|strong|reduced”
Example:
<speak>This is <emphasis level="strong">very important</emphasis> to hear.</speak>
Intonation / Speed / Volume — <prosody>
- Speed: rate=”1.1″ (range ~0.25…4.0)
- Pitch: pitch=”+2st” (−20…+20 semitones; not all voices support it)
- Volume: volume=”+2dB” (−96…+16 dB)
Example:
<speak><prosody rate="1.1" pitch="+2st" volume="+2dB">Listen carefully.</prosody></speak>
How To Read — <say-as>
interpret-as=”characters|cardinal|ordinal|date|time|telephone|currency”
Example:
<speak>
Code: <say-as interpret-as="characters">TTS</say-as>.
Number: <say-as interpret-as="cardinal">1234</say-as>.
Ordinal: <say-as interpret-as="ordinal">21</say-as>.
Phone: <say-as interpret-as="telephone">+1 800 555 1212</say-as>.
</speak>
Pronunciation Alias — <sub>
Reads the alias instead of the original word.
Example:
<speak>Company <sub alias="Google">G00gle</sub> announced a new feature.</speak>
Structure — <p> and <s>
Paragraphs and sentences add natural pauses.
Example:
<speak><p>First idea. <s>Short pause.</s></p><p>Second idea.</p></speak>
Phonemes — <phoneme>
IPA/X-SAMPA phonemes; supported for some languages only.
Example:
<speak>This word: <phoneme alphabet="ipa" ph="ˈɡuːɡl">Google</phoneme>.</speak>
Timing Markers — <mark>
Adds anchors for synchronization (not spoken).
Example:
<speak>Start <mark name="m1"/> and continue.</speak>
Note: the <audio> tag is not supported by Google Cloud TTS.
Ready-To-Use Snippets
Pauses & Accents:
<speak>
Welcome! <break time="350ms"/>
<emphasis level="moderate">Today</emphasis> we’ll cover the basics.
<prosody rate="1.05">Let’s go.</prosody>
</speak>
IVR / Auto-Attendant:
<speak>
You reached <sub alias="C-P-A Live">CPA.LIVE</sub>.
Support: <say-as interpret-as="telephone">+1 800 555 3535</say-as>.
Press <say-as interpret-as="characters">0</say-as> to talk to an agent.
</speak>
Date, Time & Amount:
<speak>
Meeting on <say-as interpret-as="date">2025-09-10</say-as> at <say-as interpretas="time">14:30</say-as>.
Total: <say-as interpret-as="currency">1500 USD</say-as>.
</speak>
Step-By-Step Instructions:
<speak>
Step one: open settings. <break time="500ms"/>
Step two: choose “Audio”. <break time="600ms"/>
Step three: click “Save”.
</speak>
Brand Pronunciation + Tempo:
<speak>
<sub alias="Google">Google</sub> released updates.
<prosody rate="0.95">Stay tuned.</prosody>
</speak>
Tips & Limits
- Always wrap content in <speak>…</speak>, close all tags, and use straight quotes (“).
- Keep values reasonable: rate 0.25–4.0, pitch −20…+20 st, volume −96…+16 dB.
- Some voices don’t support pitch and/or SSML; if that happens, synthesize without those parameters or pick another voice (Standard / Wavenet / Neural2 are usually safest).
- Natural pauses are typically 100–700 ms; keep very long pauses ≤ 2–3 seconds.
- Character counting: SSML tags count toward usage; <mark> adds anchors but produces no audio.
- Avoid HTML tags like <b>, <i>, <div> — they’re not SSML.
Mini-Cheatsheet
- Pause: <break time=”400ms”/>
- Emphasis: <emphasis level=”strong”>text</emphasis>
- Prosody: <prosody rate=”1.1″ pitch=”+2st” volume=”+2dB”>text</prosody>
- Telephone: <say-as interpret-as=”telephone”>+1 800 555 1212</say-as>
- Alias: <sub alias=”correct pronunciation”>word</sub>
- Paragraph/Sentence: <p>…</p>, <s>…</s>
- Marker: <mark name=”m1″/>
Common Errors (And How To Fix Them)
- SSML / INVALID_ARGUMENT — check closing tags, attribute names, and quotes; remove non-SSML HTML.
- “Pitch not supported” — remove pitch or choose another voice (Standard/Wavenet/Neural2).
- Too loud/quiet — keep volume near −6…+6 dB for natural sound.
- Too fast/slow — start near rate=”0.9–1.1″ and adjust.