Free Text-to-Speech (54 Languages)

Looking for a quick and high-quality way to convert text to speech online? This tool lets you turn any text into realistic, clear speech in 54 languages — no extra steps required. Just enter your text, choose a voice, adjust tone or speed — and get the result in seconds. Perfect for ad creatives.

The speech synthesis runs directly in your browser — no software installation, no sign-up, completely free. Choose from male and female voices, emotions, pauses, pitch accents, and SSML format support.

Instruction: How To Use SSML For Text-To-Speech

SSML (Speech Synthesis Markup Language) lets you control pauses, emphasis, intonation, speed, and how numbers/dates are read.

Wrap your text in <speak>…</speak> to enable SSML. If you omit it, speech still works but without fine-grained controls.

Most-Used Tags

Review the essentials below. You can paste the examples into the field and listen to the result.

Pause — <break>

  • Exact pause: time=”400ms” (typical 100–2000 ms)
  • Preset strength: strength=”weak|medium|strong|x-strong”

Example:

<speak>Hi there! <break time="400ms"/> Let’s begin.</speak>

Emphasis — <emphasis>

  • level=”moderate|strong|reduced”

Example:

<speak>This is <emphasis level="strong">very important</emphasis> to hear.</speak>

Intonation / Speed / Volume — <prosody>

  • Speed: rate=”1.1″ (range ~0.25…4.0)
  • Pitch: pitch=”+2st” (−20…+20 semitones; not all voices support it)
  • Volume: volume=”+2dB” (−96…+16 dB)

Example:

<speak><prosody rate="1.1" pitch="+2st" volume="+2dB">Listen carefully.</prosody></speak>

How To Read — <say-as>

interpret-as=”characters|cardinal|ordinal|date|time|telephone|currency”

Example:

<speak>

Code: <say-as interpret-as="characters">TTS</say-as>. 

Number: <say-as interpret-as="cardinal">1234</say-as>. 

Ordinal: <say-as interpret-as="ordinal">21</say-as>. 

Phone: <say-as interpret-as="telephone">+1 800 555 1212</say-as>.

</speak>

Pronunciation Alias — <sub>

Reads the alias instead of the original word.

Example:

<speak>Company <sub alias="Google">G00gle</sub> announced a new feature.</speak>

Structure — <p> and <s>

Paragraphs and sentences add natural pauses.

Example:

<speak><p>First idea. <s>Short pause.</s></p><p>Second idea.</p></speak>

Phonemes — <phoneme>

IPA/X-SAMPA phonemes; supported for some languages only.

Example:

<speak>This word: <phoneme alphabet="ipa" ph="ˈɡuːɡl">Google</phoneme>.</speak>

Timing Markers — <mark>

Adds anchors for synchronization (not spoken).

Example:

<speak>Start <mark name="m1"/> and continue.</speak>

Note: the <audio> tag is not supported by Google Cloud TTS.

Ready-To-Use Snippets

Pauses & Accents:

<speak>

Welcome! <break time="350ms"/>

<emphasis level="moderate">Today</emphasis> we’ll cover the basics.

<prosody rate="1.05">Let’s go.</prosody>

</speak>

IVR / Auto-Attendant:

<speak>

You reached <sub alias="C-P-A Live">CPA.LIVE</sub>.

Support: <say-as interpret-as="telephone">+1 800 555 3535</say-as>.

Press <say-as interpret-as="characters">0</say-as> to talk to an agent.

</speak>

Date, Time & Amount:

<speak>

Meeting on <say-as interpret-as="date">2025-09-10</say-as> at <say-as interpretas="time">14:30</say-as>.

Total: <say-as interpret-as="currency">1500 USD</say-as>.

</speak>

Step-By-Step Instructions:

<speak>

Step one: open settings. <break time="500ms"/>

Step two: choose “Audio”. <break time="600ms"/>

Step three: click “Save”.

</speak>

Brand Pronunciation + Tempo:

<speak>

<sub alias="Google">Google</sub> released updates.

<prosody rate="0.95">Stay tuned.</prosody>

</speak>

Tips & Limits

  • Always wrap content in <speak>…</speak>, close all tags, and use straight quotes (“).
  • Keep values reasonable: rate 0.25–4.0, pitch −20…+20 st, volume −96…+16 dB.
  • Some voices don’t support pitch and/or SSML; if that happens, synthesize without those parameters or pick another voice (Standard / Wavenet / Neural2 are usually safest).
  • Natural pauses are typically 100–700 ms; keep very long pauses ≤ 2–3 seconds.
  • Character counting: SSML tags count toward usage; <mark> adds anchors but produces no audio.
  • Avoid HTML tags like <b>, <i>, <div> — they’re not SSML.

Mini-Cheatsheet

  • Pause: <break time=”400ms”/>
  • Emphasis: <emphasis level=”strong”>text</emphasis>
  • Prosody: <prosody rate=”1.1″ pitch=”+2st” volume=”+2dB”>text</prosody>
  • Telephone: <say-as interpret-as=”telephone”>+1 800 555 1212</say-as>
  • Alias: <sub alias=”correct pronunciation”>word</sub>
  • Paragraph/Sentence: <p>…</p>, <s>…</s>
  • Marker: <mark name=”m1″/>

Common Errors (And How To Fix Them)

  • SSML / INVALID_ARGUMENT — check closing tags, attribute names, and quotes; remove non-SSML HTML.
  • “Pitch not supported” — remove pitch or choose another voice (Standard/Wavenet/Neural2).
  • Too loud/quiet — keep volume near −6…+6 dB for natural sound.
  • Too fast/slow — start near rate=”0.9–1.1″ and adjust.
Share to friends
CPA.LIVE
Subscribe
Notify of
0 комментариев
Most Voted
Newest Oldest
Inline Feedbacks
View all comments