Skip to main content

Text To Speech

Quick guide: Understand what Text-to-Speech is, how it functions, and how to manage its settings and downloads.

Updated over 2 weeks ago

TTS stands for Text-to-Speech. Itโ€™s a technology that converts written text into spoken audio using synthetic voices.
โ€‹



๐Ÿ” What Does TTS Do?

TTS reads digital text aloud. For example:

  • You type "Hello, how are you?"

  • The TTS system uses a computer-generated voice to say it out loud.

๐Ÿง  How It Works

1. Text Input

You enter or upload the text you want to convert to speech.

2. Linguistic Analysis

The system analyzes pronunciation, punctuation, and intonation to prepare the text for natural-sounding speech.

3. Speech Synthesis

It generates audio using either pre-recorded voice samples or AI-generated voices.


Settings

  • Gender: Select the desired voice gender (Male / Female / Neutral).

  • Language: Choose the language of the input text.

  • Output Format: Select the desired audio format.
    โ€‹
    โ€‹

  • .flac โ€“ Free Lossless Audio Codec

  • .wav โ€“ Waveform Audio File Format

  • .adts โ€“ Audio Data Transport Stream

  • .pcm_s16le โ€“ 16-bit signed little-endian PCM

  • .pcm_s32le โ€“ 32-bit signed little-endian PCM
    โ€‹
    โ€‹


    โœ… Generate Audio

    Click on "Generate Speech" to create the audio from your text input.
    โ€‹

    โฌ‡๏ธ Download

    Once the audio is generated, you can listen to it and download the output by clicking on "Download".


๐Ÿ“‚ TTS History

You can also view your TTS history below to access previously generated audio files.
โ€‹

Did this answer your question?