Generate Speech

Checking model...

Enter the text you want to convert to speech. For audiobooks, you can paste long chapters.

0 Characters

Loading presets...

Generation Parameters

Integer for reproducible results. Some engines use 0 or -1 for random.

MP3 is recommended for smaller file sizes (e.g., audiobooks).

Server Configuration

These settings are loaded from config.yaml via an API call. Restart the server to apply changes to Host, Port, Model, or Path settings if modified here or directly in the file.

Tips & Tricks

  • For Audiobooks, use MP3 format, enable Split text, and set a chunk size of ~250-500.
  • Use Predefined Voices for consistent, high-quality output. You can import new ones.
  • For Voice Cloning, upload clean reference audio (.wav/.mp3). Quality of reference is key.
  • Experiment with Temperature and other generation parameters to fine-tune output.
  • Adjusting Speed Factor away from 1.0 is experimental and may cause echo.
  • When using Turbo model, you can insert paralinguistic tags like [laugh] or [sigh] for natural vocal reactions.
  • Check the /docs endpoint for API details.