Frequently Asked Questions
What is AI Voice?
AI Voice is a computer generated voice powered by machine learning and can generate speech from text with natural intonation and real accents. AI Voices are created by machine learning models that process hundreds of hours of voice recordings from real voiceover artists and then learn to speak based on the audio recordings. Today AI Voices are used in several applications due to their natural-sounding tone.
How long does it take to synthesize text into speech?
The text to speech synthesis is realtime in most cases, and only takes a couple of minutes to convert the input text into audio. Our TTS software runs in the cloud, so if you are converting large amounts of text then you can paste it in our voice generator's interface and start the conversion. There's no need for you to wait for the conversion to finish. Once the audio is ready, the files will be available in your dashboard to download.
What customizations can I do with the AI Voices?
All our AI Voices support SSML features - rate, pitch, volume and pronunciations. You can add custom pauses for different punctuation marks to create a more natural speaking tone. Adjust the pitch of the voice to make it sound more deeper or child-like. The speaking rate allows you to increase or decrease the speed of voice. With our pronunciations library you can save custom pronunciations and use them whenever you create speech.
Can I use the voices for commercial purpose?
Yes, all our voices can be used for commercial purposes. Please refer to our Pricing page to select the appropriate plan that offers commercial rights.
Do you offer a free version?
Yes, we do offer a free version that allows you to preview all the available voices and convert a few words to audio.