Engage Users with an Easy to Use Text to Speech API

Integrate real-time voice synthesis in your devices and applications with our easy-to-use Generative AI Text to Speech API. Create delightful user experiences in real-time with our ultra-realistic Voice AI models.

Try API playground →Contact sales

Generate spoken audio from input text


    const options = {
    method: 'POST',
    headers: {
        AUTHORIZATION: '<api-key>',
        'X-USER-ID': '<api-key>',
        'Content-Type': 'application/json',
    },
    body: JSON.stringify({
        model: 'PlayDialog',
        text: `Country Mouse: Welcome to my humble home, cousin! 
            Town Mouse: Thank you, cousin. It's quite... peaceful here. 
            Country Mouse: It is indeed. I hope you're hungry. 
            I've prepared a simple meal of beans, barley, and fresh roots. 
            Town Mouse: Well, it's... earthy. Do you eat this every day?`,
        voice: 's3://voice-cloning-zero-shot/baf1ef41-36b6-428c-9bdf-50ba54682bd8/original/manifest.json',
        voice2: 's3://voice-cloning-zero-shot/baf1ef41-36b6-428c-9bdf-50ba54682bd8/original/manifest.json',
        outputFormat: 'mp3',
        speed: 1,
        sampleRate: 44100,
        seed: null,
        temperature: null,
        turnPrefix: 'Country Mouse:',
        turnPrefix2: 'Town Mouse:',
        prompt: '<string>',
        prompt2: '<string>',
        voiceConditioningSeconds: 20,
        voiceConditioningSeconds2: 20,
        language: 'english',
        webHookUrl: '<string>',
        }),
    };

    fetch('https://api.play.ai/api/v1/tts', options)
    .then(response => response.json())
    .then(response => console.log(response))
    .catch(err => console.error(err));

Use the Best Text-to-Speech AI Voices in Your Devices and Applications

Choose from an expansive library of 900+ natural-sounding Generative and Neural AI Voices. Offer a rich, multilingual voice experience to your users in real time with our voices in 142 languages and accents.

Conversational Voices

Perfect for entertainment videos, podcasts and audiobooks

Narrative Voices

Ideal for audiobooks, explainer videos and documentary videos

Explainer Voices

Ideal for entertainment videos, explainer videos, podcasts and audiobooks

Children Voices

Perfect for audiobooks, explainer videos and e-learning

Local Accents

Localize your entertainment videos, adverts and audiobooks

Emotions

Ideal for gaming, creative videos and ads

Character Voices

Perfect for gaming, creative videos and ads

Training Voices

Suitable for training videos, L&D and E-learning

Access 140+ Languages and Accents with Our Voice API

Offer multilingual voice experiences to your users in real time with our voices in 142 languages and accents. Create localized speech content in almost every language using our API.

PlayAI Offers the Best Text to Speech API

Access all the best text-to-speech AI Voices from PlayAI, Google, Amazon, IBM and Microsoft using PlayAI's text-to-speech API. Our text-to-speech API provides a single interface to convert text to audio using AI Voices across different providers. Check out our text to speech API for Python & Javascript

Real-time latency

PlayAI's new Turbo voice models can generate speech in <300ms.

Low maintenance

As you’ll be using a single text-to-speech API, you just have one integration to maintain.

Automatic updates

We make sure you’re always up to date with all the improvements made by the TTS providers.

Latest voices

All the latest voices added by the TTS providers are synced and ready for you to use.

Integrate Simply, Scale Efficiently

Subscribe to a plan

The API is a premium feature and is available across all the subscription plans

Generate secret key

Go to your dashboard and acquire your unique secret key

Synthesize speech

Convert text-to-speech and start integrating in your applications

Key Features

Leverage futuristic text-to-speech features to create the most realistic speech for your applications.

829 Voices across
142 Languages

Access a growing library of 829 high-quality male, female and kids voices in 142 languages.

Expressive Voice
Styles

Explore expressive voice styles such as narrative, conversational, cheerful, angry, sad and empathetic.

Manipulate Voice
Tones

Manipulate the volume, rate and pitch of words or even entire sentences to create unique voice effects.

Text and SSML
Support

Add pauses, numbers, date, time formatting, and other advanced, pronunciation instructions.

Frequently Asked Questions

Build Real-Time Voice Applications with PlayAI's Text-to-Speech API

Contact sales