Text to Speech API for Python

Unleash the power of real-time voice synthesis in your applications with our simple-to-use Text to SpeechPython API. Deliver exceptional user experiences with low latency, lifelike AI voices using our advanced AI models.

Try API playground →Contact sales

Generate spoken audio from input text


    const options = {
    method: 'POST',
    headers: {
        AUTHORIZATION: '<api-key>',
        'X-USER-ID': '<api-key>',
        'Content-Type': 'application/json',
    },
    body: JSON.stringify({
        model: 'PlayDialog',
        text: `Country Mouse: Welcome to my humble home, cousin! 
            Town Mouse: Thank you, cousin. It's quite... peaceful here. 
            Country Mouse: It is indeed. I hope you're hungry. 
            I've prepared a simple meal of beans, barley, and fresh roots. 
            Town Mouse: Well, it's... earthy. Do you eat this every day?`,
        voice: 's3://voice-cloning-zero-shot/baf1ef41-36b6-428c-9bdf-50ba54682bd8/original/manifest.json',
        voice2: 's3://voice-cloning-zero-shot/baf1ef41-36b6-428c-9bdf-50ba54682bd8/original/manifest.json',
        outputFormat: 'mp3',
        speed: 1,
        sampleRate: 44100,
        seed: null,
        temperature: null,
        turnPrefix: 'Country Mouse:',
        turnPrefix2: 'Town Mouse:',
        prompt: '<string>',
        prompt2: '<string>',
        voiceConditioningSeconds: 20,
        voiceConditioningSeconds2: 20,
        language: 'english',
        webHookUrl: '<string>',
        }),
    };

    fetch('https://api.play.ai/api/v1/tts', options)
    .then(response => response.json())
    .then(response => console.log(response))
    .catch(err => console.error(err));

Best AI Voices, Lowest Latency, Python API for Text to Speech in Your Applications

Choose from a vast selection of over 900 lifelike generative and neural AI voices. Provide your users with a seamless, multilingual voice experience in real time, featuring support for 142 languages and native accents.

Conversational Voices

Perfect for entertainment videos, podcasts and audiobooks

Narrative Voices

Ideal for audiobooks, explainer videos and documentary videos

Explainer Voices

Ideal for entertainment videos, explainer videos, podcasts and audiobooks

Children Voices

Perfect for audiobooks, explainer videos and e-learning

Local Accents

Localize your entertainment videos, adverts and audiobooks

Emotions

Ideal for gaming, creative videos and ads

Character Voices

Perfect for gaming, creative videos and ads

Training Voices

Suitable for training videos, L&D and E-learning

Access 140+ Languages and Accents with the Lowest Latency Python Text to Speech Voice API

Offer multilingual voice experiences to your users in real time with our voices in 142 languages and accents. Create localized speech content in almost every language using our Python API.

PlayAI Offers the Best Text to Speech Python API

Our natural sounding voices with local and regional accents are trained on our own models. Get access to these unique, high quality voices via our very low latency text to speech Python API. No matter the project, the character, or the situation, you’re sure to find the perfect voice.

Optionally, unlock access to other top providers to manage multiple AI voices via a single API. Also, check out our text to speech JavaScript API.

Real-time latency

PlayAI's new Turbo voice models can generate speech in <300ms.

Low maintenance

As you’ll be using a single text to speech API, you just have one integration to maintain.

Automatic updates

We make sure you’re always up to date with all the improvements made by the TTS providers.

Latest voices

All the latest voices added by the TTS providers are synced and ready for you to use.

Integrate Simply, Scale Efficiently

Subscribe to a plan

The Python API is a premium feature and is available across all the subscription plans

Generate secret key

Go to your dashboard and acquire your unique secret key

Synthesize speech

Convert text to speech and start integrating in your applications

Key Features

Leverage futuristic text to speech features to create the most realistic speech for your applications.

829 Voices across
142 Languages

Access a growing library of 829 high-quality male, female and kids voices in 142 languages.

Expressive Voice
Styles

Explore expressive voice styles such as narrative, conversational, cheerful, angry, sad and empathetic.

Manipulate Voice
Tones

Manipulate the volume, rate and pitch of words or even entire sentences to create unique voice effects.

Text and SSML
Support

Add pauses, numbers, date, time formatting, and other advanced, pronunciation instructions.

Frequently Asked Questions

Build Real-Time Voice Applications with PlayAI's Text to Speech Python API

Contact sales