Engage users with AI Voices in your devices and applications.

Integrate real-time voice synthesis in your applications with an easy-to-use API. Access state-of-the-art AI voices from Google, Amazon, IBM and Microsoft in over 60 languages and accents.

Rated 4.8/5 based on 75+ reviews


Trusted by 7000+ users and teams of all sizes


Use the Best Text-to-Speech AI Voices in Your Devices and Applications

Choose from a growing library of 907 natural sounding voices with humanlike intonation in 142 languages and accents powered by machine learning technology.

Play Icon  Narrative  us
Play Icon  Narrative  us
Play Icon  Marketing  gb
Play Icon  Promo  us
Play Icon  Promo  us
Play Icon  Podcast  gb
Play Icon  Kids  us
Play Icon  Support  us
Play Icon  Converse  ca

AI Voices in Every Language and Accent in the World

Create natural sounding speech in 142 languages & accents.

Why Play.ht’s API?

Access all the best text-to-speech AI voices from Google, Amazon, IBM and Microsoft using Play.ht's text-to-speech API. Our text-to-speech API provides a single interface to convert text to audio using AI voices across different providers.

Using a single text-to-speech API in your projects saves you time and offers many benefits.

All the leading AI voices

Access all the AI voices from leading TTS providers such as Google, Amazon, IBM and Microsoft.

Low maintenance

As you’ll be using a single text-to-speech API, you just have one integration to maintain.

Automatic updates

We make sure you’re always up to date with all the improvements made by the TTS providers.

Latest voices

All the latest voices added by the TTS providers are synced and ready for you to use.

Take a look at the voices reference file to see a list of the available voices and languages.

Key Features

Leverage futuristic text-to-speech features to create the most realistic speech for your applications.

907 Voices across
142 Languages

Access a growing library of 907 high-quality male, female and kids voices available in 142 languages.

Expressive Voice

Explore expressive voice styles such as narrative, conversational, cheerful, angry, sad and empathetic.

Manipulate Voice

Manipulate the volume, rate and pitch of words or even entire sentences to create unique voice effects.

Text and SSML

Add pauses, numbers, date, time formatting, and other advanced pronunciation instructions.


Ready to take a deeper dive? Here’s the full API documentation on GitHub.

Integrate Simply, Scale Efficiently

Subscribe to a plan

The API is a premium feature and is available across all the subscription plans

Generate secret key

Go to your dashboard and acquire your unique secret key

Synthesize speech

Convert text-to-speech and start integrating in your applications

Start creating engaging voiceovers for your projects