Can modern AI Voices replace professional voice over actors?

One area of technology that is seeing rapid development is the Text to Speech technology. The four tech giants – Amazon, Google, Microsoft, and IBM along with a lot of other opensource projects are silently competing with each other to create better and more realistic voices. But are the Text to Speech voices replace the professional voice-over actors? Let’s find out.

There are four key things one would consider while hiring a voice-over actor:
1. Quality of voice
2.Cost
3.Delivery time
4.Commercial rights

Let’s compare these things and understand why we say Text to Speech voices will replace voice over actors.

Quality of AI Voice

Early AI Voices

The early Artificial Intelligence voices (AI Voices) sounded extremely robotic because they were generated through a process known as a concatenative approach where sounds of words would be fist recorded and later stitched together to create audio. The resulting voice would sound monotonous and lacked any intonation or expression.

Here is a sample of a male AI voice using the concatenative approach –

TTS male voice introducing himself

Here is a female TTS voice reading an excerpt from an article using the same concatenative approach –

TTS female voice reading out an article paragraph

 

Modern Text to Speech Voices

The latest AI voices, however, are dynamically generated based on a process called neural learning which is based on machine learning. A computer model is first trained using a high-quality dataset which then learns to predict the speech based on the context of input texts.

The resulting voices sound shockingly real.

Here’s one of the neural AI Voice male voices –

A male AI voice introducing himself

A neural AI Voice female voice reading out the same article excerpt –

A female AI voice reading a paragraph

Here’s a AI voice that’s designed to read out news –

A male AI voice reading out the news

As you can notice the newer Text to Speech voices don’t sound anything like the older voices. And in some cases they sound so real, it’s hard to identify if it’s a machine or human.

Cost of creating audio

On average, a professional voice actor charges $10 for 100 words. Text to Speech, on the other hand, costs a fraction of that price.

There are two types of Text to Speech voices that are available – standard and neural. The standard voices cost around $0.04 for 1000 words and the neural voices cost around $0.16 for 1000 words.

Delivery of time

A voice actor typically takes around 3-4 days to create and deliver the audio. With Text to Speech technology, you can create the audio in almost real-time.

You also have the benefit of doing unlimited revisions that are limited and time consuming with a voice actor.

Broadcast and Commercial rights

Although a voice actor grants all the rights you would require to commercially use the audio, they typically charge to provide these rights.

With Text to Speech voices though, you don’t have to worry or pay any extra fees to use the audio commercially.


Applications more suited for AI voices

There are some of the applications that are more suited Text to Speech technology than hiring voice actors:

  1. Creating audio versions of articles and blog posts to repurpose content boost and user engagement.
  2. To create voice-over audio for YouTube videos.
  3. Creating voice-over audio for presentations and product demos.
  4. Create announcements.
  5. To create audio for avatars for VR or video games.
  6. Create audio content for courses and eLearning material.

All in all, the significant improvements that you see in today’s Text to Speech voices have definitely made them end-user-consumable, and have opened up a plethora of applications for them but they are still not applicable to certain use cases such as creating commercials, narrating audiobooks, etc wherein a human voice is needed to convey an emotion in the audio.

We believe it’s just a matter of time that AI voices will catchup to sound exactly alike, or even better than professional voice actors.

18 thoughts on “Can modern AI Voices replace professional voice over actors?”

  1. I do not even know how I ended up here, but I thought this post was great. I do not know who you are but definitely you’re going to a famous blogger if you are not already 😉 Cheers!

  2. I feel that is among the most significant information for me. And i’m happy reading your article. However wanna commentary on some normal things, The site taste is ideal, the articles is in reality great : D. Just right job, cheers

  3. I’m not that much of a online reader to be honest but your sites really nice, keep it up! I’ll go ahead and bookmark your website to come back later on. Many thanks

  4. Pingback: Are Audio Articles the next norm in content marketing? - Play.ht

  5. A lot of thanks for your own hard work on this website. Betty enjoys making time for investigation and it’s easy to see why. My spouse and i hear all regarding the lively tactic you produce worthwhile items via this web blog and even strongly encourage contribution from others about this area of interest so my princess is certainly studying a lot of things. Enjoy the rest of the year. You’re the one performing a powerful job.

  6. A lot of thanks for all of your hard work on this web page. My daughter really likes engaging in research and it’s really easy to understand why. Many of us hear all relating to the compelling ways you present priceless guidelines by means of the web site and improve contribution from other ones about this content while our own princess is in fact discovering a great deal. Take pleasure in the remaining portion of the new year. You are doing a splendid job.

Leave a Reply

Your email address will not be published. Required fields are marked *