March 6, 2025 We’re announcing a partnership between LiveKit and PlayAI to give developers the tools to build high-performance voice...
April 4, 2025
PlayAI is partnering with Groq to deliver Dialog, our market-leading voice AI model, using fast AI inference from GroqCloud™. Click here to learn more.
The gap between human-like voice interaction and machine-generated speech has been steadily closing, but existing models have been locked in a dilemma between delivering quality or speed. Real-time applications, particularly agents, can’t sacrifice either of these options. Developers need snappiness and emotiveness, with their apps feeling robotic if either one is missing.
Today, we’re thrilled to announce a game-changing partnership that eliminates this compromise.
PlayAI is partnering with Groq to deliver Dialog, our market-leading voice AI model, using fast AI inference from GroqCloud™. This collaboration represents a fundamental shift in what’s possible with conversational AI, combining PlayAI’s advanced text-to-speech (TTS) technology with Groq LPU-based AI inference infrastructure. Plus, we’ve made it really easy to get started with Dialog on Groq, get up and running in seconds on their console & API, or our new Dialog Turbo endpoint.
Dialog has already set new standards for natural-sounding AI speech, outperforming competitive models by 3:1 in blind testing. Now, running on GroqCloud, Groq is delivering up to 215 characters/s on PlayAI’s Dialog model, a significant boost compared to the same model running on GPUs at 80 characters/s. That means that Dialog generates text up to 15 times faster than real-time. All without sacrificing speech quality. Paired with Time to First Audio as low as 200 milliseconds (and dropping by the day), your users will feel the difference with Dialog on Groq.
In addition to the speed, efficiency, and natural voice breakthroughs, PlayAI is announcing the launch of the first Arabic generative voice AI for the Middle East, and one capturing the nuances of Saudi Arabian Arabic.
What makes Dialog different is its unique ability to understand and maintain conversational context. Unlike traditional TTS models that process each sentence in isolation, Dialog was built with a novel architecture that considers the entire conversation history. This means every response is enriched with:
Trained on millions of conversations across over 30 languages, Dialog captures the subtle nuances that make human speech feel natural and engaging. This extensive training allows the model to handle everything from casual conversations to professional narrations with appropriate style and tone.
The partnership with Groq represents a strategic leap forward in our ability to deliver Dialog at scale. GroqCloud infrastructure provides:
This means developers can now build voice applications that respond as quickly as humans do, maintaining the natural flow of conversation without sacrificing speech quality.
At launch, Dialog on GroqCloud supports both English and Arabic languages, with several additional languages coming soon. The service is available through an API (documentation here), and GroqCloud Developer Console, a simple front end (GUI) with embedded code examples for using the Groq SDK.
Dialog via Groq is priced at $50 per million characters.
You can also use the same Groq silicon through your existing Play.ai account to supercharge your TTS generations! Check out our API docs here.
Dialog running on GroqCloud enables a new generation of voice applications, including:
Create voice agents that respond naturally and emotionally appropriately to customer inquiries, maintaining context throughout the entire conversation.
Generate synthetic podcasts where multiple speakers sound like they’re in the same room, with natural interaction patterns and emotional engagement.
Produce high-quality voiceovers that maintain the emotional nuance and timing of the original performance.
Build interactive voice experiences that respond instantly while maintaining natural prosody and emotional authenticity.
Our partnership with Groq is just the beginning. We’re excited about the future possibilities as we continue to work together to push the boundaries of what’s possible in voice AI.
This partnership marks the beginning of what’s possible when you combine state-of-the-art voice AI with ultra-fast inference infrastructure. We’re excited about the future possibilities as we continue to work together to push the boundaries of what’s possible in voice AI.
Experience the next generation of voice AI for yourself. Developers can access Dialog powered by Groq on GroqCloud Developer Console, Groq TTS API, or our new Dialog Turbo endpoint.
For enterprise solutions and custom implementations, contact our team.
March 6, 2025
March 6, 2025 We’re announcing a partnership between LiveKit and PlayAI to give developers the tools to build high-performance voice...
March 4, 2025
We’re thrilled to announce a major upgrade to the Play.ai Studio, bringing together our most requested features and capabilities into...
February 3, 2025
February 3, 2025. PlayAI’s Dialog Text-to-Speech model is now in general availability, bringing multilingual capabilities, and exceptional performance to applications...
October 14, 2024
Today we’re releasing our most capable and conversational voice model that can speak in 30+ languages using any voice or...
October 12, 2023
TL;DR We are thrilled to announce the release of the FASTEST Voice LLM to date! Experience real-time speech streaming from...
August 9, 2023
Today we’re introducing the first ever Generative Text to Voice AI model that’s capable of synthesizing humanlike speech with incredible...
August 7, 2023
Today we’re announcing a new feature that enables non-English speakers to clone their voices to create English speaking clones of...
August 6, 2023
Today we’re introducing a new Generative Text-to-Voice AI Model that’s trained and built to generate conversational speech. This model also...
March 29, 2023
PlayHT at GDC 2023. A full recap. We believe that AI voices have a bright future in game development. With...