Custom Voice
Build a recognizable, one-of-a-kind voice for your text-to-speech apps, as unique as your business.
Get started
New to Speech Services? Create a Speech resource
Create a brand voice that supports custom personas
To customize your voice agent, simply record audio and upload it as training data. We do the rest - creating a unique voice font tuned for your recording. You can also develop a highly realistic, humanlike custom voice by using Microsoft’s groundbreaking neural text-to-speech models.
Hear Custom Voice for yourself
Build a highly natural voice without a single line of code, starting from just a few minutes of audio.
1 hour of speech data
8 hours of speech data
3 hours of speech data
2 hours of speech data or less
Custom Neural Voice capability is now in public preview – with limited access
These voices are so lifelike, that they must be designed in a way to earn the trust of others. To learn about the principles of building synthetic voices that create confidence in your company and services, as well as responsible deployment of these voices, visit Microsoft's Responsible AI guidelines.
Develop a highly realistic custom voice for your business
Brand identity
Design and implement a voice model that strengthens your brand strategy.
Custom persona
Extract value from your analytics to work by accommodating customer sentiment with custom voice characteristics.
Natural interaction
Increase your customers’ emotional connection and interactions with your applications.
Creating a custom voice model
Custom voice Diagram
1 Prepare training data and create a Speech resource before you start to train a Custom Voice.
2 Upload your data to the Custom Voice portal or through the Custom Voice API and check quality.
3 Use your data to train a custom model. Test the model with your script when it’s ready.
4 Deploy the voice model to get your custom API endpoint. Test the endpoint before you integrate it in your system.
5 Use the voice in your apps by using code samples from the endpoint.