Get started with
Azure Cognitive Services Speech
Give your apps the ability to hear, understand, and even talk to your customers with features like speech to text and text to speech.
Speech capabilities by scenario
Explore, try out, and view sample code for some of common use cases using Azure Speech Services features like speech to text and text to speech.
Captioning with speech to text
Convert the audio content of TV broadcast, webcast, film, video, live event or other productions into text to make your content more accessible to your audience.
Post call transcription and analytics
Batch transcribe call center recordings and extract valuable information such as Personal Identifiable Information (PII), sentiment, and call summary.
Live chat avatar
Engage in natural conversations with an avatar that recognizes users' speech input and responds fluently with realistic AI voice.
Language learning Preview
Get instant feedback on pronunciation accuracy, fluency, prosody, grammar, and vocabulary from your chatting experience.
Video translation Preview
Effortlessly translate and apply AI voice dubbing to your videos across more than 100 languages, with a choice of over 400 prebuilt voices or using the personal voice across languages.
Speech to text
Quickly and accurately transcribe in more than 100 languages and dialects. Enhance the accuracy of your transcriptions by creating a custom speech model that can handle domain-specific terminology, background noise, and accents. Learn more about speech to text
Real-time speech to text
Quickly test live transcription capabilities on your own audio without writing any code.
Whisper Model in Azure OpenAI Service
Quickly test live transcription capabilities on your own audio utilizing your Azure OpenAI resource and use prompts to improve the quality of the transcripts.
Batch speech to text
Quickly test batch transcription capabilities to transcribe a large amount of audio in storage and receive results asynchronously using Azure Speech models or OpenAI Whisper model.
Custom Speech
Add your own data and adapt to specific speaking styles, vocabulary, and more with a customized speech to text model.
Pronunciation Assessment with speech to text
Get instant feedback on pronunciation accuracy and fluency by reading a script aloud.
Speech Translation
Translate speech into other languages of your choice with low latency.
Text to speech
Build apps and services that speak naturally with more than 150 voices across 500 languages and dialects. Create a customized voice to differentiate your brand and use various speaking styles to bring a sense of emotion to your spoken content. Learn more about text to speech
Voice Gallery
Browse expressive voices with humanlike speech to find the perfect speaker for your project.
Custom Voice
Use your own audio recordings to create a distinct, one-of-a-kind voice for your text to speech apps.
Personal Voice
Create an AI voice easily from a human voice sample, providing your users with a personalized voice experience across 100 languages.
Audio Content Creation
Craft nuanced speech by adjusting the speaking style, pacing, and pronunciation of your spoken content.
Text to speech Avatar
Bring text to life with natural-sounding voices and photorealistic talking avatars, creating a more engaging and delightful communication experience.
Voice assistant
Enrich your app or experience with a conversational interface to activate and control your product. Learn more about Voice assistant
Custom Keyword
Create a unique keyword or short phrase to activate your product by voice.
Responsible AI in Speech
We offer guidance for responsible use of these capabilities based on Microsoft AI’s principles of fairness, reliability and safety, privacy and security, inclusiveness, transparency, and human accountability.
Learning resources
Documentation
Understand the details of how to recognize speech, synthesize speech, get real-time translations, transcribe conversations, or integrate speech into your automated experiences.
Quick start guides
Use the SDK to get started with samples in a variety of languages and platforms to discover what you can build.
Microsoft Q&A
For quick and reliable answers, engage with us on Azure's preferred destination for community support.
Microsoft Learn
Discover new skills, find certifications, and advance your career in minutes with interactive, hands-on learning paths.