Microsoft MAI-Voice-2 logo

Microsoft MAI-Voice-2

Expressive TTS with voice cloning in 15 languages

Artificial Intelligence Developer Tools Productivity

Microsoft's most expressive TTS model yet — voice cloning from short samples, fine-grained emotional control, and consistent voice identity across 15 languages. Now live in Azure AI Foundry at $22 per million characters, with integrations rolling out in VSCode, Dynamics 365 Contact Center, and Teams. For builders shipping voice agents who need production-grade prosody without the OpenAI Realtime API price tag.

投票数: 0
← 投稿一覧に戻る