TADA
ストックにはログインが必要です
1:1 text-acoustic alignment for 5x faster speech generation
Artificial Intelligence
Open Source
Audio
TADA (Text-Acoustic Dual Alignment) is Hume AI's open-source speech-language model that synchronizes text and audio one-to-one. TADA synchronizes text and speech into a single continuous stream via 1:1 token alignment. Generating audio at 5x the speed of conventional LLM-based TTS systems completely eliminates skipped words and content hallucinations across 1000+ tests.
投票数: 87