Product Launchttsvoice cloningmultilingualopen source
Qwen3-TTS Releases Open-Source Voice-Cloning And Generation
9.2
Relevance Score
Qwen releases the Qwen3-TTS family of open-source text-to-speech models, publishing tokenizers and models under the Apache 2.0 license. The models, trained on over 5 million hours across 10 languages, support three-second voice cloning, description-based control, streaming real-time synthesis via a dual-track LM, and state-of-the-art results on multilingual and long-speech benchmarks. Hugging Face hosts 0.6B (2.52GB) and 1.7B (4.54GB) variants with a browser demo that enables voice cloning.


