Two months ago, we did something the industry said was impossible. Our team released Chatterbox—an open-source text-to-speech model that outperformed ElevenLabs with 63.75% user preference in blind evaluations.
The response was overwhelming. Within weeks, Chatterbox exploded to over 1 Million downloads on Hugging Face and above 11,000 GitHub stars. Developers built audiobook generators, game studios integrated it for NPC dialogue, and educators created language learning tools. DigitalOcean called it “significant progress in personalized Voice AI.” The community had spoken: open-source TTS had finally arrived.
But in every GitHub issue, Reddit thread, and Discord conversation, one request dominated: “When will Chatterbox support other languages?”
Today, we answer that call.
Languages Supported
Arabic (ar) • Danish (da) • German (de) • Greek (el) • English (en) • Spanish (es) • Finnish (fi) • French (fr) • Hebrew (he) • Hindi (hi) • Italian (it) • Japanese (ja) • Korean (ko) • Malay (ms) • Dutch (nl) • Norwegian (no) • Polish (pl) • Portuguese (pt) • Russian (ru) • Swedish (sv) • Swahili (sw) • Turkish (tr) • Chinese (zh)
State of the Art Voice Cloning in 23 Languages
Demand for multilingual TTS is skyrocketing. Brands, educators, and developers are no longer limited to English-only voice AI. Users expect apps and agents to sound human, speak in their native language, and deliver content with authentic tone and emotion.
Chatterbox Multilingual was built to meet that demand. It combines:
- Breadth of languages: 23 supported languages from launch.
- Expressive control: Fine-tune delivery with emotion and intensity settings.
- Enterprise reliability: Ultra-stable inference and built-in watermarking.
Whether you’re designing a voice AI agent for customer support, a language-learning app, or a global gaming experience, Chatterbox Multilingual gives you the tools to build voices that resonate everywhere.
Chatterbox Multilingual Samples
Language | Preview |
---|
Get Started from HuggingFace
Six lines of code to access Hollywood-quality voice synthesis in 23 languages.
import torchaudio as ta
from chatterbox.tts import ChatterboxTTS
model = ChatterboxTTS.from_pretrained(
repo_id="ResembleAI/chatterbox-multilingual",
device="cuda" # or "cpu"
)
text = "Bienvenue dans Chatterbox Multilingual. Rapide, expressif et prêt pour la production."
wav = model.generate(text, lang="fr")
ta.save("sample_fr.wav", wav, model.sr)