Introducing Chatterbox Multilingual: Open Source TTS for 23 Languages

Sep 4, 2025

Two months ago, we did something the industry said was impossible. Our team released Chatterbox—an open-source text-to-speech model that outperformed ElevenLabs with 63.75% user preference in blind evaluations.

The response was overwhelming. Within weeks, Chatterbox exploded to over 1 Million downloads on Hugging Face and above 11,000 GitHub stars. Developers built audiobook generators, game studios integrated it for NPC dialogue, and educators created language learning tools. DigitalOcean called it “significant progress in personalized Voice AI.” The community had spoken: open-source TTS had finally arrived.

But in every GitHub issue, Reddit thread, and Discord conversation, one request dominated: “When will Chatterbox support other languages?”

Today, we answer that call.

Languages Supported

Arabic (ar) • Danish (da) • German (de) • Greek (el) • English (en) • Spanish (es) • Finnish (fi) • French (fr) • Hebrew (he) • Hindi (hi) • Italian (it) • Japanese (ja) • Korean (ko) • Malay (ms) • Dutch (nl) • Norwegian (no) • Polish (pl) • Portuguese (pt) • Russian (ru) • Swedish (sv) • Swahili (sw) • Turkish (tr) • Chinese (zh)

State of the Art Voice Cloning in 23 Languages

Demand for multilingual TTS is skyrocketing. Brands, educators, and developers are no longer limited to English-only voice AI. Users expect apps and agents to sound human, speak in their native language, and deliver content with authentic tone and emotion.

Chatterbox Multilingual was built to meet that demand. It combines:

  • Breadth of languages: 23 supported languages from launch.
  • Expressive control: Fine-tune delivery with emotion and intensity settings.
  • Enterprise reliability: Ultra-stable inference and built-in watermarking.

Whether you’re designing a voice AI agent for customer support, a language-learning app, or a global gaming experience, Chatterbox Multilingual gives you the tools to build voices that resonate everywhere.

Chatterbox Multilingual Samples

Language Preview

Get Started from HuggingFace

Six lines of code to access Hollywood-quality voice synthesis in 23 languages.

import torchaudio as ta
from chatterbox.tts import ChatterboxTTS

model = ChatterboxTTS.from_pretrained(
    repo_id="ResembleAI/chatterbox-multilingual",
    device="cuda"  # or "cpu"
)

text = "Bienvenue dans Chatterbox Multilingual. Rapide, expressif et prêt pour la production."
wav = model.generate(text, lang="fr")
ta.save("sample_fr.wav", wav, model.sr)

Enterprise Level Voice Cloning with Chatterbox Multilingual Pro

While the open-source release brings world-class TTS to everyone, many enterprises need fine-tuned performance and strict service guarantees. That’s where Chatterbox Multilingual Pro comes in.

🎯 Custom fine-tuning: Train on your brand’s vocabulary, product names, and domain terms.
Sub-200 ms latency: Real-time streaming with global infrastructure.
📈 SLAs and reliability: Guaranteed uptime and throughput.
🔒 Advanced watermarking & detection: Enhanced protection against deepfakes.

For call centers, financial services, healthcare platforms, and any organization where accuracy and compliance matter, Chatterbox Multilingual Pro closes the last mile.

Get started with Chatterbox Multilingual for free

More From This Category

Replay Attacks: The Blind Spot in Audio Deepfake Detection

Replay Attacks: The Blind Spot in Audio Deepfake Detection

We're thrilled to announce that groundbreaking research from our team at Resemble AI and collaborators, detailed in the paper "Replay Attacks Against Audio Deepfake Detection," has been accepted for presentation at the prestigious Interspeech 2025 conference! This...

read more