Coqui TTS

Open-source Text-to-Speech (TTS) deep learning toolkit C Integrations & Community

Basic Information

  • Original Developer: Coqui AI (Company closed in December 2023)
  • Current Maintenance: Community fork by Idiap Research Institute
  • Country/Region: Germany (original company) / Switzerland (Idiap)
  • GitHub: https://github.com/coqui-ai/TTS (original) / https://github.com/idiap/coqui-ai-TTS (community-maintained)
  • Type: Open-source Text-to-Speech (TTS) deep learning toolkit
  • First Release: 2021
  • Latest Version: 0.27.3 (January 2026)
  • License: Apache 2.0 (framework), some models have separate licenses
  • HuggingFace: https://huggingface.co/coqui/XTTS-v2

Product Description

Coqui TTS is one of the most technologically advanced open-source text-to-speech frameworks, offering deep learning-powered speech synthesis capabilities. Although Coqui AI announced its closure in December 2023, its open-source TTS project continues to be maintained by Idiap Research Institute. Coqui TTS supports multilingual speech synthesis, voice cloning (with just 6 seconds of audio), cross-lingual voice transfer, and emotion/style transfer among other advanced features.

Core Features

  • Multilingual Support: Natural speech synthesis in multiple languages
  • Voice Cloning: Clone a voice with just 6 seconds of audio
  • Cross-Lingual Voice Transfer: Transfer a voice from one language to another
  • Emotion/Style Transfer: Transfer emotions and speaking styles from reference audio
  • XTTS-v2: Flagship model supporting high-quality multilingual speech synthesis
  • Local Deployment: Can be run locally, ensuring privacy and low latency
  • Multi-Model Architecture: Supports various model architectures like Tacotron2, VITS, Glow-TTS, etc.

Supported Model Architectures

  • Tacotron / Tacotron2
  • Glow-TTS
  • VITS / VITS2
  • FastSpeech / FastSpeech2
  • YourTTS
  • XTTS / XTTS-v2

Business Model

  • Framework Free and Open Source: Apache 2.0 license
  • Model Licensing: Some pre-trained models may have separate usage restrictions
  • Commercial Use: Apache 2.0 license allows commercial use

Current Status

  • Company has closed, but the open-source project continues to be maintained
  • Community fork by Idiap Research Institute maintains regular updates
  • Development pace has slowed compared to the company era
  • Still one of the most popular open-source TTS frameworks in 2026

Relationship with OpenClaw Ecosystem

Coqui TTS is one of the supported open-source speech synthesis solutions in the OpenClaw ecosystem. For users prioritizing privacy and local deployment, OpenClaw can utilize Coqui TTS for high-quality speech synthesis locally, without relying on cloud APIs. The multilingual support and voice cloning capabilities of the XTTS-v2 model enable OpenClaw agents to interact with users using personalized voices.