Clone a voice in 5 seconds to generate arbitrary speech in real-time
A sound cloning tool with a web interface, using your voice
Instant voice cloning by MIT and MyShell. Audio foundation model
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
1 min voice data can also be used to train a good TTS model
The open-source voice synthesis studio powered by Qwen3-TTS
A high-quality rapid TTS voice cloning model
High-Quality Voice Cloning TTS for 600+ Languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Generate audiobooks from e-books, voice cloning & 1107+ languages
Industrial-level controllable zero-shot text-to-speech system
Official PyTorch Implementation
One-stop AI digital human system with video voice synthesis tools
A lightweight text-to-speech model with zero-shot voice cloning
Real-time voice interactive digital human
Tokenizer-Free TTS for Multilingual Speech Generation
Foundational model for human-like, expressive TTS
Spark-TTS Inference Code
Open-source framework for intelligent speech interaction
The official Python SDK for the ElevenLabs API
Video translation and dubbing tool powered by LLMs
Multi-lingual large voice generation model, providing inference
Controllable & emotion-expressive zero-shot TTS
ComfyUI integration for Microsoft's VibeVoice text-to-speech model