A high-quality rapid TTS voice cloning model
High-Quality Voice Cloning TTS for 600+ Languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Industrial-level controllable zero-shot text-to-speech system
One-stop AI digital human system with video voice synthesis tools
A lightweight text-to-speech model with zero-shot voice cloning
Real-time voice interactive digital human
Tokenizer-Free TTS for Multilingual Speech Generation
Foundational model for human-like, expressive TTS
Multi-lingual large voice generation model, providing inference
Controllable & emotion-expressive zero-shot TTS
MARS5 speech model (TTS) from CAMB.AI
Long-form streaming TTS system for multi-speaker dialogue generation
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Multi-Voice and Prompt-Controlled TTS Engine
A webui for different audio related Neural Networks
Dia-1.6B generates lifelike English dialogue and vocal expressions