Cross-platform AI language practice app
A simple, high-quality voice conversion tool focused on ease of use
StreamSpeech is a seamless model for offline speech recognition
TTS with kokoro and onnx runtime
Automatic Speech Recognition with Word-level Timestamps
Port of OpenAI's Whisper model in C/C++
Open Source Speech Language Model
Offline Text To Speech synthesis for python
Spark-TTS Inference Code
PersonaPlex code
Industrial-level controllable zero-shot text-to-speech system
Qwen3-TTS is an open-source series of TTS models
Multilingual Automatic Speech Recognition with word-level timestamps
The behavior guidance framework for customer-facing LLM agents
A TTS that fits in your CPU (and pocket)
Speech recognition for your site
A fast TTS architecture with conditional flow matching
Long-form streaming TTS system for multi-speaker dialogue generation
Audio foundation model excelling in audio understanding
Open-source multi-speaker long-form text-to-speech model
A high-quality rapid TTS voice cloning model
End-to-end speech processing toolkit
Captcha solver extension for humans
Fast multimodal LLM for real-time voice interaction and AI apps
Faster Whisper transcription with CTranslate2