Voice Recognition to Text Tool
The free, Open Source alternative to OpenAI, Claude and others
Open source text-to-speech tool, supports extra-long text
A nearly-live implementation of OpenAI's Whisper
1 min voice data can also be used to train a good TTS model
OpenVINO™ Toolkit repository
AI teacher that lives as a buddy next to your cursor
SOTA discrete acoustic codec models with 40/75 tokens per second
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Controllable & emotion-expressive zero-shot TTS
The official Python SDK for the ElevenLabs API
Qwen3-ASR is an open-source series of ASR models
Video translation and dubbing tool powered by LLMs
Real-time voice interactive digital human
Generate audiobooks from e-books, voice cloning & 1107+ languages
Converts text to speech in realtime
Offline inference engine for art, real-time voice conversations
Dicio assistant app for Android
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Official PyTorch Implementation
Open source assistive on-screen keyboard that runs on Windows
Speakr is a personal, self-hosted web application
Open source AI VTuber platform with voice chat and Live2D avatars
The python library for real-time communication
Repo of Qwen2-Audio chat & pretrained large audio language model