Interface for OuteTTS models
EPUB to audiobook converter, optimized for Audiobookshelf
Production ready toolkit to run AI locally
Framework for building neural networks
AI-powered tool for generating, optimizing, and translating subtitles
Instant voice cloning by MIT and MyShell. Audio foundation model
Workflow and speech recognition app
LLM-based Reinforcement Learning audio edit model
Automagically synchronize subtitles with video
LLM Large Model of Selling Anchor
Textream is a free macOS teleprompter app for streamers, interviewers
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Subtitle Creation Assistant
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Scalable generative AI framework built for researchers and developers
Lightning-fast, on-device TTS, running natively via ONNX
Replace OpenAI GPT with another LLM in your app
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
A sound cloning tool with a web interface, using your voice
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Build your own AI friend
Transcribe on your own
NLP Cloud serves high performance pre-trained or custom models