AI video generator optimized for low VRAM and older GPUs use
A natural language interface for computers
A simple, high-quality voice conversion tool focused on ease of use
A text-to-speech, speech-to-text and speech-to-speech library
Modular AI image and video generation web UI with extensible tools
OCR software, free and offline
The Clay Foundation Model - An open source AI model and interface
Open-Sora: Democratizing Efficient Video Production for All
A research prototype of a human-centered web agent
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
A simple screen parsing tool towards pure vision based GUI agent
Evaluation and Tracking for LLM Experiments
Open source AI pair programmer for coding, debugging, automation
Collect, organize, use, and share, all in OmniBox
Implementation of Recurrent Interface Network (RIN)
Image polygonal annotation with Python
Gen-AI Chat for Teams
Python Stream Processing
Agent-ready RPA suite with visual workflow automation tools engine
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Browser userscript that enhances ChatGPT reliability and usability
Unified terminal AI tool for exploring and editing codebases
Python library and CLI tool to interface with Google Translate
the terminal client for Ollama
Convert codebases into structured prompts optimized for LLM analysis