Implementation of Make-A-Video, new SOTA text to video generator
AI tool for automatic batch short video creation and editing
Official Python inference and LoRA trainer package
Open-Sora: Democratizing Efficient Video Production for All
Multimodal-Driven Architecture for Customized Video Generation
The python library for real-time communication
Generate blog articles from video or audio
Synchronized Translation for Videos
The media player for language learning, with dual subtitles
Framework for building real-time voice and multimodal AI agents
AI-powered tool for generating, optimizing, and translating subtitles
Build Vision Agents quickly with any model or video provider
Large Multimodal Models for Video Understanding and Editing
A suite of advanced multi-modal LLMs
A python tool that uses GPT-4, FFmpeg, and OpenCV
AI-assisted storyboard and video generation tool
Search all of YouTube from the command line
Code and models for ICML 2024 paper, NExT-GPT
Open source text-to-speech tool, supports extra-long text
Instantly generate AI-powered subtitles on your device
Code for running inference and finetuning with SAM 3 model
AI framework for automated short video creation and editing tools
Self-hosted collection of powerful web-based tools for everyday tasks
Workflow and speech recognition app
Multimodal embedding and reranking models built on Qwen3-VL