Long-form streaming TTS system for multi-speaker dialogue generation
The knowledge and task management backbone for AI coding assistants
LTX-Video Support for ComfyUI
Solve end to end problems using Llama model family
Framework to easily create LLM powered bots over any dataset
An open source implementation of CLIP
Deep learning library
GLM-4 series: Open Multilingual Multimodal Chat LMs
A Powerful Native Multimodal Model for Image Generation
Replace OpenAI GPT with another LLM in your app
Tool for visualizing and tracking your machine learning experiments
Tutorial tailored for Chinese babies on rapid fine-tuning
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
Concatenate a directory full of files into a single prompt
Library for OCR-related tasks powered by Deep Learning
An MCP server for interacting with Google Colab
Framework for building AI agents that automate complex web tasks
Local RAG engine for private multimodal knowledge search on devices
Collection of Kaggle Solutions and Ideas
An agentless approach to automatically solve software development
A new open-source framework to build and deploy intelligent agents
Follow along with my AI Agents Masterclass videos
4M: Massively Multimodal Masked Modeling
PyTorch code and models for V-JEPA self-supervised learning from video
The Unified Machine Learning Framework