High-Performance Face Recognition Library on PaddlePaddle & PyTorch
A collection of scientific methods, processes, algorithms
Collection of Kaggle Solutions and Ideas
The SOTA Open-Source Browser Agent
Open Source Deep Research Alternative to Reason and Search
slime is an LLM post-training framework for RL Scaling
A simple yet powerful agent framework for personal assistants
Photorealistic Synthetic Dataset for Holistic Indoor Scene
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Machine Learning Pipelines for Kubeflow
Hindsight: Agent Memory That Learns
An AI for Music Generation
Implementation of Vision Transformer, a simple way to achieve SOTA
The largest collection of PyTorch image encoders / backbones
Benchmarking Multimodal Agents for Open-Ended Tasks
MiniSom is a minimalistic implementation of the Self Organizing Maps
Interpretable prompting and models for NLP
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
AI agents running research on single-GPU nanochat training
Building a Secure and Interoperable Future for AI-Driven Payments
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
An AI-powered security review GitHub Action using Claude
Evals is a framework for evaluating LLMs and LLM systems