An LLM Compiler for Parallel Function Calling
A high-throughput and memory-efficient inference and serving engine
A Simple and Universal Swarm Intelligence Engine
Ongoing research training transformer models at scale
A state-of-the-art open visual language model
Development repository for the Triton language and compiler
Block Diffusion for Ultra-Fast Speculative Decoding
Stanford NLP Python library for many human languages
Large-language-model & vision-language-model based on Linear Attention
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Making large AI models cheaper, faster and more accessible
Parallax is a distributed model serving framework
Powerful framework for controlling Android and iOS devices
The official repository for ERNIE 4.5 and ERNIEKit
Language Model Reinforcement Learning Environments frameworks
Seamlessly integrate LLMs as Python functions
FAIR Sequence Modeling Toolkit 2
Build production-ready AI agents in both Python and Typescript
A Python library for extracting structured information
Fault-tolerant, highly scalable GPU orchestration
Chat language model that can use tools and interpret the results
A software construction tool
Your Automatic Prompt Engineering Assistant for GenAI Applications
Best practice TTS based on BERT and VITS