Stars
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Entropy Based Sampling and Parallel CoT Decoding
A high-throughput and memory-efficient inference and serving engine for LLMs
A throughput-oriented high-performance serving framework for LLMs
SGLang is a fast serving framework for large language models and vision language models.
Composable building blocks to build Llama Apps
A new local-first, privacy-focused and open-source home for your markdown notes
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation