leeeizhang

🚀

Ignition sequence start

Lei Zhang leeeizhang

🚀

Ignition sequence start

CS Student. Focused on LLM System and AI Agent.

35 followers · 154 following

13:10 (UTC +08:00)

Achievements

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

microsoft / vidur

A large-scale simulation framework for LLM inference

Python 243 28 Updated Oct 1, 2024

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 5,809 544 Updated Oct 3, 2024

traceloop / openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

Python 1,895 176 Updated Oct 3, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,114 159 Updated Oct 3, 2024

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 802 160 Updated Aug 28, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,848 149 Updated Sep 25, 2024

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 715 86 Updated Aug 21, 2024

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 9,344 493 Updated Oct 2, 2024

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,470 143 Updated Sep 25, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 5,390 393 Updated Oct 3, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 1,117 113 Updated Oct 3, 2024

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,067 22 Updated Jul 31, 2024

bytedance / byteir

A model compilation solution for various hardware

MLIR 362 38 Updated Sep 30, 2024

MLSysOps / Code-Agent-Survey

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

10 Updated Aug 20, 2024

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,238 205 Updated Oct 3, 2024

pytorch / tensordict

TensorDict is a pytorch dedicated tensor container.

Python 816 66 Updated Oct 2, 2024

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 2,609 261 Updated Oct 1, 2024

siliconflow / triton

Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ 1 1 Updated Apr 4, 2024

microsoft / triton-shared

Shared Middle-Layer for Triton Compilation

MLIR 163 34 Updated Oct 2, 2024

meta-llama / llama-stack-apps

Agentic components of the Llama Stack APIs

Python 3,625 468 Updated Oct 3, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,448 923 Updated Sep 25, 2024

pytorch-labs / applied-ai

Applied AI experiments and examples for PyTorch

Python 138 12 Updated Sep 30, 2024

Qualcomm-AI-research / FP8-quantization

Python 114 8 Updated Mar 9, 2023

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,176 486 Updated Oct 3, 2024

neuralmagic / AutoFP8

Python 151 17 Updated Oct 1, 2024

pytorch-labs / float8_experimental

This repository contains the experimental PyTorch native float8 training UX

Python 212 20 Updated Aug 1, 2024

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Python 2,258 288 Updated Oct 3, 2024

bytedance / flux

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 198 13 Updated Sep 18, 2024

e2b-dev / awesome-ai-agents

A list of AI autonomous agents

9,938 725 Updated Sep 28, 2024

mit-han-lab / qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 406 19 Updated Sep 5, 2024

Lei Zhang leeeizhang

Lists (3)

🔮 Future ideas

✨ Inspiration

🚀 My stack

Starred repositories

Awesome Lists