Orion-Zheng

Follow

Zian(Andy) Zheng Orion-Zheng

Follow

CS MMath@UWaterloo Prev@NUS HPC-AI Lab

28 followers · 29 following

Achievements

Achievements

Highlights

Pro

Stars

Juniper / grpc-c

C implementation of gRPC layered on top of core library

C 220 59 Updated Feb 6, 2019

jiahaoli57 / Call-for-Reviewers

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

370 9 Updated Oct 6, 2024

mutonix / Vript

Python 116 3 Updated Jun 23, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,544 393 Updated Sep 23, 2024

SkyworkAI / Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

122 5 Updated Jun 12, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,903 408 Updated Sep 6, 2024

LLaMafia / llamafia.github

Python 310 16 Updated Jul 16, 2024

Psycoy / MixEval

The official evaluation suite and dynamic data release for MixEval.

Python 209 31 Updated Sep 29, 2024

yangjianxin1 / LLMPruner

Python 294 23 Updated Apr 6, 2023

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,233 3,386 Updated Sep 21, 2024

Stonesjtu / pytorch-learning

learning notes when learning the source code of pytorch

24 7 Updated Apr 3, 2019

PrincetonUniversity / multi_gpu_training

Python 260 36 Updated Mar 5, 2024

odota / web

React web interface for the OpenDota platform

JavaScript 1,088 392 Updated Sep 14, 2024

andrewsnowden / dota2py

Python tools for Dota 2

Protocol Buffer 115 36 Updated Sep 27, 2019

skadistats / compendium

Dota 2 replay knowledge in book form.

24 4 Updated Apr 30, 2014

microsoft / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,852 342 Updated Oct 4, 2024

microsoft / rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

296 11 Updated Apr 18, 2024

iheartdisraptor / dota2-clarity

Custom console scripts for Dota 2.

Python 88 10 Updated Apr 16, 2014

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,771 2,110 Updated Aug 9, 2024

chenzomi12 / DeepLearningSystem

AI Infra主要是指AI的基础建设，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。

162 3 Updated Mar 26, 2024

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 959 79 Updated Jul 23, 2024

OpenGenerativeAI / llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,319 156 Updated Sep 6, 2024

liyucheng09 / llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

Python 25 Updated May 29, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,598 176 Updated Oct 6, 2024

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,498 236 Updated May 1, 2024

NUS-HPC-AI-Lab / Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 828 42 Updated Sep 19, 2024

shaoshitong / G_VBSM_Dataset_Condensation

[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)

Python 22 2 Updated Feb 27, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,080 839 Updated Jul 1, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,563 5,758 Updated Aug 19, 2024

catid / dora

Implementation of DoRA

Python 278 18 Updated Jun 7, 2024