Skip to content
View Orion-Zheng's full-sized avatar

Highlights

  • Pro

Block or report Orion-Zheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

C implementation of gRPC layered on top of core library

C 220 59 Updated Feb 6, 2019

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

370 9 Updated Oct 6, 2024
Python 116 3 Updated Jun 23, 2024

Robust recipes to align language models with human and AI preferences

Python 4,544 393 Updated Sep 23, 2024

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

122 5 Updated Jun 12, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,903 408 Updated Sep 6, 2024
Python 310 16 Updated Jul 16, 2024

The official evaluation suite and dynamic data release for MixEval.

Python 209 31 Updated Sep 29, 2024
Python 294 23 Updated Apr 6, 2023

A generative speech model for daily dialogue.

Python 31,233 3,386 Updated Sep 21, 2024

learning notes when learning the source code of pytorch

24 7 Updated Apr 3, 2019

React web interface for the OpenDota platform

JavaScript 1,088 392 Updated Sep 14, 2024

Python tools for Dota 2

Protocol Buffer 115 36 Updated Sep 27, 2019

Dota 2 replay knowledge in book form.

24 4 Updated Apr 30, 2014

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,852 342 Updated Oct 4, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

296 11 Updated Apr 18, 2024

Custom console scripts for Dota 2.

Python 88 10 Updated Apr 16, 2014

Open-Sora: Democratizing Efficient Video Production for All

Python 21,771 2,110 Updated Aug 9, 2024

AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。

162 3 Updated Mar 26, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 959 79 Updated Jul 23, 2024

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,319 156 Updated Sep 6, 2024

Longitudinal Evaluation of LLMs via Data Compression

Python 25 Updated May 29, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,598 176 Updated Oct 6, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,498 236 Updated May 1, 2024

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 828 42 Updated Sep 19, 2024

[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)

Python 22 2 Updated Feb 27, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,080 839 Updated Jul 1, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,563 5,758 Updated Aug 19, 2024

Implementation of DoRA

Python 278 18 Updated Jun 7, 2024
Next