Shuai-Xie

🌏

Exploring

Shuai Xie Shuai-Xie

🌏

Exploring

Keep your pace

115 followers · 115 following

JD.com
北京市
https://shuai-xie.github.io/

Achievements

Block or Report

Block or report Shuai-Xie

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (7)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

zju-vipa / Odyssey

Odyssey: Empowering Agents with Open-World Skills

Python 114 2 Updated Jul 24, 2024

meta-llama / llama-agentic-system

Agentic components of the Llama Stack APIs

Python 1,392 134 Updated Jul 26, 2024

NVlabs / Minitron

A family of compressed models obtained via pruning and knowledge distillation

62 5 Updated Jul 25, 2024

NVIDIA / pyxis

Container plugin for Slurm Workload Manager

C 268 30 Updated Jul 23, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,060 2,309 Updated Jul 26, 2024

microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,782 167 Updated Jul 25, 2024

xai-org / grok-1

Grok open release

Python 49,207 8,311 Updated May 29, 2024

samoshkin / tmux-config

Tmux configuration, that supercharges your tmux to build cozy and cool terminal environment

Shell 2,125 496 Updated Jul 10, 2024

bojone / rerope

Rectified Rotary Position Embeddings

Python 324 27 Updated May 20, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（聚合资料，各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Python 3,432 278 Updated Jul 25, 2024

huggingface / optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python 2,342 415 Updated Jul 26, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 341 20 Updated Jul 26, 2024

HandH1998 / QQQ

QQQ is an innovative and hardware-optimized W4A8 quantization solution.

Python 31 2 Updated Jul 24, 2024

Cornell-RelaxML / QuIP

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

Python 329 30 Updated Feb 24, 2024

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,268 112 Updated Apr 17, 2024

Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 145 12 Updated Jul 4, 2024

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 679 39 Updated Jul 26, 2024

bytedance / decoupleQ

A quantization algorithm for LLM

Cuda 87 5 Updated Jun 21, 2024

jy-yuan / KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 173 15 Updated Jul 23, 2024

taishan1994 / Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

Python 76 5 Updated Jul 25, 2024

pcg-mlp / KsanaLLM

C++ 209 25 Updated Jul 15, 2024

Leeroo-AI / mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 375 22 Updated Jun 2, 2024

Macaronlin / LLaMA3-Quantization

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Python 139 4 Updated May 27, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,397 357 Updated Jul 26, 2024