Skip to content
View Shuai-Xie's full-sized avatar
🌏
Exploring
🌏
Exploring
Block or Report

Block or report Shuai-Xie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Odyssey: Empowering Agents with Open-World Skills

Python 114 2 Updated Jul 24, 2024

Agentic components of the Llama Stack APIs

Python 1,392 134 Updated Jul 26, 2024

A family of compressed models obtained via pruning and knowledge distillation

62 5 Updated Jul 25, 2024

Container plugin for Slurm Workload Manager

C 268 30 Updated Jul 23, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,060 2,309 Updated Jul 26, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,782 167 Updated Jul 25, 2024

Grok open release

Python 49,207 8,311 Updated May 29, 2024

Tmux configuration, that supercharges your tmux to build cozy and cool terminal environment

Shell 2,125 496 Updated Jul 10, 2024

Rectified Rotary Position Embeddings

Python 324 27 Updated May 20, 2024

Llama3、Llama3.1 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 3,432 278 Updated Jul 25, 2024

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python 2,342 415 Updated Jul 26, 2024

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 341 20 Updated Jul 26, 2024

QQQ is an innovative and hardware-optimized W4A8 quantization solution.

Python 31 2 Updated Jul 24, 2024

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

Python 329 30 Updated Feb 24, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,268 112 Updated Apr 17, 2024

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 145 12 Updated Jul 4, 2024

A pytorch quantization backend for optimum

Python 679 39 Updated Jul 26, 2024

A quantization algorithm for LLM

Cuda 87 5 Updated Jun 21, 2024

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 173 15 Updated Jul 23, 2024

对llama3进行全参微调、lora微调以及qlora微调。

Python 76 5 Updated Jul 25, 2024
C++ 209 25 Updated Jul 15, 2024

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 375 22 Updated Jun 2, 2024

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Python 139 4 Updated May 27, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,397 357 Updated Jul 26, 2024

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

651 41 Updated May 8, 2024

FacTool: Factuality Detection in Generative AI

Python 788 60 Updated Jul 19, 2024

Must-read Papers on LLM Agents.

1,497 80 Updated Jul 8, 2024
Python 1,416 121 Updated Jul 18, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 427 26 Updated May 20, 2024

[ACL'24 Oral] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 315 13 Updated Jul 9, 2024
Next