Skip to content
View mahuichao's full-sized avatar

Block or report mahuichao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple C++11 Thread Pool implementation

C++ 7,820 2,233 Updated Jul 20, 2024

Reference implementations of MLPerf™ inference benchmarks

Python 1,188 519 Updated Sep 17, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,314 1,081 Updated Sep 2, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,851 1,532 Updated Sep 17, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,415 306 Updated Jan 4, 2024

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,108 213 Updated Sep 7, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,803 803 Updated Aug 15, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,329 461 Updated Aug 19, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,334 178 Updated Jul 16, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,132 26,323 Updated Sep 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 26,871 3,946 Updated Sep 18, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 30,912 3,812 Updated Sep 16, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,334 1,547 Updated Jul 10, 2024

The Mojo Programming Language

Mojo 22,888 2,578 Updated Sep 16, 2024

Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators

C 1,519 219 Updated Aug 28, 2019

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,169 475 Updated Sep 18, 2024

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,579 985 Updated Sep 18, 2024

Seamless operability between C++11 and Python

C++ 15,430 2,080 Updated Sep 17, 2024

This is a code repository for pytorch c++ (or libtorch) tutorial.

C++ 725 122 Updated Nov 2, 2021

Keyword spotting on Arm Cortex-M Microcontrollers

C 1,123 414 Updated Apr 10, 2019

Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769

Jupyter Notebook 121 33 Updated Apr 29, 2022

Transformer related optimization, including BERT, GPT

C++ 5,776 882 Updated Mar 27, 2024

Visualizer for Valgrind Massif data files

C++ 304 18 Updated Sep 18, 2024

webrtc audio processing

C++ 375 136 Updated May 10, 2020

Rocket Chip Generator

Scala 3,166 1,115 Updated Sep 17, 2024

Nuclei Microcontroller Software Interface Standard Development Repo

C 60 16 Updated Jul 1, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,632 7,980 Updated Sep 10, 2024

Flops counter for convolutional networks in pytorch framework

Python 2,783 309 Updated Jul 16, 2024

FFT generator using Chisel

Verilog 55 17 Updated Sep 26, 2021
Next