nanmi

😉

Atypical AI practitioners

Atypical AI worker

48 followers · 15 following

JD
Beijing
medium.com/@nanmi

Achievements

Block or Report

Block or report nanmi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 333 19 Updated Jun 25, 2024

cuda-mode / lectures

Material for cuda-mode lectures

Jupyter Notebook 1,902 186 Updated Jun 13, 2024

alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 457 41 Updated Jul 23, 2024

idootop / mi-gpt

🏠 将小爱音箱接入 ChatGPT 和豆包，改造成你的专属语音助手。

TypeScript 6,364 580 Updated Jul 24, 2024

te42kyfo / gpu-benches

collection of benchmarks to measure basic GPU capabilities

Jupyter Notebook 180 29 Updated Jun 21, 2024

pcg-mlp / KsanaLLM

C++ 206 24 Updated Jul 15, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,374 863 Updated May 23, 2024

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 241 21 Updated Apr 20, 2024

tlc-pack / libflash_attn

Standalone Flash Attention v2 kernel without libtorch dependency

C++ 79 12 Updated May 21, 2024

HKUNLP / ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 300 15 Updated Jul 18, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 23,585 2,552 Updated Jul 23, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 3,650 304 Updated Jul 23, 2024

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

900 54 Updated Jul 24, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,046 139 Updated Jul 23, 2024