📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,800 191 Updated Nov 1, 2024

ashishpatel26 / LLM-Finetuning

LLM Finetuning with peft

Jupyter Notebook 2,150 597 Updated Jul 8, 2024

ibeatai / beat-ai

<Beat AI> 又名 <零生万物> , 是一本专属于软件开发工程师的 AI 入门圣经，手把手带你上手写 AI。从神经网络到大模型，从高层设计到微观原理，从工程实现到算法，学完后，你会发现 AI 也并不是想象中那么高不可攀、无法战胜，Just beat it !

Handlebars 3,459 203 Updated Apr 22, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,956 362 Updated Oct 29, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 24,367 2,753 Updated Oct 2, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,206 277 Updated May 4, 2024

vikhyat / moondream

tiny vision language model

Jupyter Notebook 5,627 463 Updated Nov 11, 2024

Snowflake-Labs / snowflake-arctic

Python 517 46 Updated Aug 16, 2024

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,579 150 Updated Sep 25, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,554 159 Updated Aug 17, 2024

wdndev / tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Python 257 30 Updated Aug 22, 2024

karpathy / build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python 3,584 497 Updated Aug 13, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,918 183 Updated Nov 5, 2024

deepseek-ai / DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

2,176 107 Updated Sep 24, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,074 1,641 Updated Aug 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jesse Zhang Nagi-ovo

Achievements

Achievements

Highlights

Organizations

Block or report Nagi-ovo

LLM

karpathy / minbpe

openai / tiktoken

run-llama / llama_index

google / gemma.cpp

google / gemma_pytorch

DLLXW / baby-llama2-chinese

allenai / OLMo

GanymedeNil / document.ai

crabml / crabml

AviSoori1x / makeMoE

ollama / ollama

jaymody / picoGPT

karpathy / nanoGPT

minimaxir / simpleaichat

xai-org / grok-1

DefTruth / Awesome-LLM-Inference