📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,659 182 Updated Oct 15, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,902 359 Updated Sep 26, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,516 3,981 Updated Oct 15, 2024

microsoft / autogen

A programming framework for agentic AI 🤖

C# 31,986 4,653 Updated Oct 16, 2024

public-apis / public-apis

A collective list of free APIs

Python 315,508 33,636 Updated Sep 25, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 9,604 944 Updated Oct 13, 2024

jackaduma / awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1,158 265 Updated Dec 14, 2023

elviswf / DeepLearningBookQA_cn

深度学习面试问题回答对应的DeepLearning中文版页码

866 193 Updated Nov 2, 2017

songyingxin / NLPer-Interview

该仓库主要记录 NLP 算法工程师相关的面试题

2,598 506 Updated Apr 12, 2022

PPshrimpGo / AIinterview

算法工程师面试题整理

875 145 Updated Jan 14, 2022

Mikoto10032 / DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Jupyter Notebook 14,249 3,527 Updated Apr 21, 2022

Rayrtfr / fastertransformer_backend

Forked from void-main/fastertransformer_backend

Python 9 Updated Jan 23, 2024

Rayrtfr / FasterTransformer

Forked from void-main/FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 17 1 Updated Jul 29, 2023

LlamaFamily / Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

Python 13,827 1,240 Updated Sep 5, 2024

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 51,226 11,415 Updated Oct 16, 2024

bojone / NBCE

Naive Bayes-based Context Extension

Python 311 22 Updated May 31, 2023

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,285 1,865 Updated Apr 30, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,863 755 Updated Oct 16, 2024

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 539 60 Updated May 9, 2024

zhulei227 / ML_Notes

机器学习算法的公式推导以及numpy实现

Jupyter Notebook 2,011 478 Updated May 2, 2023

FMInference / FlexiGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,169 547 Updated Oct 8, 2024

facebookresearch / metaseq

Repo for external large-scale work

Python 6,474 723 Updated Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whcisci

Achievements

Achievements

Block or report whcisci

Lists (1)

llm

Stars

xianshang33 / llm-paper-daily

HuangLK / transpeeder

CoinCheung / gdGPT

wdndev / llm_interview_note

amusi / Deep-Learning-Interview-Book

bighuang624 / AI-research-tools

sweetice / Deep-reinforcement-learning-with-pytorch

lyhue1991 / eat_pytorch_in_20_days

DefTruth / Awesome-LLM-Inference