Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 344 30 Updated Apr 23, 2024

pratyushasharma / laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Python 367 28 Updated Jul 9, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,806 4,160 Updated Nov 4, 2024

duanhongyi / gmssl

a python crypto for sm2/sm3/sm4

Python 485 140 Updated May 20, 2024

huybery / Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

932 52 Updated Jun 29, 2024

FranxYao / chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,569 129 Updated Aug 4, 2024

Mooler0410 / LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,451 718 Updated May 31, 2024

dqxiu / ICL_PaperList

Paper List for In-context Learning 🌷

815 60 Updated Oct 8, 2024

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

18,654 1,521 Updated Nov 5, 2024

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,676 280 Updated Oct 2, 2024

jackaduma / awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1,161 270 Updated Dec 14, 2023

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

15,839 1,460 Updated Sep 19, 2024

mli / paper-reading

深度学习经典、新论文逐段精读

26,992 2,438 Updated Aug 8, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 10,375 1,031 Updated Nov 3, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,338 811 Updated Aug 20, 2024

MingchaoZhu / DeepLearning

Python for《Deep Learning》，该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现

Python 6,491 1,339 Updated Jun 23, 2020

PaddlePaddle / awesome-DeepLearning

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI

Jupyter Notebook 3,102 838 Updated Jul 25, 2024

OpenNLPLab / cosFormer

[ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention

Python 179 26 Updated Dec 2, 2022

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,171 74 Updated Nov 5, 2024

harvardnlp / annotated-transformer

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,699 1,232 Updated Apr 7, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 134,527 26,903 Updated Nov 6, 2024

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,844 1,979 Updated Apr 16, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,338 1,605 Updated Nov 5, 2024

thunlp / OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 995 80 Updated Sep 19, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,350 1,866 Updated Apr 30, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,626 858 Updated Oct 31, 2024

d2l-ai / d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 63,370 11,036 Updated Jul 30, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,610 246 Updated Dec 12, 2023