🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…

Python 52,324 5,413 Updated Jul 29, 2024

km1994 / LLMsNineStoryDemonTower

【LLMs九层妖塔】分享 LLMs在自然语言处理（ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等）、信息检索（langchain）、语言合成、语言识别、多模态等领域（Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等）等实战与经验。

1,634 165 Updated Mar 30, 2024

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

1,227 94 Updated Mar 31, 2024

WooooDyy / LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

5,883 354 Updated Jul 28, 2024

jwkirchenbauer / lm-watermarking

Jupyter Notebook 482 64 Updated Mar 14, 2024

hiyouga / LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,032 3,436 Updated Jul 30, 2024

princeton-nlp / TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 189 13 Updated Jun 14, 2023

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,017 235 Updated Apr 14, 2024

mattf1n / basis-aware-threshold

Code for the paper "Closing the Curious Case of Neural Text Degeneration"

Python 7 2 Updated Oct 21, 2023

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

867 65 Updated Jul 20, 2024

HillZhang1999 / llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

869 46 Updated Apr 5, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,423 437 Updated May 3, 2024

MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,336 86 Updated Jun 3, 2024

WeOpenML / PandaLM

Python 871 66 Updated May 22, 2024

cybertronai / gradient-checkpointing

Make huge neural nets fit in memory

Python 2,668 271 Updated Apr 26, 2020

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Python 3,033 457 Updated Jul 25, 2024

LinkSoul-AI / LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 501 53 Updated Sep 11, 2023

DUOMO / TransGPT

Python 694 75 Updated Sep 14, 2023

huggingface / trl

Train transformer language models with reinforcement learning.

Python 8,897 1,093 Updated Jul 29, 2024

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

15,970 2,029 Updated Jun 3, 2024

LlamaFamily / Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

Python 13,178 1,202 Updated Jul 25, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 10,943 1,550 Updated Jul 29, 2024

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,155 819 Updated Jul 27, 2024

TeSaiFa / llm-auto-eval

Python 20 Updated Oct 18, 2023

guodongxiaren / README

README文件语法解读，即Github Flavored Markdown语法介绍

6,771 7,251 Updated Mar 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yubai Wei BaileyWei

Block or report BaileyWei

Stars

GFNOrg / gfn-lm-tuning

renmada / t5-pegasus-pytorch

andrewekhalel / MLQuestions

frank1ma / LinearAlgebraQuickReview

eric-mitchell / direct-preference-optimization

labmlai / annotated_deep_learning_paper_implementations