Skip to content
View BaileyWei's full-sized avatar
Block or Report

Block or report BaileyWei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 105 16 Updated Jan 16, 2024

Machine Learning and Computer Vision Engineer - Technical Interview Questions

2,759 471 Updated May 22, 2024

Linear Algebra Quick Review

36 5 Updated Dec 20, 2019

Reference implementation for DPO (Direct Preference Optimization)

Python 1,914 149 Updated May 23, 2024

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…

Python 52,324 5,413 Updated Jul 29, 2024

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

1,634 165 Updated Mar 30, 2024

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,227 94 Updated Mar 31, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

5,883 354 Updated Jul 28, 2024
Jupyter Notebook 482 64 Updated Mar 14, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,032 3,436 Updated Jul 30, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 189 13 Updated Jun 14, 2023

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,017 235 Updated Apr 14, 2024

Code for the paper "Closing the Curious Case of Neural Text Degeneration"

Python 7 2 Updated Oct 21, 2023

A collection of AWESOME things about mixture-of-experts

867 65 Updated Jul 20, 2024

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

869 46 Updated Apr 5, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,423 437 Updated May 3, 2024

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,336 86 Updated Jun 3, 2024
Python 871 66 Updated May 22, 2024

Make huge neural nets fit in memory

Python 2,668 271 Updated Apr 26, 2020

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Python 3,033 457 Updated Jul 25, 2024

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 501 53 Updated Sep 11, 2023
Python 694 75 Updated Sep 14, 2023

Train transformer language models with reinforcement learning.

Python 8,897 1,093 Updated Jul 29, 2024

Generative Agents: Interactive Simulacra of Human Behavior

15,970 2,029 Updated Jun 3, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 13,178 1,202 Updated Jul 25, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 10,943 1,550 Updated Jul 29, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,155 819 Updated Jul 27, 2024
Python 20 Updated Oct 18, 2023

README文件语法解读,即Github Flavored Markdown语法介绍

6,771 7,251 Updated Mar 8, 2023
Next