Skip to content
View whcisci's full-sized avatar

Block or report whcisci

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

947 37 Updated Jul 31, 2024

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Python 208 18 Updated Nov 21, 2023

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

Python 90 8 Updated Feb 5, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 3,149 372 Updated Aug 19, 2024

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)

7,550 1,316 Updated Apr 24, 2024

🔨AI 方向好用的科研工具

2,338 348 Updated Jun 10, 2024

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 3,897 846 Updated Mar 24, 2023

Pytorch🍊🍉 is delicious, just eat it! 😋😋

Jupyter Notebook 5,183 1,136 Updated Sep 11, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,659 182 Updated Oct 15, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,902 359 Updated Sep 26, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,516 3,981 Updated Oct 15, 2024

A programming framework for agentic AI 🤖

C# 31,986 4,653 Updated Oct 16, 2024

A collective list of free APIs

Python 315,508 33,636 Updated Sep 25, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 9,604 944 Updated Oct 13, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,158 265 Updated Dec 14, 2023

深度学习面试问题 回答对应的DeepLearning中文版页码

866 193 Updated Nov 2, 2017

该仓库主要记录 NLP 算法工程师相关的面试题

2,598 506 Updated Apr 12, 2022

算法工程师面试题整理

875 145 Updated Jan 14, 2022

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Jupyter Notebook 14,249 3,527 Updated Apr 21, 2022

Transformer related optimization, including BERT, GPT

C++ 17 1 Updated Jul 29, 2023

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 13,827 1,240 Updated Sep 5, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 51,226 11,415 Updated Oct 16, 2024

Naive Bayes-based Context Extension

Python 311 22 Updated May 31, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,285 1,865 Updated Apr 30, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,863 755 Updated Oct 16, 2024

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 539 60 Updated May 9, 2024

机器学习算法的公式推导以及numpy实现

Jupyter Notebook 2,011 478 Updated May 2, 2023

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,169 547 Updated Oct 8, 2024

Repo for external large-scale work

Python 6,474 723 Updated Apr 27, 2024
Next