Skip to content
View superlj666's full-sized avatar

Highlights

  • Pro

Block or report superlj666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,398 200 Updated Nov 6, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

445 49 Updated Jul 10, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 344 30 Updated Apr 23, 2024

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Python 367 28 Updated Jul 9, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,806 4,160 Updated Nov 4, 2024

a python crypto for sm2/sm3/sm4

Python 485 140 Updated May 20, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

932 52 Updated Jun 29, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,569 129 Updated Aug 4, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,451 718 Updated May 31, 2024

Paper List for In-context Learning 🌷

815 60 Updated Oct 8, 2024

Awesome-LLM: a curated list of Large Language Model

18,654 1,521 Updated Nov 5, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,676 280 Updated Oct 2, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,161 270 Updated Dec 14, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,839 1,460 Updated Sep 19, 2024

深度学习经典、新论文逐段精读

26,992 2,438 Updated Aug 8, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 10,375 1,031 Updated Nov 3, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,338 811 Updated Aug 20, 2024

Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现

Python 6,491 1,339 Updated Jun 23, 2020

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI

Jupyter Notebook 3,102 838 Updated Jul 25, 2024

[ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention

Python 179 26 Updated Dec 2, 2022

Awesome LLM compression research papers and tools.

1,171 74 Updated Nov 5, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,699 1,232 Updated Apr 7, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 134,527 26,903 Updated Nov 6, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,844 1,979 Updated Apr 16, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,338 1,605 Updated Nov 5, 2024

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 995 80 Updated Sep 19, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,350 1,866 Updated Apr 30, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,626 858 Updated Oct 31, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 63,370 11,036 Updated Jul 30, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,610 246 Updated Dec 12, 2023
Next