Skip to content
View Longyichen's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Longyichen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]

Python 18 1 Updated May 28, 2024

Continual Learning of Large Language Models: A Comprehensive Survey

170 12 Updated Jul 26, 2024

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 119 19 Updated Mar 13, 2024

崩坏:星穹铁道脚本 | Honkai: Star Rail auto bot (简体中文/繁體中文/English/Español)

Python 2,815 138 Updated Jul 24, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,145 119 Updated Jun 26, 2024

An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star

Python 156 17 Updated Aug 19, 2022
Python 26 3 Updated Apr 11, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,072 141 Updated Jul 28, 2024

[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"

Python 62 9 Updated Jun 6, 2024

LaTeX Proposal Template for the University of Chinese Academy of Sciences

TeX 576 135 Updated Oct 29, 2021

LaTeX Thesis Template for the University of Chinese Academy of Sciences

TeX 3,397 924 Updated Feb 29, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

873 73 Updated Jul 25, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 303 15 Updated Jul 18, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 498 38 Updated Mar 4, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,131 146 Updated Jun 1, 2024

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 794 38 Updated May 26, 2024

Notebooks for training universal 0-shot classifiers on many different tasks

Jupyter Notebook 99 6 Updated Apr 3, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,054 134 Updated Jun 25, 2024

Token Omission Via Attention

Python 114 6 Updated Feb 11, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Python 811 44 Updated Jun 25, 2024

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 329 29 Updated Jul 26, 2024

fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。

Python 744 86 Updated Dec 9, 2023

中英文信息抽取数据集整理

12 Updated May 15, 2022
Python 7 1 Updated Oct 31, 2022

Codebase for Merging Language Models (ICML 2024)

Python 713 40 Updated May 5, 2024

Supercharge Your Model Training

Python 5,077 408 Updated Jul 27, 2024

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 2,994 624 Updated Jan 22, 2024

Codes and checkpoints of paper "Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules"

Python 7 Updated Oct 24, 2023

Framework for BLOOM probing

Python 8 3 Updated Oct 17, 2023

A curated list of neural network pruning resources.

2,295 327 Updated Apr 4, 2024
Next