Skip to content
View Jiacheng-Zhu-AIML's full-sized avatar
🚀
🚀

Highlights

  • Pro

Block or report Jiacheng-Zhu-AIML

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 1,310 58 Updated Aug 26, 2024

Codebase for Merging Language Models (ICML 2024)

Python 732 41 Updated May 5, 2024

System prompts from Apple's new Apple Intelligence on MacOS Sequoia

JavaScript 64 4 Updated Aug 21, 2024

[ICCV 2017] Torch code for Grad-CAM

Lua 1,471 222 Updated Sep 17, 2022

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Python 36 3 Updated Aug 14, 2024

Building modular LMs with parameter-efficient fine-tuning.

Python 68 5 Updated Aug 26, 2024

Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind

Python 104 1 Updated Aug 23, 2024

A collection of AWESOME things about mixture-of-experts

898 68 Updated Jul 31, 2024
Jupyter Notebook 4 Updated Apr 30, 2023

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 52 2 Updated Aug 24, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 6,476 813 Updated Aug 22, 2024
Python 5 1 Updated Jul 30, 2024

Code for paper: "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models"

Python 2 Updated Jul 3, 2024

Awesome-LLM: a curated list of Large Language Model

16,971 1,367 Updated Aug 19, 2024

LaTeX style file for the Journal of Machine Learning Research

TeX 109 112 Updated Jul 16, 2024
Jupyter Notebook 5 2 Updated Aug 6, 2024

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 116 8 Updated Jul 19, 2024
Python 248 26 Updated Jul 19, 2024
Python 1 Updated Jul 23, 2024

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 354 19 Updated Aug 24, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,711 107 Updated Jul 29, 2024

Bridging LLM and Recommender System.

Jupyter Notebook 494 44 Updated Aug 24, 2024

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 122 19 Updated Mar 13, 2024

State-of-the-art Parameter-Efficient MoE Fine-tuning Method

Python 42 7 Updated Aug 22, 2024

Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).

Python 153 15 Updated Apr 12, 2024

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Cuda 136 4 Updated Aug 5, 2024

Official PyTorch Code for SAFF (ICCV 2023)

Python 1 Updated Jul 15, 2024

Sparse Autoencoder for Mechanistic Interpretability

Python 166 38 Updated Jul 20, 2024

Using sparse coding to find distributed representations used by neural networks.

Jupyter Notebook 153 27 Updated Nov 10, 2023

The official implementation of the paper "Demystifying the Compression of Mixture-of-Experts Through a Unified Framework".

Python 31 4 Updated Aug 8, 2024
Next