Skip to content
View Jiacheng-Zhu-AIML's full-sized avatar
🚀
🚀

Highlights

  • Pro

Block or report Jiacheng-Zhu-AIML

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Python 507 43 Updated Sep 19, 2024

An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT

Python 24 4 Updated Sep 13, 2024
1 Updated Sep 28, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 945 50 Updated Sep 3, 2024

LLaMA 2 implemented from scratch in PyTorch

Python 226 46 Updated Sep 25, 2023

CoMamba: Real-time Cooperative Perception Unlocked with State Space Models

Python 5 Updated Sep 20, 2024

GRadient-INformed MoE

247 14 Updated Sep 25, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1 Updated May 9, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,971 214 Updated Oct 1, 2024

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 1,927 254 Updated Mar 1, 2024

Implementation of PhysioMTL (CHIL 2022)

Python 2 Updated Apr 4, 2023

The official implementation of Self-Play Preference Optimization (SPPO)

Python 477 61 Updated Aug 4, 2024
HTML 2 Updated Sep 28, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 398 30 Updated Sep 17, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,450 283 Updated Sep 27, 2024

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 397 25 Updated Aug 26, 2024

MetaDrive-0.2.6.0 that is compatible with newest gym version.

Python 8 1 Updated Jul 21, 2024

Gin provides a lightweight configuration framework for Python

Python 2,049 120 Updated Aug 19, 2024

Plots from "Can AI Help Reduce Disparities in General Medical and Mental Health Care?"

Jupyter Notebook 7 1 Updated Jul 6, 2023

Efficient Triton Kernels for LLM Training

Python 3,111 159 Updated Oct 2, 2024

Codebase for Merging Language Models (ICML 2024)

Python 749 44 Updated May 5, 2024

System prompts from Apple's new Apple Intelligence on MacOS Sequoia

JavaScript 92 9 Updated Sep 29, 2024

[ICCV 2017] Torch code for Grad-CAM

Lua 1,484 224 Updated Sep 17, 2022

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Python 41 3 Updated Aug 14, 2024

Building modular LMs with parameter-efficient fine-tuning.

Python 75 7 Updated Sep 29, 2024

Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind

Python 108 3 Updated Aug 23, 2024

A collection of AWESOME things about mixture-of-experts

936 70 Updated Jul 31, 2024
Jupyter Notebook 4 Updated Apr 30, 2023

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 60 5 Updated Sep 30, 2024
Next