Jiacheng-Zhu-AIML

🚀

Jiacheng Zhu Jiacheng-Zhu-AIML

🚀

Intern Bayesian

65 followers · 114 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

NVlabs / EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Python 507 43 Updated Sep 19, 2024

TUDB-Labs / MoE-PEFT

An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT

Python 24 4 Updated Sep 13, 2024

taco-group / opencda-loop

1 Updated Sep 28, 2024

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 945 50 Updated Sep 3, 2024

hkproj / pytorch-llama

LLaMA 2 implemented from scratch in PyTorch

Python 226 46 Updated Sep 25, 2023

taco-group / CoMamba

CoMamba: Real-time Cooperative Perception Unlocked with State Space Models

Python 5 Updated Sep 20, 2024

microsoft / GRIN-MoE

GRadient-INformed MoE

247 14 Updated Sep 25, 2024

onkarbhardwaj / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1 Updated May 9, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,971 214 Updated Oct 1, 2024

eseckel / ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 1,927 254 Updated Mar 1, 2024

Jiacheng-Zhu-AIML / PhysioMTL

Implementation of PhysioMTL (CHIL 2022)

Python 2 Updated Apr 4, 2023

uclaml / SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Python 477 61 Updated Aug 4, 2024

GilgameshD / label-tool

HTML 2 Updated Sep 28, 2024

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 398 30 Updated Sep 17, 2024

mechanistic-interpretability-grokking / progress-measures-paper

Jupyter Notebook 50 16 Updated Oct 11, 2022

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,450 283 Updated Sep 27, 2024

Leeroo-AI / mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 397 25 Updated Aug 26, 2024

HenryLHH / metadrive_clean

MetaDrive-0.2.6.0 that is compatible with newest gym version.

Python 8 1 Updated Jul 21, 2024

google / gin-config

Gin provides a lightweight configuration framework for Python

Python 2,049 120 Updated Aug 19, 2024

irenetrampoline / mimic-disparities

Plots from "Can AI Help Reduce Disparities in General Medical and Mental Health Care?"

Jupyter Notebook 7 1 Updated Jul 6, 2023

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,111 159 Updated Oct 2, 2024

yule-BUAA / MergeLM

Codebase for Merging Language Models (ICML 2024)

Python 749 44 Updated May 5, 2024

Explosion-Scratch / apple-intelligence-prompts

System prompts from Apple's new Apple Intelligence on MacOS Sequoia

JavaScript 92 9 Updated Sep 29, 2024

ramprs / grad-cam

[ICCV 2017] Torch code for Grad-CAM

Lua 1,484 224 Updated Sep 17, 2022

Young98CN / LoRA_Composer

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Python 41 3 Updated Aug 14, 2024

microsoft / mttl

Building modular LMs with parameter-efficient fine-tuning.

Python 75 7 Updated Sep 29, 2024

lucidrains / PEER-pytorch

Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind

Python 108 3 Updated Aug 23, 2024

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

936 70 Updated Jul 31, 2024

joshuaray / PhysioMTL

Jupyter Notebook 4 Updated Apr 30, 2023

kyegomez / Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 60 5 Updated Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiacheng Zhu Jiacheng-Zhu-AIML

Achievements

Achievements

Highlights

Block or report Jiacheng-Zhu-AIML

Stars

NVlabs / EAGLE

TUDB-Labs / MoE-PEFT

taco-group / opencda-loop

facebookresearch / MobileLLM

hkproj / pytorch-llama

taco-group / CoMamba

microsoft / GRIN-MoE

onkarbhardwaj / vllm

hijkzzz / Awesome-LLM-Strawberry

eseckel / ai-for-grant-writing

Jiacheng-Zhu-AIML / PhysioMTL

uclaml / SPPO

GilgameshD / label-tool

allenai / OLMoE

mechanistic-interpretability-grokking / progress-measures-paper

TransformerLensOrg / TransformerLens

Leeroo-AI / mergoo

HenryLHH / metadrive_clean

google / gin-config

irenetrampoline / mimic-disparities

linkedin / Liger-Kernel

yule-BUAA / MergeLM

Explosion-Scratch / apple-intelligence-prompts

ramprs / grad-cam

Young98CN / LoRA_Composer

microsoft / mttl

lucidrains / PEER-pytorch

XueFuzhao / awesome-mixture-of-experts

joshuaray / PhysioMTL

kyegomez / Mixture-of-Depths