Skip to content
View Jiacheng-Zhu-AIML's full-sized avatar
🚀
🚀

Highlights

  • Pro
Block or Report

Block or report Jiacheng-Zhu-AIML

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Code for SAFF (ICCV 2023)

Python 1 Updated Jul 15, 2024

Sparse Autoencoder for Mechanistic Interpretability

Python 137 35 Updated Jul 20, 2024

Using sparse coding to find distributed representations used by neural networks.

Jupyter Notebook 139 25 Updated Nov 10, 2023

The official implementation of the paper "Demystifying the Compression of Mixture-of-Experts Through a Unified Framework".

Python 22 3 Updated Jul 19, 2024

A Closer Look into Mixture-of-Experts in Large Language Models

Python 29 Updated Jun 29, 2024

Mode Connectivity and Fast Geometric Ensembles in PyTorch

Python 256 40 Updated Oct 24, 2022

A 100x faster SVD for PyTorch⚡️

C++ 423 35 Updated Oct 10, 2022

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 34 1 Updated Jul 18, 2024

Implementation of LPLR algorithm for matrix compression

Python 20 1 Updated Nov 21, 2023

A method of ensemble learning for heterogeneous large language models.

Python 20 3 Updated Jul 20, 2024

[CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations

Python 3 Updated Apr 29, 2024
Jupyter Notebook 53 16 Updated Feb 1, 2024

LLM101n: Let's build a Storyteller

24,890 1,310 Updated Jul 21, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 306 17 Updated May 29, 2024

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Jupyter Notebook 7 Updated Jun 7, 2024

Analysis of token routing for different implementations of Mixture of Experts

Jupyter Notebook 6 Updated Mar 22, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,753 1,028 Updated Jun 27, 2024

Conversions of Fairseq models in HuggingFace-style

Python 4 Updated Dec 6, 2023

[SIGIR'24] The official implementation code of MOELoRA.

Python 105 11 Updated Jul 22, 2024

DeepSeek LLM: Let there be answers

Makefile 1,346 88 Updated Feb 4, 2024
Python 23 2 Updated Sep 28, 2023
Python 2 Updated Jul 19, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,331 204 Updated Jul 18, 2024

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Python 75 8 Updated Jun 8, 2023
Python 42 5 Updated Oct 17, 2023

AdaMoLE: Adaptive Mixture of Low-Rank Adaptation Experts

Python 11 Updated May 27, 2024
32 Updated Apr 14, 2024

LLM-Merging: Building LLMs Efficiently through Merging

Python 128 19 Updated Jul 15, 2024

The official code for AISTATS 2024 "Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations"

Python 1 Updated May 1, 2024
Next