Jiacheng-Zhu-AIML

Follow

🚀

Jiacheng Zhu Jiacheng-Zhu-AIML

🚀

Follow

Intern Bayesian

61 followers · 114 following

Achievements

Achievements

Highlights

Pro

Stars

alexrame / diwa

DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization

Python 27 6 Updated Jan 31, 2023

khanrc / swad

Official Implementation of SWAD (NeurIPS 2021)

Python 151 18 Updated Dec 10, 2022

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,672 87 Updated Jan 21, 2024

bigscience-workshop / promptsource

Toolkit for creating, sharing and using natural language prompts.

Python 2,629 345 Updated Oct 23, 2023

lucidrains / st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Python 270 23 Updated Jun 17, 2024

XueFuzhao / OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,330 64 Updated Mar 8, 2024

mistralai / mistral-common

Python 481 44 Updated Aug 15, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,614 50 Updated Aug 12, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,719 2,863 Updated Aug 12, 2024

databricks / megablocks

Python 1,146 163 Updated Aug 21, 2024

UNITES-Lab / MC-SMoE

[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"

Python 62 9 Updated Jun 6, 2024

naver-ai / model-stock

Model Stock: All we need is just a few fine-tuned models

75 Updated Mar 29, 2024

google-deepmind / opro

official code for "Large Language Models as Optimizers"

Python 362 38 Updated Aug 16, 2024

neubig / research-career-tools

Python 133 3 Updated Apr 23, 2024

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,493 234 Updated May 1, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,584 783 Updated Aug 15, 2024

yegcjs / mixinglaws

Jupyter Notebook 83 6 Updated Apr 27, 2024

SakanaAI / evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,173 81 Updated Mar 30, 2024

readthedocs / sphinx_rtd_theme

Sphinx theme from Read the Docs

Sass 4,728 1,728 Updated Aug 20, 2024

PolinaKirichenko / deep_feature_reweighting

Jupyter Notebook 101 11 Updated Sep 20, 2023

izmailovpavel / spurious_feature_learning

Python 40 5 Updated Jan 17, 2023

xai-org / grok-1

Grok open release

Python 49,374 8,328 Updated Aug 7, 2024

openai / grok

Python 4,076 509 Updated Mar 19, 2024

szc12153 / sparse_meta_tuning

Official implementation for Sparse MetA-Tuning (SMAT)

Python 9 Updated Jun 29, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,321 139 Updated Jun 3, 2024

Jason-Qiu / Embodied_Policy_Learning

[NAACL 2024] Embodied Executable Policy Learning with Language-based Scene Summarization

Python 4 Updated Mar 13, 2024

prateeky2806 / ComPEFT

Python 24 1 Updated Nov 23, 2023

SkunkworksAI / hydra-moe

Python 409 15 Updated Nov 2, 2023

wpeebles / G.pt

Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"

Python 332 21 Updated Oct 3, 2022

uukuguy / multi_loras

Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answer based on user queries.

Python 134 9 Updated Feb 9, 2024