sglucas

Follow

sglucas

Follow

1 follower · 37 following

Stars

166 results for source starred repositories

SqueezeAILab / SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 632 43 Updated Aug 13, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,797 2,583 Updated Sep 30, 2024

ashishpatel26 / LLM-Finetuning

LLM Finetuning with peft

Jupyter Notebook 2,101 587 Updated Jul 8, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,317 1,021 Updated Oct 10, 2024

dvmazur / mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,294 227 Updated Apr 8, 2024

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 956 101 Updated Apr 19, 2024

ZhengxiangShi / DePT

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"

Python 94 15 Updated Apr 10, 2024

xai-org / grok-1

Grok open release

Python 49,473 8,322 Updated Aug 30, 2024

princeton-nlp / QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 137 10 Updated Jun 20, 2024

yaojin17 / Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Python 38 1 Updated Sep 30, 2024

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 883 52 Updated Jan 10, 2024

beeevita / EvoPrompt

Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Python 98 15 Updated May 29, 2024

THU-KEG / CLEVE

Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"

Python 81 8 Updated Nov 24, 2022

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,235 824 Updated Oct 5, 2024

shizhediao / Black-Box-Prompt-Learning

Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"

Python 55 7 Updated Sep 7, 2023

fuyw / jrlzoo

A collection of RL baselines in Jax.

Python 8 Updated Nov 18, 2023

cmnfriend / O-LoRA

Python 143 17 Updated Jul 13, 2024

openai / transformer-debugger

Python 4,022 233 Updated Jun 4, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,873 149 Updated Sep 25, 2024

THUDM / LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 203 13 Updated Apr 22, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,499 453 Updated Oct 9, 2024

LargeWorldModel / LWM

Python 7,102 549 Updated Aug 12, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,373 71 Updated Oct 9, 2024

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

978 83 Updated Oct 9, 2024

metame-ai / awesome-llm-plaza

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

138 11 Updated Sep 27, 2024

nengwp / Lion-vs-Adam

Lion and Adam optimization comparison

Jupyter Notebook 56 7 Updated Feb 23, 2023

XiaoxinHe / G-Retriever

Repository for G-Retriever

Python 292 51 Updated Sep 24, 2024

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,098 137 Updated Oct 8, 2024

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 352 28 Updated Jun 29, 2024

Princeton-SysML / GradAttack

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning, as well as corresponding mitigation strategies.

Python 183 38 Updated May 7, 2024