sglucas

sglucas

1 follower · 37 following

Block or Report

Block or report sglucas

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

SqueezeAILab / SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 597 38 Updated May 2, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,357 2,543 Updated Jun 24, 2024

ashishpatel26 / LLM-Finetuning

LLM Finetuning with peft

Jupyter Notebook 1,815 498 Updated Jul 8, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 8,172 828 Updated Jul 8, 2024

dvmazur / mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,271 224 Updated Apr 8, 2024

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 897 93 Updated Apr 19, 2024

nbasyl / DoRA

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

114 2 Updated Apr 28, 2024

ZhengxiangShi / DePT

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"

Python 87 14 Updated Apr 10, 2024

xai-org / grok-1

Grok open release

Python 49,151 8,309 Updated May 29, 2024

princeton-nlp / QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 115 9 Updated Jun 20, 2024

yaojin17 / Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Python 26 Updated May 16, 2024

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 788 47 Updated Jan 10, 2024

beeevita / EvoPrompt

Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Python 75 13 Updated May 29, 2024

THU-KEG / CLEVE

Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"

Python 79 8 Updated Nov 24, 2022

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,120 819 Updated Jul 7, 2024

shizhediao / Black-Box-Prompt-Learning

Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"

Python 55 7 Updated Sep 7, 2023

fuyw / jrlzoo

A collection of RL baselines in Jax.

Python 8 Updated Nov 18, 2023

cmnfriend / O-LoRA

Python 128 12 Updated Jun 15, 2024

openai / transformer-debugger

Python 3,969 232 Updated Jun 4, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 922 63 Updated Jul 7, 2024

THUDM / LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Python 153 10 Updated Apr 22, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,192 391 Updated Jul 7, 2024

LargeWorldModel / LWM

Python 6,998 541 Updated Jun 14, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

937 53 Updated Jun 24, 2024

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

833 69 Updated Jul 8, 2024

metame-ai / awesome-llm-plaza

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

90 5 Updated Jul 6, 2024

nengwp / Lion-vs-Adam

Lion and Adam optimization comparison

Jupyter Notebook 54 7 Updated Feb 23, 2023

XiaoxinHe / G-Retriever

Repository for G-Retriever

Python 208 40 Updated Jun 26, 2024

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,016 125 Updated Jul 8, 2024

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 276 21 Updated Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly