Skip to content
View sglucas's full-sized avatar
Block or Report

Block or report sglucas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 597 38 Updated May 2, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,357 2,543 Updated Jun 24, 2024

LLM Finetuning with peft

Jupyter Notebook 1,815 498 Updated Jul 8, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 8,172 828 Updated Jul 8, 2024

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,271 224 Updated Apr 8, 2024

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 897 93 Updated Apr 19, 2024

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

114 2 Updated Apr 28, 2024

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"

Python 87 14 Updated Apr 10, 2024

Grok open release

Python 49,151 8,309 Updated May 29, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 115 9 Updated Jun 20, 2024

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Python 26 Updated May 16, 2024

What would you do with 1000 H100s...

Jupyter Notebook 788 47 Updated Jan 10, 2024

Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Python 75 13 Updated May 29, 2024

Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"

Python 79 8 Updated Nov 24, 2022

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,120 819 Updated Jul 7, 2024

Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"

Python 55 7 Updated Sep 7, 2023

A collection of RL baselines in Jax.

Python 8 Updated Nov 18, 2023
Python 128 12 Updated Jun 15, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 922 63 Updated Jul 7, 2024

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Python 153 10 Updated Apr 22, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,192 391 Updated Jul 7, 2024
Python 6,998 541 Updated Jun 14, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

937 53 Updated Jun 24, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

833 69 Updated Jul 8, 2024

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

90 5 Updated Jul 6, 2024

Lion and Adam optimization comparison

Jupyter Notebook 54 7 Updated Feb 23, 2023

Repository for G-Retriever

Python 208 40 Updated Jun 26, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,016 125 Updated Jul 8, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 276 21 Updated Jun 29, 2024
Next