Skip to content
View sglucas's full-sized avatar

Block or report sglucas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
166 results for source starred repositories
Clear filter

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 632 43 Updated Aug 13, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,797 2,583 Updated Sep 30, 2024

LLM Finetuning with peft

Jupyter Notebook 2,101 587 Updated Jul 8, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,317 1,021 Updated Oct 10, 2024

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,294 227 Updated Apr 8, 2024

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 956 101 Updated Apr 19, 2024

[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"

Python 94 15 Updated Apr 10, 2024

Grok open release

Python 49,473 8,322 Updated Aug 30, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 137 10 Updated Jun 20, 2024

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Python 38 1 Updated Sep 30, 2024

What would you do with 1000 H100s...

Jupyter Notebook 883 52 Updated Jan 10, 2024

Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Python 98 15 Updated May 29, 2024

Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"

Python 81 8 Updated Nov 24, 2022

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,235 824 Updated Oct 5, 2024

Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"

Python 55 7 Updated Sep 7, 2023

A collection of RL baselines in Jax.

Python 8 Updated Nov 18, 2023
Python 143 17 Updated Jul 13, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,873 149 Updated Sep 25, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 203 13 Updated Apr 22, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,499 453 Updated Oct 9, 2024
Python 7,102 549 Updated Aug 12, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,373 71 Updated Oct 9, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

978 83 Updated Oct 9, 2024

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

138 11 Updated Sep 27, 2024

Lion and Adam optimization comparison

Jupyter Notebook 56 7 Updated Feb 23, 2023

Repository for G-Retriever

Python 292 51 Updated Sep 24, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,098 137 Updated Oct 8, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 352 28 Updated Jun 29, 2024

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning, as well as corresponding mitigation strategies.

Python 183 38 Updated May 7, 2024
Next