Skip to content
View Muennighoff's full-sized avatar

Block or report Muennighoff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 397 30 Updated Sep 17, 2024

🙌 OpenHands: Code Less, Make More

Python 32,280 3,697 Updated Oct 1, 2024

[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623

Python 53 4 Updated Sep 26, 2024

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Python 44 Updated Jul 23, 2024

Data for the MTEB leaderboard

Python 4 5 Updated Oct 1, 2024

Code for the MTEB leaderboard

Python 10 9 Updated Oct 1, 2024

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Jupyter Notebook 81 3 Updated Sep 19, 2024

DataComp for Language Models

HTML 1,121 99 Updated Sep 5, 2024

Code for the MTEB Arena

Python 14 6 Updated Sep 17, 2024

AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark

Python 98 8 Updated Sep 28, 2024

BigCodeBench: Benchmarking Code Generation Towards AGI

Python 191 22 Updated Sep 17, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,273 517 Updated Feb 27, 2024

Home of StarCoder2!

Python 1,731 158 Updated Mar 21, 2024

Language models scale reliably with over-training and on downstream tasks

Jupyter Notebook 91 4 Updated Apr 2, 2024

A Survey on Data Selection for Language Models

151 8 Updated Jun 4, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 537 38 Updated Sep 22, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 710 39 Updated Sep 28, 2024

Evaluation of BLOOM on the HumanEval benchmark

Shell 6 Updated Aug 18, 2022

A Scandinavian Benchmark for sentence embeddings

Python 27 3 Updated Aug 24, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Language Models

Jupyter Notebook 57 2 Updated Apr 10, 2024

A framework for few-shot evaluation of language models.

Python 6,575 1,742 Updated Sep 30, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,463 446 Updated Oct 1, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 928 100 Updated Sep 30, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 781 206 Updated Sep 26, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,000 511 Updated Sep 26, 2024

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 425 27 Updated Sep 22, 2024

Scaling Data-Constrained Language Models

Jupyter Notebook 316 19 Updated Sep 22, 2024

BLOOM+1: Adapting BLOOM model to support a new unseen language

Python 70 16 Updated Mar 2, 2024

Crosslingual Generalization through Multitask Finetuning

Jupyter Notebook 513 37 Updated Sep 22, 2024
Next