Skip to content
View mmarius's full-sized avatar
👨‍💻
👨‍💻

Highlights

  • Pro
Block or Report

Block or report mmarius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Improving Alignment and Robustness with Circuit Breakers

Jupyter Notebook 89 10 Updated Jul 12, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,676 309 Updated Jul 27, 2024

Machine Learning Engineering Open Book

Python 10,302 618 Updated Jul 27, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,810 805 Updated Jul 1, 2024

Minimalistic large language model 3D-parallelism training

Python 1,004 91 Updated Jul 25, 2024

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,206 216 Updated Jul 11, 2024

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,414 377 Updated Jul 20, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,161 156 Updated Jul 12, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,226 1,462 Updated Jul 26, 2024

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,263 101 Updated Jun 13, 2024

Algorithmically create or extend categorical colour palettes

Python 170 7 Updated Jun 18, 2024

An autoregressive character-level language model for making more things

Python 2,349 605 Updated Jun 4, 2024

Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.

Python 651 25 Updated Apr 23, 2024

Code to run the TILT transfer learning experiments

Python 32 10 Updated Feb 13, 2021

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Python 52 8 Updated Jun 10, 2024

A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.

Python 67 8 Updated Feb 28, 2024

An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"

Python 129 18 Updated Apr 23, 2022

This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.

Python 12 Updated Aug 23, 2021

Train Dense Passage Retriever (DPR) with a single GPU

Python 127 20 Updated Jun 16, 2021

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Python 335 19 Updated Mar 26, 2024

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,569 349 Updated Jul 22, 2024
Python 94 7 Updated Oct 27, 2022

An Open-Source Framework for Prompt-Learning.

Python 4,253 436 Updated Jul 16, 2024

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,047 751 Updated Jun 28, 2024

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Python 83 8 Updated May 10, 2022

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,325 505 Updated Jul 2, 2024

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Python 548 49 Updated Nov 10, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,289 210 Updated Mar 20, 2024

Model parallel transformers in JAX and Haiku

Python 6,248 889 Updated Jan 21, 2023
Next