Skip to content
View StellaAthena's full-sized avatar

Organizations

@EleutherAI
Block or Report

Block or report StellaAthena

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 251 26 Updated Jun 18, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,418 514 Updated Mar 12, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 458 31 Updated Jun 16, 2024

The Art of Debugging

C 741 28 Updated May 17, 2024

Machine Learning Engineering Open Book

Python 10,095 595 Updated Jun 6, 2024
Python 4 2 Updated Dec 6, 2023

A framework for few-shot evaluation of language models.

Python 5,601 1,487 Updated Jun 18, 2024

LLM vulnerability scanner

Python 983 117 Updated Jun 19, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,102 154 Updated Jun 18, 2024

Tools for understanding how transformer predictions are built layer-by-layer

Python 373 37 Updated Jun 2, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,501 1,098 Updated Apr 25, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,377 465 Updated Jan 8, 2024

Toolkit for creating, sharing and using natural language prompts.

Python 2,565 342 Updated Oct 23, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 10,931 1,433 Updated Feb 29, 2024

A dataset of alignment research and code to reproduce it

HTML 65 17 Updated Jun 22, 2023

A framework for few-shot evaluation of autoregressive language models.

Python 97 31 Updated May 9, 2023
Python 3 1 Updated May 4, 2022

CLOOB training (JAX) and inference (JAX and PyTorch)

Python 70 9 Updated May 16, 2022

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 469 54 Updated Mar 20, 2023

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 513 106 Updated Apr 20, 2024

Implementation of LogAvgExp for Pytorch

Python 33 2 Updated Mar 28, 2022

GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.

Cuda 293 24 Updated May 13, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,279 1,156 Updated Apr 7, 2024

Annotated transformer blog

2 Updated Nov 22, 2021

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Python 404 73 Updated Mar 30, 2024

v objective diffusion inference code for PyTorch.

Python 708 108 Updated Nov 29, 2022

Code and explanation for IEEE CoG paper "Predicting Human Card Selection in Magic: The Gathering with Contextual Preference Ranking"

Python 5 5 Updated Dec 13, 2022

State of the Art Magic: the Gathering Draft and DeckBuilder AI.

Python 137 36 Updated Mar 30, 2024

An efficient interactive zero-knowledge proof scheme based on GKR in terms of unlayered circuit.

C++ 14 5 Updated May 23, 2023
Next