Skip to content
View Ray1234a's full-sized avatar
Block or Report

Block or report Ray1234a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results
Python 150 10 Updated Feb 22, 2024

Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State

Jupyter Notebook 12 1 Updated Jan 8, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,215 247 Updated Jul 18, 2024

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 531 113 Updated Apr 20, 2024

Find context neurons in Pythia models.

Python 9 1 Updated Jun 13, 2023

Sparse probing paper full code.

Jupyter Notebook 45 10 Updated Dec 17, 2023

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 1,943 165 Updated Jul 13, 2024

Code for paper "Multiple Physics Pretraining for Physical Surrogate Models

Python 105 17 Updated May 4, 2024
Jupyter Notebook 3 3 Updated Jan 31, 2023

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,149 157 Updated Jul 12, 2024

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Jupyter Notebook 43 10 Updated Mar 11, 2024

Llama3 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 3,334 266 Updated Jul 12, 2024

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 79,143 6,035 Updated Jul 18, 2024
Jupyter Notebook 11 2 Updated Apr 17, 2023

Demo code for the paper -- Spatial interpolation using conditional generative adversarial neural networks

Jupyter Notebook 56 19 Updated Dec 7, 2020

Tools for understanding how transformer predictions are built layer-by-layer

Python 391 39 Updated Jun 2, 2024

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

Python 76 5 Updated Sep 5, 2021

The Python programming language

Python 61,193 29,527 Updated Jul 19, 2024

The official Meta Llama 3 GitHub site

Python 23,351 2,504 Updated Jul 17, 2024

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 2,963 678 Updated Oct 30, 2023

awesome papers in LLM interpretability

197 10 Updated Jun 19, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,901 5,370 Updated Jul 14, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 80,533 21,631 Updated Jul 19, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,562 246 Updated Jul 5, 2024

Inference code for Llama models

Python 54,272 9,331 Updated Jul 13, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,174 491 Updated Jul 11, 2024
Next