Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 1,943 165 Updated Jul 13, 2024

PolymathicAI / multiple_physics_pretraining

Code for paper "Multiple Physics Pretraining for Physical Surrogate Models

Python 105 17 Updated May 4, 2024

arnab-api / Logit-Lens-Interpreting-GPT-2

Jupyter Notebook 3 3 Updated Jan 31, 2023

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,149 157 Updated Jul 12, 2024

epfl-dlab / llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Jupyter Notebook 43 10 Updated Mar 11, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3 中文仓库（聚合资料，各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Python 3,334 266 Updated Jul 12, 2024

ollama / ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 79,143 6,035 Updated Jul 18, 2024

omarmohamed15 / DIP_for_3D_Seismic_Denoising

Jupyter Notebook 11 2 Updated Apr 17, 2023

dizhu-gis / cedgan-interpolation

Demo code for the paper -- Spatial interpolation using conditional generative adversarial neural networks

Jupyter Notebook 56 19 Updated Dec 7, 2020

AlignmentResearch / tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Python 391 39 Updated Jun 2, 2024

mega002 / ff-layers

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

Python 76 5 Updated Sep 5, 2021

python / cpython

The Python programming language

Python 61,193 29,527 Updated Jul 19, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 23,351 2,504 Updated Jul 17, 2024

yangjianxin1 / GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 2,963 678 Updated Oct 30, 2023

zepingyu0512 / awesome-llm-understanding-mechanism

awesome papers in LLM interpretability

197 10 Updated Jun 19, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,901 5,370 Updated Jul 14, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 80,533 21,631 Updated Jul 19, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,562 246 Updated Jul 5, 2024

google-deepmind / alphageometry

Python 3,831 422 Updated Jul 9, 2024

meta-llama / llama

Inference code for Llama models

Python 54,272 9,331 Updated Jul 13, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,174 491 Updated Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ray1234a

Block or report Ray1234a

Stars

openai / transformer-debugger

davidbau / baukit

KoyenaPal / future-lens

TransformerLensOrg / TransformerLens

redwoodresearch / Easy-Transformer

kmeng01 / rome

wesg52 / llm-context-neurons

wesg52 / sparse-probing-paper

openai / automated-interpretability

jalammar / ecco