Skip to content
View slavachalnev's full-sized avatar

Block or report slavachalnev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 5 1 Updated Sep 24, 2024

Steering Llama 2 with Contrastive Activation Addition

Jupyter Notebook 92 29 Updated May 23, 2024

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 379 35 Updated Oct 16, 2024

Code for reproducing our paper "Not All Language Model Features Are Linear"

Jupyter Notebook 58 5 Updated Sep 30, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,177 409 Updated Oct 12, 2024
Jupyter Notebook 2 Updated Jan 20, 2024

Training Sparse Autoencoders on Language Models

Jupyter Notebook 407 110 Updated Oct 15, 2024

Using sparse coding to find distributed representations used by neural networks.

Jupyter Notebook 173 28 Updated Nov 10, 2023

Sparse probing paper full code.

Jupyter Notebook 49 10 Updated Dec 17, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 6,165 618 Updated Oct 14, 2024

Convenience functions for working with pytorch hooks.

Python 6 Updated May 28, 2023

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,212 271 Updated Jul 15, 2024

Interpreting how transformers simulate agents performing RL tasks

Jupyter Notebook 66 16 Updated Oct 23, 2023

Graph-based LLM power tool for exploring many completions in parallel.

TypeScript 784 108 Updated Jun 11, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,114 5,144 Updated Oct 16, 2024

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,166 419 Updated Jun 28, 2023

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,001 2,475 Updated Aug 15, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,581 2,937 Updated Oct 16, 2024

Mechanistic Interpretability for Transformer Models

Python 48 6 Updated Jun 1, 2022

Model parallel transformers in JAX and Haiku

Python 6,285 892 Updated Jan 21, 2023

A library for mechanistic interpretability of GPT-style language models

Python 1,480 289 Updated Oct 16, 2024

A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations

Python 171 35 Updated Dec 22, 2021

Inference code for Llama models

Python 56,030 9,523 Updated Aug 18, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,169 547 Updated Oct 8, 2024

Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.

Python 145 8 Updated Jul 26, 2021

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,728 5,800 Updated Aug 19, 2024

Inquisitive Parrots for Search

Python 177 18 Updated Feb 29, 2024

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Python 55,897 3,978 Updated Oct 16, 2024
Next