Skip to content
View pavanyellow's full-sized avatar
Block or Report

Block or report pavanyellow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Sparse Autoencoders for extracting new superhuman concepts from AlphaZero and Pluribus

    Python Updated Jun 19, 2024
  • Controlling LLM outputs by activating/suppressing feature vectors

    Jupyter Notebook Apache License 2.0 Updated Jun 14, 2024
  • games-sae Public

    Extract concepts from superhuman game playing AIs using SAE

    Updated Jun 13, 2024
  • A library for mechanistic interpretability of GPT-style language models

    Python MIT License Updated Apr 5, 2024
  • Jupyter Notebook MIT License Updated Feb 8, 2024
  • Jupyter Notebook Apache License 2.0 Updated Feb 8, 2024
  • Interpretability of 2*2*2 layer model calculating the number of non zero entries in the input

    Jupyter Notebook Updated Feb 8, 2024
  • Exploring the sudden incoherence of language models at higher sampling temperatures

    Jupyter Notebook Apache License 2.0 Updated Feb 2, 2024
  • Python MIT License Updated Jan 7, 2024
  • Interpreting the ultra-low density cluster in sparse autoencoders from Anthropic's Towards Monosemanticity work

    Jupyter Notebook Updated Dec 6, 2023
  • Examples in the MLX framework

    Python MIT License Updated Dec 6, 2023
  • Attention Public

    A simple, pure Python implementation of the original attention mechanism with no PyTorch or NumPy dependencies

    Python Updated Dec 6, 2023
  • HTML MIT License Updated Dec 5, 2023
  • A simple Transformer trained on Tinystories Dataset for 5min on M1 Air that can produce coherant looking language

    Jupyter Notebook Updated Dec 4, 2023
  • My Working codebase for exploratory Interpretability work and replicating important papers in the field

    Jupyter Notebook Updated Dec 2, 2023
  • A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.

    Python MIT License Updated Feb 22, 2023