Skip to content
View KshitijAggarwal's full-sized avatar
Block or Report

Block or report KshitijAggarwal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

287 results for source starred repositories
Clear filter

This repository collects all relevant resources about interpretability in LLMs

111 6 Updated Jul 6, 2024

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 96 22 Updated Jul 4, 2024

Latent Large Language Models

C++ 11 Updated Jul 2, 2024

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Python 1,065 127 Updated Apr 22, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,484 1,334 Updated Jul 1, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,302 426 Updated May 3, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 414 17 Updated Jun 20, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,356 484 Updated Jul 3, 2024

Transformers trained on Tiny ImageNet

Python 47 9 Updated Aug 6, 2022

Python logging made (stupidly) simple

Python 18,720 682 Updated Jun 23, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 3,253 252 Updated Jul 5, 2024

This implements training of popular model architectures, such as AlexNet, ResNet and VGG on the ImageNet dataset(Now we supported alexnet, vgg, resnet, squeezenet, densenet)

Python 380 81 Updated Jun 29, 2018

Claudette is Claude's friend

Jupyter Notebook 84 10 Updated Jun 24, 2024

LLM101n: Let's build a Storyteller

14,835 705 Updated Jun 28, 2024

Sparse autoencoders

Python 110 11 Updated Jul 7, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 21,607 2,196 Updated Jul 6, 2024

Let's train vision transformers (ViT) for cifar 10!

Python 492 104 Updated May 15, 2024

Implementation of papers in 100 lines of code.

Python 645 94 Updated May 4, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 669 35 Updated Jun 27, 2024

gpt-2 from scratch in mlx

Python 318 21 Updated Jun 12, 2024

Implementation for MatMul-free LM.

Python 2,609 151 Updated Jun 27, 2024

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 282 19 Updated Jul 2, 2024

The Mojo Programming Language

Mojo 22,101 2,536 Updated Jul 7, 2024

Video+code lecture on building nanoGPT from scratch

Python 2,884 341 Updated Jul 4, 2024

A fast Lomb-Scargle periodogram. It's nifty, and uses a NUFFT!

Python 20 1 Updated Jul 2, 2024

Training Sparse Autoencoders on Language Models

HTML 201 69 Updated Jul 7, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,175 242 Updated Jul 6, 2024

Deep learning at the speed of light.

Rust 1,392 86 Updated Jul 4, 2024

use candle to implement some of the d2l.ai

Rust 2 Updated Jun 15, 2024
Next