Skip to content
View karpathy's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report karpathy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 614 30 Updated Jun 14, 2024

gpt-2 from scratch in mlx

Python 300 19 Updated Jun 12, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 29,054 3,129 Updated Jun 26, 2024

Implementation of Diffusion Transformer (DiT) in JAX

Python 225 4 Updated Jun 11, 2024

Implementation for MatMul-free LM.

Python 2,364 132 Updated Jun 21, 2024

Schedule-Free Optimization in PyTorch

Python 1,531 48 Updated May 30, 2024

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,147 422 Updated Jun 26, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 10,407 885 Updated Jun 25, 2024

My favorite C programming practices.

1,906 94 Updated Oct 1, 2020

Tile primitives for speedy kernels

Cuda 1,328 43 Updated Jun 22, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,578 489 Updated Jun 14, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,675 478 Updated Jun 25, 2024

LLM inference in C/C++

C++ 60,811 8,681 Updated Jun 26, 2024

Distribute and run LLMs with a single file.

C++ 16,576 824 Updated Jun 24, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 413 17 Updated Jun 20, 2024

Fast bare-bones BPE for modern tokenizer training

Python 127 2 Updated Dec 19, 2023

The official PyTorch implementation of Google's Gemma models

Python 5,094 483 Updated Jun 25, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 793 59 Updated Jun 23, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,275 233 Updated May 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,682 3,048 Updated Jun 26, 2024

MLX: An array framework for Apple silicon

C++ 15,565 883 Updated Jun 26, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 198 15 Updated Jun 21, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,075 4,013 Updated Mar 12, 2024

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 696 62 Updated Oct 11, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,313 482 Updated Jun 26, 2024

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 11,964 772 Updated Jun 26, 2024

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Python 404 26 Updated Dec 2, 2023

Master programming by recreating your favorite technologies from scratch.

278,769 26,204 Updated Jun 26, 2024

Xv6 for RISC-V

C 6,490 2,353 Updated Jun 24, 2024

Load, pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Python 8,060 810 Updated Jun 26, 2024
Next