Skip to content
View karpathy's full-sized avatar

Highlights

  • Pro

Block or report karpathy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NanoGPT (124M) quality in 7.8 8xH100-minutes

Python 1,031 83 Updated Nov 18, 2024

A native PyTorch Library for large model training

Python 2,620 204 Updated Nov 19, 2024

Efficient Triton Kernels for LLM Training

Python 3,442 204 Updated Nov 19, 2024

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 985 59 Updated Nov 18, 2024

Official inference repo for FLUX.1 models

Python 15,955 1,159 Updated Nov 14, 2024

the scott CPU from "But How Do It Know?" by J. Clark Scott

Go 1,891 158 Updated Oct 21, 2020

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,382 221 Updated Nov 18, 2024

Animation engine for explanatory math videos

Python 70,922 6,239 Updated Oct 27, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,753 177 Updated Nov 18, 2024

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 120 3 Updated Nov 11, 2024

UNet diffusion model in pure CUDA

Cuda 584 28 Updated Jun 28, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 803 48 Updated Aug 21, 2024

gpt-2 from scratch in mlx

Python 358 23 Updated Jun 12, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 47,365 5,788 Updated Nov 18, 2024

Implementation of Diffusion Transformer (DiT) in JAX

Python 252 4 Updated Jun 11, 2024

Implementation for MatMul-free LM.

Python 2,920 183 Updated Nov 5, 2024

Schedule-Free Optimization in PyTorch

Python 1,897 65 Updated Nov 6, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 6,805 512 Updated Nov 19, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 15,516 1,469 Updated Nov 17, 2024

My favorite C programming practices.

2,003 98 Updated Oct 1, 2020

Tile primitives for speedy kernels

Cuda 1,658 70 Updated Nov 19, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,084 533 Updated Aug 18, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,991 509 Updated Nov 18, 2024

LLM inference in C/C++

C++ 68,026 9,756 Updated Nov 19, 2024

Distribute and run LLMs with a single file.

C++ 20,517 1,031 Updated Nov 16, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 483 22 Updated Oct 25, 2024

Fast bare-bones BPE for modern tokenizer training

Python 142 2 Updated Oct 21, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,290 508 Updated Jul 31, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 916 65 Updated Nov 4, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,637 271 Updated Oct 29, 2024
Next