karpathy

Andrej karpathy

I like to train Deep Neural Nets on large datasets.

80k followers · 8 following

Stanford
https://twitter.com/karpathy

Achievements

x2 x2 x4

BetaSend feedback

Achievements

x2 x2 x4

BetaSend feedback

Highlights

Block or Report

Block or report karpathy

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

microsoft / Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 614 30 Updated Jun 14, 2024

pranavjad / mlx-gpt2

gpt-2 from scratch in mlx

Python 300 19 Updated Jun 12, 2024

open-webui / open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 29,054 3,129 Updated Jun 26, 2024

kvfrans / jax-diffusion-transformer

Implementation of Diffusion Transformer (DiT) in JAX

Python 225 4 Updated Jun 11, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,364 132 Updated Jun 21, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 1,531 48 Updated May 30, 2024

skypilot-org / skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,147 422 Updated Jun 26, 2024

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 10,407 885 Updated Jun 25, 2024

mcinglis / c-style

My favorite C programming practices.

1,906 94 Updated Oct 1, 2020

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,328 43 Updated Jun 22, 2024

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,578 489 Updated Jun 14, 2024

google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,675 478 Updated Jun 25, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 60,811 8,681 Updated Jun 26, 2024

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 16,576 824 Updated Jun 24, 2024

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 413 17 Updated Jun 20, 2024

gautierdag / bpeasy

Fast bare-bones BPE for modern tokenizer training

Python 127 2 Updated Dec 19, 2023

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,094 483 Updated Jun 25, 2024

carlini / yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 793 59 Updated Jun 23, 2024

Codium-ai / AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,275 233 Updated May 17, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,682 3,048 Updated Jun 26, 2024

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 15,565 883 Updated Jun 26, 2024

normster / llm_rules

RuLES: a benchmark for evaluating rule-following in language models

Python 198 15 Updated Jun 21, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,075 4,013 Updated Mar 12, 2024

abacaj / fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 696 62 Updated Oct 11, 2023

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,313 482 Updated Jun 26, 2024

unslothai / unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 11,964 772 Updated Jun 26, 2024

isafulf / inbox_cleaner

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Python 404 26 Updated Dec 2, 2023

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

278,769 26,204 Updated Jun 26, 2024

mit-pdos / xv6-riscv

Xv6 for RISC-V

C 6,490 2,353 Updated Jun 24, 2024

Lightning-AI / litgpt

Load, pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Python 8,060 810 Updated Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Andrej karpathy

Achievements

Achievements

Highlights

Block or report karpathy

Stars

microsoft / Samba

pranavjad / mlx-gpt2

open-webui / open-webui

kvfrans / jax-diffusion-transformer

ridgerchu / matmulfreellm

facebookresearch / schedule_free

skypilot-org / skypilot

ItzCrazyKns / Perplexica

mcinglis / c-style

HazyResearch / ThunderKittens

adam-maj / tiny-gpu

google / gemma.cpp

ggerganov / llama.cpp

Mozilla-Ocho / llamafile

BobMcDear / attorch

gautierdag / bpeasy

google / gemma_pytorch

carlini / yet-another-applied-llm-benchmark

Codium-ai / AlphaCodium

vllm-project / vllm

ml-explore / mlx

normster / llm_rules

tatsu-lab / stanford_alpaca

abacaj / fine-tune-mistral

pytorch-labs / gpt-fast

unslothai / unsloth

isafulf / inbox_cleaner

codecrafters-io / build-your-own-x

mit-pdos / xv6-riscv

Lightning-AI / litgpt