Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,418 2,052 Updated Jul 18, 2024

basalt-org / basalt

A Machine Learning framework from scratch in Pure Mojo 🔥

Mojo 366 24 Updated Jul 19, 2024

Ceyron / machine-learning-and-simulation

All the handwritten notes 📝 and source code files 🖥️ used in my YouTube Videos on Machine Learning & Simulation (https://www.youtube.com/channel/UCh0P7KwJhuQ4vrzc3IRuw4Q)

Jupyter Notebook 797 176 Updated Jul 29, 2024

geohot / fromthetransistor

From the Transistor to the Web Browser, a rough outline for a 12 week course

5,111 426 Updated Oct 12, 2021

arpitingle / gpu-alpha

High Quality Resources on GPU Programming/Architecture

548 16 Updated Jul 26, 2024

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,283 1,334 Updated Jan 20, 2024

exo-explore / mlx-bitnet

1.58 Bit LLM on Apple Silicon using MLX

Python 93 4 Updated May 10, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,324 263 Updated Aug 14, 2024

interpretingdl / eacl2024_transformer_interpretability_tutorial

Materials for EACL2024 tutorial: Transformer-specific Interpretability

Jupyter Notebook 31 1 Updated Mar 26, 2024

jiesutd / Text-Attention-Heatmap-Visualization

Plot the vector graph of attention based text visualisation

Python 363 58 Updated Apr 12, 2019

nivibilla / build-nanogpt

Forked from karpathy/build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python 63 10 Updated Jun 14, 2024

pranavjad / mlx-gpt2

gpt-2 from scratch in mlx

Python 341 22 Updated Jun 12, 2024

Qualcomm-AI-research / transformer-quantization

Python 182 21 Updated Nov 9, 2021

karpathy / build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python 3,230 427 Updated Aug 13, 2024

rbiswasfc / llm-detect-ai

1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition

Python 137 24 Updated May 20, 2024

JonasGeiping / cramming

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,278 100 Updated Jun 13, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,800 919 Updated May 23, 2024

sdan / vlite

fast vector database made in numpy

Python 734 37 Updated Apr 29, 2024

rogeriochaves / langstream

Build robust LLM applications with true composability 🔗

Python 407 28 Updated Jan 3, 2024

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,639 965 Updated Jul 10, 2024

lucidrains / triton-transformer

Implementation of a Transformer, but completely in Triton

Python 233 13 Updated Apr 5, 2022

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,253 940 Updated Aug 14, 2024

lhyfst / knowledge-distillation-papers

knowledge distillation papers

730 82 Updated Feb 10, 2023

hkproj / pytorch-transformer

Attention is all you need implementation

Jupyter Notebook 503 218 Updated Jun 8, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,774 344 Updated May 8, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,310 139 Updated Jun 3, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 24,757 2,613 Updated Aug 14, 2024