- San Francisco
- pavankatta.com
Stars
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
An easy Python framework to build distributed systems
Alex Krizhevsky's original code from Google Code
Chess reinforcement learning by AlphaGo Zero methods.
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row
PyTorch implementation of AlphaZero Chess from scratch
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
Open source code for AlphaFold 2.
The nnsight package enables interpreting and manipulating the internals of deep learned models.
Extracting spatial and temporal world models from LLMs
Practice The CodeSignal Pre-screen for the Industry Coding Framework.
"Nobody ever figures out what life is all about, and it doesn't matter. Explore the world. Nearly everything is really interesting if you go into it deeply enough."― Richard P. Feynman
Solve puzzles. Improve your pytorch.
A library for mechanistic interpretability of GPT-style language models
Language model alignment-focused deep learning curriculum
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
antimatter15 / alpaca.cpp
Forked from ggerganov/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🦜🔗 Build context-aware reasoning applications