Stars
ALIEN is a CUDA-powered artificial life simulation program.
StoneSoup is an open-source, flexible platform for AI workflows and experimentation. It focuses on plugin-centric development and extensibility, enabling users to easily test various methods and pa…
GPU programming related news and material links
A few scripts to be run on a fresh-off-the-presses Ubuntu VM, in order to get its shell nice 'n purdy.
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Lightning fast C++/CUDA neural network framework
Playing Pokemon Red with Reinforcement Learning
Python project and library template for clean, reliable, open-source projects.
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
megastep helps you build 1-million FPS reinforcement learning environments on a single GPU
Experiments with applying Fourier transofrms to various plane-filling curves and patterns
Baba Is You simulator using C++ with some reinforcement learning
Experiments with applying Fourier transofrms to various plane-filling curves and patterns
Really Fast End-to-End Jax RL Implementations
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
SwissGL is a minimalistic wrapper on top of WebGL2 JS API. It's designed to reduce the amount of boilerplate code required to manage GLSL shaders, textures and framebuffers when making procedural v…
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A Library for Differentiable Logic Gate Networks
Performance-portable, length-agnostic SIMD with runtime dispatch
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
A book teaching assembly language programming on the ARM 64 bit ISA. Along the way, good programming practices and insights into code development are offered which apply directly to higher level la…
An introduction to ARM64 assembly on Apple Silicon Macs