Stars
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
A guide to help developers get up and running quickly with the OpenCL programming framework
Python interface for MLIR - the Multi-Level Intermediate Representation
UBGen can generate programs with undefined behaviors (e.g., buffer-overflow, use-after-free, etc.)
The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
A curated list of automated machine learning papers, articles, tutorials, slides and projects
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
A book about compiling Racket and Python to x86-64 assembly
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Language definitions and styles for listings in LaTeX.
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Fourier ACelerator Compiler Framework. Efforts have been taken to blind code for submission.
PyTorch Extension Library of Optimized Scatter Operations
NPBench - A Benchmarking Suite for High-Performance NumPy
The financial transactions database designed for mission critical safety and performance.
lightweight, standalone C++ inference engine for Google's Gemma models.
PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations
Implementation of IR2Vec, published in ACM TACO
Strategies for Pre-training Graph Neural Networks