Skip to content
View dmarx's full-sized avatar

Organizations

@pytti-tools
Block or Report

Block or report dmarx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

26 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 22,412 2,486 Updated Jul 31, 2024

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 15,678 1,891 Updated Apr 18, 2024

A massively parallel, optimal functional runtime in Rust

Cuda 10,339 390 Updated Jul 25, 2024

Squeeze-and-Excitation Networks

Cuda 3,350 834 Updated Feb 25, 2019

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,760 126 Updated Apr 5, 2024

Tile primitives for speedy kernels

Cuda 1,425 53 Updated Jul 30, 2024

CUDA accelerated rasterization of gaussian splatting

Cuda 1,380 166 Updated Aug 1, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 909 83 Updated Jul 31, 2024

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 699 183 Updated Aug 1, 2024
Cuda 664 46 Updated Oct 20, 2023

UNet diffusion model in pure CUDA

Cuda 540 27 Updated Jun 28, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 507 42 Updated Apr 7, 2024

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

Cuda 470 52 Updated Jun 16, 2021

NeRFshop: Interactive Editing of Neural Radiance Fields

Cuda 442 23 Updated Mar 27, 2023

Fast CUDA matrix multiplication from scratch

Cuda 373 49 Updated Dec 28, 2023

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

Cuda 325 24 Updated Jul 17, 2024

Code for "Representing Volumetric Videos as Dynamic MLP Maps" CVPR 2023

Cuda 233 10 Updated Dec 6, 2023

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 229 15 Updated Jul 2, 2024

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 201 33 Updated May 12, 2024

Learning Deformable Tetrahedral Meshes for 3D Reconstruction (NeurIPS 2020)

Cuda 163 11 Updated Oct 23, 2023

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 101 5 Updated Jul 3, 2024

Blazingly fast encoding for neural networks based on permutohedral lattices

Cuda 94 10 Updated May 10, 2023

LLM training in simple, raw C/CUDA

Cuda 76 4 Updated May 1, 2024

3D Gaussian Splatting in JAX

Cuda 49 2 Updated May 30, 2024

A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!

Cuda 33 5 Updated Jul 25, 2024