dmarx

Follow

David Marx dmarx

Follow

Engineer / Machine Learning Researcher interested in deep learning, probabilistic ML, generative models, multi-modal SSL, visual understanding, geometric

473 followers · 327 following

Stability.ai, Eleuther.ai
Seattle, WA
https://dmarx.github.io
@DigThatData

Achievements

Achievements

Organizations

Lists (32)

Sort

agentic

167 repositories

data visualization

165 repositories

Generative Art

529 repositories

image processing

210 repositories

Knowledge Management

214 repositories

Liked

174 repositories

linux-tweaks

398 repositories

ml audio

223 repositories

ML Depth

406 repositories

ML diffusion

218 repositories

ML Explainability

158 repositories

ML finetune

247 repositories

ML image segmentation

142 repositories

ML Implicit Representations

221 repositories

ML Performance

616 repositories

ML Research

3093 repositories

ML Research - overflow

117 repositories

ML super resolution

41 repositories

ML Tools

671 repositories

ML Tools - Non-Python languages

132 repositories

ML Video

351 repositories

MLOps - tools

141 repositories

Multi-modal

198 repositories

needs-attention

highlighting projects that aren't integrated into tools I like (e.g. ComfyUI) and would benefit from community attention

15 repositories

NLP

767 repositories

Pedagogical

430 repositories

prompting

320 repositories

Python Tools

457 repositories

scene synthesis

109 repositories

SD Public Projects

747 repositories

swarm intelligence / ML Climate

TTIv2.0 wishlist

316 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

27 stars written in Cuda

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 23,194 2,572 Updated Aug 26, 2024

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 15,788 1,898 Updated Apr 18, 2024

HigherOrderCO / HVM

A massively parallel, optimal functional runtime in Rust

Cuda 10,417 393 Updated Sep 4, 2024

hujie-frank / SENet

Squeeze-and-Excitation Networks

Cuda 3,363 835 Updated Feb 25, 2019

CannyLab / tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,777 126 Updated Apr 5, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,474 55 Updated Sep 4, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,106 100 Updated Sep 5, 2024

rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 729 187 Updated Sep 6, 2024

princeton-vl / lietorch

Cuda 670 50 Updated Oct 20, 2023

clu0 / unet.cu

UNet diffusion model in pure CUDA

Cuda 560 28 Updated Jun 28, 2024

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 549 48 Updated Apr 7, 2024

creiser / kilonerf

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

Cuda 471 51 Updated Jun 16, 2021

graphdeco-inria / nerfshop

NeRFshop: Interactive Editing of Neural Radiance Fields

Cuda 444 23 Updated Mar 27, 2023

siboehm / SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Cuda 411 53 Updated Dec 28, 2023

SHI-Labs / NATTEN

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

Cuda 339 25 Updated Aug 20, 2024

efeslab / Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 253 20 Updated Jul 2, 2024

zju3dv / mlp_maps

Code for "Representing Volumetric Videos as Dynamic MLP Maps" CVPR 2023

Cuda 233 10 Updated Dec 6, 2023

BlinkDL / RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 208 34 Updated May 12, 2024

CisMine / Parallel-Computing-Cuda-C

CUDA Learning guide

Cuda 198 19 Updated Jun 20, 2024

nv-tlabs / DefTet

Learning Deformable Tetrahedral Meshes for 3D Reconstruction (NeurIPS 2020)

Cuda 165 11 Updated Oct 23, 2023

mit-han-lab / Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 150 6 Updated Jul 3, 2024

RaduAlexandru / permutohedral_encoding

Blazingly fast encoding for neural networks based on permutohedral lattices

Cuda 94 10 Updated May 10, 2023

gevtushenko / llm.c

Forked from karpathy/llm.c

LLM training in simple, raw C/CUDA

Cuda 77 6 Updated May 1, 2024

yklcs / jaxsplat

3D Gaussian Splatting in JAX

Cuda 52 2 Updated May 30, 2024

mobiusml / gemlite

Simple and fast low-bit matmul kernels in CUDA

Cuda 46 4 Updated Aug 21, 2024

morousg / cvGPUSpeedup

A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!

Cuda 34 5 Updated Sep 6, 2024

huggingface / candle-paged-attention

Cuda 12 3 Updated Jan 4, 2024

Starred topics

t-sne

latent-dirichlet-allocation

end-to-end

transformer

encoder-decoder

sequence-to-sequence

bayes

boosting

classification-trees

lightgbm

See all starred topics