NVIDIA / CUDALibrarySamples
CUDA Library Samples
See what the GitHub community is most excited about today.
CUDA Library Samples
CUDA Kernel Benchmarking Library
NCCL Tests
Tile primitives for speedy kernels
Causal depthwise conv1d in CUDA, with a PyTorch interface
cuVS - a library for vector search and clustering on the GPU
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
LLM training in simple, raw C/CUDA
cuGraph - RAPIDS Graph Analytics Library
A massively parallel, optimal functional runtime in Rust
Instant neural graphics primitives: lightning fast NeRF and more
WholeGraph - large scale Graph Neural Networks
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl