- Minneapolis, MN
-
accelerated-computing-hub Public
Forked from NVIDIA/accelerated-computing-hubNVIDIA curated collection of educational resources related to general purpose GPU programming.
Jupyter Notebook Other UpdatedJul 29, 2024 -
-
llm.c Public
Forked from gevtushenko/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedMay 1, 2024 -
-
-
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedOct 10, 2023 -
NVTX Public
Forked from NVIDIA/NVTXThe NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
C Apache License 2.0 UpdatedSep 22, 2023 -
compiler-explorer Public
Forked from compiler-explorer/compiler-explorerRun compilers interactively from your web browser and interact with the assembly
Assembly BSD 2-Clause "Simplified" License UpdatedSep 18, 2023 -
infra Public
Forked from compiler-explorer/infraInfrastructure to set up the public Compiler Explorer instances and compilers
Python BSD 2-Clause "Simplified" License UpdatedSep 13, 2023 -
cuda-quantum Public
Forked from NVIDIA/cuda-quantumC++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
C++ Other UpdatedAug 10, 2023 -
-
libcudacxx Public
Forked from NVIDIA/libcudacxxThe NVIDIA C++ Standard Library
C++ UpdatedFeb 21, 2023 -
thrust Public
Forked from NVIDIA/thrustThrust is a C++ parallel programming library which resembles the C++ Standard Library.
C++ Other UpdatedFeb 16, 2023 -
cub Public
Forked from NVIDIA/cubCooperative primitives for CUDA C++.
Cuda BSD 3-Clause "New" or "Revised" License UpdatedFeb 16, 2023 -
stdexec Public
Forked from NVIDIA/stdexec`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
C++ Apache License 2.0 UpdatedJan 19, 2023 -
cuda_scalar_result Public
Answering "What is the faster way to return a single scalar from a kernel to host?"
-
-
-
-
two_largest Public
Adventure in profiling and optimization.
-
-
nvbench Public
Forked from NVIDIA/nvbenchCUDA Kernel Benchmarking Library
Cuda Apache License 2.0 UpdatedOct 8, 2021 -
cudf Public
Forked from rapidsai/cudfPython GPU DataFrame Library
Cuda Apache License 2.0 UpdatedJun 8, 2021 -
rmm Public
Forked from rapidsai/rmmRAPIDS Memory Manager
C++ Apache License 2.0 UpdatedApr 22, 2021 -
nvtx_wrappers Public
This repository is deprecated and the code has moved to the official NVIDIA NVTX github repository: https://github.com/NVIDIA/NVTX
-
example_cuda_benchmark Public
Template repository for CUDA enabled benchmarks using Google Benchmark
-
-
creduce-example Public
Examples on how to use C-Reduce to create minimal compiler bug reproducers
-
cuda-api-wrappers Public
Forked from eyalroz/cuda-api-wrappersThin C++-flavored wrappers for the CUDA Runtime API
C++ BSD 3-Clause "New" or "Revised" License UpdatedOct 14, 2020