Skip to content
View jrhemstad's full-sized avatar
🏠
⬇️ 👢
🏠
⬇️ 👢

Block or report jrhemstad

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • NVIDIA curated collection of educational resources related to general purpose GPU programming.

    Jupyter Notebook Other Updated Jul 29, 2024
  • jrhemstad Public

    Updated May 1, 2024
  • llm.c Public

    Forked from gevtushenko/llm.c

    LLM training in simple, raw C/CUDA

    Cuda MIT License Updated May 1, 2024
  • cccl Public

    Forked from NVIDIA/cccl

    CUDA C++ Core Libraries

    C++ 1 Other Updated Feb 23, 2024
  • Updated Nov 18, 2023
  • Updated Nov 3, 2023
  • cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ Other Updated Oct 10, 2023
  • NVTX Public

    Forked from NVIDIA/NVTX

    The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

    C Apache License 2.0 Updated Sep 22, 2023
  • Run compilers interactively from your web browser and interact with the assembly

    Assembly BSD 2-Clause "Simplified" License Updated Sep 18, 2023
  • infra Public

    Forked from compiler-explorer/infra

    Infrastructure to set up the public Compiler Explorer instances and compilers

    Python BSD 2-Clause "Simplified" License Updated Sep 13, 2023
  • C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    C++ Other Updated Aug 10, 2023
  • Shell Updated Jun 7, 2023
  • libcudacxx Public

    Forked from NVIDIA/libcudacxx

    The NVIDIA C++ Standard Library

    C++ Updated Feb 21, 2023
  • thrust Public

    Forked from NVIDIA/thrust

    Thrust is a C++ parallel programming library which resembles the C++ Standard Library.

    C++ Other Updated Feb 16, 2023
  • cub Public

    Forked from NVIDIA/cub

    Cooperative primitives for CUDA C++.

    Cuda BSD 3-Clause "New" or "Revised" License Updated Feb 16, 2023
  • stdexec Public

    Forked from NVIDIA/stdexec

    `std::execution`, the proposed C++ framework for asynchronous and parallel programming.

    C++ Apache License 2.0 Updated Jan 19, 2023
  • Answering "What is the faster way to return a single scalar from a kernel to host?"

    CMake 7 1 Apache License 2.0 Updated Sep 9, 2022
  • Shell 4 Updated Aug 10, 2022
  • C++ Apache License 2.0 Updated Jul 28, 2022
  • .github Public

    Forked from rapidsai/.github
    Updated Jun 23, 2022
  • two_largest Public

    Adventure in profiling and optimization.

    C++ 7 1 Apache License 2.0 Updated Apr 21, 2022
  • gil_preload Public

    Add NVTX ranges to Python GIL

    C++ Updated Mar 1, 2022
  • nvbench Public

    Forked from NVIDIA/nvbench

    CUDA Kernel Benchmarking Library

    Cuda Apache License 2.0 Updated Oct 8, 2021
  • cudf Public

    Forked from rapidsai/cudf

    Python GPU DataFrame Library

    Cuda Apache License 2.0 Updated Jun 8, 2021
  • rmm Public

    Forked from rapidsai/rmm

    RAPIDS Memory Manager

    C++ Apache License 2.0 Updated Apr 22, 2021
  • This repository is deprecated and the code has moved to the official NVIDIA NVTX github repository: https://github.com/NVIDIA/NVTX

    C++ 2 Apache License 2.0 Updated Apr 19, 2021
  • Template repository for CUDA enabled benchmarks using Google Benchmark

    CMake 7 2 Apache License 2.0 Updated Apr 14, 2021
  • link_test Public

    Testing linkage of function local statics

    C++ 1 1 Updated Mar 31, 2021
  • Examples on how to use C-Reduce to create minimal compiler bug reproducers

    Shell 1 Apache License 2.0 Updated Oct 21, 2020
  • Thin C++-flavored wrappers for the CUDA Runtime API

    C++ BSD 3-Clause "New" or "Revised" License Updated Oct 14, 2020