Skip to content
View qtwang's full-sized avatar
🧠
Quack!
🧠
Quack!

Block or report qtwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
8 stars written in Cuda
Clear filter

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Cuda 4,317 908 Updated Aug 30, 2024

Squeeze-and-Excitation Networks

Cuda 3,379 839 Updated Feb 25, 2019

Fast CUDA matrix multiplication from scratch

Cuda 454 61 Updated Dec 28, 2023

Step-by-step optimization of CUDA SGEMM

Cuda 219 36 Updated Mar 30, 2022

Examples from Programming in Parallel with CUDA

Cuda 104 41 Updated Mar 17, 2023

GPU-Suite

Cuda 82 13 Updated Aug 25, 2016

cuDTW++: Ultra-Fast Dynamic Time Warping on CUDA-enabled GPUs

Cuda 21 2 Updated May 11, 2020

This is the CUDA GPU implementation + Python interface (using PyTorch) of DCI. The paper can be found at https://arxiv.org/abs/1512.00442.

Cuda 12 4 Updated Dec 20, 2023