Skip to content
View catsdogone's full-sized avatar

Block or report catsdogone

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fast MoE impl for PyTorch

Python 1,561 188 Updated Jul 5, 2024

Let us control diffusion models!

Python 30,362 2,729 Updated Feb 25, 2024

Introduction to Parallel Programming class code

Cuda 1,296 1,140 Updated Jun 27, 2022

Implement asm gemm on vega64 for 4096x4096 fp32 matrix

C++ 20 7 Updated Oct 12, 2019

14 basic topics for VEGA64 performance optmization

C++ 50 23 Updated Mar 18, 2021

CentOS cloud images

781 563 Updated Mar 25, 2024

TensorFlow ROCm port

C++ 688 94 Updated Nov 14, 2024

ROCm SMI LIB

C++ 123 50 Updated Nov 13, 2024

Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster

Go 273 47 Updated Oct 22, 2024

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Python 1,079 310 Updated May 2, 2024

Code for training py-faster-rcnn and py-R-FCN on multiple GPUs in caffe

Jupyter Notebook 193 97 Updated Jun 6, 2017

Caffe models in TensorFlow

Python 2,798 1,034 Updated Jul 18, 2019

Caffe for YOLO

C++ 231 161 Updated Jan 7, 2017

YOLO reimplement in caffe, written with python layer.

Python 13 1 Updated Apr 11, 2017

A tensorflow implementation for SqueezeDet, a convolutional neural network for object detection.

Python 739 306 Updated Nov 22, 2022

Caffe: a fast open framework for deep learning.

C++ 4,770 1,674 Updated Apr 21, 2023

Fast R-CNN

Python 3,344 1,567 Updated Jan 23, 2018