Skip to content
View baokangx's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report baokangx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,708 104 Updated Nov 22, 2024

CUDA Kernel Benchmarking Library

Cuda 520 66 Updated Nov 20, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,555 4,136 Updated Nov 23, 2024

Test suite for probing the numerical behavior of NVIDIA tensor cores

Cuda 30 12 Updated Jul 24, 2024

compiler learning resources collect.

Python 2,168 332 Updated May 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 30,693 4,656 Updated Nov 23, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,622 206 Updated Nov 22, 2024

OpenCL extension for csmith.

C++ 23 11 Updated Apr 1, 2017

A CUDA compiler fuzzer

C++ 22 9 Updated Oct 12, 2023

A primitive library for neural network

C++ 1,295 216 Updated Nov 7, 2024

Optimized primitives for collective multi-GPU communication

C++ 3,262 827 Updated Sep 17, 2024

cppreference.com html archive converter to microsoft help (for Visual Studio 2012+) and chm help (for Windows)

CSS 967 166 Updated Oct 7, 2024

Csmith, a random generator of C programs

C++ 1,024 147 Updated Jan 26, 2024

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

397 35 Updated Oct 14, 2024

First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.

C 350 72 Updated Nov 10, 2014

CUDA Templates for Linear Algebra Subroutines

C++ 5,695 978 Updated Nov 18, 2024

Python bindings for FFmpeg - with complex filtering support

Python 10,072 893 Updated Aug 4, 2024

This repository is a home to Intel® Deep Learning Streamer (Intel® DL Streamer) Pipeline Framework. Pipeline Framework is a streaming media analytics framework, based on GStreamer* multimedia frame…

C++ 529 171 Updated Oct 31, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 31,583 3,649 Updated Nov 23, 2024

《Linux/Unix系统编程手册》笔记

C 88 24 Updated Apr 16, 2024
2 2 Updated Jun 23, 2024

Khronos Vulkan, OpenGL, and OpenGL ES Conformance Tests

C++ 527 295 Updated Nov 23, 2024

SPIR-V specs

HTML 112 77 Updated Nov 20, 2024

The WebRTC project

310 59 Updated Jan 15, 2018

OpenCL for Rust

Rust 6 5 Updated Feb 24, 2021

OpenCL for Rust

Rust 736 75 Updated Apr 5, 2024

Rust tools for OpenCL and GPU management.

Rust 81 38 Updated Sep 26, 2024

zk-SNARK library

Rust 190 120 Updated Oct 1, 2024
Next