gccn

gccn

Organizations

8 stars written in C++

LLM inference in C/C++

C++ 65,763 9,444 Updated Oct 3, 2024

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

C++ 9,865 1,686 Updated Jun 19, 2023

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,294 334 Updated Sep 26, 2024

C++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.

C++ 1,744 338 Updated Sep 28, 2024

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 387 50 Updated Sep 5, 2024

Microsoft Collective Communication Library

C++ 309 29 Updated Sep 20, 2023

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 235 33 Updated Sep 25, 2024

Automatic virtualization of (general) accelerators.

C++ 40 20 Updated Nov 28, 2022