UranusSeven

🎯

Focusing

Uranus UranusSeven

🎯

Focusing

42 followers · 12 following

https://www.zhihu.com/people/840445

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Block or Report

Block or report UranusSeven

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (9)

Sort

🚀 My stack

Stable Diffusion⭐️

3 repositories

Tools🔨

7 repositories

Training

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

16 stars written in C++

Clear filter

ggerganov / llama.cpp

LLM inference in C/C++

C++ 61,436 8,783 Updated Jul 10, 2024

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 33,038 3,304 Updated Jul 9, 2024

apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

C++ 13,903 3,384 Updated Jul 10, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 11,928 1,414 Updated Jul 10, 2024

BYVoid / OpenCC

Conversion between Traditional and Simplified Chinese

C++ 8,215 972 Updated Jun 19, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,647 407 Updated Jul 1, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,432 802 Updated Jul 10, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,643 877 Updated Mar 27, 2024

facebookincubator / velox

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

C++ 3,288 1,086 Updated Jul 10, 2024

li-plus / chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4

C++ 2,830 327 Updated Jun 24, 2024

microsoft / msccl

Microsoft Collective Communication Library

C++ 271 26 Updated Sep 20, 2023

KnowingNothing / MatmulTutorial

A Easy-to-understand TensorOp Matmul Tutorial

C++ 221 22 Updated Jun 15, 2024

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 183 27 Updated Jul 9, 2024

bytedance / flux

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 80 7 Updated Jul 9, 2024

tlc-pack / libflash_attn

Standalone Flash Attention v2 kernel without libtorch dependency

C++ 79 12 Updated May 21, 2024

LLMServe / SwiftTransformer

High performance Transformer implementation in C++.

C++ 43 2 Updated Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uranus UranusSeven

Achievements

Achievements

Block or report UranusSeven

Lists (9)

Big data💿

Gen AI Applications

GGML

HPC💻

Local GPT🤖

🚀 My stack

Stable Diffusion⭐️

Tools🔨

Training

Starred repositories

ggerganov / llama.cpp

ggerganov / whisper.cpp

apache / arrow

triton-lang / triton

BYVoid / OpenCC

SJTU-IPADS / PowerInfer

NVIDIA / TensorRT-LLM

NVIDIA / FasterTransformer

facebookincubator / velox

li-plus / chatglm.cpp

microsoft / msccl

KnowingNothing / MatmulTutorial

microsoft / mscclpp

bytedance / flux

tlc-pack / libflash_attn

LLMServe / SwiftTransformer

Starred topics

Deep learning

C++

python3

Database