Skip to content
View ScorpionYH's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report ScorpionYH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A throughput-oriented high-performance serving framework for LLMs

Cuda 589 24 Updated Sep 21, 2024

Low-bit LLM inference on CPU with lookup table

C++ 476 34 Updated Oct 12, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

289 95 Updated Oct 16, 2023

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 267 24 Updated Jul 2, 2024

使用 cutlass 实现 flash-attention 精简版,具有教学意义

Cuda 29 1 Updated Aug 12, 2024

learning how CUDA works

Cuda 155 20 Updated Aug 16, 2024

使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention

Cuda 48 3 Updated Aug 12, 2024
Python 75 7 Updated Sep 9, 2024

LLM101n: Let's build a Storyteller

29,301 1,602 Updated Aug 1, 2024

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Python 548 75 Updated May 8, 2024

Ring attention implementation with flash attention

Python 552 42 Updated Oct 8, 2024

基于ncnn的手机端轻量级人脸检测和关键点定位模型

C++ 48 14 Updated May 25, 2021

A simple try on knowledge distillation.

Python 2 Updated Apr 2, 2021

ffmpeg+cuvid硬解码rtsp

C++ 25 8 Updated May 7, 2021

Material for gpu-mode lectures

Jupyter Notebook 2,681 267 Updated Oct 11, 2024

可在浏览器和微信小程序中使用的人脸识别算法. This is a WASM implementation of the Retinaface face detection algorithm.

C 38 3 Updated Apr 22, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,535 934 Updated Sep 22, 2024

🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.

Cuda 1,257 135 Updated Oct 13, 2024

A Detailed Cplusplus Concurrency Tutorial 《C++ 并发编程指南》

C++ 5,303 1,487 Updated Dec 29, 2022

二爷翻墙,专注免费翻墙30年,但没有掌握核心科技,一切已经开始!^_^

1,924 244 Updated Sep 11, 2024

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 9,546 1,555 Updated Aug 20, 2024

Simple samples for TensorRT programming

Python 1,492 338 Updated Sep 5, 2024

卢瑟们的作业展示,答案讲解,以及一些C++知识

C++ 639 134 Updated Sep 22, 2024

face detection face recognition包含人脸检测(retinaface,yolov5face,yolov7face,yolov8face),人脸检测跟踪(ByteTracker),人脸角度计算(Face_Angle)人脸矫正(Face_Aligner),人脸识别(Arcface),口罩检测(MaskRecognitiion),年龄性别检测(Gender_age),静…

C++ 285 61 Updated Mar 4, 2024

Code and information for face image quality assessment with SER-FIQ

Python 534 90 Updated Dec 9, 2022

Solutions and Notes for Labs of Computer Systems: A Programmer's Perspective 3rd Editon // 《深入理解计算机系统》第三版的实验文件、解答与笔记

C 2,412 438 Updated Feb 15, 2023

亲测可用的 VPN。亲测有效的科学上网,同时支持 windows、mac、linux、ios 和 andrioid 系统。并提供 chrome、firefox、opera 等浏览器的插件使用。

805 120 Updated Aug 16, 2024

Examples from Programming in Parallel with CUDA

Cuda 104 41 Updated Mar 17, 2023
Next