Block or Report
Block or report woaihekele
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
This is a list of useful libraries and resources for CUDA development.
😆 国内外互联网技术大牛们都写了哪些书籍:计算机基础、网络、前端、后端、数据库、架构、大数据、深度学习...
A collection of resources for learning type theory and type theory adjacent fields.
DSPy: The framework for programming—not prompting—foundation models
A massively parallel, optimal functional runtime in Rust
A massively parallel, high-level programming language
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Play ChatGPT and other LLM with Xiaomi AI Speaker
🦀 Small exercises to get you used to reading and writing Rust code!
🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
<<Rust算法题解>>,用Rust语言实现常见的算法和数据结构,以及leetcode题解,algos = algorithms,written with ❤️ by course.rs team
OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
The book "Performance Analysis and Tuning on Modern CPU"
🚂 🦀 The one-person framework for Rust for side-projects and startups
Next generation face swapper and enhancer
FlashInfer: Kernel Library for LLM Serving
A Java version of simdjson, a high-performance JSON parser utilizing SIMD instructions
Learn Rust dark magics by implementing an expression framework in database systems