Stars
🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
Development repository for the Triton language and compiler
《Machine Learning Systems: Design and Implementation》- Chinese Version
A new markup-based typesetting system that is powerful and easy to learn.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
CTPAPI的Python接口,使用Swig技术制作,支持pip install。
This repository contains demos I made with the Transformers library by HuggingFace.
Transformer seq2seq model, program that can build a language translator from parallel corpus
Playing Pokemon Red with Reinforcement Learning
Implementation of the LLVM tutorial in Python
LLVM Tutorial: Kaleidoscope (Implementing a Language with LLVM)
clang & llvm examples, e.g. AST Interpreter, Function Pointer Analysis, Value Range Analysis, Data-Flow Analysis, Andersen Pointer Analysis, LLVM Backend...
A step-by-step tutorial for building an LLVM sample pass
Simplified Chinese translation for the LLVM Tutorial