Skip to content
View ofhwei's full-sized avatar

Block or report ofhwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Material for cuda-mode lectures

Jupyter Notebook 2,271 229 Updated Aug 25, 2024

🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 1,084 104 Updated Aug 27, 2024

LLM training in simple, raw C/CUDA

Cuda 23,011 2,564 Updated Aug 26, 2024

A code generator from ONNX to PyTorch code

Python 132 30 Updated Nov 15, 2022

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 9,083 2,575 Updated Mar 15, 2024
Python 6 Updated Aug 23, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 131,345 26,121 Updated Aug 30, 2024

A unified evaluation library for multiple machine learning libraries

Python 251 48 Updated Mar 29, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,561 94 Updated Aug 26, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 5,846 659 Updated Aug 21, 2024