Skip to content
View zhuangh's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zhuangh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

Python 2,538 84 Updated Apr 25, 2023

A playbook for systematically maximizing the performance of deep learning models.

26,642 2,214 Updated Jun 18, 2024

Submanifold sparse convolutional networks

C++ 2,031 332 Updated Jan 9, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 10,683 1,539 Updated Sep 29, 2024

Fast and memory-efficient exact attention

Python 13,662 1,252 Updated Oct 8, 2024

Roadmap to becoming an Artificial Intelligence Expert in 2022

JavaScript 29,070 2,472 Updated Dec 31, 2023

深度学习经典、新论文逐段精读

26,538 2,408 Updated Aug 8, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,127 2,756 Updated Oct 8, 2024

List of Computer Science courses with video lectures.

66,874 9,091 Updated Sep 13, 2024

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 42,592 5,432 Updated Oct 4, 2024

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,005 123 Updated Apr 17, 2024

搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。

C++ 7,901 1,964 Updated Sep 12, 2024

RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.

C++ 153 30 Updated Oct 8, 2024

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

14,557 1,040 Updated Jul 26, 2024

Development repository for the Triton language and compiler

C++ 12,950 1,575 Updated Oct 8, 2024

程序员延寿指南 | A programmer's guide to live longer

29,840 2,091 Updated Jan 30, 2024

Lingvo

Python 2,811 445 Updated Oct 4, 2024

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 11,670 3,450 Updated Oct 8, 2024

FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks

C++ 42 6 Updated Apr 12, 2022

Vitis HLS LLVM source code and examples

379 56 Updated Sep 30, 2024

Reinforcement learning environments for compiler and program optimization tasks

Python 906 127 Updated Jun 11, 2024

Brevitas: neural network quantization in PyTorch

Python 1,163 192 Updated Oct 7, 2024

Dataflow compiler for QNN inference on FPGAs

Python 724 230 Updated Oct 7, 2024

Vitis HLS Library for FINN

C++ 174 65 Updated Sep 23, 2024

Stencil with Optimized Dataflow Architecture Compiler

Python 16 5 Updated May 4, 2020

Training neural models with structured signals.

Python 982 190 Updated Jun 18, 2024

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Python 322 92 Updated Apr 20, 2024

Machine learning on FPGAs using HLS

C++ 1,241 402 Updated Oct 7, 2024

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Python 1,486 320 Updated Sep 25, 2024
Next