zhuangh

🎯

Focusing

Hao Zhuang zhuangh

🎯

Focusing

scaling Tesla AI @teslamotors and Dojo; ex-@google qkeras, Brain/Perception Video ML; UCSD CS PhD, PKU EECS

84 followers · 320 following

Tesla
Sunnyvale
20:29 (UTC -07:00)
zhuangh.github.io
https://linkedin.com/zhuangh
@zhuangh

Achievements

Stars

apple / ml-ane-transformers

Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

Python 2,538 84 Updated Apr 25, 2023

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

26,642 2,214 Updated Jun 18, 2024

facebookresearch / SparseConvNet

Submanifold sparse convolutional networks

C++ 2,031 332 Updated Jan 9, 2024

chenzomi12 / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 10,683 1,539 Updated Sep 29, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,662 1,252 Updated Oct 8, 2024

AMAI-GmbH / AI-Expert-Roadmap

Roadmap to becoming an Artificial Intelligence Expert in 2022

JavaScript 29,070 2,472 Updated Dec 31, 2023

mli / paper-reading

深度学习经典、新论文逐段精读

26,538 2,408 Updated Aug 8, 2024

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,127 2,756 Updated Oct 8, 2024

Developer-Y / cs-video-courses

List of Computer Science courses with video lectures.

66,874 9,091 Updated Sep 13, 2024

isocpp / CppCoreGuidelines

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 42,592 5,432 Updated Oct 4, 2024

pytorch / torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,005 123 Updated Apr 17, 2024

applenob / Cpp_Primer_Practice

搞定C++:punch:。C++ Primer 中文版第5版学习仓库，包括笔记和课后练习答案。

C++ 7,901 1,964 Updated Sep 12, 2024

rapidstream-org / rapidstream-tapa

RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.

C++ 153 30 Updated Oct 8, 2024

jhuangtw / xg2xg

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

14,557 1,040 Updated Jul 26, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 12,950 1,575 Updated Oct 8, 2024

geekan / HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

29,840 2,091 Updated Jan 30, 2024

tensorflow / lingvo

Lingvo

Python 2,811 445 Updated Oct 4, 2024

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 11,670 3,450 Updated Oct 8, 2024

harvard-acc / FlexASR

FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks

C++ 42 6 Updated Apr 12, 2022

Xilinx / HLS

Vitis HLS LLVM source code and examples

379 56 Updated Sep 30, 2024

iwls2020-lsml-contest / iwls2020-lsml-contest

20 2 Updated Nov 18, 2022

facebookresearch / CompilerGym

Reinforcement learning environments for compiler and program optimization tasks

Python 906 127 Updated Jun 11, 2024

Xilinx / brevitas

Brevitas: neural network quantization in PyTorch

Python 1,163 192 Updated Oct 7, 2024

Xilinx / finn

Dataflow compiler for QNN inference on FPGAs

Python 724 230 Updated Oct 7, 2024

Xilinx / finn-hlslib

Vitis HLS Library for FINN

C++ 174 65 Updated Sep 23, 2024

UCLA-VAST / soda-compiler

Stencil with Optimized Dataflow Architecture Compiler

Python 16 5 Updated May 4, 2020

tensorflow / neural-structured-learning

Training neural models with structured signals.

Python 982 190 Updated Jun 18, 2024

cornell-zhang / heterocl

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Python 322 92 Updated Apr 20, 2024

fastmachinelearning / hls4ml

Machine learning on FPGAs using HLS

C++ 1,241 402 Updated Oct 7, 2024

tensorflow / model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Python 1,486 320 Updated Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hao Zhuang zhuangh

Achievements

Achievements

Block or report zhuangh

Stars

apple / ml-ane-transformers

google-research / tuning_playbook

facebookresearch / SparseConvNet

chenzomi12 / AISystem

Dao-AILab / flash-attention

AMAI-GmbH / AI-Expert-Roadmap

mli / paper-reading

jax-ml / jax

Developer-Y / cs-video-courses

isocpp / CppCoreGuidelines

pytorch / torchdynamo

applenob / Cpp_Primer_Practice

rapidstream-org / rapidstream-tapa

jhuangtw / xg2xg

triton-lang / triton

geekan / HowToLiveLonger

tensorflow / lingvo

apache / tvm

harvard-acc / FlexASR

Xilinx / HLS

iwls2020-lsml-contest / iwls2020-lsml-contest

facebookresearch / CompilerGym

Xilinx / brevitas

Xilinx / finn

Xilinx / finn-hlslib

UCLA-VAST / soda-compiler

tensorflow / neural-structured-learning

cornell-zhang / heterocl

fastmachinelearning / hls4ml

tensorflow / model-optimization