Stars
Inference Vision Transformer (ViT) in plain C/C++ with ggml
Try to track the available stencil implementations
1
Updated Jul 17, 2024
深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
fujitsu / pytorch
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Main repo to keep scripts, dockerfiles, wiki, etc
Instruction latency & throughput profiler for AArch64
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Minimal PyTorch implementation of YOLOv3