Stars
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Transformer related optimization, including BERT, GPT
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
A tensorflow implementation of EAST text detector
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
💃 Real-time single person pose estimation for Android and iOS.
Realtime C++ code for multi-person pose estimation
CNN architecture for articulated human pose estimation
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Pelee(NeurIPS'18)-TensorRT Implementation (Caffe Parser)
https://www.datafountain.cn/competitions/334/details 赛题的baseline开源
Object Detection and Tracking based on C++ and OpenCV
shihenw / caffe
Forked from BVLC/caffeCaffe: a fast open framework for deep learning. Fork maintained by shihenw in CMU.
DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
based on official caffe_train, i change some cudnn*.cpp and cudnn*.hpp for cuda9.