Skip to content
View maoyanpeng's full-sized avatar
Block or Report

Block or report maoyanpeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Tengine is a lite, high performance, modular inference engine for embedded device

C++ 4,591 995 Updated Dec 24, 2023

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 390 8 Updated Jul 4, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 9,677 1,393 Updated Jul 17, 2024

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 661 161 Updated Jul 23, 2024

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,386 428 Updated Jul 23, 2024

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 766 121 Updated Jul 29, 2023

Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .

C++ 93 102 Updated Jul 23, 2024

StyleGAN - Official TensorFlow Implementation

Python 14,037 3,166 Updated Apr 10, 2024

cuda编程学习资料

Cuda 30 9 Updated Apr 4, 2020

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,660 445 Updated Oct 9, 2023

A PyTorch Library for Accelerating 3D Deep Learning Research

Python 4,325 537 Updated Jul 22, 2024

C++ Lightweight Utility Extensions

C++ 74 20 Updated Nov 28, 2021

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Jupyter Notebook 19,883 5,377 Updated Aug 2, 2023

Seamless operability between C++11 and Python

C++ 15,190 2,061 Updated Jul 23, 2024

Read-only mirror of https://gitlab.gnome.org/GNOME/libxml2

C 560 361 Updated Jul 22, 2024

nniefacelib是一个在海思35xx系列芯片上运行的人脸算法库

C 552 180 Updated Jan 27, 2023

RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4/MKV/WebM

C 3,000 1,064 Updated Jul 21, 2024

Project moved to: https://github.com/llvm/llvm-project

LLVM 4,599 2,097 Updated Sep 2, 2020

动手学习TVM核心原理教程

Python 58 16 Updated Dec 4, 2020

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 11,447 3,409 Updated Jul 23, 2024

An open optimized software library project for the ARM® Architecture

C 1,449 404 Updated Dec 9, 2022

AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.

Python 207 35 Updated Mar 21, 2024

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ 4,172 688 Updated Jul 23, 2024

Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure framework to speed up Deep Learning on Arm-based heterogeneous em…

C++ 270 101 Updated Oct 16, 2018

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的数值计算框架

C++ 1 Updated Mar 25, 2020

Caffe: a fast open framework for deep learning.

C++ 33,985 18,725 Updated Feb 21, 2024

WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11

C++ 13,331 3,288 Updated Jul 23, 2024

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

C++ 41,292 10,436 Updated Jul 23, 2024

Fast C++ logging library.

C++ 23,137 4,391 Updated Jul 22, 2024
Next