Skip to content
View richjjj's full-sized avatar
Block or Report

Block or report richjjj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.

Python 57 9 Updated Jun 26, 2024

This repository is based on shouxieai/tensorRT_Pro, with adjustments to support YOLOv8.

C++ 147 28 Updated Jun 3, 2024

LLM inference in C/C++

C++ 60,893 8,688 Updated Jun 28, 2024

Cartographer is a system that provides real-time simultaneous localization and mapping (SLAM) in 2D and 3D across multiple platforms and sensor configurations.

C++ 7,017 2,237 Updated Jan 5, 2024

Autoware - the world's leading open-source software project for autonomous driving

Shell 8,571 2,862 Updated Jun 27, 2024

跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : ) 。**VideoPipe下一版本正在开发中,在保证跨平台、易上手的前提下,预计性能直逼deepstream等各硬件平台官方框架**。

C++ 1,145 162 Updated Jun 6, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 476 35 Updated Apr 7, 2024

A simple implementation of Tensorrt YOLOv8

Cuda 73 16 Updated Apr 24, 2023

[ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers

Python 102 8 Updated Jan 10, 2024

The official implementation of the NeurIPS 2022 paper Q-ViT.

Python 76 7 Updated May 22, 2023

🚀🚀🚀This is an AI high-performance reasoning C++ library, Currently supports the deployment of yolov5, yolov7, yolov7-pose, yolov8, yolov8-seg, yolov8-pose, yolov8-obb, yolox, RTDETR, DETR, depth-an…

C++ 97 15 Updated May 4, 2024

This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes

Shell 28 2 Updated May 23, 2024

ONNX-compatible DeDoDe 🎶 Detect, Don't Describe - Describe, Don't Detect, for Local Feature Matching. Supports TensorRT 🚀

Python 57 4 Updated Aug 21, 2023

C++ application to perform computer vision tasks using Nvidia Triton Server for model inference

C++ 13 1 Updated Jun 19, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,174 318 Updated Jun 27, 2024

Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function independently without continuous internet access.

55 3 Updated Mar 23, 2024

Mamba SSM architecture

Python 11,370 924 Updated Jun 24, 2024

A C++ implementation for UCMCTrack (SOTA in MOT17)

C++ 7 3 Updated Apr 5, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 1,994 175 Updated Jun 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,774 3,066 Updated Jun 28, 2024

TensorRT+YOLO系列的 多路 多卡 多实例 并行视频分析处理案例

C++ 194 34 Updated Jun 25, 2024

gstreamer rtsp client support rockchip and jetson nx for C/C++ Python

C++ 53 18 Updated Jan 22, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,200 1,222 Updated Jun 28, 2024

彻底弄懂BP反向传播,15行代码,C++实现也简单,MNIST分类98.29%精度

C++ 32 5 Updated Apr 2, 2022

A toolkit showing GPU's all-round capability in video processing

C 167 40 Updated Aug 7, 2023

An onnx-based quantitation tool.

Python 68 10 Updated Jan 8, 2024

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 429 35 Updated Jun 24, 2024

提供多款 Shadowrocket 规则,拥有强劲的广告过滤功能。每日8时重新构建规则。

11,242 682 Updated Jun 27, 2024

分流规则、重写写规则及脚本。

JavaScript 16,190 2,601 Updated Jun 27, 2024

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,637 149 Updated May 27, 2024