Cartographer is a system that provides real-time simultaneous localization and mapping (SLAM) in 2D and 3D across multiple platforms and sensor configurations.

C++ 7,017 2,237 Updated Jan 5, 2024

autowarefoundation / autoware

Autoware - the world's leading open-source software project for autonomous driving

Shell 8,571 2,862 Updated Jun 27, 2024

sherlockchou86 / VideoPipe

跨平台的视频结构化（视频分析）框架，觉得有帮助的请给个星星 : ) 。**VideoPipe下一版本正在开发中，在保证跨平台、易上手的前提下，预计性能直逼deepstream等各硬件平台官方框架**。

C++ 1,145 162 Updated Jun 6, 2024

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 476 35 Updated Apr 7, 2024

Monday-Leo / YOLOv8_Tensorrt

A simple implementation of Tensorrt YOLOv8

Cuda 73 16 Updated Apr 24, 2023

zkkli / RepQ-ViT

[ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers

Python 102 8 Updated Jan 10, 2024

YanjingLi0202 / Q-ViT

The official implementation of the NeurIPS 2022 paper Q-ViT.

Python 76 7 Updated May 22, 2023

yhwang-hub / dl_model_infer

🚀🚀🚀This is an AI high-performance reasoning C++ library, Currently supports the deployment of yolov5, yolov7, yolov7-pose, yolov8, yolov8-seg, yolov8-pose, yolov8-obb, yolox, RTDETR, DETR, depth-an…

C++ 97 15 Updated May 4, 2024

levipereira / triton-server-yolo

This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes

Shell 28 2 Updated May 23, 2024

fabio-sim / DeDoDe-ONNX-TensorRT

ONNX-compatible DeDoDe 🎶 Detect, Don't Describe - Describe, Don't Detect, for Local Feature Matching. Supports TensorRT 🚀

Python 57 4 Updated Aug 21, 2023

olibartfast / computer-vision-triton-cpp-client

C++ application to perform computer vision tasks using Nvidia Triton Server for model inference

C++ 13 1 Updated Jun 19, 2024

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,174 318 Updated Jun 27, 2024

BestAnHongjun / LMDeploy-Jetson

Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function independently without continuous internet access.

55 3 Updated Mar 23, 2024

state-spaces / mamba

Mamba SSM architecture

Python 11,370 924 Updated Jun 24, 2024

LSH9832 / UCMCTrack-cpp

A C++ implementation for UCMCTrack (SOTA in MOT17)

C++ 7 3 Updated Apr 5, 2024

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 1,994 175 Updated Jun 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,774 3,066 Updated Jun 28, 2024

1461521844lijin / trt_yolo_video_pipeline

TensorRT+YOLO系列的多路多卡多实例并行视频分析处理案例

C++ 194 34 Updated Jun 25, 2024

zhuyuliang / gst_rtsp_client

gstreamer rtsp client support rockchip and jetson nx for C/C++ Python

C++ 53 18 Updated Jan 22, 2024

intel-analytics / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,200 1,222 Updated Jun 28, 2024