Stars
Images to inference with no labeling (use foundation models to train supervised models).
【grps接入trtllm】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Official release of InternLM2.5 base and chat models. 1M context support
Provides an ensemble model to deploy a YOLOv8 TensorRT model to Triton
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
📚 The list of vision-based SLAM / Visual Odometry open source, blogs, and papers
BEVDet online real-time inference using CUDA, TensorRT, ROS1 & C++.
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
基于TensorRT的C++高性能推理库,Yolov10, YoloPv2,Yolov5/7/X/8,RT-DETR,单目标跟踪OSTrack、LightTrack。
Open deep learning compiler stack for cpu, gpu and specialized accelerators
This Repository is implementation of majority of Semantic Segmentation Loss Functions
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Python implementation of Philip J. Schneider's "Algorithm for Automatically Fitting Digitized Curves" from the book "Graphics Gems"
A parser, editor and profiler tool for ONNX models.
[CVPR 2023] DepGraph: Towards Any Structural Pruning
You Only Look Once for Panopitic Driving Perception.(MIR2022)
🍅 Deploy ncnn on mobile phones. Support Android and iOS. 移动端ncnn部署,支持Android与iOS。
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Baidu Rope3d detector based on yolov7