Stars
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Segment Anything in Medical Images
Official implementation of SAM-Med2D
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Helper function for working with the REAL-Colon Dataset
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Effortless data labeling with AI support from Segment Anything and other awesome models.
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Spatial-Temporal Feature Transformation for Video Object Detection, MICCAI2021
Code for MICCAI2023: YONA: You Only Need One Adjacent Reference-frame for Accurate and Fast Video Polyp Detection
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
State-of-the-art 2D and 3D Face Analysis Project
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议
OpenMMLab Detection Toolbox and Benchmark
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
A repository of models, textual inversions, and more
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.