Stars
A latent text-to-image diffusion model
Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection
Some code for processing and analysing UK Biobank cardiac MR images.
Official code of ACM MM2024 paper- Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection
CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models
Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
Code repository for the 2024 MICCAI Paper "TabMixer: Noninvasive Estimation of the Mean Pulmonary Artery Pressure via Imaging and Tabular Data Mixing"
This is an implementation of zero-shot instance segmentation using Segment Anything.
SAM model finetuned for Skin Instance segmentation tasks
This is an official repo for fine-tuning SAM to customized medical images.
Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios
This is the pytorch implement of our paper "RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model"
Evaluation of Segment Anything Model 2: The Role of SAM2 in the Underwater Environment
Instance Shadow Detection with A Single-Stage Detector [SSIS & SSISv2] (CVPR 2021 Oral & TPAMI 2022)
Video Instance Shadow Detection Under the Sun and Sky (IEEE TIP 2024)
Task-Customized Mixture of Adapters for General Image Fusion (CVPR 2024)
[ECCV 2024] TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data (an official implementation)