![:octocat: :octocat:](https://github.githubassets.com/images/icons/emoji/octocat.png)
-
The Intelligent Media Analysis Group (IMAG), Nanjing University of Science and Technology
- Nanjing, Jiangsu, China
Block or Report
Block or report WayneTomas
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
✨✨Latest Advances on Multimodal Large Language Models
[CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
[ICML2024]The official implementation of SemiRES in PyTorch.
📚 A collection of papers about Referring Image Segmentation.
[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
A benchmark dataset for GRES and GREC [CVPR2023 Highlight]
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
OpenMMLab Detection Toolbox and Benchmark
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A lightweight codebase for referring expression comprehension and segmentation
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
library supporting NLP and CV research on scientific papers
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Recent Advances in Vision and Language Pre-training (VLP)
[TPAMI]CTNet: Context-based Tandem Network for Semantic Segmentation
This project, pdf2md, transforms academic paper PDF files into digestible text files. By analyzing the layout of the PDF file, the application restructures paragraphs and translates desired content…
opengovsg / pdf2md
Forked from jzillmann/pdf-to-markdownA PDF to Markdown converter
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
SAM with text prompt
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
Bringing Old Photo Back to Life (CVPR 2020 oral)