Unified optimal transport framework for cross-modal retrieval
-
Updated
Jun 26, 2024 - OCaml
Unified optimal transport framework for cross-modal retrieval
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval" (ACM TOMM 2024).
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
The Unified Code of Image-Text Retrieval for Further Exploration.
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
An AI-powered interactive video retrieval system
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
Add a description, image, and links to the cross-modal-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the cross-modal-retrieval topic, visit your repo's landing page and select "manage topics."