eric-ai-lab / ComCLIP Star 27 Code Issues Pull requests Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching" causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2 Updated Apr 10, 2024 Python
juletx / spatial-reasoning Star 12 Code Issues Pull requests Grounding Language Models for Compositional and Spatial Reasoning nlp computer-vision deep-learning dataset image-captioning image-retrieval spatial-reasoning multimodal vision-and-language grounding vsr caption-retrieval winoground visual-spatial-reasoning Updated Oct 26, 2022 Jupyter Notebook