Stars
encoder
3 repositories
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
MLCD & UNICOM : Large-Scale Visual Representation Model
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"