This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""

Python 58 8 Updated May 17, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,759 452 Updated Sep 19, 2024

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 113 6 Updated Oct 12, 2024

mhamilton723 / STEGO

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

Jupyter Notebook 720 145 Updated Mar 24, 2023

wusize / CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Python 167 9 Updated Feb 5, 2024

jianzongwu / Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

817 47 Updated Oct 8, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,331 208 Updated Oct 8, 2024

cvlab-kaist / CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Python 260 25 Updated Apr 11, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,998 791 Updated Aug 7, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,250 1,429 Updated Aug 8, 2024

Developer-Y / cs-video-courses

List of Computer Science courses with video lectures.

66,979 9,088 Updated Sep 13, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,904 194 Updated Sep 19, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,423 3,267 Updated Jul 23, 2024

OpenDriveLab / maskalign

[CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"

Python 64 5 Updated Dec 6, 2023

YanhaoWu / STSSL

Code for **Spatiotemporal Self-supervised Learning for Point Clouds in the Wild** (STSSL) CVPR2023

Python 43 4 Updated Mar 4, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,801 356 Updated May 8, 2024

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python 1,439 84 Updated Jan 23, 2024

facebookresearch / PLRC

Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)

Python 32 3 Updated Dec 23, 2022

facebookresearch / fair_self_supervision_benchmark

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Python 587 63 Updated Oct 12, 2021

hologerry / SoCo

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Python 172 21 Updated Nov 17, 2021

extreme-assistant / CVPR2024-Paper-Code-Interpretation

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理

12,428 2,311 Updated Apr 25, 2024

abess-team / abess

Fast Best-Subset Selection Library

C++ 472 41 Updated Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CongpeiQiu

Highlights

Block or report CongpeiQiu

Stars

sihyun-yu / REPA

3DTopia / OpenLRM

justimyhxu / GRM

Lingzhi-Pan / PILOT

szymanowiczs / splatter-image

BradyFU / Awesome-Multimodal-Large-Language-Models

ztt1024 / denseSSL

shashankvkt / DoRA_ICLR24