Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,379 247 Updated Apr 24, 2024

facebookresearch / Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,837 207 Updated Mar 21, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,542 610 Updated Aug 5, 2024

tsujuifu / pytorch_violet

A PyTorch implementation of VIOLET

Python 136 6 Updated Dec 17, 2023

bytedance / ibot

iBOT 🤖: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)

Jupyter Notebook 651 76 Updated Apr 14, 2022

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,110 187 Updated Jan 24, 2024

easonnie / mlp-vil

MLPs for Vision and Langauge Modeling (Coming Soon)

27 Updated Dec 9, 2021

zdou0830 / METER

METER: A Multimodal End-to-end TransformER Framework

Python 357 30 Updated Nov 16, 2022

gaopengcuhk / Stable-Pix2Seq

A full-fledged version of Pix2Seq

Python 235 20 Updated Nov 6, 2021

VITA-Group / CV_A-FAN

[TMLR] "Adversarial Feature Augmentation and Normalization for Visual Recognition", Tianlong Chen, Yu Cheng, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zhangyang Wang, Jingjing Liu

Python 20 2 Updated Nov 27, 2022

j-min / VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Python 356 57 Updated Jul 29, 2023

zinengtang / VidLanKD

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))

Python 56 8 Updated Feb 6, 2023

THUDM / P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Python 910 111 Updated Oct 6, 2022

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

Python 1,470 191 Updated Sep 20, 2022

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,629 190 Updated May 20, 2024

rabeehk / compacter

Python 126 15 Updated Aug 18, 2022

microsoft / Focal-Transformer

[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

Python 543 59 Updated Mar 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhe Gan zhegan27

Achievements

Achievements

Block or report zhegan27

Stars

microsoft / X-Decoder

JialianW / GRiT

buxiangzhiren / DDCap

microsoft / FIBER

NoviScl / GPT3-Reliability

kakaobrain / mindall-e

microsoft / GenerativeImage2Text

microsoft / UniTAB

xyzforever / BEVT

czczup / ViT-Adapter

microsoft / SwinBERT

facebookresearch / SLIP

j-min / DallEval

OFA-Sys / OFA