Larger model #6

yixuan-qiao · 2022-02-07T03:08:19Z

Awesome work, thanks for releasing!

Is there some plans to further release larger models, such as BLIP-large or BLIP-xxlarge?

LiJunnan1992 · 2022-02-07T03:19:00Z

Hi, we have released a larger model which use ViT-L as the vision encoder (the text encoder is still bert-base). Currently we do not have plans to train models that are larger than that.

Thanks!

christophschuhmann · 2022-02-12T08:20:17Z

In case you change your mind, we from LAION can provide compute & have 6B yet unreleased image-text-pairs, 2.3B english.
( https://laion.ai )

We are currently busy with preparing the training of CLIP-versions, but we could just scale the ViT & LM up with the existing code and cooperate on pulling off the training.

Btw, here is a colab with pretty impressive captioning results i got with BLIP with many cannidate captions and filtering with CLIP ViT L & ResNet 50x64 https://colab.research.google.com/drive/1fKxiDMa-9uu1A6XiYjxTbYxSagvbZ8Fb?usp=sharing

LiJunnan1992 · 2022-02-12T08:57:37Z

Hi @christophschuhmann, it would be great if we can cooperate to train larger BLIP models with our code and your data & compute. I am very interested to continue this discussion.

Thanks for the colab, the captions do look nice!

christophschuhmann · 2022-02-12T09:05:12Z

Awesome! :)

We mostly use discord for correspondence. My handle is: spirit-from-germany#1488

Here is an invite link to the server we work on:

https://discord.gg/AAwcPAw894

For the Image captioning and VQA stuff, we use the channel #image-captioning.

Let's chat there :)

Btw, here are some VQA results we recently got with a frozen CLIP ViT L 14 and a frozen GPT J and a trained mapping transformer in between:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Larger model #6

Larger model #6

yixuan-qiao commented Feb 7, 2022

LiJunnan1992 commented Feb 7, 2022

christophschuhmann commented Feb 12, 2022

LiJunnan1992 commented Feb 12, 2022

christophschuhmann commented Feb 12, 2022

Larger model #6

Larger model #6

Comments

yixuan-qiao commented Feb 7, 2022

LiJunnan1992 commented Feb 7, 2022

christophschuhmann commented Feb 12, 2022

LiJunnan1992 commented Feb 12, 2022

christophschuhmann commented Feb 12, 2022