Skip to content

LAION-Beyond: Visual Concepts unseen in LAION-400M

Notifications You must be signed in to change notification settings

M-HuangX/LAION-Beyond

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

LAION-Beyond

This repository is created for open-source LAION-Beyond dataset. The benchmark consists of two parts: OOV (Out of Vocabulary) and IV (In Vocabulary). The OOV image-text pairs count is 106,052, while the IV image count is 51,330 (IV data does not provide captions and is only used for evaluation). The LAION-Beyond dataset consists of 9 domains: Plants Fungi, Insects Spiders, Animals, Pokemon, FolkArt, Landmark, Attire, Food, and Architecture.

Our pre-print paper and LAION-Beyond dataset will be released soon.

Analysis

Statistics of the class quantities across 9 domains

Statistics of the class across 9 domains

Statistics of the image quantities across 9 domains

Statistics of the image quantities across 9 domains

Image Counts per Category

Image Counts per Category

Zeroshot of Openclip Performance

ZeroshotCLIP Openclip Performance

Different Methods' Performance on LAION-Beyond's OOV images (Ours: FSNL)

Different Methods' Performance on LAION-Beyond

Updates

  • 05.12.2023: Create the LAION-Beyond repo.

About

LAION-Beyond: Visual Concepts unseen in LAION-400M

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published