Skip to content

Latest commit

 

History

History
53 lines (42 loc) · 4.23 KB

GranD.md

File metadata and controls

53 lines (42 loc) · 4.23 KB

GranD - Grounding Anything Dataset 🚀

The Grounding-anything Dataset (GranD) dataset offers densely annotated data, acquired through an automated annotation pipeline that leverages state-of-the-art (SOTA) vision and V-L models. This documentation covers how to download the GranD dataset and a guide to the automated annotation pipeline used to create GranD.

Download GranD 📂

Note: Please note that annotations are being uploaded incrementally and more parts will be available soon.

Preparing the Pretraining Annotations from GranD 🛠️

After downloading the GranD annotations, utilize the scripts below to transform them into GLaMM pretraining data, or to prepare them for your specific tasks.

The above scripts generate annotations in JSON format. To convert these for use in pretraining datasets requiring LMDB format, please use to the following scripts:

GranD Automated Annotation Pipeline

GranD is a comprehensive, multi-purpose image-text dataset offering a range of contextual information, from fine-grained to high-level details. The pipeline contains four distinct levels. The code for the four levels are provided in: GranD

More detailed information: