This repo stores data from the USDA Pomological Watercolor Collection:
The USDA Pomological Watercolor Collection documents fruit and nut varieties developed by growers or introduced by USDA plant explorers around the turn of the 20th century. Technically accurate paintings were used to create lithographs illustrating USDA bulletins, yearbooks, and other series distributed to growers and gardeners across America
The following data is provided in this repo:
Image directory containing a .jpg
for each individual painting.
There are 7584 images in total.
Csv file containing metadata for each painting.
Column | Description | Data Type |
---|---|---|
painting_number |
Painting number as enumerated in collection. | number |
fruit |
Pomological name of the primary fruit depicted in the painting. | text |
authors |
Author(s) of the given watercolor painting. | text |
subjects |
Broader classification of fruit(s) depicted in the painting. (e.g. 'apple') | text |
year |
Year painting was published. | number |
thumbnail_image |
Link to thumbnail jpg of watercolor painting. | text |
image |
Link to jpg image of watercolor painting. | text |
Python script used to scrape the paintings.
Example use:
# call without arguments
$ python get_pomological_data.py
# call with arguments
$ python get_pomological_data.py --start=20 --end=400 --csv_name fruits.csv --verbose 1
Calling without arguments scrapes all of the watercolor painting data and stores
the information to a local csv titled usda_pomological_watercolors.csv
.