Google Open Images

train.tgz 447.89GB
val.tgz 8.66GB
abstract= {Open Images is a dataset of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories.
This is train and validation part of dataset. All images resized to 420 on small side. The name of the saved image corresponds to Google's ImageID which can be used to look up labels in the open image dataset.

all: 9011219
downloaded: 8798643
labeled: 8646180
post-download clean: 8591564

all: 167056
downloaded: 160957
post-download clean: 159847},
