|  | The Oxford-IIIT Pet Dataset | 2 | 2022-07-29 | 811.09MB | 110,114 | 88+ | 1 | 
                      
          |  | Introduction to Computer Science [CS50x] [Harvard] [2018] | 192 | 2018-01-24 | 9.61GB | 84,932 | 66+ | 5 | 
                      
          |  | Indiana University - Chest X-Rays (XML Reports) | 1 | 2018-11-22 | 1.11MB | 49,833 | 19+ | 0 | 
                      
          |  | Reading Text in the Wild with Convolutional Neural Networks | 1 | 2021-11-12 | 10.68GB | 48,521 | 32 | 5 | 
                      
          |  | Richard Feynman's Lectures on Physics (The Messenger Lectures) | 7 | 2017-07-20 | 1.07GB | 41,596 | 34+ | 1 | 
                      
          |  | Indiana University - Chest X-Rays (PNG Images) | 1 | 2018-11-22 | 1.36GB | 41,128 | 21+ | 1 | 
                      
          |  | Vggface2: A dataset for recognising faces across pose and age | 14 | 2021-03-07 | 40.25GB | 34,210 | 23+ | 3 | 
                      
          |  | DRIVE: Digital Retinal Images for Vessel Extraction | 2 | 2020-07-11 | 29.34MB | 31,039 | 20+ | 0 | 
                      
          |  | CS231n: Convolutional Neural Networks Spring 2017 | 16 | 2018-09-25 | 2.63GB | 30,522 | 21+ | 2 | 
                      
          |  | Enron Email Dataset | 1 | 2016-08-26 | 443.25MB | 27,597 | 14+ | 3 | 
                      
          |  | NIH Pancreas-CT Dataset | 164 | 2017-09-12 | 4.86GB | 25,980 | 15+ | 1 | 
                      
          |  | 115 paintings from the Hermitage museum, high-resolution, JPEG | 113 | 2021-01-28 | 2.23GB | 24,329 | 25 | 3 | 
                      
          |  | LiTS – Liver Tumor Segmentation Challenge (LiTS17) | 262 | 2018-07-21 | 16.66GB | 21,757 | 17+ | 2 | 
                      
          |  | The Extended Yale Face Database B | 1 | 2014-10-20 | 2.09GB | 20,912 | 13+ | 2 | 
                      
          |  | Vincent van Gogh Paintings | 2036 | 2016-01-24 | 513.76MB | 20,647 | 27+ | 2 | 
                      
          |  | [Coursera] Algorithms: Design and Analysis, Part 1 (Stanford University) (algo) | 532 | 2017-03-05 | 1.95GB | 20,417 | 36+ | 2 | 
                      
          |  | IDRiD (Indian Diabetic Retinopathy Image Dataset) | 3 | 2019-02-06 | 1.01GB | 19,689 | 16+ | 1 | 
                      
          |  | [Coursera] What A Plant Knows (Daniel Chamovitz, Tel Aviv University) | 108 | 2020-06-09 | 538.94MB | 19,599 | 11+ | 2 | 
                      
          |  | Illinois DOC labeled faces dataset | 8 | 2019-12-05 | 6.36GB | 16,618 | 10 | 2 | 
                      
          |  | Viking Merged Color Mosaic | 17 | 2013-12-27 | 276.68MB | 16,062 | 11+ | 1 | 
                      
          |  | THEMIS Day IR Global Mosaic | 217 | 2013-12-27 | 4.39GB | 15,547 | 10+ | 2 | 
                      
          |  | Arizona State University Twitter Data Set | 1 | 2013-12-23 | 354.77MB | 14,930 | 11+ | 0 | 
                      
          |  | NIH Chest X-ray Dataset of 14 Common Thorax Disease Categories | 15 | 2017-10-09 | 45.09GB | 14,506 | 9+ | 2 | 
                      
          |  | AVA: A Large-Scale Database for Aesthetic Visual Analysis | 92 | 2017-07-16 | 33.14GB | 14,446 | 9+ | 1 | 
                      
          |  | Non-contrast head/brain CT CQ500 Dataset | 493 | 2018-10-05 | 28.66GB | 14,348 | 20+ | 4 | 
                      
          |  | Educational Process Mining (EPM): A Learning Analytics Data Set Data Set | 1 | 2016-02-11 | 4.93MB | 12,859 | 10+ | 0 | 
                      
          |  | Psychology 1 - General Psychology - Fall 2007 - UC Berkeley | 25 | 2017-03-09 | 3.36GB | 12,736 | 21 | 0 | 
                      
          |  | LUng Nodule Analysis (LUNA16) All Images | 888 | 2018-07-15 | 66.00GB | 12,696 | 12+ | 2 | 
                      
          |  | UCI Machine Learning Datasets 12/2013 | 211 | 2013-12-20 | 16.37GB | 11,464 | 8 | 2 | 
                      
          |  | University of Washington - Pedro Domingos - Machine Learning | 113 | 2018-11-09 | 9.07GB | 11,452 | 25+ | 0 | 
                      
          |  | Caltech CS156 - Machine Learning - Yaser | 54 | 2015-04-24 | 3.36GB | 11,363 | 18+ | 2 | 
                      
          |  | DiaRetDB1 V2.1 - Diabetic Retinopathy Database | 1 | 2019-06-05 | 144.10MB | 10,781 | 9+ | 1 | 
                      
          |  | Malignant lymphoma classification | 1 | 2018-02-19 | 1.44GB | 10,558 | 9+ | 2 | 
                      
          |  | Breast Ultrasound Images Dataset (Dataset BUSI) | 1 | 2021-03-05 | 205.87MB | 10,215 | 14+ | 0 | 
                      
          |  | BU-Web-Client Network Traces | 1 | 2014-05-16 | 13.79MB | 10,189 | 10+ | 1 | 
                      
          |  | MOLA Shaded Relief / Colorized Elevation | 146 | 2013-12-27 | 2.77GB | 10,084 | 10+ | 1 | 
                      
          |  | UCSD Pedestrian Database | 1 | 2016-09-21 | 791.87MB | 9,820 | 3+ | 1 | 
                      
          |  | NLCD2006 Land Cover Change (NLCD2006_landcover_change_pixels_5-4-11_se5.zip) | 1 | 2014-01-21 | 104.43MB | 9,646 | 7+ | 1 | 
                      
          |  | Kaggle Diabetic Retinopathy Detection Training Dataset (DRD) | 3 | 2019-02-06 | 35.00GB | 9,496 | 9+ | 2 | 
                      
          |  | Viking MDIM2.1 Colorized Global Mosaic 232m | 1 | 2014-03-06 | 12.74GB | 9,476 | 5+ | 3 | 
                      
          |  | Twitter Data - NIPS 2012 | 1 | 2014-04-06 | 22.34MB | 9,458 | 11+ | 0 | 
                      
          |  | Ocular Disease Intelligent Recognition ODIR-5K | 3 | 2019-11-25 | 1.30GB | 9,294 | 10+ | 3 | 
                      
          |  | Medical Imaging with Deep Learning Tutorial 2020 - Joseph Paul Cohen | 7 | 2020-07-28 | 76.86MB | 9,228 | 28+ | 0 | 
                      
          |  | Electrical Engineering 123, 001 - Spring 2015 - UC Berkeley | 36 | 2017-03-08 | 6.06GB | 9,190 | 13 | 4 | 
                      
          |  | Statistics 21 - Fall 2009 - UC Berkeley | 25 | 2017-03-09 | 4.27GB | 9,118 | 12 | 0 | 
                      
          |  | US domestic flights from 1990 to 2009 | 1 | 2014-08-10 | 35.40MB | 8,647 | 17+ | 0 | 
                      
          |  | A collection of sport activity files for data analysis and data mining 2016a | 1 | 2016-01-18 | 245.47MB | 8,628 | 6+ | 1 | 
                      
          |  | The Extended Yale Face Database B (Cropped) | 1 | 2014-10-20 | 58.49MB | 8,165 | 7+ | 0 | 
                      
          |  | Labeled Fishes in the Wild | 1 | 2016-02-03 | 444.37MB | 7,378 | 12+ | 0 | 
                      
          |  | [Coursera ] Text Mining and Analytics | 148 | 2017-01-22 | 1.06GB | 7,282 | 14 | 0 | 
                      
          |  | Mnih Massachusetts Building Dataset | 303 | 2014-09-19 | 2.07GB | 6,777 | 7+ | 1 | 
                      
          |  | Georgia Tech face database | 1 | 2015-10-29 | 133.19MB | 6,773 | 11+ | 0 | 
                      
          |  | Tom Mitchell - Machine Learning  - 2012 | 110 | 2018-11-18 | 5.87GB | 6,600 | 21 | 1 | 
                      
          |  | Analysis of the Cryptocurrency Marketplace | 1 | 2014-02-23 | 1.98MB | 6,560 | 28+ | 0 | 
                      
          |  | 20150112.json.gz | 1 | 2015-01-17 | 3.91GB | 6,513 | 4+ | 1 | 
                      
          |  | 01QZP 2018-2019 Ambient Intelligence | 21 | 2019-06-13 | 4.38GB | 6,410 | 5 | 3 | 
                      
          |  | Public Health 241, 001 - Spring 2011 - UC Berkeley | 38 | 2017-03-10 | 1.87GB | 6,075 | 17 | 1 | 
                      
          |  | Statistics 21 - 001 - Spring 2010 - UC Berkeley | 25 | 2017-03-09 | 5.58GB | 6,014 | 10 | 0 | 
                      
          |  | LBL-CONN-7 Network Traces | 1 | 2014-05-16 | 15.58MB | 5,837 | 10+ | 0 | 
                      
          |  | UCI Folio Leaf Dataset | 1 | 2015-10-12 | 972.47MB | 5,820 | 6+ | 1 | 
                      
          |  | Object-CXR - Automatic detection of foreign objects on chest X-rays | 4 | 2020-07-08 | 13.64GB | 5,511 | 9+ | 2 | 
                      
          |  | Introducing R | 1 | 2014-02-04 | 820.99kB | 5,502 | 16+ | 0 | 
                      
          |  | MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face Recognition | 7 | 2019-06-04 | 246.39GB | 5,447 | 13 | 2 | 
                      
          |  | DRIMDB (Diabetic Retinopathy Images Database) Database for Quality Testing of Retinal Images | 1 | 2019-06-05 | 17.07MB | 5,434 | 9+ | 0 | 
                      
          |  | NLCD2006 Land Cover (2011 Edition) nlcd_2006_landcover_2011_edition_2014_03_31.zip | 1 | 2014-10-08 | 1.09GB | 5,430 | 7+ | 0 | 
                      
          |  | Downsampled ImageNet 32x32 | 2 | 2017-06-03 | 4.27GB | 5,246 | 7+ | 1 | 
                      
          |  | Economics 1, 001 - Fall 2011 - UC Berkeley | 24 | 2017-03-09 | 4.19GB | 5,109 | 6 | 2 | 
                      
          |  | 30M Factoid Question-Answer Corpus (30MQA) | 2 | 2018-11-29 | 529.34MB | 5,053 | 8+ | 2 | 
                      
          |  | Multivariable Calculus - Math 53 - Fall 2009 - UC Berkeley | 25 | 2017-03-09 | 5.84GB | 5,013 | 20 | 3 | 
                      
          |  | MPEG-7 Core Experiment CE-Shape-1 [tar.gz] | 1 | 2014-10-08 | 2.27MB | 4,814 | 9+ | 0 | 
                      
          |  | Political Science 179 - Spring 2008 - UC Berkeley | 12 | 2017-03-09 | 1.50GB | 4,789 | 11 | 0 | 
                      
          |  | Nuclear Engineering 101, 001 - Fall 2014 - UC Berkeley | 29 | 2017-03-09 | 4.97GB | 4,712 | 14 | 3 | 
                      
          |  | Integrative Biology 131 - General Human Anatomy Online Course Videos - UCBerkeley | 39 | 2017-03-08 | 8.52GB | 4,587 | 20 | 1 | 
                      
          |  | Physics 10, 001 - Spring 2006 - UC Berkeley | 26 | 2017-03-09 | 4.82GB | 4,307 | 11 | 0 | 
                      
          |  | Visual Object Classes Challenge 2012 Dataset (VOC2012) VOCtrainval_11-May-2012.tar | 1 | 2013-12-19 | 2.00GB | 4,280 | 6+ | 3 | 
                      
          |  | Lerman Digg 2009 Dataset | 2 | 2014-08-15 | 37.55MB | 4,238 | 7+ | 0 | 
                      
          |  | Synthetic Data for Text Localisation in Natural Images | 15 | 2021-11-15 | 73.50GB | 4,053 | 8 | 6 | 
                      
          |  | MNIST Database | 4 | 2014-10-14 | 11.59MB | 4,019 | 11+ | 0 | 
                      
          |  | Structured Web Data Extraction Dataset (SWDE) | 1 | 2015-11-29 | 207.31MB | 3,969 | 8 | 0 | 
                      
          |  | Udacity Self-Driving Car Dataset 2-1 | 1 | 2016-10-10 | 1.64GB | 3,930 | 3 | 1 | 
                      
          |  | INbreast: toward a full-field digital mammographic database | 1 | 2022-08-06 | 2.06GB | 3,842 | 6+ | 1 | 
                      
          |  | International and Area Studies 107, 001 - Spring 2011 - UC Berkeley | 21 | 2017-03-09 | 1.64GB | 3,709 | 9 | 1 | 
                      
          |  | Lerman Twitter 2010 Dataset | 3 | 2014-08-15 | 292.17MB | 3,617 | 8+ | 2 | 
                      
          |  | ImageNet Large Scale Visual Recognition Challenge (V2017) | 1 | 2019-03-06 | 166.02GB | 3,613 | 8+ | 0 | 
                      
          |  | Chest X-Ray Images (Pediatric Pneumonia) | 1 | 2018-12-14 | 1.24GB | 3,532 | 7+ | 2 | 
                      
          |  | Sci-Hub SQL Database (2020-05-30) | 1 | 2020-07-07 | 10.35GB | 3,482 | 7+ | 1 | 
                      
          |  | Chemical & Biomolecular Engineering 179 Process Technology of Solid-State Materials Devices  - UC Berkeley | 33 | 2017-03-09 | 3.64GB | 3,447 | 11 | 0 | 
                      
          |  | Downsampled ImageNet 64x64 | 2 | 2017-06-02 | 12.59GB | 3,358 | 5+ | 3 | 
                      
          |  | Chemistry 1A, 002 - Spring 2010 - UC Berkeley | 43 | 2017-03-09 | 9.28GB | 3,211 | 11 | 0 | 
                      
          |  | Crater Detection via Genetic Search Methods to Reduce Image Features | 1 | 2013-12-15 | 19.31MB | 3,125 | 8+ | 1 | 
                      
          |  | Online News Popularity Data Set | 1 | 2016-02-11 | 7.48MB | 3,119 | 5+ | 1 | 
                      
          |  | Effectiveness of Cybersecurity Competitions | 1 | 2013-11-25 | 318.48kB | 3,096 | 9+ | 0 | 
                      
          |  | Introduction to Theory of Computation | 1 | 2014-03-30 | 1.29MB | 3,057 | 7 | 0 | 
                      
          |  | A collection of sport activity datasets with an emphasis on powermeter data | 1 | 2018-06-23 | 919.75MB | 2,946 | 8+ | 0 | 
                      
          |  | NASA Astronomy Picture of the Day Archive (7800 images, 2011) | 1 | 2021-02-22 | 2.82GB | 2,929 | 17 | 0 | 
                      
          |  | DeepLesion (10,594 CT scans with lesions) | 59 | 2019-01-26 | 243.04GB | 2,787 | 7+ | 4 | 
                      
          |  | NYPD 7 Major Felony Incidents | 1 | 2016-02-01 | 13.23MB | 2,756 | 4+ | 0 | 
                      
          |  | Peace and Conflict Studies 164B - Spring 2007 - UC Berkeley | 27 | 2017-03-09 | 5.07GB | 2,725 | 11 | 2 | 
                      
          |  | A collection of sport activity files for data analysis and data mining | 1 | 2015-02-16 | 316.18MB | 2,602 | 4+ | 1 | 
                      
          |  | Bioengineering 200, 001 - Spring 2014  - UC Berkeley | 9 | 2017-03-09 | 3.80GB | 2,570 | 13 | 0 | 
                      
          |  | Boston Hubway Data Visualization Challenge Dataset | 1 | 2015-11-24 | 26.00MB | 2,555 | 5+ | 0 | 
                      
          |  | Genetically Enhanced Feature Selection of Discriminative Planetary Crater Image Features | 1 | 2013-11-23 | 580.96kB | 2,499 | 6+ | 0 | 
                      
          |  | Stanford EE364A - Convex Optimization I - Boyd | 41 | 2015-04-25 | 4.46GB | 2,491 | 10+ | 3 | 
                      
          |  | Object and Concept Recognition for Content-Based Image Retrieval (CBIR) | 1365 | 2014-09-22 | 387.89MB | 2,409 | 4+ | 2 | 
                      
          |  | A collection of IRONMAN, IRONMAN 70.3 and Ultra-triathlon race results | 1 | 2016-09-30 | 54.96MB | 2,403 | 5+ | 0 | 
                      
          |  | BRATS2013 Tumor-NoTumor Dataset (T-NT) | 1 | 2018-11-05 | 65.63MB | 2,361 | 9+ | 0 | 
                      
          |  | Accelerometer-Based Event Detector for Low-Power Applications | 1 | 2014-02-01 | 1.45MB | 2,325 | 3+ | 1 | 
                      
          |  | UMN Sarwat Foursquare Dataset (September 2013) | 3 | 2014-07-02 | 160.75MB | 2,300 | 5+ | 1 | 
                      
          |  | BuzzFeed News transcription of Airbnb NYC data | 1 | 2016-02-01 | 192.92kB | 2,298 | 6+ | 1 | 
                      
          |  | Wikipedia English Official Offline Edition 2014-02-03 | 1 | 2014-04-28 | 10.59GB | 2,263 | 3+ | 1 | 
                      
          |  | MovieLens 20M Dataset | 1 | 2016-12-16 | 198.70MB | 2,244 | 5+ | 0 | 
                      
          |  | Astronomy C12, 001 - Fall 2014 - UC Berkeley | 25 | 2017-03-09 | 11.35GB | 2,181 | 12 | 1 | 
                      
          |  | Chemistry 3B, 002 - Fall 2014 - UC Berkeley | 26 | 2017-03-09 | 4.79GB | 2,115 | 6 | 0 | 
                      
          |  | Crater Dataset | 1 | 2013-12-11 | 32.49MB | 2,084 | 8+ | 0 | 
                      
          |  | Mars Weekend: A Panel and Games at the Museum of Science Boston | 1 | 2013-11-23 | 146.90kB | 2,080 | 3+ | 0 | 
                      
          |  | Environmental Economics and Policy 145 - Fall 2014 - UC Berkeley | 37 | 2017-03-09 | 3.93GB | 2,045 | 11 | 1 | 
                      
          |  | Bernoulli trials based feature selection for crater detection | 1 | 2013-12-11 | 2.38MB | 2,020 | 7+ | 1 | 
                      
          |  | Holistic Recognition of Low Quality License Plates (HDR dataset) | 1 | 2018-08-30 | 65.88MB | 1,992 | 7+ | 0 | 
                      
          |  | RSNA Pneumonia Detection Challenge (JPG files) | 29686 | 2020-03-19 | 3.93GB | 1,912 | 7 | 3 | 
                      
          |  | Ischemic Stroke Lesion Segmentation Challenge 2017 (ISLES2017) | 1382 | 2017-09-13 | 1.40GB | 1,812 | 14+ | 1 | 
                      
          |  | Law 271, Environmental Law and Policy - Fall 2009 - UC Berkeley | 26 | 2017-03-10 | 4.25GB | 1,770 | 10 | 2 | 
                      
          |  | A general method applicable to the search for similarities in the amino acid sequence of two proteins | 1 | 2014-10-29 | 641.72kB | 1,731 | 4+ | 1 | 
                      
          |  | Efficient Accelerometer-based Event Detector in Wireless Sensor Networks | 1 | 2014-02-01 | 516.71kB | 1,700 | 3+ | 1 | 
                      
          |  | ISBI Challenge: Segmentation of neuronal structures in EM stacks | 3 | 2016-07-01 | 23.61MB | 1,699 | 4+ | 0 | 
                      
          |  | New approach for modeling of transiting exoplanets for arbitrary limb-darkening law | 1 | 2014-02-01 | 3.58MB | 1,666 | 3+ | 0 | 
                      
          |  | 1000 Fundus images with 39 categories | 1 | 2019-08-20 | 402.76MB | 1,601 | 6+ | 1 | 
                      
          |  | Management of acute and post-operative pain in chronic kidney disease | 1 | 2014-02-07 | 571.35kB | 1,579 | 3+ | 1 | 
                      
          |  | A Brief Review of Nature-Inspired Algorithms for Optimization | 1 | 2017-01-09 | 161.54kB | 1,552 | 3+ | 0 | 
                      
          |  | New York Taxi Data 2009-2016 in Parquet Fomat | 800 | 2017-07-01 | 35.08GB | 1,549 | 4 | 1 | 
                      
          |  | Breast Cancer Cell Segmentation | 174 | 2018-02-19 | 159.96MB | 1,541 | 6+ | 2 | 
                      
          |  | Lung CT Segmentation Challenge 2017 (LCTSC) | 9569 | 2019-03-22 | 5.11GB | 1,524 | 10+ | 1 | 
                      
          |  | Public Health 150E, 001 - Spring 2015 - UC Berkeley | 1 | 2017-03-09 | 252.00MB | 1,472 | 14+ | 0 | 
                      
          |  | HMC-QU echocardiography ultrasound recordings | 1 | 2023-01-07 | 2.49GB | 1,469 | 7+ | 2 | 
                      
          |  | US Stock Market End of Day dataset | 1 | 2016-12-24 | 250.71MB | 1,454 | 9 | 1 | 
                      
          |  | MRI Dataset for Hippocampus Segmentation (HFH) (hippseg_2011) | 1 | 2019-08-20 | 598.88MB | 1,314 | 6+ | 2 | 
                      
          |  | comma.ai driving dataset | 6 | 2018-10-25 | 48.28GB | 1,241 | 2+ | 2 | 
                      
          |  | The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions | 6 | 2022-07-29 | 3.20GB | 1,241 | 5+ | 2 | 
                      
          |  | True Marble Global Image Dataset GeoTIFF | 68 | 2016-08-26 | 9.74GB | 1,238 | 4+ | 2 | 
                      
          |  | Udacity Dataset 2-3 Compressed | 1 | 2016-10-12 | 20.96GB | 1,222 | 3 | 2 | 
                      
          |  | SIIM-ACR Pneumothorax Segmentation | 15295 | 2020-07-04 | 2.07GB | 1,194 | 9+ | 1 | 
                      
          |  | Replicated GPT-2 1.5B Parameter Model | 3 | 2019-08-24 | 5.79GB | 1,137 | 4+ | 0 | 
                      
          |  | CMU Graphics Lab Motion Capture Database Converted to FBX | 1 | 2019-05-16 | 1.92GB | 1,112 | 3+ | 1 | 
                      
          |  | 10 years of Dukascopy Forex Tick Data (2008-2019) | 475 | 2021-02-21 | 65.03GB | 1,097 | 5 | 2 | 
                      
          |  | COCO 2017 Resized to 256x256 | 1 | 2021-04-05 | 1.64GB | 1,045 | 5+ | 0 | 
                      
          |  | STructured Analysis of the Retina | 1 | 2022-12-31 | 484.37MB | 1,041 | 9+ | 0 | 
                      
          |  | MIT-BIH Arrhythmia Database | 145 | 2019-05-29 | 93.86MB | 1,035 | 9+ | 1 | 
                      
          |  | A collection of sport activity datasets for data analysis and data mining 2017a | 1 | 2017-03-30 | 789.14MB | 1,003 | 4+ | 0 | 
                      
          |  | Minecraft Skins | 1 | 2019-08-05 | 2.47GB | 985 | 5+ | 1 | 
                      
          |  | MNIST Database (mnist.pkl.gz) | 1 | 2016-10-12 | 16.17MB | 971 | 8+ | 1 | 
                      
          |  | Udacity Didi $100k Challenge Dataset 1 | 1 | 2017-03-23 | 32.80GB | 967 | 2 | 2 | 
                      
          |  | Stanford Drone Dataset | 1 | 2019-04-27 | 71.00GB | 964 | 8+ | 1 | 
                      
          |  | GANGogh training data set | 1 | 2017-11-29 | 37.15GB | 953 | 2+ | 1 | 
                      
          |  | Terrestrial Ecological Systems of the United States (Version 3.0; Updated March 2014) | 1 | 2016-02-04 | 3.99GB | 952 | 1+ | 2 | 
                      
          |  | MICCAI 2015 Challenge on Multimodal Brain Tumor Segmentation (BraTS2015) | 1812 | 2017-09-19 | 5.34GB | 938 | 6+ | 1 | 
                      
          |  | Head-Neck-CT | 43286 | 2019-04-21 | 22.84GB | 813 | 14 | 3 | 
                      
          |  | AOL Search Data 20M web queries (2006) | 3 | 2016-12-17 | 460.41MB | 794 | 5+ | 0 | 
                      
          |  | Small Object Dataset | 1 | 2017-06-06 | 5.86MB | 784 | 6+ | 0 | 
                      
          |  | TB Portal Tuberculosis Chest X-ray dataset for Belarus | 1049 | 2020-10-22 | 12.40GB | 769 | 8+ | 1 | 
                      
          |  | QuantQuote Free Historical Stock Data 2013 | 1 | 2016-06-20 | 36.62MB | 747 | 3+ | 2 | 
                      
          |  | Yelp Restaurant Photo Classification Data | 5 | 2016-05-12 | 14.14GB | 741 | 2+ | 2 | 
                      
          |  | Udacity Self-Driving Car Driving Data 9/29/2016 (dataset.bag.tar.gz) | 1 | 2016-10-10 | 25.95GB | 738 | 2+ | 1 | 
                      
          |  | UC Merced Land Use Dataset | 1 | 2017-10-10 | 332.47MB | 737 | 6+ | 2 | 
                      
          |  | PROSTATEx | 42121 | 2019-04-27 | 4.32GB | 726 | 10 | 1 | 
                      
          |  | Udacity Self Driving Car Dataset 3-1: El Camino | 1 | 2016-10-21 | 29.99GB | 670 | 2 | 2 | 
                      
          |  | CAMUS Cardiac Acquisitions for Multi-structure Ultrasound Segmentation | 7500 | 2023-01-12 | 3.83GB | 653 | 10+ | 1 | 
                      
          |  | Gland Segmentation in Histology Images Challenge (GlaS) Dataset | 1 | 2016-09-21 | 180.90MB | 636 | 8+ | 2 | 
                      
          |  | P. vivax (malaria) infected human blood smears (BBBC041) | 1 | 2019-08-02 | 2.26GB | 604 | 7+ | 2 | 
                      
          |  | Twitch Emotes Images Dataset | 1 | 2019-08-03 | 4.29GB | 588 | 5+ | 1 | 
                      
          |  | Non-Small Cell Lung Cancer CT Scan Dataset (NSCLC-Radiomics-Genomics) | 183 | 2017-09-19 | 4.52GB | 575 | 7+ | 2 | 
                      
          |  | The Cars Overhead With Context (COWC) | 128 | 2016-09-20 | 9.34GB | 565 | 2+ | 2 | 
                      
          |  | Didi Data Release #2 - Round 1 Test Sequence and Training | 1 | 2017-04-04 | 21.93GB | 558 | 2 | 2 | 
                      
          |  | PanNuke: An Open Pan-Cancer Histology Dataset for Nuclei Instance Segmentation and Classification | 3 | 2020-08-14 | 2.08GB | 553 | 8+ | 1 | 
                      
          |  | Sentiment Labelled Sentences Data Set | 1 | 2016-08-26 | 512.21kB | 543 | 6+ | 0 | 
                      
          |  | Udacity Self-Driving Car Dataset 2-2 | 1 | 2016-10-10 | 7.04GB | 525 | 2 | 1 | 
                      
          |  | ISIC2018: Skin Lesion Analysis Towards Melanoma Detection | 9 | 2019-07-24 | 17.08GB | 498 | 7+ | 2 | 
                      
          |  | Animals with Attributes 2 (AwA2) dataset | 2 | 2017-10-23 | 13.92GB | 475 | 5+ | 1 | 
                      
          |  | r/WritingPrompts, Text (2018) | 1 | 2019-06-19 | 87.47MB | 434 | 5 | 0 | 
                      
          |  | musicnet.tar.gz | 1 | 2019-12-03 | 11.10GB | 355 | 2+ | 2 | 
                      
          |  | PMC Open Access Subset | 16 | 2020-05-24 | 84.14GB | 333 | 7+ | 2 | 
                      
          |  | Condensing Steam: Distilling the Diversity of Gamer Behavior | 1 | 2019-01-15 | 18.30GB | 326 | 2+ | 0 | 
                      
          |  | Medical Segmentation Decathlon Datasets | 4380 | 2018-09-20 | 75.91GB | 321 | 8+ | 1 | 
                      
          |  | LNDb CT scan dataset (training) | 240 | 2019-12-16 | 29.21GB | 312 | 7+ | 2 | 
                      
          |  | VGG Cell Dataset from Learning To Count Objects in Images | 1 | 2017-04-02 | 16.34MB | 300 | 8+ | 1 | 
                      
          |  | Columbia University Image Library (COIL-20) | 3 | 2015-11-26 | 19.89MB | 291 | 7+ | 0 | 
                      
          |  | Pre-configured (Mint) linux based virtual machine image | 1 | 2017-01-05 | 3.06GB | 284 | 3+ | 0 | 
                      
          |  | Microsoft Academic Graph - 2016/02/05 | 1 | 2016-12-25 | 28.94GB | 269 | 3+ | 2 | 
                      
          |  | Open Payments Dataset - 2014 Program Year | 1 | 2017-02-26 | 728.44MB | 252 | 6+ | 1 | 
                      
          |  | Electron Microscopy (CA1 hippocampus) Dataset | 5 | 2017-10-24 | 3.87GB | 250 | 6+ | 2 | 
                      
          |  | TotalSegmentator CT Dataset | 1 | 2022-11-17 | 28.40GB | 248 | 9+ | 1 | 
                      
          |  | Inria Aerial Image Labeling Dataset | 5 | 2019-04-27 | 20.96GB | 246 | 4+ | 1 | 
                      
          |  | MoNuSeg Training Data - Multi-organ nuclei segmentation from H&E stained histopathological images | 1 | 2018-09-05 | 142.31MB | 243 | 7+ | 0 | 
                      
          |  | Wikilinks: A Large-scale Cross-Document Coreference Corpus Labeled via Links to Wikipedia (Original Dataset) | 10 | 2017-03-04 | 1.84GB | 241 | 4+ | 2 | 
                      
          |  | Open Payments Dataset - 2015 Program Year | 1 | 2017-02-26 | 584.88MB | 238 | 3+ | 1 | 
                      
          |  | OpenWebText (Gokaslan's distribution, 2019), GPT-2 Tokenized | 395 | 2019-06-01 | 16.02GB | 223 | 2 | 1 | 
                      
          |  | Great Zebra and Giraffe Count ID Dataset | 1 | 2020-07-31 | 10.43GB | 217 | 4 | 2 | 
                      
          |  | UrbanMapper 3D (Digital Surface Model and Digital Terrain Model) Dataset | 3 | 2017-10-14 | 6.62GB | 206 | 5+ | 1 | 
                      
          |  | Leaf counting dataset | 2 | 2020-06-22 | 925.39MB | 205 | 7+ | 3 | 
                      
          |  | Reddit comments/submissions 2005-06 to 2022-06 | 404 | 2022-07-17 | 1.76TB | 194 | 6 | 0 | 
                      
          |  | POLEN23E: image dataset for the Brazilian Savannah pollen types | 1 | 2018-11-09 | 34.56MB | 185 | 10+ | 0 | 
                      
          |  | UCF Google Street View Dataset 2014 | 15 | 2019-04-10 | 46.25GB | 178 | 3+ | 1 | 
                      
          |  | Human acute monocytic leukemia | 1 | 2017-04-28 | 1.61GB | 165 | 8+ | 2 | 
                      
          |  | Labeled Optical Coherence Tomography (OCT) | 1 | 2018-12-15 | 5.79GB | 165 | 8+ | 1 | 
                      
          |  | Corpus of Russian news articles collected from Lenta.Ru | 1 | 2018-07-16 | 1.81GB | 164 | 2+ | 1 | 
                      
          |  | Modified PubMed Dataset used by WSU-IR team at TREC 2015 Clinical Decision Support Track | 1 | 2016-09-24 | 18.53GB | 162 | 2 | 1 | 
                      
          |  | PADCHEST_SJ (Feb 2019 Update) | 60 | 2019-04-07 | 1.13TB | 160 | 5+ | 2 | 
                      
          |  | Whale Shark ID Dataset | 1 | 2020-07-31 | 6.47GB | 149 | 3 | 3 | 
                      
          |  | The PatchCamelyon benchmark dataset (PCAM) | 10 | 2018-11-13 | 8.06GB | 148 | 6+ | 2 | 
                      
          |  | North America roads GIS data | 1 | 2018-07-21 | 8.35GB | 139 | 2+ | 0 | 
                      
          |  | Open Payments Dataset - 2013 Program Year | 1 | 2017-02-26 | 277.98MB | 138 | 7+ | 1 | 
                      
          |  | UT Zappos50K (Version 2.1) | 6 | 2020-10-16 | 887.03MB | 135 | 5+ | 1 | 
                      
          |  | N+1 fish, N+2 fish dataset (test_videos) | 667 | 2017-09-06 | 32.93GB | 127 | 1+ | 2 | 
                      
          |  | VizWiz v1.0 dataset (Answering Visual Questions from Blind People) | 1 | 2018-08-23 | 15.39GB | 125 | 5+ | 1 | 
                      
          |  | Data of the White Matter Hyperintensity (WMH) Segmentation Challenge | 1 | 2022-12-21 | 8.72GB | 97 | 5 | 3 | 
                      
          |  | vgg19_normalized.pkl | 1 | 2016-10-12 | 80.13MB | 96 | 3+ | 0 | 
                      
          |  | ImageClef - IAPR TC-12 Benchmark | 1 | 2018-11-03 | 1.76GB | 87 | 3+ | 3 | 
                      
          |  | TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild | 13 | 2020-04-19 | 1.14TB | 73 | 2 | 1 | 
                      
          |  | Avantes Dual Spectrograph.zip | 1 | 2016-08-30 | 21.48GB | 72 | 2 | 0 | 
                      
          |  | ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events | 25 | 2019-02-05 | 1.65TB | 69 | 1+ | 3 | 
                      
          |  | Ukrainian Open Speech To Text Dataset 4.2 ~1200 hours | 18 | 2021-03-09 | 188.31GB | 6 |  | 1 |