irma.nps.gov-datastore

folder irma.nps.gov-datastore (14 files)
filezipfiles.txt.zst 10.08kB
filesteps.txt 0.38kB
fileREADME 1.14kB
fileprofiles.tar.zst 45.95MB
filepdfs.tar.zst 473.98GB
filepages.tar.zst 11.62MB
fileothers.tar.zst 252.37GB
filehtml.tar.zst 48.05MB
fileget-profile.sh 0.25kB
fileholdings.tar.zst 7.28MB
fileextracted-zip.tar.zst 2.18TB
fileget-holdings.sh 0.80kB
filedownload-page.sh 1.00kB
filedownload-file-types.txt.zst 631.36kB
Type: Dataset
Tags: gov, united states, national park service, irma, nps

Bibtex:
@article{,
title= {irma.nps.gov-datastore},
journal= {},
author= {},
year= {},
url= {},
abstract= {# IRMA NPS DataStore

A mirror of https://irma.nps.gov/DataStore/Search/Quick -- sent a search for the empty string, and crawled through all results and file downloads.

Data captured on 2025-03-04

This archive contains 3317 pages of search results, amounting to 165806 records
("references" in DataStore lingo)

pages.tar.zst contains all pages from search results, in JSON format.

profiles.tar.zst contains JSON metadata (description, date published, author)
for each reference.

holdings.tar.zst contains JSON file listings per reference, i.e. file
MIME-types and sizes.

html.tar.zst pdfs.tar.zst extracted-zip.tar.zst and others.tar.zst are the
actual downloaded files segmented by filetypes for compressability.

extracted-zip.tar.zst does not contain the original zipfiles but rather
extracted folders so that they can be more effectively recompressed by
ZStandard. The original total size of all zipfiles was 2.2 TiB, all data fully
extracted was 3.1 TiB.

Detailed code for how the data was scraped is available in steps.txt. Data is
packed in ZStandard-compressed tarballs with -9 --long to reduce torrent
metadata and disk usage.},
keywords= {national park service,irma,nps,gov,united states},
terms= {},
license= {},
superseded= {}
}


Send Feedback