usace.contentdm.oclc.org

folder usace.contentdm.oclc.org (15 files)
filesteps.txt 2.41kB
fileREADME 0.52kB
filepdfs.tar.zst 482.59GB
filepages.tar.zst 10.65MB
fileothers.tar.zst 25.37GB
fileold/steps.txt 1.94kB
fileold/pages.tar.zst 386.23kB
fileold/README 0.36kB
fileold/items.tar.zst 1.93MB
fileold/item-links.txt.zst 17.63kB
filejp2s.tar.zst 113.83GB
fileitems.tar.zst 10.83MB
fileitem-links.txt.zst 2.28MB
filedownload-urls.txt.zst 79.09kB
filefile-types.txt.zst 202.76kB
Type: Dataset
Tags: usa, gov, united states, usace

Bibtex:
@article{,
title= {usace.contentdm.oclc.org},
journal= {},
author= {},
year= {},
url= {},
abstract= {**U.S. Army Corps of Engineers Digital Library**

An almost complete mirror of https://usace.contentdm.oclc.org/

Data captured from 2025-02-28 to 2025-03-02

Metadata is downloaded in JSON format and is available in pages.tar.zst and
items.tar.zst

Downloads are available segmented by filetype in other .tar.zst folders:
pdfs.tar.zst contains only PDF files, jp2s.tar.zst contains only JPEG 2000
files, and so on.

download-urls.txt.zst and item-links.txt.zst are intermediate artifacts
from scraping. steps.txt contains the shell scripts used to produce this dataset.},
keywords= {usace,usa,united states,gov},
terms= {},
license= {},
superseded= {}
}

Hosted by users:

Send Feedback