usace.contentdm.oclc.org (15 files)
steps.txt |
2.41kB |
README |
0.52kB |
pdfs.tar.zst |
482.59GB |
pages.tar.zst |
10.65MB |
others.tar.zst |
25.37GB |
old/steps.txt |
1.94kB |
old/pages.tar.zst |
386.23kB |
old/README |
0.36kB |
old/items.tar.zst |
1.93MB |
old/item-links.txt.zst |
17.63kB |
jp2s.tar.zst |
113.83GB |
items.tar.zst |
10.83MB |
item-links.txt.zst |
2.28MB |
download-urls.txt.zst |
79.09kB |
file-types.txt.zst |
202.76kB |
Type: Dataset
Bibtex:
Tags:
Bibtex:
@article{,
title= {usace.contentdm.oclc.org},
journal= {},
author= {},
year= {},
url= {},
abstract= {**U.S. Army Corps of Engineers Digital Library**
An almost complete mirror of https://usace.contentdm.oclc.org/
Data captured from 2025-02-28 to 2025-03-02
Metadata is downloaded in JSON format and is available in pages.tar.zst and
items.tar.zst
Downloads are available segmented by filetype in other .tar.zst folders:
pdfs.tar.zst contains only PDF files, jp2s.tar.zst contains only JPEG 2000
files, and so on.
download-urls.txt.zst and item-links.txt.zst are intermediate artifacts
from scraping. steps.txt contains the shell scripts used to produce this dataset.},
keywords= {usace,usa,united states,gov},
terms= {},
license= {},
superseded= {}
}
steps.txt