WebText dataset urls.txt.tar.gz

urls.txt.tar.gz1.75GB
Type: Dataset
Tags:WebText, Reddit

Bibtex:
@article{,
title= {WebText dataset urls.txt.tar.gz},
journal= {},
author= {},
year= {},
url= {https://github.com/eukaryote31/openwebtext},
abstract= {Collection of URLs hosting content used in the WebText dataset described by OpenAI here: https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

URLs obtained with the scripts by eukaryote31},
keywords= {WebText, Reddit},
terms= {},
license= {},
superseded= {}
}

Hosted by users:


Support
Academic Torrents!

Disable your
ad-blocker!

10 day statistics (1 downloads taking more than 30 seconds)

Average Time 35 minutes, 26 seconds
Average Speed 825.38kB/s
Best Time 35 minutes, 26 seconds
Best Speed 825.38kB/s
Worst Time 35 minutes, 26 seconds
Worst Speed 825.38kB/s
Report