WebText dataset urls.txt.tar.gz

urls.txt.tar.gz 1.75GB
Type: Dataset

Bibtex:
@article{,
title= {WebText dataset urls.txt.tar.gz},
journal= {},
author= {},
year= {},
url= {https://github.com/eukaryote31/openwebtext},
abstract= {Collection of URLs hosting content used in the WebText dataset described by OpenAI here: https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

URLs obtained with the scripts by eukaryote31},
keywords= {WebText, Reddit},
terms= {},
license= {},
superseded= {}
}

Citation:
WebText dataset urls.txt.tar.gz. (2019). [Data set]. Academic Torrents. https://academictorrents.com/details/15f3494b2991e75194d3af72bf7afa5025a7abc3
Hosted by users

Send Feedback