Info hash | 36c39b25657ce1639ccec0a91cf242b42e1f01db |
Last mirror activity | 0:23 ago |
Size | 16.02GB (16,023,403,913 bytes) |
Added | 2019-06-01 17:19:18 |
Views | 1006 |
Hits | 1848 |
ID | 4199 |
Type | multi |
Downloaded | 212 time(s) |
Uploaded by | |
Folder | data-owt |
Num files | 395 files [See full list] |
Mirrors | 5 complete, 0 downloading = 5 mirror(s) total [Log in to see full list] |

![]() |
40.76MB |
![]() |
40.44MB |
![]() |
40.58MB |
![]() |
40.61MB |
![]() |
40.61MB |
![]() |
40.61MB |
![]() |
40.62MB |
![]() |
40.40MB |
![]() |
40.56MB |
![]() |
40.58MB |
![]() |
40.57MB |
![]() |
40.62MB |
![]() |
40.58MB |
![]() |
40.49MB |
![]() |
40.56MB |
![]() |
40.56MB |
![]() |
40.53MB |
![]() |
40.58MB |
![]() |
40.57MB |
![]() |
40.56MB |
![]() |
40.55MB |
![]() |
40.50MB |
![]() |
40.64MB |
![]() |
40.53MB |
![]() |
40.59MB |
![]() |
40.55MB |
![]() |
40.66MB |
![]() |
40.54MB |
![]() |
40.54MB |
![]() |
40.51MB |
![]() |
40.57MB |
![]() |
40.60MB |
![]() |
40.54MB |
![]() |
40.42MB |
![]() |
40.70MB |
![]() |
40.65MB |
![]() |
40.67MB |
![]() |
40.41MB |
![]() |
40.55MB |
![]() |
40.56MB |
![]() |
40.56MB |
![]() |
40.58MB |
![]() |
40.60MB |
![]() |
40.51MB |
![]() |
40.51MB |
![]() |
40.28MB |
![]() |
40.60MB |
![]() |
40.52MB |
![]() |
40.50MB |
|
Type: Dataset
Tags:
Bibtex:
Tags:
Bibtex:
@article{, title= {OpenWebText (Gokaslan's distribution, 2019), GPT-2 Tokenized}, journal= {}, author= {eukaryote31 and Joshua Peterson and Aaron Gokaslan and Vanya Cohen}, year= {}, url= {}, abstract= {Code by eukaryote31 and Joshua Peterson: https://github.com/jcpeterson/openwebtext and https://github.com/eukaryote31/openwebtext Scraped by Aaron Gokaslan and Vanya Cohen: https://skylion007.github.io/OpenWebTextCorpus/ Tokenized by eukaryote31}, keywords= {}, terms= {}, license= {}, superseded= {} }