The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name: The Pile An 800GB Dataset of Diverse Text for Language Modeling
Creator: EleutherAI
Published: 2021-03-01 01:37:09
License: https://academictorrents.com/nolicensespecified

A DMCA notice has been issued for this torrent

Date	2023-08-24 13:54:33
Submitter Name	Thomas Heldrup, Vesterbrogade 15, 1 floor, 1620 Copenhagen V, Denmark
Submitter Email	Thomas.heldrup@rettighedsalliancen.dk
Provide a description of the content in question:	"The book ""Afrikas Horn"" by Wilbur Smith, published by Lindhardt og Ringhof A/S in Denmark. There are additional 108 works we represent that are infringed on the URL. On the following link you can see an official description of ""Afrikas Horn"" by Wilbur Smith: https://www.lindhardtogringhof.dk/afrikas-horn-3"
How are you authorized to make the request?	Authorised agent
How is the content not covered under the Fair Use Act sections 107 or 108?	The work originates from an illegal filesharing site called bibliotik.me (this explicit from the paper documenting "The Pil" found here: https://arxiv.org/abs/2101.00027. As the origin of the copy of the content is an illegal source the content cannot be claimed to fall under the Fair Use doctrine.
Provide a statement that the complaining party has a good faith belief.	I have good faith that the use of the work in this notice is not authorised by the copyright owner its agent, or the law.

Info hash	0d366035664fdf51cfbe9f733953ba325776e667
Last mirror activity	1058d,01:26:33 ago
Size	772.89GB (772,891,257,239 bytes)
Added	2021-03-01 01:37:09
Views	954
Hits	1112
ID	4618
Type	multi
Downloaded	465 time(s)
Uploaded by	joecohen
Folder	EleutherAI_ThePile_v1
Num files	51 files [See full list]
Mirrors	0 complete, 0 downloading = 0 mirror(s) total [Log in to see full list]

Compute download stats

Related Torrents

Send Feedback

The Pile An 800GB Dataset of Diverse Text for Language Modeling EleutherAI

The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI