A collection of datasets useful for natural language processing (including annotated text, speech, etc) that I've uploaded.
Most data uploaded was originally sources from the Linguistic Data Consortium (https://www.ldc.upenn.edu/).
If there are any issues with any of the torrents' data or metadata, or if you have any questions, please feel free to email me at < datatorrents AT computer DOT garden >.
Thank you!