Type: Dataset
Tags: reddit
Bibtex:
Tags: reddit
Bibtex:
@article{, title= {Subreddit comments/submissions 2005-06 to 2023-12}, journal= {}, author= {Watchful1}, year= {}, url= {https://www.reddit.com/r/pushshift/comments/1akrhg3/separate_dump_files_for_the_top_40k_subreddits/}, abstract= {This is the top 40,000 subreddits from reddit's history in separate files. You can use your torrent client to only download the subreddit's you're interested in. These are from the pushshift dumps from 2005-06 to 2023-12 which can be found here https://academictorrents.com/details/7c0645c94321311bb05bd879ddee4d0eba08aaee These are zstandard compressed ndjson files. Example python scripts for parsing the data can be found here https://github.com/Watchful1/PushshiftDumps If you have questions, please reply to this reddit post or DM u/Watchful on reddit or respond to this post https://www.reddit.com/r/pushshift/comments/1akrhg3/separate_dump_files_for_the_top_40k_subreddits/}, keywords= {reddit}, terms= {}, license= {}, superseded= {} }