Reddit comments/submissions 2005-06 to 2022-06
stuck_in_the_matrix and Watchful1

reddit (404 files)
submissions/RS_2022-06.zst 10.05GB
submissions/RS_2022-05.zst 10.36GB
submissions/RS_2022-03.zst 9.54GB
submissions/RS_2022-04.zst 9.60GB
submissions/RS_2022-02.zst 8.35GB
submissions/RS_2021-12.zst 7.99GB
submissions/RS_2022-01.zst 8.83GB
submissions/RS_2021-11.zst 7.62GB
submissions/RS_2021-09.zst 7.31GB
submissions/RS_2021-10.zst 7.50GB
submissions/RS_2021-08.zst 7.81GB
submissions/RS_2021-06.zst 9.46GB
submissions/RS_2021-07.zst 7.78GB
submissions/RS_2021-05.zst 9.74GB
submissions/RS_2021-03.zst 9.18GB
submissions/RS_2021-04.zst 8.97GB
submissions/RS_2021-02.zst 8.37GB
submissions/RS_2020-12.zst 8.13GB
submissions/RS_2021-01.zst 8.70GB
submissions/RS_2020-11.zst 7.57GB
submissions/RS_2020-09.zst 7.37GB
submissions/RS_2020-10.zst 7.68GB
submissions/RS_2020-08.zst 7.59GB
submissions/RS_2020-06.zst 6.90GB
submissions/RS_2020-07.zst 7.30GB
submissions/RS_2020-05.zst 7.06GB
submissions/RS_2020-03.zst 8.14GB
submissions/RS_2020-04.zst 9.16GB
submissions/RS_2020-02.zst 6.89GB
submissions/RS_2019-12.zst 6.50GB
submissions/RS_2020-01.zst 6.96GB
submissions/RS_2019-11.zst 5.94GB
submissions/RS_2019-09.zst 5.76GB
submissions/RS_2019-10.zst 5.84GB
submissions/RS_2019-08.zst 6.51GB
submissions/RS_2019-06.zst 5.76GB
submissions/RS_2019-07.zst 6.25GB
submissions/RS_2019-05.zst 5.77GB
submissions/RS_2019-03.zst 5.62GB
submissions/RS_2019-04.zst 5.59GB
submissions/RS_2019-02.zst 4.54GB
submissions/RS_2018-12.zst 3.94GB
submissions/RS_2019-01.zst 4.69GB
submissions/RS_2018-11.zst 3.61GB
submissions/RS_2018-09.zst 3.29GB
submissions/RS_2018-10.zst 3.56GB
submissions/RS_2018-08.zst 3.31GB
submissions/RS_2018-06.zst 2.89GB
submissions/RS_2018-07.zst 3.23GB
Too many files! Click here to view them all.
Type: Dataset
Tags: reddit

Bibtex:
@article{,
title= {Reddit comments/submissions 2005-06 to 2022-06},
journal= {},
author= {stuck_in_the_matrix and Watchful1},
year= {},
url= {},
abstract= {Reddit comments and submissions from 2005-06 to 2022-06 collected by pushshift which can be found here https://files.pushshift.io/reddit/

These are zstandard compressed ndjson files. Example python scripts for parsing the data can be found here https://github.com/Watchful1/PushshiftDumps},
keywords= {reddit},
terms= {},
license= {},
superseded= {}
}

Hosted by users:

Send Feedback