Download eight years' worth of Reddit comments

Sponsored Links

Mariella Moon
July 11th, 2015
In this article: reddit, social
Download eight years' worth of Reddit comments

If you need (almost) every publicly available Reddit comment for any reason -- hey, maybe you're a researcher or maybe you just love data -- then ready your external HDD, because someone bundled 'em all up nicely. User "Stuck_In_the_Matrix" collected every comment he could from as far back as October 2007, two years since the website was founded, up until May 2015. It took him 14 months and about 20 million API calls to farm around 1.65 billion entries, though approximately 350,000 couldn't be collected due to issues with Reddit's API.

Those comments are saved as plain text, along with their authors' usernames, scores and subreddit locations, among other info. even considered the feat notable enough to preserve for future generations. You can get the compilation right now through the torrent file "Stuck_In_the_Matrix" provided, but take note that all that data totals 150GB when compressed and almost a terabyte uncompressed. In case you're unwilling to invest time in downloading something you haven't seen before, his original Reddit post also comes with a much smaller one-month sampler.

[Image credit: Getty Images]

All products recommended by Engadget are selected by our editorial team, independent of our parent company. Some of our stories include affiliate links. If you buy something through one of these links, we may earn an affiliate commission.
Popular on Engadget