“Every publicly available Reddit comment”

I have every publicly available Reddit comment for research. ~ 1.7 billion comments @ 250 GB compressed. /r/datasets

A massive, torrent-ready data set of Reddit comments. Yes, it includes objects like “gilded,” and “controversiality.”

Via my colleague James Nylen (who does not seem to be on Twitter?)

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.