Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 814 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 814 Bytes

Reddit-Data-Mining-Pushshift-Notebook

This is a notebook that shows how to extract and analyse different parts of reddit threads and comments using Pushshift API.

Normally PRAW (Reddit Python API) is pretty good at getting reddit data but there are some limitations with it. Including the removal of the subreddit.submissions endpoint.

So for extracting Reddit submissions and the primarily data such as upvotes and comments count, I put together this notebook using Pushshift.

If you still prefer PRAW for extract submissions, I have written a code template here.

More info on the removal of the subreddit.submissions endpoint.