Skip to content
A basic R package for scraping Reddit data using the pushshift API
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R
man
.Rbuildignore
.gitattributes
.gitignore
DESCRIPTION
NAMESPACE
README.md
pushshiftR.Rproj

README.md

pushshiftR

This is a very basic R package for fetching Reddit data using the pushshift API. At present, the package should suit general users, but is not a general package.

Installation

devtools::install_github("https://github.com/nathancunn/pushshiftR")

Basic use

To get top-level posts from /r/soccer from January 1st 2019:

getPushshiftData(postType = "comment",
                 size = 1000,
                 after = "1546300800",
                 subreddit = "soccer",
                 nest_level = 1)

Acknowledgments

This package is basically an R implementation of the code here and uses the pushshift API to download Reddit data. If you use this, you might consider donating to them.

You can’t perform that action at this time.