A scraper for Reddit communities with scripts to perform post-scraping text analysis
Python R Mako
Switch branches/tags
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
app
migrate
scripts
.gitignore
.gitmodules
.todo
LICENSE
README.md
alembic.ini
fabfile.py
requirements.R
requirements.txt
settings-sample.py

README.md

redditanalyser

A scraper for Reddit communities (or users) with scripts to perform post-scraping text analysis.

Usage

Requirements

Installation

Clone the repository, then bootstrap the environment:

$ fab bootstrap

Configuration

Copy settings-sample.py in the root directory and rename it settings.py. Then configure the file, as appropriate.

Note: You must set the username before executing the scraper.

Execution

Run the scraper:

$ fab scrape

Run the data analyser:

$ fab analyse

Generate wordclouds:

$ Rscript scripts/wordclouds.R

Testing

Setup bootstrap environment, configure settings, and then run:

$ fab test