Skip to content

The following project is a scraper I wrote in Rust to gather network nodes using the Pushshift API. My project is NOT an official API for Pushshift or Reddit in any way. I played around a bit with Rust prior to starting this project; my code is quite awful.

License

Notifications You must be signed in to change notification settings

joshuamegnauth54/thesis_gamer_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

What is this project?

Hi! I wrote Gamer Scraper to gather data for my thesis. I initially planned to write my scraper in Python. I dithered for weeks by working on small data science projects which I ported to both Python and R.

A crazy idea struck soon after completing my first project: What if I wrote my scraper in Rust---a language I adore despite my rusty paltry skills?

Call my coup uninspired and a further manifestation of dithering. I won't blame you. After a few days of work, however, I managed to scrape together a working, uh, scraper!

The goal of the scraper is to gather posters on a subset of gamer subreddits along with other subs frequented by said users. The final network is a sample of the gamers and their dispersion on Reddit in terms of subs while the transpose shows the connections between the gamers.

Implementation

I implemented a small, incomplete, and likely wrong subset of the Pushshift API for Reddit using reqwests. My program works yet is flawed in many ways. Errors could be handled better as I mostly just consume them. Functionality bleeds into two unrelated modules: the Pushshift implementation and the actual scraper. The function to perform the scraping itself is a terrible mess.

About

The following project is a scraper I wrote in Rust to gather network nodes using the Pushshift API. My project is NOT an official API for Pushshift or Reddit in any way. I played around a bit with Rust prior to starting this project; my code is quite awful.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages