Skip to content

darlastill/reddit-python-anonymous

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Determining Anonymity with Reddit Data

This program assumes you have acquired Reddit data and have the corpus of all authors to compare with authors in your sample. This program was used with High Performance Computing through a university server.

Getting Started

Want the Reddit data? See this archived Reddit post.

How it Works

  • Anonymity is defined as an author (OP) on Reddit who only uses their username in one subreddit to write posts. This code does not account for comments outside of the subreddit of study.

  • Read in three data files - sample authors, sample commenters, and the corpus of authors outside of your sample.

  • The code creates a numpy array for you to search the corpus for authors and create a binary variable indicating whether an author appears outside of the subreddit of study.

  • Then, save this as a new csv file.

Built With and Versioning

This code and workshop was written using Python 3.7.4 with the Anaconda distribution.

Authors

  • Darla Still

About

How to see if a username only appears in one subreddit or elsewhere on Reddit.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages