Skip to content

huskydj1/s24-eng-sirum

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SIRUM: Medication Data Pipeline

Description:

The result of a partnership between SIRUM and Harvard's Tech for Social Good, this codebase automates the manual browsing of customer testimonials/complaints with respect to inaccessible medication. For further information about our process and product, please refer to our documentation.

The scripts provided have been specifically designed for GoFundMe pages, as we found most success analyzing its pages when compared with Reddit, Quora, Facebook, and the National Institute of Health. However, the scripts may be tailored towards other sites by modifying scrapePages.py, specifically the tags passed into the presence_of_element_located function.

Setup

Due to package requirements, we encourage users to run our code using Python 3.9. Dependencies may be installed by running the following code in your terminal:

pip install -r requirements.txt

Some of these packages require "constants" to run--variables that must be defined once, such as API keys. You must define these in constants.py. We provide further instructions about each variable in the file.

Scripts

  • Scraping Links
  • Scraping Pages
  • Analyzing Sentiment
  • Extracting Medication-related Terms
  • Producing Geographic Heat-Maps

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages