Cleaning up survey data
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
BC_patients.R
README.md

README.md

Processing and visualizing birth control survey data

This is my code (written in R) to process, clean, analyze, and graph data from a survey on birth control methods.

The survey contained multiple choice and user-entered responses that required cleaning to elimate typed variations.

Once the data are cleaned and processed, graphs are created and new, cleaned data is exported.

The data processing includes:

  • filtering out null responses
  • dropping irrelevant columns
  • geolocating the IP addresses of survey participants and adding latitude, longitude, city, and state to the dataframe
  • creating a dataframe of people who agreed to be interviewed
  • counting total answers for various multiple-choice questions
  • standardizing the user-entered question to discrete categories and counting total answers
  • graphing the answer counts to each question
  • exporting the graphs and dataframes created

I made a quick visualization mapping out the survey data in Tableau. The latitude and longitude of each survey respondent was generated with the above R script. Each point represents one survey respondent, color-coded by the age at which they first thought about using birth control. The tooltips display the respondent's current age, city, and birth control method. Take a look here: https://public.tableau.com/profile/princess7577#!/vizhome/BCsurveyresponsesacrossthenation/Sheet1