Cleaning up data extracted from Hacker News salary survey on 3/21/16
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
.gitignore
README.md
basic_analysis.sql
publish.py
stage.py
tables.ddl
transform.py

README.md

Cleaning up raw data pulled from a Hacker News salary survey conducted on 3/21/16. Latest clean version of the data is available here.

Additional Information

External data sources

Notes on validation

  • No responses were removed for trolling. Luckily, imposing sane limits on some fields eliminated a lot of them.
  • Currency conversion is hard when no one indicates the currency being used. Right now, rows indicating a currency type other than USD are converted to USD and everything else is assumed to be USD. Salary analysis internationally is sketchy at best.