h1b2019database

A python script that load 2019 h1b petition data into local sqlite database

requirements

python 3
python module sqlalchemy

steps

download the raw data in format of xlsx from Department of Labor
use any online xlsx -> csv converting website(I use zamar) to convert the file
create a sqlite database

sqlite3 h1b_data.db

run the python script

# python establish2019H1BDatabas.py <csv file name> <db name>
python establish2019H1BDatabas.py H-1B_Disclosure_Data_FY2019.csv h1b_data.db

run the salary analytics(medium/quartile) script

# python calculateEmployerSalaryStatisitcs.py <number of employers to be calculated> <db name>
python calculateEmployerSalaryStatisitcs.py 2000 h1b_data.db

check the data

high salary h1b employers

   SELECT count(*), EMPLOYER_NAME FROM h1bdata_2019 
    WHERE PREVAILING_WAGE > 123000 
      AND JOB_TITLE LIKE "%Engineer%" 
 GROUP BY EMPLOYER_NAME ORDER BY COUNT(*) DESC LIMIT 100;

certain employer/city/state/job_title

 SELECT PREVAILING_WAGE, EMPLOYER_NAME, JOB_TITLE, WORKSITE_CITY, WORKSITE_STATE
   FROM h1bdata_2019
  WHERE EMPLOYER_NAME LIKE "%Google%" 
    AND WORKSITE_CITY = 'Cambridge' LIMIT 100;

top quartile salary emploer list

SELECT * FROM employer_salary_stats ORDER BY QUARTILE_PAY DESC LIMIT 100;

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
output_csv		output_csv
pics		pics
README.md		README.md
calculateEmployerSalaryStatisitcs.py		calculateEmployerSalaryStatisitcs.py
establish2019H1BDatabas.py		establish2019H1BDatabas.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output_csv

output_csv

pics

pics

README.md

README.md

calculateEmployerSalaryStatisitcs.py

calculateEmployerSalaryStatisitcs.py

establish2019H1BDatabas.py

establish2019H1BDatabas.py

Repository files navigation

h1b2019database

requirements

steps

check the data

high salary h1b employers

certain employer/city/state/job_title

top quartile salary emploer list

About

Releases

Packages

Languages

fatliau/h1b2019database

Folders and files

Latest commit

History

Repository files navigation

h1b2019database

requirements

steps

check the data

high salary h1b employers

certain employer/city/state/job_title

top quartile salary emploer list

About

Resources

Stars

Watchers

Forks

Languages