The data being archived in this github repository represents projects that were completed during several months of assignments in the Springboard Data Science Intensive Program.
- Working with JSON (See sliderule_dsi_json_exercise.ipynb)
Using data in file 'data/world_bank_projects.json', solve the following;
a) Find the 10 countries with most projects
b) Find the top 10 major project themes (using column 'mjtheme_namecode')
c) In 2. above you will notice that some entries have only the code and the name is missing. Create a dataframe
with the missing names filled in.
- Working with XML (See sliderule_dsi_xml_exercise.ipynb)
Using data in 'data/mondial_database.xml' and refering to https://docs.python.org/2.7/library/xml.etree.elementtree.html, solve the following;
a) 10 countries with the lowest infant mortality rates
b) 10 cities with the largest population
c) 10 ethnic groups with the largest overall populations (sum of best/latest estimates over all countries)
name and country of a) longest river, b) largest lake and c) airport at highest elevation