Skip to content

andre-moore/data_science_intensive

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Springboard Data Science Intensive

The data being archived in this github repository represents projects that were completed during several months of assignments in the Springboard Data Science Intensive Program.

Project 1 - Using Python Pandas to work with JSON and XML files

  1. Working with JSON (See sliderule_dsi_json_exercise.ipynb)

Using data in file 'data/world_bank_projects.json', solve the following;

a) Find the 10 countries with most projects
b) Find the top 10 major project themes (using column 'mjtheme_namecode')
c) In 2. above you will notice that some entries have only the code and the name is missing. Create a dataframe
with the missing names filled in.
  1. Working with XML (See sliderule_dsi_xml_exercise.ipynb)

Using data in 'data/mondial_database.xml' and refering to https://docs.python.org/2.7/library/xml.etree.elementtree.html, solve the following;

a) 10 countries with the lowest infant mortality rates
b) 10 cities with the largest population
c) 10 ethnic groups with the largest overall populations (sum of best/latest estimates over all countries)
name and country of a) longest river, b) largest lake and c) airport at highest elevation

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published