Skip to content
Exploration and charting of world income distribution
Jupyter Notebook
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
DATA
archive
.gitignore
01_global_income_distribution.ipynb
02_income_dist_2011.ipynb
03_export_to_js_2011.ipynb
04_country_names.ipynb
LICENSE
README.md
requirements.txt

README.md

Data Visualization Challenge: Visualize Inequality

This repository contains the code used to generate a submission that won 3rd place in the Data Visualization Challenge organized by the United Nations Development Programme Human Development Reports.

The submission lives at income-inequality.info

Income Inequality

The website files are located in another GitHub repository: Income-Inequality

To launch a live interactive version: Binder

To report errors, ask questions, or offer corrections, please use the Issues section on this or the other repository.

Notes

I tried to document my steps and processing. Please let me know if you'd like me to clarify, elaborate, or better-organize this repository.

There were very few steps I had to perform manually (without running a script), primarily cleaning the .csv file headers for the Pandas library to be able to read the files. Another manual step was dealing with the data for China, which was split into two categories: urban and rural. Only four countries had to be removed from the original dataset before the final export (there was trouble with PPP conversion).

Outline

Files are numbered for convenience and in rough chronological order:

01 - Generates the 1st, 2nd, and 3rd figures on the website

02 - Generates the 4th - 8th figures, export csv with coordinate data for the interactive graph

03 - Process csv from above step into correct format for interactive graph library

04 - Generate the js dictionaries to convert between ISO2, ISO3, and country names on the website

See the archive folder for the initial exploration with 2005 data

For non-programmers who want to try

Coding is likely less scary than you think. Getting everything above to work will just take some research and patience.

The above code requires you install:

  1. Python (version 3.6 or better yet 3.7)
    • this will automatically install pip too
  2. Open your terminal and use pip to install*:
    • jupyterlab (a newer version of Jupyter Notebooks)
    • pandas
    • matplotlib
    • seaborn
  3. Save all the files from this repository
    • if you install git you can just use the command git clone https://github.com/whyboris/Global-Income-Distribution.git in your terminal
  4. Finally navigate to this folder in your terminal and type in jupyter lab
    • Your browser should open the Jupyter Lab interface and you'll be able to open and run the files!

* Using pip to install above is as easy as typing pip install pandas in your terminal. Though if you start coding more-seriously, consider using virtual environments as well.

When unsure about how to do something, just search online, and on stackoverflow.

You can’t perform that action at this time.