## Introduction

The Multidimensional Poverty Index Data describes the complexities of people that are impoverished individually and collectively each year. It focuses on how mulitdimensional poverty has declined in the world,covering 5 billion people. For this research, my focus is Nigeria.

## Data

The data at my disposal covers: 
* **MPI estimates for 36 states in Nigeria**
* **Proportion of people who are MPI poor and experience deprivations in each of the indicators by State**
* **Percentage contribution of deprivations of each dimension to overall poverty**
* **Proportion of people who experience deprivations in each of the indicators by State**
* **Standard errors and 95% confidence intervals for the MPI and the headcount ratio of states**
* **Sample sizes from each survey that were used to compute the MPI and gives the region breakdown**


In [105]:
#Let's import some of the libraries I am going to use

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

print("Libraries imported")

Libraries imported


## Methodology

With the data at my disposal, I can use the Python pandas library to unravel the following questions:

* **1.	Which state and region has the highest and lowest MPI?**
* **2.	Which dimension of the deprivations contributes the highest percentage to the overall poverty on the average?**
* **3.	What is the exact number(in thousands) of people poor and deprived in Health(*Nutrition and Child mortality*), Education(*Years of schooling and School attendance*), and Living standards(*Cooking fuel, Sanitation, Drinking water, Electricity, Housing and Assets*)for all the states/regions indicated**?


In [106]:
file_path = 'C:/Users/Joy-PC/Desktop/Redwire/MPI_wrangled.xlsx'

MPI_data = pd.read_excel(file_path, sheet_name=None)

mdf = pd.concat(MPI_data, axis=0, ignore_index=True)

## Analysis
 **1.	Which states/regions has the highest and lowest MPI?**

To answer this, I am going to parse in data on the sheet named *MPI StateWise* and use functions in the pandas library to determine the state/region with the highest and lowest MPI.



In [107]:
# This shows the first 5 rows of the sheet in question
df.head()



Unnamed: 0,State,Nigeria MPI,MPI Region,Headcount ratio(H),Intensity of deprivation among the poor\n(A),Vulnerable to poverty (who experience 20-33.32% intensity of deprivations) %,In severe poverty (with intensity higher than 50%),Year of the survey(thousands),Population 2017,Population 2018,Population share by region(%)2018,Population size by \nregion(thousands)2018,Number of MPI poor by regionᵇ(thousands)2018
0,Abia,0.25439,0.035475,9.331366,38.017052,21.410738,0.539488,195874.685,190873.247,195874.685,0.014236,2788.502197,260.205353
1,Adamawa,0.25439,0.332866,62.519079,53.242242,21.872628,35.901564,195874.685,190873.247,195874.685,0.02118,4148.636719,2593.689453
2,Akwa Ibom,0.25439,0.085492,20.796007,41.109884,18.902284,3.699443,195874.685,190873.247,195874.685,0.021538,4218.729492,877.327271
3,Anambra,0.25439,0.034662,8.601175,40.298966,19.751711,1.061069,195874.685,190873.247,195874.685,0.033193,6501.663086,559.219421
4,Bauchi,0.25439,0.45129,74.797773,60.334718,15.834358,55.255628,195874.685,190873.247,195874.685,0.035428,6939.354004,5190.482422


In [108]:
#We find out the region with the highest MPI

max_value_column = df['MPI Region'].max()
print(max_value_column)

df.loc[df['MPI Region'].idxmax()]

0.5852675437927246


State                                                                               Kebbi
Nigeria MPI                                                                       0.25439
MPI Region                                                                       0.585268
Headcount ratio(H)                                                                87.3556
Intensity of deprivation among the poor\n(A)                                      66.9983
Vulnerable to poverty (who experience 20-33.32% intensity of deprivations) %      7.98074
In severe poverty (with intensity higher than 50%)                                71.5866
Year of the survey(thousands)                                                      195875
Population 2017                                                                    190873
Population 2018                                                                    195875
Population share by region(%)2018                                               0.0269578
Population

In [109]:
#We find out the region with the lowest MPI

min_value_column = df['MPI Region'].min()
print(min_value_column)

df.loc[df['MPI Region'].idxmin()]

0.01613912358880043


State                                                                               Lagos
Nigeria MPI                                                                       0.25439
MPI Region                                                                      0.0161391
Headcount ratio(H)                                                                4.14464
Intensity of deprivation among the poor\n(A)                                      38.9398
Vulnerable to poverty (who experience 20-33.32% intensity of deprivations) %      14.9075
In severe poverty (with intensity higher than 50%)                               0.410629
Year of the survey(thousands)                                                      195875
Population 2017                                                                    190873
Population 2018                                                                    195875
Population share by region(%)2018                                               0.0591928
Population

**2.	Which dimension of the deprivations contributes the highest percentage to the overall poverty on the average?**

To answer this, I am going to parse in data on the sheet named *Contribution State* and use functions in the pandas library to determine this.

In [110]:
MPI_data2 = pd.read_excel(file_path, sheet_name='Contribution State')

MPI_data2.head()

Unnamed: 0,State,Nigeria MPI,MPI Region,Health,Education,Living Standards
0,Abia,0.25439,0.035475,52.676594,10.511436,36.811972
1,Adamawa,0.25439,0.332866,29.255202,32.862926,37.881875
2,Akwa Ibom,0.25439,0.085492,36.595172,11.352739,52.052087
3,Anambra,0.25439,0.034662,36.793262,10.917614,52.289122
4,Bauchi,0.25439,0.45129,28.852713,31.992096,39.155191


In [111]:
Avg = MPI_data2[['Health','Education','Living Standards']].mean()
print(Avg)

Health              34.716888
Education           20.481292
Living Standards    44.801820
dtype: float64


From the above, it is obvious that the **Living Standards** contributes the highest to the overall poverty on the average across the 36 states.

**3.	What is the exact number(in thousands) of people poor and deprived in Health**(*Nutrition and Child mortality*), **Education**(*Years of schooling and School attendance*), **and Living standards**(*Cooking fuel, Sanitation, Drinking water, Electricity, Housing and Assets*)**for all the states/regions indicated**?

To get this done, I will locate the sheet named *Censor Headcounts Region*

In [135]:
Read_file = pd.read_excel(file_path, sheet_name='Censor Headcounts Region')

Poor_deprived = Read_file.set_index('State')
Poor_deprived.head()

Unnamed: 0_level_0,Nigeria MPI,MPI Region,Nutrition,Child mortality,Years of schooling,School attendance,Cooking fuel,Sanitation,Drinking water,Electricity,Housing,Assets,Population 2017,Population 2018,Population share by region 2018(%),Region Population
State,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1
Abia,0.25439,0.035475,7.330786,3.881459,1.220067,1.017299,7.546432,6.715836,5.1932,2.155598,0.369768,1.525522,190873.247,195874.685,0.014236,2788.502197
Adamawa,0.25439,0.332866,47.577098,10.851197,31.076899,34.556717,62.406141,24.743183,36.352336,39.930072,43.458691,20.08191,190873.247,195874.685,0.02118,4148.636719
Akwa Ibom,0.25439,0.085492,12.128294,6.643304,1.303928,4.519492,20.172518,14.471917,15.634452,10.662886,12.017204,7.141828,190873.247,195874.685,0.021538,4218.729492
Anambra,0.25439,0.034662,5.513076,2.138858,1.263825,1.006722,8.281516,7.668129,4.874547,5.138004,2.097489,4.564192,190873.247,195874.685,0.033193,6501.663086
Bauchi,0.25439,0.45129,56.537026,21.588659,37.767434,48.858893,74.189174,59.917325,37.224123,56.097889,62.042677,28.595218,190873.247,195874.685,0.035428,6939.354004


In [136]:
#Let's get the decimal value of the percentages of the relevant columns


for col in Poor_deprived.columns[2:12]:
    Poor_deprived[col] = Poor_deprived[col]/100

Poor_deprived.head()

Unnamed: 0_level_0,Nigeria MPI,MPI Region,Nutrition,Child mortality,Years of schooling,School attendance,Cooking fuel,Sanitation,Drinking water,Electricity,Housing,Assets,Population 2017,Population 2018,Population share by region 2018(%),Region Population
State,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1
Abia,0.25439,0.035475,0.073308,0.038815,0.012201,0.010173,0.075464,0.067158,0.051932,0.021556,0.003698,0.015255,190873.247,195874.685,0.014236,2788.502197
Adamawa,0.25439,0.332866,0.475771,0.108512,0.310769,0.345567,0.624061,0.247432,0.363523,0.399301,0.434587,0.200819,190873.247,195874.685,0.02118,4148.636719
Akwa Ibom,0.25439,0.085492,0.121283,0.066433,0.013039,0.045195,0.201725,0.144719,0.156345,0.106629,0.120172,0.071418,190873.247,195874.685,0.021538,4218.729492
Anambra,0.25439,0.034662,0.055131,0.021389,0.012638,0.010067,0.082815,0.076681,0.048745,0.05138,0.020975,0.045642,190873.247,195874.685,0.033193,6501.663086
Bauchi,0.25439,0.45129,0.56537,0.215887,0.377674,0.488589,0.741892,0.599173,0.372241,0.560979,0.620427,0.285952,190873.247,195874.685,0.035428,6939.354004


In [137]:
#The result of indices under each dimension is then multiplied by the Region Population of 2018 to get the exact figures in thousands
Mul = Poor_deprived.loc[:,'Nutrition':'Assets'].multiply(Poor_deprived['Region Population'], axis="index")

Mul.head()

Unnamed: 0_level_0,Nutrition,Child mortality,Years of schooling,School attendance,Cooking fuel,Sanitation,Drinking water,Electricity,Housing,Assets
State,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1
Abia,204.41912,108.234578,34.021599,28.367395,210.432431,187.271224,144.812485,60.108894,10.310984,42.539218
Adamawa,1973.800958,450.176741,1289.26765,1433.632637,2589.004062,1026.504774,1508.126376,1656.553647,1802.943222,833.125512
Akwa Ibom,511.659927,280.263035,55.009194,190.665158,851.023987,610.531024,659.575229,449.838306,506.973324,301.294395
Anambra,358.441659,139.061336,82.16967,65.453702,538.436239,498.555887,316.926612,334.055698,136.37164,296.748372
Bauchi,3923.304389,1498.113493,2620.815952,3390.491542,5148.249429,4157.875276,2583.113662,3892.831103,4305.361018,1984.32341
