# Topic:

# Data description

The dataset was downloaded on the World Health Organisation website(https://www.who.int/). The data was collected in 2019, 2015, 2010, and 2005 for 183 countries. The desciption of each variable is as follows:

### Life expectancy at birth (years)
The average number of years that a newborn could expect to live, if he or she were to pass through life exposed to the sex- and age-specific death rates prevailing at the time of his or her birth, for a specific year, in a given country, territory, or geographic area. *
### Population using at least basic drinking-water services (%)
The percentage of population using at least basic drinking water services, that is, the population that drinks water from an improved source, provided collection time is not more than 30 minutes for a round trip. This indicator encompasses both people using basic drinking water services as well as those using safely managed drinking water services. Improved water sources include piped water, boreholes or tubewells, protected dug wells, protected springs, and packaged or delivered water.
### Current health expenditure (CHE) as percentage of gross domestic product (GDP) (%)
Level of Current Health Expenditure expressed as a percentage of GDP

### Alcohol, recorded per capita (15+) consumption (in litres of pure alcohol)
Recorded APC is defined as the recorded amount of alcohol consumed per capita (15+ years) over a calendar year in a country, in litres of pure alcohol. The indicator only takes into account the consumption which is recorded from production, import, export, and sales data often via taxation. Numerator: The amount of recorded alcohol consumed per capita (15+ years) during a calendar year, in litres of pure alcohol. Denominator: Midyear resident population (15+ years) for the same calendar year, UN World Population Prospects, medium variant.
### Estimated number of people (all ages) living with HIV

### General government expenditure on health as a percentage of total government expenditure
Level of general government expenditure on health (GGHE) expressed as a percentage of total government expenditure.
### External resources for health as a percentage of total expenditure on health
External resources for health expressed as a percentage of total expenditure on health.
### Population practising open defecation (%)
The percentage of the population practising open defecation is defined as the proportion of the population who usually don’t use any kind of toilet facility for defecation. Those using unimproved sanitation facilities like pit latrines without slab, open pit, or hanging latrines, are not counted as practising open defecation.
### Population using at least basic sanitation services (%)	
The percentage of population using at least basic sanitation services, that is, improved sanitation facilities that are not shared with other households. This indicator encompasses both people using basic sanitation services as well as those using safely managed sanitation services. Improved sanitation facilities include flush/pour flush toilets connected to piped sewer systems, septic tanks or pit latrines; pit latrines with slabs (including ventilated pit latrines), and composting toilets.
### Estimate of current tobacco use prevalence (%) (age-standardized rate)

### Hepatitis B (HepB3) immunization coverage among 1-year-olds (%)	
The percentage of one-year-olds who have received three doses of hepatitis B vaccine in a given year.
### Neonates protected at birth against neonatal tetanus (PAB) (%)	
Proportion of infants whose mothers had two tetanus toxoid doses during the last pregnancy or had received at least TT2 (3 years protection), TT3 (5 years protection), TT4 (10 years protection) or TT5 (lifetime protection). “Protection at birth:” For prevention of neonatal and maternal tetanus, WHO recommends giving women a series of five doses of tetanus toxoid (TT). Each dose increases the level and protection against tetanus. It is assumed that a newborn is protected against tetanus at birth if the total of all doses received, including those during the last pregnancy, are as follows: • 2 doses and last dose was 3 years or less prior to the most recent delivery • 3 doses and last dose was 5 years or less prior to the most recent delivery • 4 doses and last dose was 10 years or less prior to the most recent delivery • 5 or more doses ever.
### Measles-containing-vaccine second-dose (MCV2) immunization coverage by the nationally recommended age (%)
The percentage of children who have received two doses of measles containing vaccine (MCV2) in a given year, according to the nationally recommended schedule.

In [1]:
import pandas as pd
import matplotlib.pyplot as plt
import numpy as np
from sklearn.linear_model import LinearRegression, Ridge
from sklearn.metrics import mean_squared_error, r2_score, mean_absolute_error
from sklearn.model_selection import train_test_split
from statsmodels.tsa.seasonal import seasonal_decompose
from statsmodels.tsa.stattools import adfuller
from scipy import stats
import seaborn as sns 

sns.set_style("darkgrid")

In [2]:
dataset = pd.read_csv('Life expectancy at birth.csv', sep=",")

In [4]:
dataset

Unnamed: 0,Location,Period,Life expectancy at birth (years),Population using at least basic drinking-water services (%),Current health expenditure (CHE) as percentage of gross domestic product (GDP) (%),"Alcohol, recorded per capita (15+) consumption (in litres of pure alcohol)",Estimated number of people (all ages) living with HIV,General government expenditure on health as a percentage of total government expenditure,External resources for health as a percentage of total expenditure on health,Population practising open defecation (%),Population using at least basic sanitation services (%),Estimate of current tobacco use prevalence (%) (age-standardized rate),Hepatitis B (HepB3) immunization coverage among 1-year-olds (%),Neonates protected at birth against neonatal tetanus (PAB) (%),Measles-containing-vaccine second-dose (MCV2) immunization coverage by the nationally recommended age (%)
0,Afghanistan,2019,63.21,,,0.01,11000.0,,,,,,66.0,68.0,39.0
1,Afghanistan,2015,61.65,52.39,10.11,0.00,7600.0,,,14.18,40.71,,65.0,70.0,39.0
2,Afghanistan,2010,59.94,40.52,8.57,0.02,4600.0,14.40,25.54,18.40,34.18,,66.0,79.0,29.0
3,Afghanistan,2000,54.99,21.62,,,1500.0,,,26.02,23.52,,,32.0,
4,Albania,2019,78.00,,,4.40,1400.0,,,,,,99.0,96.0,96.0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
727,Zambia,2000,44.46,30.90,7.15,2.82,770000.0,13.34,12.69,24.50,23.66,,,78.0,
728,Zimbabwe,2019,60.68,,,3.11,1400000.0,,,,,,90.0,87.0,75.0
729,Zimbabwe,2015,58.48,51.03,7.45,3.84,1300000.0,,,25.61,37.57,,87.0,75.0,
730,Zimbabwe,2010,51.49,54.14,10.48,3.38,1200000.0,7.47,,27.12,41.05,15.6,90.0,76.0,


In [5]:
dataset.isnull().sum()

Location                                                                                                       0
Period                                                                                                         0
Life expectancy at birth (years)                                                                               0
Population using at least basic drinking-water services (%)                                                  183
Current health expenditure (CHE) as percentage of gross domestic product (GDP) (%)                           201
Alcohol, recorded per capita (15+) consumption (in litres of pure alcohol)                                    12
Estimated number of people (all ages) living with HIV                                                         52
General government expenditure on health as a percentage of total government expenditure                     375
External resources for health as a percentage of total expenditure on health                    