# Project Title: Nigeria COVID-19 Data Analysis Using Python

# Introduction:

In [1]:
# Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus, and it has affected major 
# parts of the world. Nigeria, a West-African country, has also been affected by the COVID-19 pandemic after recording its first
# case on 27th February 2020.
# Nigeria is a country with 37 states - Federal Capital Territory included- and a fast-growing economic environment with about 
# 200 million citizens. COVID-19 has affected several country activities as the country steadily progressed from its first case 
# to shutting down major airports, state-wide lockdown, curfews, and reviving its economy.
# The major aim of writing this report is to evaluate the negative impact of covid-19 on Nigeria's economy and how the country
# managed to survive it.

### Analysis Questions

In [2]:
# Generate a plot that shows the Top 10 states in terms of Confirmed Covid cases by Laboratory test.
# Generate a plot that shows the Top 10 states in terms of Discharged Covid cases. Hint - Sort the values.
# Plot the top 10 Death cases.
# Generate a line plot for the total daily confirmed, recovered and death cases in Nigeria.
# Determine the daily infection rate, you can use the Pandas diff method to find the derivate of the total cases.
# Generate a line plot for the above.
# Calculate maximum infection rate for a day (Number of new cases).
# Find the date.
# Determine the relationship between the external dataset(overall community vulnerability index) and the NCDC COVID-19 dataset.
# Determine the relationship between the external dataset and the NCDC COVID-19 dataset(Confirmed Cases and Population Density).
# Provide a summary of your observation.
# Determine the effect of the Pandemic on the economy. To do this, you will compare the Real GDP value Pre-COVID-19 
# with Real GDP in 2020 (COVID-19 Period, especially Q2 2020.

# Body:

### Data Overview:

In [3]:
# 1...covidnig.csv from NCDC Website, with 37 Rows and 5 Columns.
# 2...(a) Global Daily Confirmed Cases from John Hopkins Repository, with 289 Rows and 1147 Columns.
#   ...(b) Global Daily Recovered Cases from John Hopkins Repository, with 274 Rows and 1147 Columns.
#   ...(c) Global Daily Death Cases from John Hopkins Repository, with 289 Rows and 1147 Columns.
# 3...External Data from Ustacky Github Repository such as:
#   ...(a) covid_external.csv. It has 32 Rows and 12 Columns.
#   ...(b) Budget data.csv, with 37 Rows and 3 Columns.
#   ...(c) RealGDP.csv, with 7 Rows and 5 Columns.

### Methods/Approaches:

#### 1...Data Collection

In [4]:
#1...covidnig.csv from NCDC Website
#2...(a) Global Daily Confirmed Cases from John Hopkins Repository
#...(b) Global Daily Recovered Cases from John Hopkins Repository
#...(c) Global Daily Death Cases from John Hopkins Repository
#3...External Data such as:
#...(a) covid_external.csv
#...(b) Budget data.csv
#...(c) RealGDP.csv

#### 2...Reading The Datasets

In [5]:
# data = pd.read_csv('covidnig.csv')
# confirmed = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv')
# recovered = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_recovered_global.csv')
# deaths = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_deaths_global.csv')
# covid_external = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/covid_external.csv')
# BG = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/Budget%20data.csv')
# RG = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/RealGDP.csv')

#### 3...Viewing The Datasets

In [6]:
# data = pd.read_csv('covidnig.csv')
# confirmed = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv')
# recovered = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_recovered_global.csv')
# deaths = pd.read_csv('https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_deaths_global.csv')
# covid_external = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/covid_external.csv')
# BG = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/Budget%20data.csv')
# RG = pd.read_csv('https://raw.githubusercontent.com/Ustacky-dev/Nigeria-COVID-19-Data-Analysis-Using-Python/main/RealGDP.csv')

#### 4 Data Cleaning and Preparation

In [7]:
# 1...Checking the number of rows and columns using an Attribute called .shape
# 2...Checking statistical summary of the dataframe using a pandas function called .discribe()
# 3...Checking the missing values using a pandas function called (.isnull().sum()
# 4...Changing Columns Names where necessary.
# 5...Checking Data Information using a pandas function called .info().
# 6...Dropping Columns that were not needed using .drop().
# 7...Renaming Columns
# 8...Dealing with Datetime features

### Analysis:

### Results:

In [8]:
# Lagos State had the highest Laboratory Confirmed cases of covid-19
# While Kogi State recorded lowest Laboratory confirmed cases of covid-19
# Lagos State had the highest Number of Discharged people from covid-19 Isolation Centers
# Kogi State had the lowest No of Discharged people from covid-19
# Lagos State recorded the highest No of Deaths of people with covid-19
# Kogi State recorded the lowest No of Deaths of people with covid-19
# The rate of daily Confirmed Cases of covid-19 was much higher in March, 2023
# The rate of daily recovered cases of covid-19 started rising from July 2021 to August 2021.
# The daily death cases from November 2022 to March 2023 remains the unchanged. There is no increase.
# The Daily Infection Rate increases within 2 days from over 2000 to over 6000.
# The Spike in Infection Rate occured between '2023-02-28'-'2023-03-01'.
# Negative Relationship exists between the Overall Community Vulnerability Index(CCVI) and the Laboratory Confirmed Cases,
# as the Overall CCVI Index decreases, Laboratory Confirmed Cases surge.
# The initial budgets of States were revised and reduced due to the effects of covid-19. And this had adverse effects on the
# economy
# There was a positive relationship between Population and Laboratory Confirmed Cases. As Population
# increases(denser population), more people were infected with covid-19 virus
# The RealGDP value in the quarter 1 (Q1 2020) were much higher. But when it got to Q2, Q3 & Q4 2020, the RealGDP value reduces.
# In fact, according to this analysis, there was no RealGDP value in Q4 2020. This could be attributed to the spread of the
# virus in some parts of the country. This had greater negative effects on Nigerian economy.

In [9]:
# Conclusion:

# The emergence of Coronavirus disease lead to shutting down of major airports,seaports, state-wide lockdown, curfews, 
#in Nigeria. This really affected the economy negatively as reflected in the analysis. A country whose Gross Domestic Value(GDP)
# reduces due to one problem or another really ecounters economic meltdown. 