<a href="https://colab.research.google.com/github/sidbannet/COVID-19_analysis/blob/develop/COVID_19_tracker.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Author Bio

[LinkedIn profile](https://www.linkedin.com/in/sidban)

[Resume](https://docs.google.com/document/d/1uVc9le7LM2WMmGM4ub9w2uI1FY7I63h7SBxNGyIItWc/edit?usp=sharing)

[GitHub](https://github.com/sidbannet?tab=repositories)

---
I develop **high-performance computation models** to understand *turbulence flow*, *multi-phase flow* and *combustion flames*. I apply **data-science** to accelerate design innovations in *propulsion* device.

I received **PhD** from **University of Wisconsin - Madison** in 2011 with major in **Mechanical and Chemical Engineering** and distributed minor in *Mathamatics*, *Statistics* and *Computer Science*.

I received recognitions for my work in clean propulsion innovation from [United States Deparment Of Energy](https://www.energy.gov/eere/vehicles/vehicle-technologies-office) and [Dr. Stephen Chu](https://en.wikipedia.org/wiki/Steven_Chu).

# About COVID-19 tracker


---
This is 2019 Novel Coronavirus Visual Dashboard operated by [Siddhartha Banerjee](https://www.linkedin.com/in/sidban) by using data published by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). Also, Supported by ESRI Living Atlas Team and the Johns Hopkins University Applied Physics Lab (JHU APL).

**Data Sources**:

World Health Organization (WHO): https://www.who.int/ DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia. BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/ National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html Macau Government: https://www.ssm.gov.mo/portal/ Taiwan CDC: https://sites.google.com/cdc.gov.tw/2019ncov/taiwan?authuser=0 US CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html Government of Canada: https://www.canada.ca/en/public-health/services/diseases/coronavirus.html Australia Government Department of Health: https://www.health.gov.au/news/coronavirus-update-at-a-glance European Centre for Disease Prevention and Control (ECDC): https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases Ministry of Health Singapore (MOH): https://www.moh.gov.sg/covid-19 Italy Ministry of Health: http://www.salute.gov.it/nuovocoronavirus 1Point3Arces: https://coronavirus.1point3acres.com/en WorldoMeters: https://www.worldometers.info/coronavirus/

Additional Information about the Visual Dashboard: https://systems.jhu.edu/research/public-health/ncov/

**Contact Me**: 
sidban@uwalumni.com


# Clone the Git repository

---

*   Clone code and data repository
*   Setup environmental variables

In [0]:
# Clone repository
!git clone https://github.com/sidbannet/COVID-19_analysis.git
%cd ./COVID-19_analysis
!git submodule update --remote
!git checkout develop

# Load packages
---
*   Analysis tools
*   Plotting tools

In [0]:
# Import nessesary modules
from tools import collection as cll
import plotly.tools as tls
from plotly.offline import iplot
import plotly.express as px

# Parse data from the database
---

*   Setup classes containing the data and methods to parse data
*   Parse the data



In [0]:
# Setup data class and parse the database
d = cll.DataClass()
try:
  d.parse()
except FileNotFoundError:
  compiled_case_dir = r'compiled_data'
  d.conf = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'confirmed_cases', index_col=0)
  d.conf_us = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'confirmed_cases_US', index_col=0)
  d.dead = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'death_cases', index_col=0)
  d.dead_us = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'death_cases_US', index_col=0)
  d.recov = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'recovered_cases', index_col=0)
  d.recov_us = cll.pd.read_csv(compiled_case_dir + cll.os.sep +'recovered_cases_US', index_col=0)
  d.df_global = cll.pd.read_csv(compiled_case_dir + cll.os.sep + 'compiled_data.csv', index_col=0)

Optional plots to check out the trends by comparing the rate of increase of COVID cases against the total number of COVID cases

In [0]:
#@title
# Setup figure objects
d.__window__ = 5
fig, ax = d.plots()
[axes.legend() for axes in ax.flat]
fig.set_size_inches(w=24, h=12)

# Plot COVID-19 trends
---
Plot COVID-19 time series data per country and per state basis with
*   Number of confirmed cases
*   Number of deaths from COVID
*   Number of recovered from COVID

Plot these variables in log scale to highlight exponential growth in pandemic against days since initial outbreak.

In [0]:
# Make some meaningful timeseries plots
fig, ax = d.plots_timeseries(n_outbreak=500, n_filter_country=10000, n_filter_state=5000)
fig.set_size_inches(w=18, h=6)
_ = [axes.set_ylim([10, 50000]) for axes in ax[:, 1].flat]
_ = ax[0, 0].set_xlim([0, 40])

In [0]:
# Convert and plot in plotly
plotly_fig = tls.mpl_to_plotly(fig) 
iplot(plotly_fig)

# Global spread of COVID-19

In this `geoscatter` animation image below, the bubble size represents the reported number of cases with COVID-19. The color of the bubble representes the daily growth rate in the number of cases. The animation frame represents the date starting from Jan 22, 2020.

In [0]:
fig = px.scatter_geo(d.df_global, locations="iso_alpha", color="rate",
                     color_continuous_scale='rainbow', range_color=[1.0, 2.5],
                     hover_name="country", size="confirmed",
                     animation_frame="date",
                     projection="natural earth")
fig.show()