As a Master's student, I am conducting an exploratory analysis of COVID-19 italian data to serve as the foundation of my Master's Thesis. The thesis aims to investigate the impact of the pandemic on businesses' bankruptcies in the Emilia-Romagna region.
In this analysis, I explore the available COVID-19 data from the official GitHub page of the Italian Civil Protection Department. The Civil Protection Department is a government agency in Italy responsible for handling emergency situations and protecting the population from natural disasters, accidents, and other hazards.
The analysis includes data cleaning, data manipulation, and visualization to aid in further analysis and enhance the research with informative and helpful figures. Given that the Emilia-Romagna region is the subject of my thesis, special attention will be devoted to it: in this repository you will find filtered datasets to lay the groundwork for further research into the virus's economic consequences in the region.
The COVID-19 data used in this project is publicly available from the Italian Government's GitHub repository under CC-BY-4.0 licence. Population data1 and shapefiles2 are provided by the Italian National Institute of Statistics ISTAT. A shapefile that is coherent with NUTS2 classification was retrieved from OnData Association Github repository.
To run the analysis, you must have Python 3.x and the required libraries installed. The required libraries are listed and imported in the setup.ipynb
notebook.
The dpc-covid19
repository contains the following folders:
scripts
that contain Jupyter notebook filesconf
, if necessary, that contains configuration files used in scripts or Jupyter notebook filesdata
, that contains input files in the.csv
format along with geographic filesresults
that contains output files, usually in the.csv
formatfigures
that contains plot filesreference
that contains any possibly referenced resource.
References, Inspiration, Code Snippets, etc.