In [1]:
import pandas as pd

from IPython.display import display, HTML
display(HTML("<style>.container { width:100% !important; }</style>"))

## Maringá Dataset [[1-3]](#Sources) (```maringa_data.csv```)

In [2]:
df_maringa = pd.read_csv('data/maringa_data.csv',
                         parse_dates = ['date'],
                         dtype = {
                             'cases': float, 
                             'deaths': int, 
                             'deaths_0_19': int, 
                             'deaths_20_29': int,
                             'deaths_30_39': int, 
                             'deaths_40_49': int, 
                             'deaths_50_59': int, 
                             'deaths_over_59': int,
                             'vaccine_first_dose': float, 
                             'vaccine_second_dose': float, 
                             'vaccine_third_dose': float,
                             'occupancy_infirmary': float, 
                             'occupancy_icu': float, 
                             'occupancy_infirmary_covid': float,
                             'occupancy_icu_covid': float, 
                             'occupancy_percentage_infirmary': float,
                             'occupancy_percentage_icu': float, 
                             'occupancy_percentage_infirmary_covid': float,
                             'occupancy_percentage_icu_covid': float, 
                             'total_beds_infirmary': float,
                             'total_beds_icu': float, 
                             'total_beds_infirmary_covid': float,
                             'total_beds_icu_covid': float,
                             'rt': float, 
                             'rt_25': float, 
                             'rt_975': float, 
                             'mobility_recreation': float, 
                             'mobility_grocery': float,
                             'mobility_parks': float, 
                             'mobility_transit': float, 
                             'mobility_workplace': float,
                             'mobility_residence': float
                         })
df_maringa.head(5)

Unnamed: 0,date,cases,deaths,deaths_0_19,deaths_20_29,deaths_30_39,deaths_40_49,deaths_50_59,deaths_over_59,vaccine_first_dose,...,total_beds_icu_covid,rt,rt_25,rt_975,mobility_recreation,mobility_grocery,mobility_parks,mobility_transit,mobility_workplace,mobility_residence
0,2020-03-18,1.0,0,0,0,0,0,0,0,,...,,,,,-25.0,14.0,-57.0,-2.0,2.0,5.0
1,2020-03-19,1.0,0,0,0,0,0,0,0,,...,,,,,-25.0,7.0,-51.0,-3.0,-5.0,7.0
2,2020-03-20,0.0,0,0,0,0,0,0,0,,...,,,,,-75.0,-25.0,-68.0,-34.0,-31.0,20.0
3,2020-03-21,1.0,0,0,0,0,0,0,0,,...,,,,,-85.0,-35.0,-81.0,-54.0,-47.0,22.0
4,2020-03-22,0.0,0,0,0,0,0,0,0,,...,,,,,-89.0,-87.0,-89.0,-65.0,-48.0,22.0


**Columns**

<span style="line-height: 1.75em;">
    
- **date** (```datetime```, ```YYYY-mm-dd```): Date.

- **cases** (```float```): Daily cases of COVID-19.

- **deaths** (```int```): Daily deaths by COVID-19.

- **deaths_0_19** (```int```): Daily deaths by COVID-19 for ages between 0 and 19 years.

- **deaths_20_29** (```int```): Daily deaths by COVID-19 for ages between 20 and 29 years.

- **deaths_30_39** (```int```): Daily deaths by COVID-19 for ages between 30 and 39 years.

- **deaths_40_49** (```int```): Daily deaths by COVID-19 for ages between 40 and 49 years.

- **deaths_50_59** (```int```): Daily deaths by COVID-19 for ages between 50 and 59 years.

- **deaths_over_59** (```int```): Daily deaths by COVID-19 for ages above 59 years.
  
- **vaccine_first_dose** (```float```): Number of administered first doses of the COVID-19 vaccine.

- **vaccine_second_dose** (```float```): Number of administered second doses of the COVID-19 vaccine.

- **vaccine_third_dose** (```float```): Number of administered third doses of the COVID-19 vaccine.

- **occupancy_infirmary** (```float```): Number of occupied infirmary beds.

- **occupancy_icu** (```float```): Number of occupied ICU beds.

- **occupancy_infirmary_covid** (```float```): Number of occupied infirmary beds (exclusive for COVID-19 patients).

- **occupancy_icu_covid** (```float```): Number of occupied ICU beds (exclusive for COVID-19 patients).

- **occupancy_percentage_infirmary** (```float```): Percentage of occupied infirmary beds.

- **occupancy_percentage_icu** (```float```): Percentage of occupied ICU beds.

- **occupancy_percentage_infirmary_covid** (```float```): Percentage of occupied infirmary beds (exclusive for COVID-19 patients).

- **occupancy_percentage_icu_covid** (```float```): Percentage of occupied ICU beds (exclusive for COVID-19 patients).

- **total_beds_infirmary** (```float```): Total number of infirmary beds.

- **total_beds_icu** (```float```): Total number of ICU beds.

- **total_beds_infirmary_covid** (```float```): Total number of infirmary beds (exclusive for COVID-19 patients).

- **total_beds_icu_covid** (```float```): Total number of ICU beds (exclusive for COVID-19 patients). 

- **rt** (```float```): Instantaneous reproductive number.

- **rt_25** (```float```): Instantaneous reproductive number's lower bound of confidence interval (2.5% percentile).

- **rt_975** (```float```): Instantaneous reproductive number's upper bound of confidence interval (97.5% percentile).

- **mobility_recreation** (```float```): Changes in mobility for retail & recreation locations.*

- **mobility_grocery** (```float```): Changes in mobility for grocery & pharmacy locations.*

- **mobility_parks** (```float```): Changes in mobility for parks locations.*

- **mobility_transit** (```float```): Changes in mobility for transit stations.*

- **mobility_workplace** (```float```): Changes in mobility for workplaces.*

- **mobility_residence** (```float```): Changes in mobility for residential locations.*
    
</span>
*Percentage change in the number of visits and length of stay compared to the median baseline value between 3 January 2020 and 6 February 2020.

## Maringá Excess Deaths Dataset [[2]](#Sources) (```maringa_excess_deaths_data.csv```)

In [3]:
df_excess_deaths_maringa = pd.read_csv('data/maringa_excess_deaths_data.csv',
                                       parse_dates = ['date'],
                                       dtype = {
                                           'death_excess': int
                                       })
df_excess_deaths_maringa.head(5)

Unnamed: 0,date,death_excess
0,2020-03-01,139
1,2020-04-01,94
2,2020-05-01,20
3,2020-06-01,-1
4,2020-07-01,28


**Columns**

<span style="line-height: 1.75em;">
    
- **date** (```datetime```, ```YYYY-mm-dd```): Date.

- **death_excess** (```int```):  Monthly number of excess deaths in Maringá for 2020, 2021, and 2022 using the monthly deaths in 2019 as reference.
    
</span>

## Maringá Decrees Dataset [[4]](#Sources) (```maringa_decree_data.csv```)

In [4]:
df_decrees_maringa = pd.read_csv('data/maringa_decree_data.csv',
                                 parse_dates = ['date'],
                                 dtype = {
                                     'decree_summary': str,
                                     'flexibilization': bool
                                })
df_decrees_maringa.head(5)

Unnamed: 0,date,decree_summary,flexibilization
0,2020-03-16,"""Decreto 436/2020. Um decreto meio efetivo, me...",False
1,2020-03-18,"""Decreto 445/2020. Situação de emergência e fe...",False
2,2020-03-20,"""Decreto 461/2020. Suspensão do transporte int...",False
3,2020-03-23,"""Decretos 462 e 464/2020. Toque de recolher, s...",False
4,2020-04-01,"""Decreto 489/2020. Liberação de padarias, açou...",True


**Columns**

<span style="line-height: 1.75em;">
 
- **date** (```datetime```, ```YYYY-mm-dd```): Date.

- **decree_summary** (```str```): Summary of a decree implementing or lifting a non-pharmaceutical intervention (NPI). All summaries are written in Portuguese.

- **flexibilization** (```bool```): Lifting (True) or implementation (False) of NPIs.    
    
</span>

## Maringá NPIs Dataset [[4]](#Sources)  (```maringa_npi_data.csv```)

In [5]:
df_npi_intensity = pd.read_csv('data/maringa_npi_data.csv',
                               parse_dates=['date'],
                               index_col='date'
                              )
df_npi_intensity.head(5)

Unnamed: 0_level_0,Public events,Private gatherings,Religious gatherings,Open public areas,Parks,Shopping malls,Gyms,Cinemas,Theaters,Nightclubs,...,Bars,Restaurants,Most nonessential businesses,Schools,Universities,Curfew (0h - 5h),Curfew (23h - 5h),Curfew (22h - 5h),Curfew (21h - 5h),Curfew (20h - 5h)
date,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
2020-02-24,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0
2020-02-25,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0
2020-02-26,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0
2020-02-27,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0
2020-02-28,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0


<span style="line-height: 1.75em;">
    
- **Index** (```datetime```, ```YYYY-mm-dd```): Date.

- **Columns** (```int```): Non-pharmaceutical intervention (NPI) type.
    
</span>
The values in this dataframe indicate the absence (0) or the restriction intensity (1 to 5) of a NPI. The higher the value, the larger the restriction intensity.
The NPIs 'Open public areas', 'Parks', 'Shopping malls', 'Gyms', 'Cinemas', 'Theaters', 'Nightclubs', 'Clubs and associations', 'Bars', 'Restaurants', 'Most nonessential businesses', 'Schools', and 'Universities' refer to a restriction of permanence in these locations.

## Brazilian States NPIs Dataset [[5]](#Sources) (```states_npi_data.csv```)

In [6]:
df_npi_states = pd.read_csv('data/states_npi_data.csv')
df_npi_states.head(5)

  df_npi_states = pd.read_csv('data/states_npi_data.csv')


Unnamed: 0,CountryName,CountryCode,RegionName,RegionCode,Jurisdiction,Date,C1_School closing,C1_Flag,C1_Notes,C2_Workplace closing,...,StringencyIndex,StringencyIndexForDisplay,StringencyLegacyIndex,StringencyLegacyIndexForDisplay,GovernmentResponseIndex,GovernmentResponseIndexForDisplay,ContainmentHealthIndex,ContainmentHealthIndexForDisplay,EconomicSupportIndex,EconomicSupportIndexForDisplay
0,Brazil,BRA,Acre,BR_AC,STATE_TOTAL,20200101,0.0,,,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
1,Brazil,BRA,Acre,BR_AC,STATE_TOTAL,20200102,0.0,,,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
2,Brazil,BRA,Acre,BR_AC,STATE_TOTAL,20200103,0.0,,,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
3,Brazil,BRA,Acre,BR_AC,STATE_TOTAL,20200104,0.0,,,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
4,Brazil,BRA,Acre,BR_AC,STATE_TOTAL,20200105,0.0,,,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0


Check the [Oxford Covid-19 Government Response Tracker (OxCGRT) documentation](https://github.com/OxCGRT/covid-policy-tracker/blob/master/documentation/codebook.md) for details.

## Brazil and Cities Dataset [[2-3, 6-8]](#Sources) (```country_and_cities_data.csv```)

In [7]:
df_country_and_cities_data = pd.read_csv('data/country_and_cities_data.csv')
df_country_and_cities_data.head(5)

Unnamed: 0,date,cases_br,deaths_br,deaths_br_0_19,deaths_br_20_29,deaths_br_30_39,deaths_br_40_49,deaths_br_50_59,deaths_br_over_59,vaccine_first_dose_br,...,deaths_sao_jose_do_rio_preto,rt_sao_jose_do_rio_preto,rt_25_sao_jose_do_rio_preto,rt_975_sao_jose_do_rio_preto,mobility_recreation_sao_jose_do_rio_preto,mobility_grocery_sao_jose_do_rio_preto,mobility_parks_sao_jose_do_rio_preto,mobility_transit_sao_jose_do_rio_preto,mobility_workplace_sao_jose_do_rio_preto,mobility_residence_sao_jose_do_rio_preto
0,2020-02-26,1.0,0,0,0,0,0,0,0,,...,0,,,,-17.0,-8.0,-20.0,-6.0,-5.0,6.0
1,2020-02-27,0.0,0,0,0,0,0,0,0,,...,0,,,,-4.0,7.0,-4.0,14.0,10.0,0.0
2,2020-02-28,0.0,0,0,0,0,0,0,1,,...,0,,,,-4.0,7.0,-4.0,6.0,12.0,0.0
3,2020-02-29,1.0,0,0,0,0,0,0,0,,...,0,,,,-1.0,7.0,4.0,2.0,8.0,2.0
4,2020-03-01,0.0,0,0,0,0,0,0,0,,...,0,,,,-1.0,7.0,-11.0,4.0,3.0,2.0


**Columns**

<span style="line-height: 1.75em;">
    
- **date** (```datetime```, ```YYYY-mm-dd```): Date.

- **cases_br** (```float```): Daily cases of COVID-19 in Brazil.

- **deaths_br** (```int```): Daily deaths by COVID-19 in Brazil.

- **deaths_br_0_19** (```int```): Daily deaths by COVID-19 for ages between 0 and 19 years in Brazil.

- **deaths_br_20_29** (```int```): Daily deaths by COVID-19 for ages between 20 and 29 years in Brazil.

- **deaths_br_30_39** (```int```): Daily deaths by COVID-19 for ages between 30 and 39 years in Brazil.

- **deaths_br_40_49** (```int```): Daily deaths by COVID-19 for ages between 40 and 49 years in Brazil.

- **deaths_br_50_59** (```int```): Daily deaths by COVID-19 for ages between 50 and 59 years in Brazil.

- **deaths_br_over_59** (```int```): Daily deaths by COVID-19 for ages above 59 years in Brazil.

- **vaccine_first_dose_br** (```float```): Number of administered first doses of the COVID-19 vaccine in Brazil.

- **rt_br** (```float```): Instantaneous reproductive number in Brazil.

- **rt_25_br** (```float```): Instantaneous reproductive number's lower bound of confidence interval (2.5% percentile) in Brazil.

- **rt_975_br** (```float```): Instantaneous reproductive number's upper bound of confidence interval (97.5% percentile) in Brazil.

- **cases_city** (```float```): Daily cases of COVID-19 in the city.

- **deaths_city** (```int```): Daily deaths by COVID-19 in the city.

- **rt_city** (```float```): Instantaneous reproductive number in the city.

- **rt_25_city** (```float```): Instantaneous reproductive number's lower bound of confidence interval (2.5% percentile) in the city.

- **rt_975_city** (```float```): Instantaneous reproductive number's upper bound of confidence interval (97.5% percentile) in the city.

- **mobility_recreation_city** (```float```): Changes in mobility for retail & recreation locations in the city.*

- **mobility_grocery_city** (```float```): Changes in mobility for grocery & pharmacy locations in the city.*

- **mobility_parks_city** (```float```): Changes in mobility for parks locations in the city.*

- **mobility_transit_city** (```float```): Changes in mobility for transit stations in the city.*

- **mobility_workplace_city** (```float```): Changes in mobility for workplaces in the city.*

- **mobility_residence_city** (```float```): Changes in mobility for residential locations in the city.*
    
</span>
*Percentage change in the number of visits and length of stay compared to the median baseline value between 3 January 2020 and 6 February 2020.

Here, the 'city' in the variables must be changed to one of the Brazilian municipalities of our study: campina_grande (Campina Grande - PA), piracicaba (Piracicaba - SP), rio_branco (Rio Branco - AC), santos (Santos - SP), or sao_jose_do_rio_preto (São José do Rio Preto - SP).

## Sources 

<a id='Sources'></a>
[1] Notifica Saúde – Boletins informativos da pandemia de coronavírus em Maringá. Available: https://www.maringa.pr.gov.br/notificasaude (2023). Accessed: 10 May 2023.

[2] Portal da Transparência – Registro Civil (Transparency Portal – Civil Registry). Available: https://transparencia.registrocivil.org.br/painel-registral/especial-covid (2023). Accessed: 10 May 2023.

[3] Google. Community Mobility Reports. Available: https://www.google.com/covid19/mobility/index.html (2023). Accessed: 10 May 2023.

[4] Prefeitura de Maringá. Legislação Municipal. Available: http://www.maringa.pr.gov.br/portais/cgm/legislacao/categoria (2023). Accessed: 10 May 2023.

[5] Oxford Covid-19 Government Response Tracker (OxCGRT). Available: https://github.com/OxCGRT/covid-policy-tracker (2022). Accessed: 10 May 2023.

[6] Roser, M., Ritchie, H., Ortiz-Ospina, E. & Rodés-Guirao, L. World Population Growth. Available: https://ourworldindata.org/world-population-growth (2013).

[7] Brasil.io – Boletins informativos e casos do coronavírus por município por dia. Available: https://brasil.io/dataset/covid19/caso (2023). Accessed: 10 May 2023.

[8] MonitoraCovid-19 – Fundação Oswaldo Cruz (Fiocruz). Available: https://bigdata-covid19.icict.fiocruz.br/ (2023). Accessed: 10 May 2023.