-
Clone the repo (make sure you have created GitHub personal access tokens as password authentication is no longer supported).
git clone https://<token>@github.com/Data-Analytics-Project-2021/eda
-
Initialise submodules
git submodule init
-
Fetch the data from the submodule to your local system (will take a very long time as it will clone 2GB of data from COVID-19 and ; decide on whether it is necessary or not)
git submodule update
-
You may wish to recursively clone this repository in a single step by passing the
--recurse-submodulesoption togit clonegit clone --recurse-submodules https://<token>@github.com/Data-Analytics-Project-2021/eda
- Data sourcing and level 1 cleaning is performed in
aggregate_cases_vaccines.ipynb - After sourcing and cleaning, the four files obtained are
india_cases_<last_source_date_dd-mm-yyyy>.csv- state-wise daily COVID-19 cases in Indiaindia_vaccines_<last_source_date_dd-mm-yyyy>.csv- state-wise daily COVID-19 vaccinations in Indiausa_cases_<last_source_date_dd-mm-yyyy>.csv- state-wise daily COVID-19 cases in USAusa_vaccines_<last_source_date_dd-mm-yyyy>.csv- state-wise daily COVID-19 vaccinations in USA