COVID-19-Forecasting

Introduction

This is a project initiated by the COVID19 Global Forecasting Kaggle competition intending to utilize data science to forecast the number of Cronavirus spread around the world. Pandemic is a heavy topic for everyone. I wanted to contribute with my knowledge in data science to potentially help discover the patterns of the Coronavirus spread and important features that affects the spread. Hopefully my findings can be helpful to lead some regions to take the correct actions.

The techniques I am planning to use for forecasting are:

ARIMA
Seq2Seq + LSTM (Deep Learning)
Xgboost (Machine Learning)

Main Files

covid19 - EDA.ipynb - Notebook performing Exploratory Data Analysis on Global comfirmed cases and deaths before June 10th
covid19 - ARIMA.ipynb - Notebook performing ARIMA algorithms to forecast Global comfirmed cases and deaths

Data Sources

(Note: You can find all those data from the data folder on this GITHUB)

Kaggle: COVID19 Global Forecasting (Week 5)

train.csv
test.csv
submission.csv

JHU CSSE COVID-19 Dataset

time_series_covid19_confirmed_global.csv
time_series_covid19_deaths_global.csv
time_series_covid19_recovered_global.csv
time_series_covid19_confirmed_US.csv

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
README.md		README.md
covid19 - EDA.ipynb		covid19 - EDA.ipynb
covid19-ARIMA.ipynb		covid19-ARIMA.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-19-Forecasting

Introduction

Main Files

Data Sources

About

Releases

Packages

Languages

Olliang/COVID-19-Forecasting

Folders and files

Latest commit

History

Repository files navigation

COVID-19-Forecasting

Introduction

Main Files

Data Sources

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages