Skip to content

bchan01/covid-19-data-analysis

Repository files navigation

COVID-19 Data Visualization

Jupyter Notebook: https://nbviewer.jupyter.org/github/bchan01/covid-19-data-analysis/blob/master/data-analysis.ipynb?flush_cache=true

Basic visualization of COVID-19 cases in the state of Pennsylvania. The dataset is obtained from PA Department of Health COVID-19 Website through Web Scraping

map


Install Dependencies

pip install -r requirements.txt
  • Web Scraping

    • beautifulsoup4
    • lxml
  • Data Analysis

    • panda
    • matplotlib
    • numpy
    • folium
  • Jupyter Widgets: (https://ipywidgets.readthedocs.io/en/latest/index.html)

    • Option 1: Conda
      • conda install -c conda-forge ipywidgets
    • Option 2: pip
      • pip install ipywidgets
      • jupyter nbextension enable --py widgetsnbextension

Data Source

  • Perform Web Scraping to obtain daily data from PA Department of Health:
    * https://www.health.pa.gov/topics/disease/coronavirus/Pages/Archives.aspx
  • Dataset collected:
    • Negative and Positive Cases and Deaths
    • Breakdown of numner of cases by county
  • How to Extract Data
    • Run: python data_extractor_pa.py
    • CSV file produced:
      • pa_summary.csv
      • pa_county.csv
      • pa_county_geolocation.csv

Visualization

  • Jupyter NoteBook

    • Dependencies: run data_extractor_pa.py to get up-to-date pa_summary.csv and pa_county.csv dataset
    • jupyter notebook
    • open data-analysis.ipynb
  • Streamlit Interactive Visualization App (COMINT SOON!!)

About

Analyze COVID-19 from various data source

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published