Skip to content

MarieDujardin2000/AM10_Group_3

Repository files navigation

AM10 Data visualisation - Final Group Project – Poster Presentation - Study Group 3

Dan Thompson, Xi Chen, Alvaro Romero, Mahah Sadique, Guangbin Yu and Marie Dujardin

What is your topic?

In our final group project for the Data Visualization course, we aim to explore, analyze, and tell a compelling story using various datasets related to COVID-19 in South Korea. Our goal is to create an original and informative project that sheds light on the pandemic's impact in the region.

What issues or questions are you addressing?

Some research questions that we will address are for example:

  1. How did government policies impact the spread and control of COVID-19 in South Korea? We aim to visualize policy changes over time and correlate them with infection rates.

  2. What were the major infection sources and clusters, and how did they contribute to the pandemic's spread?

  3. What was the public's awareness and interest in COVID-19 during different phases of the pandemic? We will create a keyword trend analysis to understand the public sentiment.

  4. How did demographic factors like age and gender influence the pandemic's impact? We will analyze patient data to answer this question.

  5. What was the mobility pattern of the population, and how did it correlate with infection rates? We will visualize floating population data alongside infection statistics.

What is the source of the data you will be using?

We plan to leverage the datasets available at https://www.kaggle.com/datasets/kimjihoo/coronavirusdataset for this purpose. There are 11 datasets available that we plan on using.

What statistical techniques do you think you may be using?

In addition to data visualization techniques, our project will involve preprocessing steps, including merging the different datasets. This integration will enable us to explore relationships between variables effectively.

Furthermore, we plan to employ regression analysis, for example to explain the factors influencing the incidence of COVID-19 in each region. We can do this by using the dataset "COVID-19 Infection Cases Data", which has information about the infection cases in South Korea, including the province and city. We will merge this dataset with the "Location and Statistical Data" , which has information on educational institutions, elderly population ratios, nursing home counts, and more. Using statistical regression models, we will gain a deeper understanding of the pandemic's impact on various regions in South Korea.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages