Skip to content

info201b-2022-spring/final-projects-randyr02

Repository files navigation

Final Project Proposal

Group member names:

  • Kaia Truong, Randy Ros, Vince Qian, Aliya Ali

Domain of interest: Healthcare

  • Why are you interested in this field/domain?

    • Our group is interested in this domain as the healthcare industry around the world has changed drastically since Covid-19. Since then, there has been a rise in attention to the importance of the field. We'd love to examine how the industry works and the stakeholders like patients and staff combine altogether to drive the system.
  • What other examples of data-driven projects have you found related to this domain (share at least 3)?

    • The difference in the healthcare system in different regions in the U.S

    • The healthcare cost in the U.S

    • And lastly, the investment in healthcare and the length of hospital stay

  • What data-driven questions do you hope to answer about this domain (share at least 3)?

    • We wonder if there was a correlation between the healthcare cost and the hospital stay length?

    • One other point we would also love to analyze is how different is the healthcare investment in minorities group compared to the majority?

    • And is there a difference in healthcare cost and nutrition?

Finding Data

  • https://www.kaggle.com/datasets/ravichaubey1506/healthcare-cost

    • A nationwide survey of hospital costs conducted by the US Agency for Healthcare consists of hospital records of inpatient samples. The given data is restricted to the city of Wisconsin and relates to patients in the age group 0-17 years.

    • There are 6 columns and 500 rows.

    • This dataset can be used to access the healthcare cost in the U.S and maybe help to analyze the healthcare investment of the minorities.

  • https://www.kaggle.com/datasets/maheshdadhich/us-healthcare-data

    • The dataset was collected by Nhanes Survey (National Health and Nutrition Examination Survey) which combines interviews and physical examinations. 10000 individuals are surveyed to represent US statistics.

    • We will be using Nutrition_US.csv in this collection. This dataset contains 52 columns and 8790 rows.

    • This dataset can help answer the healthcare cost and nutrition correlation question.

  • https://www.kaggle.com/datasets/babyoda/healthcare-investments-and-length-of-hospital-stay

    • The dataset was collected in OECD countries where all data for 1990-2018 are available at the same time in the database.

    • There are 6 columns and 518 rows.

    • This dataset can help us learn more about the investment and hospital stay length. And further analyze the healthcare cost.

About

final-projects-randyr02 created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •