Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more MEPS versions #90

Open
kristentaytok opened this issue Aug 4, 2022 · 0 comments
Open

Add more MEPS versions #90

kristentaytok opened this issue Aug 4, 2022 · 0 comments

Comments

@kristentaytok
Copy link
Contributor

Problem Statement

MEPS is an annual survey dating from 1996 to (now) 2020, and utilization patterns change over time (e.g., as new drugs become available). For our data challenge submission, we loaded MEPS 2018 data. By loading MEPS data from other years and enabling our tool to create distributions for each MEPS version, we can improve the data quality of the MDT and set the Synthea module to select the distribution based on the medication order year (e.g., when Synthea creates a med order for 2019, it can pull the MDT distribution from MEPS 2019).

Criteria for Success

  • Load additional MEPS versions into the MDT database
  • Add a MEPS year column to the distribution calculation partitions & MDT file outputs
  • Find and use Synthea's date (actually attribute called time that is in milliseconds since 1970 or something) in their transition table to configure Synthea to use the MDT/MEPS year output for the corresponding year of the Synthea date

Additional Information

MEPS data files:
https://meps.ahrq.gov/data_stats/download_data_files_results.jsp?cboDataYear=All&cboDataTypeY=2%2CHousehold+Event+File&buttonYearandDataType=Search&cboPufNumber=All&SearchTitle=Prescribed+Medicines
https://meps.ahrq.gov/mepsweb/data_stats/download_data_files_results.jsp?cboDataYear=All&cboDataTypeY=1%2CHousehold+Full+Year+File&buttonYearandDataType=Search&cboPufNumber=All&SearchTitle=Population+Characteristics
Current versions/code used in MDT database: https://github.com/coderxio/medication-diversification/blob/main/src/mdt/database.py

At the time we built/submitted the MDT for the data challenge, 2018 was the latest version available and the file formats were difficult to work with in python (.dat). Shortly after submission, MEPS added 2019 data and started making csv and xls file formats available (last time I checked, MEPS only added these formats for 2018 forward). So the final solution may need to account for these differences based on which files we load & which formats are available.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant