Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Descriptive info on the example data needed #12

Closed
xiaohy9 opened this issue Jun 17, 2021 · 3 comments
Closed

Descriptive info on the example data needed #12

xiaohy9 opened this issue Jun 17, 2021 · 3 comments

Comments

@xiaohy9
Copy link

xiaohy9 commented Jun 17, 2021

I found your PyHealth package a valuable resource. I am trying the test_sequence_data.ipynb notebook with example dataset. While the csv files in /datasets/mimic/y_data/ folder seems to be clear because the column names are self-explanatory, but not the ones in /datasets/mimic/x_data/ folder, which has no column names. I’ve read the readme files and online documentation, couldn’t find anything. Can you help me on this?

BTW, it would help a lot if you could add some minimal description on the data, data processing or training steps in the notebook. That would help the users a lot, because they don’t have to spend a lot of time finding the info everywhere.

@yzhao062
Copy link
Collaborator

if my memory is correct...the data is generated by https://raw.githubusercontent.com/yzhao062/PyHealth/master/examples/data_generation/dataloader_mimic_demo_parallel.py

So X includes the features below:
var_list = ['Capillary refill rate',
'Diastolic blood pressure',
'Fraction inspired oxygen',
'Glascow coma scale eye opening',
'Glascow coma scale motor response',
'Glascow coma scale total',
'Glascow coma scale verbal response',
'Glucose',
'Heart Rate',
'Height',
'Mean blood pressure',
'Oxygen saturation',
'Respiratory rate',
'Systolic blood pressure',
'Temperature',
'Weight',
'pH']

with some transformation and conversion. Sorry, we have not maintained the project since then so the support will be limited.

@xiaohy9
Copy link
Author

xiaohy9 commented Jun 18, 2021

thanks for the quick response.
Features in the files are columns, rows are time points? Are these time points evenly distributed, and what’s the time intervals? Thanks.

zzachw added a commit that referenced this issue Oct 22, 2022
merge ad hoc version v0 to main
@zzachw
Copy link
Collaborator

zzachw commented Nov 16, 2022

Thanks for your interest in PyHealth. We have released a brand-new version of PyHealth. It includes more EHR datasets, health-related tasks, and state-of-the-art models. Welcome to try it out!

@zzachw zzachw closed this as completed Nov 16, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants