-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor nb/nhanes survival model #3395
Refactor nb/nhanes survival model #3395
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few suggested changes as above
Hi @CloseChoice - I was just going through the PR backlog and saw this one. Are you happy to take another look at the touchups above? Or if you'd prefer, I can finish it off if you're stuck into other stuff at the moment. |
Thanks for coming back to this. Will have a look sometime this week |
@connortann I added the requested changes. The dataset seems to contain more columns now but I limited the confusion matrix to only the 12 with the highest shap values. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The "Exact Explainer" notebook was changed, is that intentional? It now has an exception:
Other than that, looks great! Thank you for making the changes I suggested above.
A minor observation, I notice the dependence plot of Red Blood Cells looks quite different. I wonder if the source data changed at some point: there seem to be values of red_blood_cells_unacceptable
which might indicate "bad" measurements which were originally dropped from the dataset? I think the change is fine though, I don't think we need to reproduce the original images exactly.
Good catch. I removed the changes in the exact notebook but didn't look into the changed plot. I would suspect that it has to do with changes in the underlying data but wouldn't spend more time on that. Thanks for the review ;) |
Overview
Works towards #3036
Description of the changes proposed in this pull request:
Note that there are a couple more warnings now, since I use xgboost 2.1.2 which deprecates the file format we are using. But this is an issue with a linked PR and will be fixed once this PR is merged.
Checklist
[ ] Unit tests added (if fixing a bug or adding a new feature)