Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding dataset GBSG2 to lifelines datasets #355

Merged
merged 2 commits into from
Nov 19, 2017
Merged

Adding dataset GBSG2 to lifelines datasets #355

merged 2 commits into from
Nov 19, 2017

Conversation

klintan
Copy link
Contributor

@klintan klintan commented Nov 18, 2017

PEC library in R (and some other libraries) use GBSG2 as a sample dataset included in the packages. It would be awesome to have it in lifelines so that we can compare R-code and lifeline code so the results are similar.

I've looked through the PEC and other libraries and all of the seem to be GPL, and googled a lot to find the license for the actual data, but couldn't find any direct reference to it. If you for some reason think this is not open data (even though its available in several R packages) feel free to ignore this pull request and just leave comment. If so I will remove it myself as well and just keep it locally.

If its interesting, take a quick look on the code so everything looks ok :)

@CamDavidsonPilon
Copy link
Owner

CamDavidsonPilon commented Nov 18, 2017

The original source looks to be

W. Sauerbrei and P. Royston (1999). Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials. Journal of the Royal Statistics Society Series A, Volume 162(1), 71--94.

Can you add that to the doc string?

Clearly you have some R-code you would like to compare to lifelines - do you have future plans to add code for that purpose?

@klintan
Copy link
Contributor Author

klintan commented Nov 18, 2017

I added that line to the docstring in the load_gbsg2() function, or did you want me to add them somewhere else (perhaps you missed that one? ) ?

Yeah sort of, I'm trying to implement timeROC and integrated AUC in python. So I might be able to do a pull request for those metrics if I am successful if that is of interest ?

@CamDavidsonPilon
Copy link
Owner

ah, you are correct 👍

Yeah sort of, I'm trying to implement timeROC and integrated AUC in python. So I might be able to do a pull request for those metrics if I am successful if that is of interest ?

This is of interest yea, look cool

@CamDavidsonPilon CamDavidsonPilon merged commit a52abc4 into CamDavidsonPilon:master Nov 19, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants