Adding dataset GBSG2 to lifelines datasets #355
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PEC library in R (and some other libraries) use GBSG2 as a sample dataset included in the packages. It would be awesome to have it in lifelines so that we can compare R-code and lifeline code so the results are similar.
I've looked through the PEC and other libraries and all of the seem to be GPL, and googled a lot to find the license for the actual data, but couldn't find any direct reference to it. If you for some reason think this is not open data (even though its available in several R packages) feel free to ignore this pull request and just leave comment. If so I will remove it myself as well and just keep it locally.
If its interesting, take a quick look on the code so everything looks ok :)