allow multiple npz key names #90

ngreenwald · 2020-05-23T00:31:37Z

Some of the old NPZ files were generated with (raw, annotated) keys. The newer NPZs have been standardized to to (X, y). This PR modifies the toolbox to accept either input so that manual renaming isn't necessary

geneva-miller

Is this the only util function that may need to load data? Is X ever loaded? This looks good to me, I just want to make sure we're not missing other cases where the array name matters.

ngreenwald · 2020-05-28T22:35:11Z

Yes, currently this is the only function that loads NPZ files, and currently we don't load the X data from the corrected NPZs, only the y.

caliban_toolbox/utils/io_utils.py

ngreenwald · 2020-06-04T20:02:07Z

Any idea why coveralls keeps seeing random fluctuations in coverage for the data_loader?

MekWarrior · 2020-06-05T18:30:13Z

The tests for the data_loader chooses randomly from a list of possible data types/specs (e.g. "all", "None", "HEK293", etc" at multiple levels of the ontology). It's likely that this is the source of the stochasticity - something that could be addressed with better tests.

willgraf

👍

allow y or annotated labels

b4c48b8

ngreenwald requested review from MekWarrior and geneva-miller May 23, 2020 00:31

geneva-miller reviewed May 28, 2020

View reviewed changes

ngreenwald requested a review from geneva-miller June 4, 2020 01:00

geneva-miller approved these changes Jun 4, 2020

View reviewed changes

ngreenwald requested a review from willgraf June 4, 2020 17:48

willgraf requested changes Jun 4, 2020

View reviewed changes

caliban_toolbox/utils/io_utils.py Outdated Show resolved Hide resolved

simlify name checking

f1aff41

ngreenwald requested a review from willgraf June 4, 2020 20:01

willgraf approved these changes Jun 5, 2020

View reviewed changes

MekWarrior merged commit 9aae72a into master Jun 6, 2020

MekWarrior deleted the npz_key_name branch June 6, 2020 06:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow multiple npz key names #90

allow multiple npz key names #90

ngreenwald commented May 23, 2020

geneva-miller left a comment

ngreenwald commented May 28, 2020

ngreenwald commented Jun 4, 2020

MekWarrior commented Jun 5, 2020

willgraf left a comment

allow multiple npz key names #90

allow multiple npz key names #90

Conversation

ngreenwald commented May 23, 2020

geneva-miller left a comment

Choose a reason for hiding this comment

ngreenwald commented May 28, 2020

ngreenwald commented Jun 4, 2020

MekWarrior commented Jun 5, 2020

willgraf left a comment

Choose a reason for hiding this comment