Added option to optionally compute channel statistics #66

lossyrob · 2017-07-07T17:09:14Z

If you want to use the generators for tasks that do not need to normalize or un-normalize the images, the requirement to compute channel statistics is burdensome (having to download the correct precomputed channel statistics or the imagery). This PR gives an option that allows for the channel statics to not be computed if the tasks do not need them.

lewfish · 2017-07-07T18:45:29Z

I can see that there are times when we don't actually need the images to be normalized, or to even iterate over images, but why is it a burden to download the channel_stats file if that happens automatically anyway and is a tiny file?

lossyrob · 2017-07-07T18:57:56Z

It's unclear what the codepath is, but when I don't set this option, my current notebook tries to calculate from the images.

The idea of the burden of downloading the file was before I realized the data generator did that automatically. If the DL happened in a separate step, then it wouldn't have to download the image tiffs either. I think this speaks a bit to that the data generator holds a lot of responsibility, and if it were split out into a file index generator and something that actually loaded and normalized the images, I could use the former class only and avoid any unnecessary downloads.

lewfish · 2017-07-07T21:33:03Z

I think this speaks a bit to that the data generator holds a lot of responsibility, and if it were split out into a file index generator and something that actually loaded and normalized the images, I could use the former class only and avoid any unnecessary downloads.

True, it could use refactoring. So, since it happens automatically do you still need the functionality in this PR?

lossyrob · 2017-07-07T22:25:55Z

The problem was with setting the proper environment variables so that the Jupyter notebook can use the right S3 path; also when downloads fail it continues as if it had succeeded.

These parts of the code need general refactoring, like you mentioned; this would be a band-aide where more work is needed, so I'll close this out.

Added option to optionally compute channel statistics

3f19bbb

lossyrob requested a review from lewfish July 7, 2017 17:09

lossyrob closed this Jul 7, 2017

lossyrob mentioned this pull request Jul 7, 2017

Bad behavior when generators try to download files on local machines #68

Closed

lewfish deleted the re/avoid-unneeded-channel-stats branch July 25, 2017 14:18

lewfish restored the re/avoid-unneeded-channel-stats branch July 25, 2017 14:19

lewfish deleted the re/avoid-unneeded-channel-stats branch August 30, 2017 17:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added option to optionally compute channel statistics #66

Added option to optionally compute channel statistics #66

lossyrob commented Jul 7, 2017

lewfish commented Jul 7, 2017

lossyrob commented Jul 7, 2017 •

edited

Loading

lewfish commented Jul 7, 2017

lossyrob commented Jul 7, 2017

Added option to optionally compute channel statistics #66

Added option to optionally compute channel statistics #66

Conversation

lossyrob commented Jul 7, 2017

lewfish commented Jul 7, 2017

lossyrob commented Jul 7, 2017 • edited Loading

lewfish commented Jul 7, 2017

lossyrob commented Jul 7, 2017

lossyrob commented Jul 7, 2017 •

edited

Loading