Add "preload" option to Polyps912 dataset #3

lamblin · 2017-05-26T20:32:29Z

The total amount of memory is < 1 GB I think, so it can be reasonable. It is still off by default.
I'm not sure what to add as a test.

fvisin

Thank you for the PR, I like the idea of having a preload option to preload the dataset in memory.

fvisin · 2017-05-30T10:26:35Z

dataset_loaders/images/polyps912.py

+        image_name_to_idx = {}
+        for idx, img_name in enumerate(self.filenames):
+            image_name_to_idx[img_name] = idx
+            img = io.imread(os.path.join(self.image_path, img_name + ".bmp"))


Can you move the code that loads images and masks (L119-L127) in a separate function _load_image(image_batch, mask_batch, filename_batch, img_name, prefix=None) and call that function from here and from load_sequence? This way the common code doesn't get duplicated.

As a more general step, I think it might be worthwhile to do the same for all the datasets and have a preload flag in parallel_loader.py, but I don't know if you feel like going that far with this PR or not.

Sure, I'll refactor the loading code for that dataset.
I'm not familiar enough with the rest of the code to have a good sense of what can be refactored in parallel_loader.py, and what needs to be done in each dataset, and unfortunately I will not have time to dive into that soon.

lamblin · 2017-05-31T16:42:52Z

Updated.

fvisin

Apart from a small thing, LGTM. Once you are done, can you please run the test at the end of the file (I know, it's not a very formal test unfortunately) just to double check that it still works? Once that's out of the way we can merge!

Thank you again for the PR!

fvisin · 2017-06-03T16:26:58Z

dataset_loaders/images/polyps912.py

-                # Add to minibatch
-                image_batch.append(img)
-                mask_batch.append(mask)
-                filename_batch.append(img_name)


You will still need to append img_name to filename_batch.

lamblin · 2017-06-20T18:00:21Z

Sorry about the delay, I'll update.

lamblin · 2017-06-20T18:39:03Z

After the last fixes, running the test:

$ python dataset_loaders/images/polyps912.py
N classes: 3
Void label: [2]
Train n_images: 547, batch_size: 10, n_batches: 55
Validation n_images: 183, batch_size: 1, n_batches: 183
Test n_images: 182, batch_size: 1, n_batches: 182
Minibatch 0 time: 4.83353710175 (4.83353710175)
Minibatch 1 time: 0.475756883621 (5.30929398537)
Minibatch 2 time: 0.469623088837 (5.7789170742)
[...]
Minibatch 54 time: 0.298352003098 (29.8576231003)

If I change the test to use preload=True:

Minibatch 0 time: 0.1208589077 (0.1208589077)
Minibatch 1 time: 0.115332841873 (0.244853019714)
Minibatch 2 time: 0.114579200745 (0.359432220459)
[...]
Minibatch 54 time: 0.0719630718231 (5.77815103531)

Should I also commit that switch to test with preload=True? Or even test both in the same function?
I guess I could time preloading as well.

fvisin · 2017-06-22T09:06:57Z

Thanks for updating the PR!

Should I also commit that switch to test with preload=True?

Sure, why not? Thanks.

I guess I could time preloading as well.

The best test here would be to wrap the _load_image function in the test with some code to make sure it's called only len(self.filenames) times. If you have some time to give it a try it would be great, but if not just commit the switch and we can merge.

Thanks for the PR! :)

lamblin · 2017-06-22T22:44:17Z

I added counts for _load_image, it seems to be correct.
I tried locally with max_epochs = 2 as well.

fvisin · 2017-07-04T15:52:13Z

Thank you for the test, everything LGTM: merged!

Add "preload" option to Polyps912 dataset

0d5f8d4

fvisin suggested changes May 30, 2017

View reviewed changes

Refactor loading of individual images

62198f0

fvisin suggested changes Jun 3, 2017

View reviewed changes

fvisin added the changes requested label Jun 3, 2017

fvisin mentioned this pull request Jun 3, 2017

Add a global 'preload' option #11

Open

Fixes

edf5bf3

fvisin force-pushed the master branch 2 times, most recently from c6a8d70 to ff0bbfe Compare June 22, 2017 18:30

Add tests for preloading

f700201

fvisin approved these changes Jul 4, 2017

View reviewed changes

fvisin merged commit 29451cb into fvisin:master Jul 4, 2017

fvisin removed the changes requested label Jul 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "preload" option to Polyps912 dataset #3

Add "preload" option to Polyps912 dataset #3

lamblin commented May 26, 2017

fvisin left a comment

fvisin May 30, 2017

lamblin May 31, 2017

lamblin commented May 31, 2017

fvisin left a comment

fvisin Jun 3, 2017

lamblin commented Jun 20, 2017

lamblin commented Jun 20, 2017

fvisin commented Jun 22, 2017

lamblin commented Jun 22, 2017

fvisin commented Jul 4, 2017

Add "preload" option to Polyps912 dataset #3

Add "preload" option to Polyps912 dataset #3

Conversation

lamblin commented May 26, 2017

fvisin left a comment

Choose a reason for hiding this comment

fvisin May 30, 2017

Choose a reason for hiding this comment

lamblin May 31, 2017

Choose a reason for hiding this comment

lamblin commented May 31, 2017

fvisin left a comment

Choose a reason for hiding this comment

fvisin Jun 3, 2017

Choose a reason for hiding this comment

lamblin commented Jun 20, 2017

lamblin commented Jun 20, 2017

fvisin commented Jun 22, 2017

lamblin commented Jun 22, 2017

fvisin commented Jul 4, 2017