Intake integration? #86

jsignell · 2018-12-11T23:15:05Z

I was just starting to work on a PIL plugin for intake to support the handling of image stacks. I was hoping to make something that takes as input: paths, file objects, url, or s3 at a minimum. The output I think would be xarray dask arrays.

Do you think this project is sturdy enough to be built on top of in that way or is it moving too quickly?

jsignell · 2018-12-12T17:35:19Z

Closing this since I just found out about dask.array.image.imread and plan to use that instead.

jakirkham · 2019-01-06T23:30:48Z

Sorry to be slow here, @jsignell.

My guess is you are running into issue ( soft-matter/pims#310 ). Though would be great if you could confirm.

The imread implementation here is a bit more efficient when it comes to reading data on the filesystem. This is because Dask needs to know the shape and type information of each image and then it actually needs to load each image later when the computation occurs. IME this was slow because Dask was loading the full image into memory to get this metadata, which we avoid here. Though would imagine this is not great when it comes to making HTTP requests either. Expect what we would really want is the ability to cache the content retrieved from those URLs to make things a bit more performant. I could be wrong about this though. Would be curious to hear your thoughts.

jsignell · 2019-01-15T14:59:55Z

Sorry if I was unclear. I hadn't run into any issues yet; I just wanted to check the status of this project. In the case I am looking at performance isn't that important. The main thing that we are interested in is maintaining the image labeling that we get from the file names. I ended up writing a new wrapper around scikit-image.io.imread so that we can pass OpenFile objects into the imread. This allows the reading of files from lots of different sources.

manugarri · 2019-04-12T14:13:38Z

@jsignell i am experiencing a somewhat similar issue (reading images in bulk from an s3 bucket to perform image classification), would you mind sharing your wrapper?

jsignell closed this as completed Dec 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intake integration? #86

Intake integration? #86

jsignell commented Dec 11, 2018

jsignell commented Dec 12, 2018

jakirkham commented Jan 6, 2019

jsignell commented Jan 15, 2019

manugarri commented Apr 12, 2019

Intake integration? #86

Intake integration? #86

Comments

jsignell commented Dec 11, 2018

jsignell commented Dec 12, 2018

jakirkham commented Jan 6, 2019

jsignell commented Jan 15, 2019

manugarri commented Apr 12, 2019