reader vs converted and performance #12

LucaMarconato · 2023-01-12T18:15:16Z

@giovp we need to test if we can use readers (as the one in your new pr) or if we need for some large datasets to use converters. The reason is that Dask allows to represent lazily the data in both cases, but the operations with a reader could be not performant.

For example, say that you want to read a .ome.tiff file, or rotate an image. With dask-image you can have a Dask array that represents in-memory lazily both of them, construct a SpatialData object and, say, view it in napari. But the visualization will be really poor. Here what would help is to first save the object to .zarr, and then reinitialize the array to read from disk. Now the visualization will be performant.

The text was updated successfully, but these errors were encountered:

LucaMarconato · 2023-01-30T09:15:39Z

Now with scverse/spatialdata#117 when we save to disk the spatialdata object is re-read and the performance problem is addressed. We just have to make aware the user that the data should be saved to have better performance.

LucaMarconato closed this as completed Jan 30, 2023

LucaMarconato mentioned this issue Feb 16, 2023

Io/hot fix scverse/spatialdata#138

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reader vs converted and performance #12

reader vs converted and performance #12

LucaMarconato commented Jan 12, 2023

LucaMarconato commented Jan 30, 2023

reader vs converted and performance #12

reader vs converted and performance #12

Comments

LucaMarconato commented Jan 12, 2023

LucaMarconato commented Jan 30, 2023