Xarray and scaling #69

brisvag · 2021-03-04T10:55:44Z

Migrated some stuff to xarray, so we now have named dimensions and dimension coordinates!
It's mostly useless (for now) for the user interface, but it makes some code MUCH easier (see coordinate reordering stuff in PointBlock).

Another change from this PR that is instead very visible from the user side: scaling based on pixel size! Now everything is automagic for mrc and star files.

Smaller things: cleans up some old code and introduces a custom ParseError that makes debugging IO operations a bit easier.

…le and automatic scaling of particles and images. Fixed some empty datablocks bugs.

brisvag · 2021-03-04T11:00:08Z

Closes #19 and fixes #67 .

alisterburt

Great stuff, will have a proper play later but happy for you to merge when you see fit!

alisterburt · 2021-03-04T12:15:21Z

peepingtom/datablocks/multiblocks/orientedpointblock.py

-        return f'{self.positions.data.shape}'
+    @property
+    def n(self):
+        return self.positions.n


so n is the name we will use for the 'row' dimension across the codebase? just checking so we're on the same page

Yes! I'm not set on it, if you have better ideas. I think it's convenient to have a quick-to-type, easily remembered thing that will work across the codebase, more or less

alisterburt · 2021-03-04T12:20:35Z

peepingtom/datablocks/multiblocks/particleblock.py

@@ -16,21 +16,13 @@ class ParticleBlock(OrientedPointBlock):
        'class_plot': ClassPlotDepictor,


ClassPlot is for plotting particles in different colours according to their class, right?

In general I think names should be a little more descriptive e.g. ClassificationResultsDepictor

I've also been thinking about removing depictor from the names of these classes, I don't think we need it, what's your feeling on that?

Yes, we can change this to clearer names. I would keep Depictor in it though, to be consistent (and since we can't remove Depictor from ParticleDepictor without being very ambiguous).

alisterburt · 2021-03-04T12:28:13Z

peepingtom/datablocks/simpleblocks/orientationblock.py

-        return data
+        dims = ['x', 'y', 'z']
+        return xr.DataArray(data, dims=['n', 'spatial1', 'spatial2'],
+                            coords=(range(len(data)), dims[:data.shape[1]], dims[:data.shape[1]]))


can you explain this coords thing? I had a quick look at the xarray docs and it's not immediately obvious to me what's going on here

coords is the xarray equivalent of index in pandas. So while an image has:

dims = ('z', 'y', 'x') coords = None # because there is no qualitative distinction between pixel 1 and pixel 2

A point block has:

dims = ('n', 'spatial') # where spatial stands for "spatial coordinates" coords = (range(0, npoints), ['x', 'y', 'z'])

I would have preferred coords over spatial, but xarray allows for attribute access to coordinates, and having it named the same as the coords attribute it would block it off (data.spatial now works).

Just to clarify: here in OrientationBlock I called them spatial1 and spatial2 for lack of a better term. Ultimately it was just a trick to automatically see if it should use xy or xyz for the unit vector and similar. I don't think it's an elegant solution, but it's a bit less clunky than what we had. Do you have better ideas?

alisterburt · 2021-03-04T12:33:32Z

peepingtom/io_/read/star/reader_functions.py

@@ -49,7 +52,7 @@ def extract_data(df, mode='3.1', name_regex=None):

        coords = df_volume[coord_headings[dim]].to_numpy(dtype=float)
        shifts = df_volume.get(shift_headings[dim][mode], pd.Series([0.0])).to_numpy()
-        px_size = df_volume.get(pixel_size_headings, pd.Series([1.0])).to_numpy()
+        px_size = df_volume.get(pixel_size_headings[mode], pd.Series([1.0])).to_numpy()


do rln 3.0 files have anything which tells you pixel size in the file? I didn't think they did

Yup, it's just in the table as rlnPixelSize rather than being in the optics as rlnImagePixelSize. Not sure why the change. There's also a rlnDetectorPixelSize or something like that, but I could not find a difference from the simple rlnPixelSize in any of my files. I guess if things break it will pop up and be clear then!

alisterburt · 2021-03-04T12:35:35Z

peepingtom/io_/write/star/star.py

-                                            axes=convention['axes'],
-                                            intrinsic=convention['intrinsic'],
-                                            right_handed_rotation=convention['right_handed_rotation'])
+                                            axes='zyz',


little bit worries about the lack of separation between star files and the conventions for angles in them but okay to leave it for now as long as everything is working!

I'm changing this file quite a bit, so we can come back to this when it's fixed. But are there cases in which relion uses different conventions?

brisvag · 2021-03-05T09:19:13Z

A thing to keep in mind: some stuff (ndim functionality, reshaping nonsense such as in TransformBlock) broke due to the use of xarray. I fixed some stuff, but others I will leave as is, since they are "legacy" code and should be either rewritten completely or moved to different places.

For now, I will remove the test where necessary (only TransformBlock has this issue).

…nd other places

codecov · 2021-03-05T09:49:18Z

Codecov Report

Merging #69 (84d49ec) into develop (bb34142) will increase coverage by 1.48%.
The diff coverage is 71.02%.

@@             Coverage Diff             @@
##           develop      #69      +/-   ##
===========================================
+ Coverage    68.51%   70.00%   +1.48%     
===========================================
  Files           85       99      +14     
  Lines         1639     1960     +321     
===========================================
+ Hits          1123     1372     +249     
- Misses         516      588      +72

Impacted Files	Coverage Δ
peepingtom/__main__.py	`0.00% <ø> (ø)`
peepingtom/gui/__init__.py	`0.00% <ø> (ø)`
peepingtom/gui/gui.py	`0.00% <ø> (ø)`
peepingtom/gui/particle_selector.py	`0.00% <0.00%> (ø)`
peepingtom/io_/utils/angle_conversion.py	`18.18% <18.18%> (ø)`
peepingtom/io_/read/read.py	`18.29% <22.72%> (-1.19%)`	⬇️
peepingtom/depictors/viewer.py	`28.12% <28.12%> (ø)`
peepingtom/depictors/pyqtgraph/plotdepictor.py	`35.00% <35.00%> (ø)`
...eepingtom/depictors/pyqtgraph/classplotdepictor.py	`38.46% <38.46%> (ø)`
...ingtom/depictors/pyqtgraph/propertyplotdepictor.py	`44.44% <44.44%> (ø)`
... and 124 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2dc2201...84d49ec. Read the comment docs.

brisvag added 3 commits March 4, 2021 10:42

removed old stuff

522857d

started migration to xarray where useful. Added parsing for pixel sca…

51968d2

…le and automatic scaling of particles and images. Fixed some empty datablocks bugs.

oopsie

aa7328c

brisvag changed the title ~~Xarray~~ Xarray and scaling Mar 4, 2021

alisterburt approved these changes Mar 4, 2021

View reviewed changes

brisvag added 5 commits March 5, 2021 10:36

transformblock is broken, for now

b8ebf5f

a few more xarray migrations, fixed consequences of xarray in tests a…

2f120d2

…nd other places

workaround to scaling bugs

226066a

automatic scaling and io updates

0bc05b0

added xarray to reqs

84d49ec

This was referenced Mar 5, 2021

Autoscaling issues #71

Closed

Transformblock #72

Closed

brisvag merged commit a958761 into brisvag:develop Mar 5, 2021

brisvag mentioned this pull request Mar 11, 2021

Rescaling data #19

Closed

brisvag mentioned this pull request May 25, 2021

Empty particleblocks (and probably other blocks) break things #67

Closed

brisvag deleted the xarray branch June 3, 2021 16:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xarray and scaling #69

Xarray and scaling #69

brisvag commented Mar 4, 2021

brisvag commented Mar 4, 2021

alisterburt left a comment

alisterburt Mar 4, 2021

brisvag Mar 4, 2021

alisterburt Mar 4, 2021

brisvag Mar 4, 2021

alisterburt Mar 4, 2021

brisvag Mar 4, 2021

brisvag Mar 4, 2021

alisterburt Mar 4, 2021

brisvag Mar 4, 2021

alisterburt Mar 4, 2021

brisvag Mar 4, 2021

brisvag commented Mar 5, 2021 •

edited

codecov bot commented Mar 5, 2021

		@@ -16,21 +16,13 @@ class ParticleBlock(OrientedPointBlock):
		'class_plot': ClassPlotDepictor,

Xarray and scaling #69

Xarray and scaling #69

Conversation

brisvag commented Mar 4, 2021

brisvag commented Mar 4, 2021

alisterburt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brisvag commented Mar 5, 2021 • edited

codecov bot commented Mar 5, 2021

Codecov Report

brisvag commented Mar 5, 2021 •

edited