BF: deterministic order of slice_time deduction, warning if multiple match #647

yarikoptic · 2018-07-12T17:09:02Z

Initially hit while trying to make dcmstack python3 compatible: moloney/dcmstack#61
What I found that dcmstack test fails due to slice_order not matching original one (4) on some runs under Python 3. But the issue is generic -- reliance on order of keys in dict, and ignoring possible multiple "matches". It was triggered by dcmstack since its test image just has 2 slices so multiple slice orders would match.

I've looked into the logic in nibabel, and my proposed solution is just a solution:

I think ideally Recoder class which stores those mappings starts to use OrderedDict to maintain order in which choices are specified, and then code should explore in that order. I was scared to touch Recorder class since it relies on self.__dict__ assignments
so I just sorted choices for the slice orders based on their expanded names (since probably original order is gone by then) and in reverse so sequentials go first
I issue a warning now that multiple choices match. I am not sure how good that is, i.e. may be the warning is not really "due" in some cases? but imho it is also bad to just choose "some" without a clear idea which one is the "correct one"

I would be interested to hear what others think about this issue, and how likely it affected anyone's real data

…match

coveralls · 2018-07-12T17:51:34Z

Coverage increased (+0.005%) to 91.794% when pulling d756751 on yarikoptic:bf-sliceorder into 2a127cc on nipy:master.

yarikoptic · 2018-07-12T19:01:18Z

hm, appveyor failures on python 3.5 seems to not be related

matthew-brett · 2018-07-12T19:13:58Z

Would you mind adding a test that triggers the failure?

codecov-io · 2018-07-13T02:04:01Z

Codecov Report

Merging #647 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #647      +/-   ##
==========================================
+ Coverage   88.81%   88.82%   +<.01%     
==========================================
  Files          92       92              
  Lines       11278    11285       +7     
  Branches     1848     1850       +2     
==========================================
+ Hits        10017    10024       +7     
  Misses        926      926              
  Partials      335      335

Impacted Files	Coverage Δ
nibabel/volumeutils.py	`92.78% <100%> (+0.02%)`	⬆️
nibabel/nifti1.py	`91.22% <100%> (+0.06%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2a127cc...d756751. Read the comment docs.

…ultiple available

yarikoptic · 2018-07-13T02:05:36Z

added some basic test for guaranteeing that it is always the same, see 244c8cf (although I didn't check if it would be triggered without the fix, I assume it might/would though ;-))

yarikoptic · 2018-07-16T18:28:52Z

@matthew-brett @effigies - but what do you think in general, either my solution would suffice or it should be more thorough, e.g. relying on the original order of those labels in the structure? we better fix it once instead of making output inconsistent across versions if we keep fixing it up incrementally ;-)

effigies · 2018-07-17T20:58:36Z

My inclination would be to set a canonical ordering, rather than depend on sorting by name, and insertion order is the most intuitive way to impose this order. This preference is due to the general move in Python to ordered dictionaries, as well as lexicographical order being an artificial (and potentially annoying) constraint.

Not a strongly-held opinion though. Happy to be argued out.

effigies · 2018-07-17T15:21:39Z

nibabel/tests/test_nifti1.py

+        hdr2.set_dim_info(slice=2)
+        hdr2.set_slice_duration(0.1)
+        hdr2.set_data_shape((1, 1, 2))
+        hdr2.set_slice_times([0.1, 0])  # will generate warning that multiple match


Want to check that the warning is generated? e.g.

with clear_and_catch_warnings() as w: hdr2.set_slice_times([0.1, 0]) assert len(w) == 1

yarikoptic · 2018-07-17T21:06:46Z

Thank you @effigies ! Do you see an easy generic way to make those Recorders to use ordered dict without duplicating storage etc?

effigies · 2018-07-17T21:22:50Z

I see a couple options:

Change the default map_maker to OrderedDict:

nibabel/nibabel/volumeutils.py

Line 81 in 2a127cc

def __init__(self, codes, fields=('code',), map_maker=dict):

Change the specific map_maker for slice_order_codes:

nibabel/nibabel/nifti1.py

Lines 143 to 150 in 2a127cc

    
           slice_order_codes = Recoder((  # code, label 
        
               (0, 'unknown'), 
        
               (1, 'sequential increasing', 'seq inc'), 
        
               (2, 'sequential decreasing', 'seq dec'), 
        
               (3, 'alternating increasing', 'alt inc'), 
        
               (4, 'alternating decreasing', 'alt dec'), 
        
               (5, 'alternating increasing 2', 'alt inc 2'), 
        
               (6, 'alternating decreasing 2', 'alt dec 2')), fields=('code', 'label'))

Becomes:

slice_order_codes = Recoder((  # code, label
    (0, 'unknown'),
    (1, 'sequential increasing', 'seq inc'),
    (2, 'sequential decreasing', 'seq dec'),
    (3, 'alternating increasing', 'alt inc'),
    (4, 'alternating decreasing', 'alt dec'),
    (5, 'alternating increasing 2', 'alt inc 2'),
    (6, 'alternating decreasing 2', 'alt dec 2')),
    fields=('code', 'label'), map_maker=OrderedDict)

Not sure if this resolves your duplicated storage concern, as I'm not quite sure what that concern is. These seem like pretty small dictionaries, anyway.

This should allow for consistent ordering of items in the Recoder

yarikoptic · 2018-07-18T19:22:04Z

Thank you @effigies ! went 1. route, had to add OrderedSet construct, seems to work and IMHO should be "better" ;-) Hopefully noone relied on doctesting output to be a set ;-)

effigies · 2018-07-19T18:00:43Z

Pre-release tests are seg-faulting, but otherwise this LGTM.

effigies · 2018-07-20T03:09:55Z

May be getting bitten by OpenMathLib/OpenBLAS#1641. May be worth reporting upstream, probably numpy, but I haven't dug any further yet.

effigies · 2018-07-20T14:21:54Z

It seems that in numpy/numpy#11551 there were OpenBLAS-based build issues, but it was decided that this wasn't something that could be fixed in numpy, is that right @matthew-brett?

I'm inclined to go ahead and merge, and accept failing --pre tests for now.

effigies · 2018-07-21T13:35:16Z

Okay, looks like the broken wheels were rebuilt. I'm 👍 for merge.

@matthew-brett, any further comments?

effigies · 2018-07-23T14:08:25Z

Thanks @yarikoptic.

BF: deterministic order of slice_time deduction, warning if multiple …

347eef9

…match

yarikoptic mentioned this pull request Jul 12, 2018

Python3 compatibility -- almost there (replace for #47) moloney/dcmstack#61

Merged

DOC: minor typo

7ba14b1

yarikoptic added the bug label Jul 12, 2018

ENH(TST): basic test for consistent choice of slice_code in case of m…

244c8cf

…ultiple available

yarikoptic force-pushed the bf-sliceorder branch from ae2f946 to 244c8cf Compare July 13, 2018 02:05

effigies reviewed Jul 17, 2018

View reviewed changes

yarikoptic added 2 commits July 18, 2018 15:20

RF+NF: Use OrderedDict and OrderedSet in Recoder

748ce00

This should allow for consistent ordering of items in the Recoder

ENH(TST): test that we issue a warning upon multiple matches

a1400bd

TEST: Ensure warning isn't suppressed

d756751

effigies merged commit bb8c6fa into nipy:master Jul 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BF: deterministic order of slice_time deduction, warning if multiple match #647

BF: deterministic order of slice_time deduction, warning if multiple match #647

yarikoptic commented Jul 12, 2018

coveralls commented Jul 12, 2018 •

edited

Loading

yarikoptic commented Jul 12, 2018

matthew-brett commented Jul 12, 2018

codecov-io commented Jul 13, 2018 •

edited

Loading

yarikoptic commented Jul 13, 2018 •

edited

Loading

yarikoptic commented Jul 16, 2018

effigies commented Jul 17, 2018

effigies Jul 17, 2018

yarikoptic commented Jul 17, 2018

effigies commented Jul 17, 2018 •

edited

Loading

yarikoptic commented Jul 18, 2018

effigies commented Jul 19, 2018

effigies commented Jul 20, 2018

effigies commented Jul 20, 2018

effigies commented Jul 21, 2018

effigies commented Jul 23, 2018

BF: deterministic order of slice_time deduction, warning if multiple match #647

BF: deterministic order of slice_time deduction, warning if multiple match #647

Conversation

yarikoptic commented Jul 12, 2018

coveralls commented Jul 12, 2018 • edited Loading

yarikoptic commented Jul 12, 2018

matthew-brett commented Jul 12, 2018

codecov-io commented Jul 13, 2018 • edited Loading

Codecov Report

yarikoptic commented Jul 13, 2018 • edited Loading

yarikoptic commented Jul 16, 2018

effigies commented Jul 17, 2018

effigies Jul 17, 2018

Choose a reason for hiding this comment

yarikoptic commented Jul 17, 2018

effigies commented Jul 17, 2018 • edited Loading

yarikoptic commented Jul 18, 2018

effigies commented Jul 19, 2018

effigies commented Jul 20, 2018

effigies commented Jul 20, 2018

effigies commented Jul 21, 2018

effigies commented Jul 23, 2018

coveralls commented Jul 12, 2018 •

edited

Loading

codecov-io commented Jul 13, 2018 •

edited

Loading

yarikoptic commented Jul 13, 2018 •

edited

Loading

effigies commented Jul 17, 2018 •

edited

Loading