RecoBundles - recognition of bundles #1443

Garyfallidis · 2018-03-05T03:20:13Z

This PR implements the super 🔥🔥🔥🔥 paper:

Garyfallidis et al. Recognition of white matter bundles using local and global streamline-based registration and clustering NeuroImage, 2017

https://www.sciencedirect.com/science/article/pii/S1053811917305839

This PR is ready for reviews.

I would like that first we merge #1379 so I can move from using directly the ArraySequence to using the Streamlines class. #1379 only needs a small coverage boost.

Some comments on the methods of this PR.

RecoBundles extracts bundles from a given tractogram using model bundles from another tractogram.
Therefore it is important to make sure that the two tractograms are registered first. So, what we usually do is apply the remarkably robust streamline registration first (see function whole_brain_slr) and then start RecoBundles. This strategy can be very useful with clinical datasets where image-based registration is uncertain. Of course you are happy to use other methods for registration too.

https://www.ncbi.nlm.nih.gov/pubmed/25987367

After this PR is merged will be working on providing an easy to use workflow and with access to an atlas of many bundles. And build the tutorial. Nonetheless, be happy to start using this and do please cite us gracefully.

Remark; If the two tractograms are already registered well with nonrigid registration make sure you disable the local_slr parameter as it will be probably an overkill.

Oh and Welcome to the Matrix! :) This PR brings infinite possibilities in the game. More cool additions and enhancements coming soon.

@frheault after this is merged make a PR for the centroid-based metric directly to the DIPY master. I am talking about your previous work here https://github.com/Garyfallidis/dipy/pull/50/files

skoudoro

Travis complain but I can not reproduce this error on windows... all tests pass

skoudoro · 2018-03-08T15:58:08Z

dipy/align/bundlemin.pyx

+
+    Parameters
+    -----------
+    static : array


The variable name static does not match with input parameter stat

skoudoro · 2018-03-08T15:58:21Z

dipy/align/bundlemin.pyx

+    -----------
+    static : array
+        Static streamlines
+    moving : array


The variable name moving does not match with input parameter mov

skoudoro · 2018-03-08T16:01:34Z

dipy/align/bundlemin.pyx

+    Notes
+    -----
+    The difference with ``_bundle_minimum_distance`` is that we sum the
+    minimum values only for the static. Therefore, this is assymetric.


typo: asymmetric

codecov-io · 2018-03-09T00:00:52Z

Codecov Report

Merging #1443 into master will increase coverage by 0.03%.
The diff coverage is 88.18%.

@@            Coverage Diff             @@
##           master    #1443      +/-   ##
==========================================
+ Coverage   87.49%   87.52%   +0.03%     
==========================================
  Files         241      244       +3     
  Lines       30789    31189     +400     
  Branches     3322     3426     +104     
==========================================
+ Hits        26939    27299     +360     
- Misses       3074     3079       +5     
- Partials      776      811      +35

Impacted Files	Coverage Δ
dipy/tracking/streamline.py	`96.03% <ø> (+0.2%)`	⬆️
dipy/tracking/utils.py	`88.57% <ø> (ø)`	⬆️
dipy/segment/clustering.py	`94.63% <100%> (+0.97%)`	⬆️
dipy/align/tests/test_streamlinear.py	`97.74% <100%> (ø)`	⬆️
dipy/align/streamlinear.py	`86.85% <75.69%> (-9.24%)`	⬇️
dipy/segment/bundles.py	`90.21% <90.21%> (ø)`
dipy/align/tests/test_whole_brain_slr.py	`93.75% <93.75%> (ø)`
dipy/segment/tests/test_qbx.py	`97.12% <96.15%> (+0.19%)`	⬆️
dipy/segment/tests/test_rb.py	`96.87% <96.87%> (ø)`
dipy/core/gradients.py	`92.1% <0%> (-5.27%)`	⬇️
... and 54 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f6af802...72fcf7f. Read the comment docs.

…how we set set_printoptions back to default

Garyfallidis · 2018-03-14T14:51:40Z

@kesshijordan it would be great if you can have a look at this PR. Let me know what you think. @MarcCote please have a look at the qbx_merge function.

kesshijordan · 2018-03-14T17:59:50Z

Thanks, @Garyfallidis ! I've been looking forward to this. I'm working on pyAFQ (@arokem) and think this approach would be a great way to clean up the results.

I made some atlases (calculated centroids from quickbundles and streamline registered a series of manually segmented healthy control tracks) that would probably work well for this.

Garyfallidis · 2018-03-14T18:22:16Z

Great strategy! Cool @kesshijordan, give it a go and let me know if you have questions. Happy to help :)

Garyfallidis · 2018-03-30T01:36:34Z

@kesshijordan and @BramshQamar please provide your review. It would be great to have this merged for this release.

MarcCote

Adde minor suggestions to qbx_merge. Otherwise, looks good to me.

MarcCote · 2018-04-05T01:24:24Z

dipy/segment/clustering.py

+    len_s = len(streamlines)
+    if select_randomly is None:
+        select_randomly = len_s
+    indices = np.random.choice(len_s, min(select_randomly, len_s),


Personally, I prefer using a dedicated random generator instead of relying on np.random. It gives you more control for reproducibility.

I would add a new optional argument rng=None to the function and do

indices = np.arange(len(streamlines)) if select_randomly: rng = np.random.RandomState(None) if rng is None else rng rng.shuffle(indices)

Just a note. I personally almost always replace the pattern x = y if x is None else x by x = x or y

rng = rng or np.random.RandomState(None)

I know that this could be a trap because False is not the same as None but some programmers prefer the or trick.

MarcCote · 2018-04-05T01:26:22Z

dipy/segment/clustering.py

+
+    final_level = len(thresholds)
+
+    qbx_ordering_final = np.random.choice(


You could reuse the rng from above here instead of relying on the global RandomState.

MarcCote · 2018-04-05T01:31:33Z

dipy/segment/clustering.py

+
+        print(' Duration %0.3f sec. \n' % (time() - t1, ))
+
+    return merged_cluster_map


Missing a newline.

kesshijordan · 2018-04-05T17:00:37Z

@Garyfallidis I think it's worth having a basic tutorial with a quick demo/guide for parameter tuning in the same release as Recobundles. I would imagine that when it's released people will give it a quick try to see if it works on their data as well as the examples in the publication. You want them to know which knobs to turn when they do; the tuning isn't necessarily intuitive (it made a huge difference in how well it worked on my data when I got feedback from you on tuning). It would help make a good first impression.

BramshQamar · 2018-04-05T21:30:15Z

dipy/align/streamlinear.py

+
+# In essence whole_brain_slr can be thought as a combination of
+# SLF and QuickBundles
+whole_brain_slr = slr_with_qb


whole_brain_slr and slr_with_qb basically have same implementation?

Yes, the same function can be used for brain tractograms and other tractograms. In the paper I called it whole_brain_slr but then realized that the function can be more generic. Adding some more documentation to explain.

nilgoyette · 2018-04-06T13:20:47Z

dipy/segment/clustering.py

+    len_s = len(streamlines)
+    if select_randomly is None:
+        select_randomly = len_s
+    indices = np.random.choice(len_s, min(select_randomly, len_s),


Just a note. I personally almost always replace the pattern x = y if x is None else x by x = x or y

rng = rng or np.random.RandomState(None)

I know that this could be a trap because False is not the same as None but some programmers prefer the or trick.

nilgoyette · 2018-04-06T13:21:49Z

dipy/align/bundlemin.pyx

+        double inf = np.finfo('f8').max
+        double dist=0
+        double * min_j
+        # double * min_i


Could you please clean all the commented code in this function?

nilgoyette · 2018-04-06T13:25:37Z

dipy/segment/tests/test_rb.py

+
+
+def test_rb_no_neighb():
+    # what if neighbors are found? No recognition


Is this a typo? What if NO neighbors are found?

nilgoyette · 2018-04-06T13:37:28Z

dipy/align/streamlinear.py


 MAX_DIST = 1e10
 LOG_MAX_DIST = np.log(MAX_DIST)

+DEFAULT_BOUNDS = [(-45, 45), (-45, 45), (-45, 45),
+                  (-30, 30), (-30, 30), (-30, 30),


Are you sure it's 45, 30, and not 30, 45? I ask because I have seen another version with translation (-30, 30) and rotation (-45, 45) and I believe it makes more sense that way.

EDIT: Oh, I see that you have another set of bounds below :) Can you explain why it's different from this set that you jsut defined?

nilgoyette · 2018-04-06T14:17:27Z

dipy/segment/bundles.py

+            clustering.
+        """
+
+        self.reduction_thr = reduction_thr


I have been working with this codebase (a really similar one from François Rheault) in the last weeks and please, for the love of gods, don't do that! Stop putting mutable stuff in self like a madman. It's so much harder to debug! I would know, I did it! I took me days to debug instead of hours.

I'm not talking only about reduction_thr. It wouldn't be a real problem if it was only for this variable. The problem happen because you have reduction_thr, model_bundle, pruned_streamlines, nb_pruned_streamlines, transf_streamlines, transf_matrix, labels, model_clust_thr, model_cluster_map, model_centroids, nb_model_centroids, centroid_matrix, neighb_streamlines, neighb_clusters, neighb_centroids, neighb_indices, nb_neighb_streamlines, slr_bmd, slr_iterations, slr_initial_matrix, slr_final_matrix, slr_xopt, rtransf_cluster_map, rtransf_centroids, nb_rtransf_centroids, pruning_matrix, pruned_indices and labeled_streamlines that are not in self at creation and are modified at random places and times. I may or may not have missed some mutable variables, so there may be more.

Some of them are only used in one function. Some of them are used in many functions but it could be easily be avoided. Some of them are simply the len of another. Maybe some of them are never used, I didn't check.

I will not write an essay here about why a programmer should never do that, I'll simply say that it's super hard to follow and to debug this kind of code. By the way, sorry for the "tone" of this review, I swear I did it for the good of DiPy.

What an old-fashioned / C++ way of thinking :P ... just kidding! I totally agree with you that would definitively clarify the code.

What is the alternative that you are proposing @nilgoyette ? Also let's all try to be polite to each other. The tone was not necessary. Please be specific.

Nothing fancy: return value(s) from functions, add arguments, try to have 0 mutable member.

As a generul rule, try to only have immutable objects. self should never be modified after construction. I know it's not Python way of thinking and it's maybe "too much", so I wouldn't care if 1 or 2 members are modified but you took it to another level :)

nilgoyette · 2018-04-11T15:26:35Z

dipy/align/bundlemin.pyx

+            openmp.omp_init_lock(&lock)
+
+        min_j = <double *> malloc(static_size * sizeof(double))
+        # min_i = <double *> malloc(moving_size * sizeof(double))


Because you only keep the min of the first loop, you don't need an array. A simple min and sum var will do.

nilgoyette · 2018-04-11T15:28:29Z

dipy/align/bundlemin.pyx

+    Notes
+    -----
+    The difference with ``_bundle_minimum_distance`` is that we sum the
+    minimum values only for the static. Therefore, this is asymetric.


Can you please explain more what are the consequences, if any? This function is a little less heavy than _bundle_minimum_distance, ok, but what are we losing? When can we use it?

BramshQamar · 2018-04-19T17:29:16Z

dipy/segment/bundles.py

+        # return recognized bundle in original streamlines, labels of
+        # recognized bundle and transformed recognized bundle
+        return self.streamlines[self.labels], self.labels, \
+            self.pruned_streamlines


It will be good to have detailed documentation about difference between self.streamlines[self.labels] and self.pruned_streamlines returned by recognize function.

nilgoyette · 2018-04-23T12:43:59Z

dipy/segment/bundles.py

+            self.cluster_map.refdata = self.streamlines
+            self.centroids = self.cluster_map.centroids
+            self.nb_centroids = len(self.centroids)
+            self.indices = [cluster.indices for cluster in self.cluster_map]


Thank you for the update, it's incredibly better. You can go further by removing self.clust_thr and self.indices (never used), and self.nb_centroids (only used in a print, so simply use len). I think I won't complain after that :)

I am removing self.clust_thr but the others are useful to stay there. I have reduced number of attributes incredibly there is no need to move directly and made _cluster_streamlines a "private" function. So, you should be very happy. Some utility attributes are useful.

nilgoyette · 2018-04-23T12:45:43Z

dipy/segment/bundles.py

+
+    def _cluster_streamlines(self, clust_thr=15, nb_pts=20):
+
+        np.random.seed(42)


Shouldn't this be a parameter? It would be nice if the user could simply choose a seed to have reproducable results. It's probably more complex than this simple line though?

Yes, needs a proper RandomState there. Fixing ...

nilgoyette · 2018-04-23T12:56:14Z

dipy/segment/bundles.py

+        centroid_matrix[centroid_matrix > reduction_thr] = np.inf
+
+        mins = np.min(centroid_matrix, axis=0)
+        close_clusters_indices = list(np.where(mins != np.inf)[0])


I don't get this strategy. Why not forget the infinity and simply use your reduction_thr directly in the np.where?

Multiple ways to go to Rome.

nilgoyette · 2018-04-23T12:57:21Z

dipy/segment/bundles.py

+
+        # TODO this can be speeded up by using directly the centroids
+        static = select_random_set_of_streamlines(model_bundle,
+                                                  select_model)


This can fit in 80 characters.

nilgoyette · 2018-04-23T12:58:31Z

dipy/segment/bundles.py

+
+        mins = np.min(pruning_matrix, axis=0)
+        pruned_indices = [rtransf_cluster_map[i].indices
+                          for i in np.where(mins != np.inf)[0]]


nilgoyette · 2018-04-23T12:59:53Z

dipy/segment/bundles.py

+        pruned_streamlines = [transf_streamlines[i]
+                              for i in pruned_indices]
+
+        pruned_indices = pruned_indices


That might be useless.

Good thanks.

Garyfallidis · 2018-04-24T21:12:11Z

@kesshijordan I will have to make a tutorial on another PR. Here is why. We are currently cleaning and preparing an atlas to be used. And it takes time. I want you to look at this atlas when you visit us next week. Ideally I want to make sure that we have a tutorial that can run with multiple input atlases. But we can start with one first.

Garyfallidis · 2018-04-26T09:59:34Z

Thank you all @kesshijordan @MarcCote @BramshQamar @nilgoyette and @MarcCote for your reviews. There were all very useful. All your comments have been address. Apologies also for the initial coding style with the many unnecessary attributes. But all this is fixed now. We are going to merge this PR today so we can move with the release in preparation for the big DIPY workshop next week. During https://brainhack.sice.indiana.edu. More updates and tutorials and atlases will be available in other PRs.

RecoBundles - recognition of bundles

Garyfallidis requested a review from BramshQamar March 5, 2018 21:18

skoudoro reviewed Mar 8, 2018

View reviewed changes

Garyfallidis changed the title ~~WIP: RecoBundles - automatic recognition of bundles~~ RecoBundles - automatic recognition of bundles Mar 11, 2018

Garyfallidis changed the title ~~RecoBundles - automatic recognition of bundles~~ RecoBundles - recognition of bundles Mar 11, 2018

Garyfallidis added this to Needs a review in Dipy 0.14.0 Mar 12, 2018

Garyfallidis added 24 commits March 12, 2018 17:12

NF: Adding Recobundles

1ab7c1b

TEST: added tests for RecoBundles

0b26b68

RF: moving qbx_merge to segment.clustering module

397b7fa

TEST: started testing for whole_brain_slr

0de3bd2

TEST: added new whole brain SLR test

0a52442

Still debugging this

5767fca

TEST: Whole brain SLR

cf23547

TEST: tests for whole brain SLR are passing in my machine

855d250

Adding alternative name SLR with QB combined

7cbf477

TEST: updated test_matrix44

a13227d

Possible fix of doctest issue

75bbffb

Hopefully removing docstring error -- this seemed to be related with …

8e6cec6

…how we set set_printoptions back to default

DOC: typo fix

6788ee0

Added check for the optimizer not providing number of iterations

b41ee03

Pep8

ba6042d

DOC: Improving docstrings in RB

189cde4

Minor update

a9e4530

DOC: improving docstrings in bundles.py

b2c63b8

DOC: improving documentation in whole_brain_slr

541cda2

DOC: SLR with QuickBundles docstring improvements

30a86cd

Trailspace

208b7be

TEST: examine clustermap input

3779467

TEST: testing cases where there are truly no neighbors around

c973d8c

TEST: increasing coverage in streamlinear.py and bundles.py

2bd31ff

RF: ArraySequence to Streamlines

d26b31a

Garyfallidis force-pushed the rb_rb branch from 81c625f to d26b31a Compare March 12, 2018 22:22

MarcCote reviewed Apr 5, 2018

View reviewed changes

BramshQamar reviewed Apr 5, 2018

View reviewed changes

nilgoyette reviewed Apr 6, 2018

View reviewed changes

nilgoyette reviewed Apr 11, 2018

View reviewed changes

BramshQamar reviewed Apr 19, 2018

View reviewed changes

Garyfallidis added 10 commits April 21, 2018 22:07

Updated qbx_with_merge with RandomState

2a90fa8

Renamed qbx_with_merge to qbx_and_merge

6a4dfd2

Removed vtk call

0007b04

RF: cleaning of bundlemin.pyx

4dce8ca

Changed default parameters for bounds

98452d5

RF: RecoBundles refactored to reduce number of attributes

a624cf7

RF: different corrections from refactoring

bd66929

RF: refactoring code seems working again

883d55c

TEST: minor update

0cb1eca

Corrected number of return objects if no neighbors are found

593b2fa

nilgoyette reviewed Apr 23, 2018

View reviewed changes

Garyfallidis added 2 commits April 24, 2018 16:54

Added RandomState inside RecoBundles

18dc049

Minor issue corrected

72fcf7f

skoudoro merged commit 26f99c5 into dipy:master Apr 28, 2018

Dipy 0.14.0 automation moved this from Needs a review to Done Apr 28, 2018

ShreyasFadnavis pushed a commit to ShreyasFadnavis/dipy that referenced this pull request Sep 20, 2018

Merge pull request dipy#1443 from Garyfallidis/rb_rb

eca9d43

RecoBundles - recognition of bundles


		final_level = len(thresholds)

		qbx_ordering_final = np.random.choice(


		print(' Duration %0.3f sec. \n' % (time() - t1, ))

		return merged_cluster_map



		def test_rb_no_neighb():
		# what if neighbors are found? No recognition


		def _cluster_streamlines(self, clust_thr=15, nb_pts=20):

		np.random.seed(42)

RecoBundles - recognition of bundles #1443

RecoBundles - recognition of bundles #1443

Conversation

Garyfallidis commented Mar 5, 2018 • edited

skoudoro left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-io commented Mar 9, 2018 • edited

Codecov Report

Garyfallidis commented Mar 14, 2018

kesshijordan commented Mar 14, 2018

Garyfallidis commented Mar 14, 2018

Garyfallidis commented Mar 30, 2018

MarcCote left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kesshijordan commented Apr 5, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarcCote Apr 6, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Garyfallidis commented Apr 24, 2018

Garyfallidis commented Apr 26, 2018 • edited

Garyfallidis commented Mar 5, 2018 •

edited

codecov-io commented Mar 9, 2018 •

edited

kesshijordan commented Apr 5, 2018 •

edited

MarcCote Apr 6, 2018 •

edited

Garyfallidis commented Apr 26, 2018 •

edited