Cosym: choose cluster containing the most identity reindexing ops #1514

rjgildea · 2020-12-07T22:05:02Z

This can be important if the lattice symmetry is only very approximately related by pseudosymmetry to the true symmetry. If all potential reindexing ops are genuine indexing ambiguities, then it doesn't matter which one is chosen, however if not, then choosing the wrong one will distort the true unit cell. In such cases it is likely that the input datasets were already indexed consistently, therefore default to choosing the cluster that contains the most identity reindexing ops.

codecov · 2020-12-09T16:57:22Z

Codecov Report

Merging #1514 (6159374) into master (b36efe5) will decrease coverage by 0.00%.
The diff coverage is 96.19%.

@@            Coverage Diff             @@
##           master    #1514      +/-   ##
==========================================
- Coverage   65.62%   65.61%   -0.01%     
==========================================
  Files         614      614              
  Lines       68965    68958       -7     
  Branches     9529     9523       -6     
==========================================
- Hits        45257    45250       -7     
- Misses      21866    21868       +2     
+ Partials     1842     1840       -2

graeme-winter

Popped a couple of comments inline, mostly asking for small improvements to the documentation of the code as there are some sgtbx type rabbit holes in this... I assume that the code does what the commit messages say and overall the spirit of the change set makes a lot of sense. I would wonder if this is a case for not squash merging however as the structure of the PR does have bug fix and tidying commits separately. Maybe squash down to 2 commits? So bugfix+news, cleaning+test changes

algorithms/symmetry/cosym/target.py

graeme-winter · 2020-12-10T15:17:12Z

algorithms/symmetry/cosym/__init__.py

@@ -175,10 +175,12 @@ def _map_space_group_to_input_cell(intensities, space_group):
                sg_best = sg_primitive.change_basis(cb_op_best_primitive.inverse())
                # best_subgroup above is the bravais type, so create thin copy here with the
                # user-input space group instead
+                best_subsym = best_subsym.customized_copy(


I think this stuff could do with some kind of annotation somewhere... I am certain you have gained some understanding as a side-effect of these changes which should probably be documented inline?i.e. the why of est_subsym.change_basis(cb_op_inp_best.inverse()) etc.

I ask this as someone who may have to debug this one day :-)

This is just to obtain the subsym in the input setting rather than the "best" setting -> cb_op_inp_best.inverse()

graeme-winter · 2020-12-10T15:18:56Z

algorithms/symmetry/cosym/__init__.py

@@ -370,28 +372,6 @@ def _analyse_symmetry(self):
        )
        self.params.cluster.n_clusters = len(cosets.partitions)

-    def _space_group_for_dataset(self, dataset_id, sym_ops):


You're not wrong that retaining this adds confusion, as I was trying to work out how things still work with all this stuff removed...

algorithms/symmetry/cosym/__init__.py

graeme-winter · 2020-12-10T15:23:21Z

command_line/cosym.py

@@ -177,9 +178,17 @@ def run(self):
        self.cosym_analysis.run()

        reindexing_ops = {}
+        sym_op_counts = {


Is behaviour well defined in the unlikely event of a tie?

graeme-winter · 2020-12-10T15:24:00Z

command_line/cosym.py

@@ -177,9 +178,17 @@ def run(self):
        self.cosym_analysis.run()

        reindexing_ops = {}
+        sym_op_counts = {
+            cluster_id: collections.Counter(


Wait, what is this syntax?

Learning opportunity for YT

Oh, dictionary creation from iterator...

Possibly re-arranging the code slightly (e.g. a dummy variable to move everything onto a single line) would make it look less magic?

newsfragments/1514.bugfix

* Deprecate target.get_sym_ops(), use target.sym_ops instead * Remove CosymAnalysis.space_groups - this has been superceded by the pointless-style symmetry analysis, and retaining this likely just causes added complexity or confusion. * Tidy up CosymAnalysis._reindexing_ops_for_dataset method * Call cosym run() directly instead of via procrunner * Set n_clusters param after clustering - n_clusters may have been Auto, in which case we now know how many clusters were found by clustering * Cleaner to use 'if i_cluster in reindexing_ops: break'; add some docstrings to the method. * Minimal test for CosymAnalysis._reindexing_ops_for_dataset()

…exing ops This can be important if the lattice symmetry is only very approximately related by pseudosymmetry to the true symmetry. If all potential reindexing ops are genuine indexing ambiguities, then it doesn't matter which one is chosen, however if not, then choosing the wrong one will distort the true unit cell. In such cases it is likely that the input datasets were already indexed consistently, therefore default to choosing the cluster that contains the most identity reindexing ops.

rjgildea requested a review from graeme-winter December 7, 2020 22:05

rjgildea force-pushed the cosym_reindexing_ops branch from abdde55 to c4275c8 Compare December 9, 2020 10:21

graeme-winter approved these changes Dec 10, 2020

View reviewed changes

rjgildea added 2 commits December 11, 2020 09:58

rjgildea force-pushed the cosym_reindexing_ops branch from 0ed77e8 to 6159374 Compare December 11, 2020 09:58

rjgildea merged commit fcfd7b4 into master Dec 11, 2020

rjgildea deleted the cosym_reindexing_ops branch December 11, 2020 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cosym: choose cluster containing the most identity reindexing ops #1514

Cosym: choose cluster containing the most identity reindexing ops #1514

rjgildea commented Dec 7, 2020

codecov bot commented Dec 9, 2020 •

edited

graeme-winter left a comment

graeme-winter Dec 10, 2020

rjgildea Dec 10, 2020

graeme-winter Dec 10, 2020

graeme-winter Dec 10, 2020

graeme-winter Dec 10, 2020

graeme-winter Dec 10, 2020

graeme-winter Dec 10, 2020

Cosym: choose cluster containing the most identity reindexing ops #1514

Cosym: choose cluster containing the most identity reindexing ops #1514

Conversation

rjgildea commented Dec 7, 2020

codecov bot commented Dec 9, 2020 • edited

Codecov Report

graeme-winter left a comment

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

rjgildea Dec 10, 2020

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

graeme-winter Dec 10, 2020

Choose a reason for hiding this comment

codecov bot commented Dec 9, 2020 •

edited