Fix boolean operator in selections to enable using them with `within` #467

Luthaf · 2020-08-31T11:40:16Z

Fix #371 using GLModel.selectedAtoms to get the set of matching atom for each individual sub-selection, and atom.serial to uniquely identify atoms when merging the sub-selections. Let me know if there is a better way to implement this (maybe directly inside of selectedAtoms?)

I am not sure how I can add tests for this. The linked issue refers to files in tests/auto, and these seems to do visual comparison of the results. My issue is that these tests don't pass on my browser (Firefox 80 on macOS 10.14.6, Intel integrated GPU), so I don't know how to make sure the test I add will work. I manually checked that this works fine for my simple use case (adding the central atom to the within selection), but I'm happy to add more tests.

Regarding code style, I used let instead of var to get proper variable scoping, which should be universally supported nowadays (https://caniuse.com/#feat=let). Let me know if that's an issue, I can try to make it work with var instead.

dkoes

Can you please redo this without introducing a bunch of whitespace changes? In addition to breaking the formatting, I can't find where the actual code change is.

Reference test images are generated using Chrome on Ubuntu Linux, but even using the same software, images can sometimes differ with different graphics drivers, which is why the image comparisons can be sorted by degree of change so they can be visually assessed.

A test case would be really helpful (include an image generated with Chrome if possible) so I can see what the goal is.

Thanks!

Luthaf · 2020-08-31T14:09:40Z

Can you please redo this without introducing a bunch of whitespace changes? In addition to breaking the formatting, I can't find where the actual code change is.

Done. I had separated the automated whitespace changes in its own commit to allow commit-by-commit review, so removing them was easy =)

I'll try to generate images this week when I get to a Linux machine.

dkoes · 2020-08-31T14:40:45Z

3Dmol/glmodel.js

@@ -1791,26 +1791,50 @@ $3Dmol.GLModel = (function() {
                        }
                    }else{//"or" and "and"
                        if(key == "and"){
-                            var and=sel[key];//array format


So I think the goal here is to use selectedAtoms recursively which should handle within etc. statements unlike atomIsSelected. This seems like a good idea.

Using index instead of serial should be more robust (index is suppose to be the position in the atoms array, whereas serial can be set in the PDB file).

Since we are keying on index, the implementation would be more efficient if they were all put in a Set and then the intersection/union taken. This would also support refactoring or/and to eliminate some duplicate code (since the only difference is the set operation). Using set lookup is O(1) vs the current O(n) search.

It would also be a good idea to memoize the result of the selectedAtoms call for even better efficiency.

Using index instead of serial should be more robust (index is suppose to be the position in the atoms array, whereas serial can be set in the PDB file).

Is index part of AtomSpec too? If so I can add it to #468.

I agree on the memoization & use of Set, I'll change this to use them.

I'd rather leave it an internal property.

Luthaf · 2020-09-02T10:52:07Z

3Dmol/glmodel.js

-                                    break;
+                    } else { //"or" and "and"
+                        // cache the evaluation of the selection for better performance
+                        if (sel[key].__cached_results === undefined) {


It would also be a good idea to memoize the result of the selectedAtoms call for even better efficiency.

This is how I've implemented it, caching the full evaluation of the and/or part of the selection. My only concern here is that the selection could change without the code realizing with something like

const selection = {and: [{serial: 4}, {resn: "ALA"}]}; model.selectedAtoms(selection); // the selection changes, but we don't know it! selection.and[0].serial = 67; model.selectedAtoms(selection);

I can see two ways to solve this: add the hash of the selection with the cached result (something like md5(JSON.stringify(sel)), and check that this did not change since last evaluation; or clean/remove __cached_results when starting a new evaluation of the selection.

The second solution require that there is a single entry point for selection evaluation, which could be GLModel.selectedAtoms. Currently, it looks like users could also use atomIsSelected as an entry point, so I could make this a private function.

As a side note, it looks like atomIsSelected do not accept the same selection elements as selectedAtoms (stuff like within/byres is not supported). Should this function be made private anyway?

Rather than hashing, my preference would be to copy the initial selection object. At little more efficient computationally, a little less efficient memory-wise. We would need to write a helper function that recursively traverses the selection object copying fields as needed (no need to copy if a cache already exists). It could also precompute the cached selections. Then instead of atomIsSelected computing the selections, it will check for the existence of a cache, if it exists it uses it, otherwise it does what it does now. This keeps backwards compatibility but makes the common path more efficient.

What do you think?

Where would the copy of the selection object take place? In selectedAtoms or in atomIsSelected? I guess selectedAtoms would make the most sense here.

Luthaf · 2020-09-09T22:35:36Z

Thanks for merging this! I'll do a follow-up PR to deal with the caching/copying of the selection.

Where do you think the copy of the selection object take place? In selectedAtoms or in atomIsSelected? I guess selectedAtoms would make the most sense here.

dkoes · 2020-09-09T22:58:11Z

Yes, selectedAtoms would be the best place.

dkoes requested changes Aug 31, 2020

View reviewed changes

Luthaf force-pushed the boolean-ops-selections branch from 17ece6c to 6f71d20 Compare August 31, 2020 14:08

dkoes requested changes Aug 31, 2020

View reviewed changes

Luthaf force-pushed the boolean-ops-selections branch from 6f71d20 to aef48df Compare September 2, 2020 10:43

Luthaf commented Sep 2, 2020

View reviewed changes

Luthaf force-pushed the boolean-ops-selections branch from aef48df to 318ce03 Compare September 2, 2020 15:25

Allow to use within with and/or in selections

a96af5b

Luthaf force-pushed the boolean-ops-selections branch from 318ce03 to a96af5b Compare September 8, 2020 12:10

Add test for complex selection including and/or

213f6a1

dkoes approved these changes Sep 9, 2020

View reviewed changes

dkoes merged commit 6dffe55 into 3dmol:master Sep 9, 2020

Luthaf deleted the boolean-ops-selections branch September 9, 2020 22:35

Luthaf mentioned this pull request Sep 11, 2020

Make a copy of the selection before evaluating them #474

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix boolean operator in selections to enable using them with `within` #467

Fix boolean operator in selections to enable using them with `within` #467

Luthaf commented Aug 31, 2020

dkoes left a comment

Luthaf commented Aug 31, 2020

dkoes Aug 31, 2020

Luthaf Aug 31, 2020

dkoes Aug 31, 2020

Luthaf Sep 2, 2020

dkoes Sep 3, 2020

Luthaf Sep 8, 2020

Luthaf commented Sep 9, 2020

dkoes commented Sep 9, 2020

Fix boolean operator in selections to enable using them with within #467

Fix boolean operator in selections to enable using them with within #467

Conversation

Luthaf commented Aug 31, 2020

dkoes left a comment

Choose a reason for hiding this comment

Luthaf commented Aug 31, 2020

dkoes Aug 31, 2020

Choose a reason for hiding this comment

Luthaf Aug 31, 2020

Choose a reason for hiding this comment

dkoes Aug 31, 2020

Choose a reason for hiding this comment

Luthaf Sep 2, 2020

Choose a reason for hiding this comment

dkoes Sep 3, 2020

Choose a reason for hiding this comment

Luthaf Sep 8, 2020

Choose a reason for hiding this comment

Luthaf commented Sep 9, 2020

dkoes commented Sep 9, 2020

Fix boolean operator in selections to enable using them with `within` #467

Fix boolean operator in selections to enable using them with `within` #467