Optimisation of flexible side chains #73

RMeli · 2019-07-29T12:33:07Z

GSoC 2019

Goal

Implement optimisation of flexible side chains using the CNN scoring function.

Tasks:

Split ligand and receptor movable atoms in the correct channels
Get combined gradient of the ligand and receptor flexible atoms

Changes

Automatically freeze receptor (--cnn_freeze_receptor) when minimising with CNN scoring function (--cnn_scoring --minimize)
Split ligand and receptor atoms in the correct channels
- CNNScorer::ligand_coords and CNNScorer::ligand_smtypes attributes to store ligand coordinates and smina atom types
- CNNScorer::receptor_coords and CNNScorer::receptor_smtypes attributes to store receptor coordinates and smina atom types
- CNNScorer::setReceptor and CNNScorer::setLigand private members to automatically fill CNNScorer::ligand_coords, CNNScorer::ligand_smtypes, CNNScorer::receptor_coords and and CNNScorer::receptor_smtypes from model::atoms, model::coorsd and model::grid_atoms
- New and unified MolGRidDayaLayer::setReceptor and MolGRidDayaLayer::setLigand API, accepting ligand/receptor coordinates and smina atom types
Get combined ligand and flexible residues gradient
- New CNNScorer::setGradient private member correctly extracting ligand (and flexible residues) gradient from MolGridDataLayer.
Print the list of flexible residues after gnina banner
- New FlexInfo::printFlex function

breaking flexible residues

gninasrc/lib/cnn_scorer.cpp

caffe/src/caffe/layers/molgrid_data_layer.cpp

dkoes

What I'm envisioning is a clean break between molgridlayer and gnina where gnina says to molgrid "these are the receptor atoms", "these are the ligand atoms" and molgrid doesn't need to worry if what's flexible or not - all molgrid needs are the types and coordinates. This means you need to splice together rigid and flexible atoms (and then separate them) but seems like a much cleaner interface to me.

If a class variable is used to assemble/dissassemble the atoms, this should be relatively efficient (no need to allocate/deallocate memory, just copying a few numbers).

gninasrc/lib/cnn_scorer.cpp

RMeli · 2019-08-02T16:39:13Z

@dkoes, thanks for the code review and the comments.

This means you need to splice together rigid and flexible atoms (and then separate them) but seems like a much cleaner interface to me.

Isn't this similar to what I'm doing (after @Jsunseri code review)?

CNNScorer now has the following new attributes:

atomv ligand_atoms, receptor_atoms;
vecv ligand_coords, receptor_coords;
std::size_t num_flex_atoms;

which explicitly differentiate between ligand and receptor atoms (both flexible and not; flexible ones are just put at the beginning and tracked with CNNScorer::num_flex_atoms) and are passed to MolGridDataLayer.

The functions setLigand and setReceptor are used to to extract rigid and flexible atoms (of both ligand and receptor) from the model (m.get_movable_atoms(), m.get_fixed_atoms()) and populate the new attributes. The same functions are also used to update the flexible atoms coordinates on subsequent calls, but allocation should be performed only once (and the atomv are untouched after the first call).

I think the current implementation is very similar to what you describe (since there are almost no changes to MolGridDataLater apart from a API change where receptor coordinates are passed explicitly) but if my understanding is wrong and you have something different in mind could you provide some more details? Thanks!

gninasrc/lib/cnn_scorer.cpp

RMeli · 2019-08-04T18:06:32Z

A possible improvement I see at the moment is that CNNScorer::setReceptor and CNNScorer::setLigand are also used to update the coordinates from model->coords to the internal CNNScorer->ligand_coords and CNNScorer->receptor_coords (for subsequent calls of the CNNScorer::score function). Would it be cleaner to pass the model to the CNNScorer constructor, call CNNScorer::setReceptor and CNNScorer::setLigand only once and update the coordinates via a different utility function (private member of CNNScorer)? Or the same CNNScorer needs to be used with different models?

RMeli · 2019-08-05T15:12:46Z

Updated PR description to reflect latest changes.

dkoes

At first glance, isn't obvious to me why you aren't clearing a vector

gninasrc/lib/cnn_scorer.cpp

RMeli added 30 commits July 9, 2019 15:34

residue serialization

4581a9e

move flexible residues after banner

57d7124

restored old order

abf44c2

print flexible residues after gnina banner

c48680c

added cnn option for flexible residues

46f33d4

activate receptor gradient for flexible residues

006502d

comments

4b81b27

moved default constructor close to copy constructor

a622205

small test for optimisation with default model

d133b97

enable flexible minimisation for test

fb17042

simplified test receptor

69c64b8

ignored Testing/Temporary

0dfdc8d

named comments instead of TODOs

1f0d6ac

preparation for flexres gradient only

82561fd

split ligand and flexible receptor in correct channels

6504e7e

fixed slicing

a9778ca

ligand/receptor extraction cleanup

c5e7db1

breaking flexible residues

ignore test output

a130f50

Merge branch 'gsoc19/dev' into gsoc19/flex

5b0663b

add 1w4o ligand and full receptor

cf25dd0

fix receptor position and orientation automatically

9c54141

remove automatic freeze from test

22e25c3

changed assertions to CHECK_EQ

51bb8f7

clearer variable names

3834ef4

Merge remote-tracking branch 'upstream/master' into gsoc19/flex

2549caa

test cleanup

4f6f29c

test name

6a37192

get receptor cleanup

ad8538f

const correctness

edc8d19

added flexible receptor setup function

4f14e4e

RMeli added 5 commits July 31, 2019 17:20

comments

89ad0da

removed useless distinction with flexopt

d492a16

minimize PR diff

f0c7858

remove withespace-only change

4a914db

changed assignment method

d0b08e0

RMeli commented Jul 31, 2019

View reviewed changes

gninasrc/lib/cnn_scorer.cpp Show resolved Hide resolved

dkoes reviewed Aug 2, 2019

View reviewed changes

caffe/src/caffe/layers/molgrid_data_layer.cpp Outdated Show resolved Hide resolved

dkoes requested changes Aug 2, 2019

View reviewed changes

gninasrc/lib/cnn_scorer.cpp Outdated Show resolved Hide resolved

RMeli commented Aug 2, 2019

View reviewed changes

gninasrc/lib/cnn_scorer.cpp Outdated Show resolved Hide resolved

Merge remote-tracking branch 'upstream/master'

d5c3fad

RMeli added 8 commits August 5, 2019 10:05

getReceptor optimisation

c04564e

setLigand refactoring

4a416d9

setReceptor refactoring

107654d

removed waters from test systems

a38f4d5

getGradient utility function

45413b0

removed flexopt flag

8c3b09b

autoremove test output

7d9153e

added check in test

40d57b4

dkoes requested changes Aug 5, 2019

View reviewed changes

gninasrc/lib/cnn_scorer.cpp Outdated Show resolved Hide resolved

gninasrc/lib/cnn_scorer.cpp Show resolved Hide resolved

RMeli and others added 6 commits August 5, 2019 21:24

moved resize at more appropriate place

c70104e

more checks on gradient

e0b0861

more checks

615fde9

Merge remote-tracking branch 'upstream/master'

8eeb8a7

Merge branch 'master' into gsoc19/flex

b658685

fix logic for flexres string and actually skip invalid specifier

42bc72c

dkoes approved these changes Aug 21, 2019

View reviewed changes

dkoes merged commit 86c3c10 into gnina:master Aug 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimisation of flexible side chains #73

Optimisation of flexible side chains #73

RMeli commented Jul 29, 2019 •

edited

dkoes left a comment

RMeli commented Aug 2, 2019

RMeli commented Aug 4, 2019

RMeli commented Aug 5, 2019

dkoes left a comment

Optimisation of flexible side chains #73

Optimisation of flexible side chains #73

Conversation

RMeli commented Jul 29, 2019 • edited

GSoC 2019

Goal

Changes

dkoes left a comment

Choose a reason for hiding this comment

RMeli commented Aug 2, 2019

RMeli commented Aug 4, 2019

RMeli commented Aug 5, 2019

dkoes left a comment

Choose a reason for hiding this comment

RMeli commented Jul 29, 2019 •

edited