Update atom densities feature calculation #101

CunliangGeng · 2019-09-16T10:58:54Z

To solve issue #92.
The new code calculates atom densities based on elements. The default is C, N, O and S. Thus both backbone and sidechain are used for feature calculation.

NOTE: Please merge this RP #101 after merging the RP #98

coveralls · 2019-09-16T18:50:24Z

Pull Request Test Coverage Report for Build 570

22 of 25 (88.0%) changed or added relevant lines in 5 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.2%) to 74.874%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
deeprank/tools/pdb2sql.py	12	13	92.31%
deeprank/generate/GridTools.py	5	7	71.43%

Totals
Change from base Build 566:	0.2%
Covered Lines:	2226
Relevant Lines:	2973

💛 - Coveralls

NicoRenaud

Looks pretty nice, maybe we can try to use Mendeleev for all the vdw radius

NicoRenaud · 2019-09-17T10:02:52Z

deeprank/config/chemicals.py

+
+# atom vdw radius
+# https://en.wikipedia.org/wiki/Van_der_Waals_radius


Could we use something else than Wikipedia ?
The python package Mendeleev has vdw radius for all atoms I think

We need vdw_radius for only five chemical elements, maybe too heavy to import a package to do that.
Changed the reference to a book.

NicoRenaud · 2019-09-17T10:04:28Z

deeprank/generate/DataGenerator.py

@@ -802,7 +803,7 @@ def map_features(self, grid_info={},
        >>> grid_info = {
        >>>     'number_of_points' : [30,30,30],
        >>>     'resolution' : [1.,1.,1.],
-        >>>     'atomic_densities' : {'CA':3.5,'N':3.5,'O':3.5,'C':3.5},
+        >>>     'atomic_densities' : {'C':1.7, 'N':1.55, 'O':1.52, 'S':1.8},


Why half of the radius ? Did I use the diameter ? There also we could use Mendeleev ...

The Mendeleev has the same VDW values as what I'm using.

NicoRenaud · 2019-09-17T10:06:16Z

deeprank/generate/GridTools.py

@@ -35,8 +35,8 @@ def __init__(self, molgrp,
            number_of_points(int, optional): number of points we want in
                each direction of the grid.
            resolution(float, optional): distance(in Angs) between two points.
-            atomic_densities(dict, optional): dictionary of atom types with
-                their vdw radius, e.g. {'CA':1.7, 'C':1.7, 'N':1.55, 'O':1.52}
+            atomic_densities(dict, optional): dictionary of element types with


atomType is the default name in the PDB file format no ? I have nothing against elementtype but I don't get why changing it

The old code calculates atomic densities based on atom types, e.g. CA, CB, C, N and O. It is not a problem for protein backbone which has four atom types (CA, C, N and O). But it will be a problem when considering side chain that has about 40 atom types, which means we have to provide a dictionary for all these atom types with their VDW radius as input if using the old code. Different force fields may use slightly different atom types, which make the situation worse.

To solve this problem, the new code calculates atomic densities based on chemical element types. The 20 amino acids have only five element types (C, N, O, S, H).

So we have to change the name to distinguish the difference.

sounds good

NicoRenaud · 2019-09-17T10:07:44Z

deeprank/tools/pdb2sql.py

@@ -113,7 +113,8 @@ def _create_sql(self):
                    'y': 'REAL',
                    'z': 'REAL',
                    'occ': 'REAL',
-                    'temp': 'REAL'
+                    'temp': 'REAL',
+                    'element': 'TEXT'


I don't know what this element is but we should also use it in pdb2sql then ... and also use pdb2sql in deeprank :)

The element here is chemical element.
OK, will update pdb2sql and use it in deeprank.

using pdb2sql will maybe not work out of the box as some functionalities ma be slightly different ... so be careful there

NicoRenaud · 2019-09-17T15:53:35Z

If you fix the conflicts it's all good for me and we can merge

CunliangGeng added 10 commits September 11, 2019 16:28

add element column and get_element func

e54334b

change atom density to element level

3b43448

add atom vaw radius

ff241d6

change PDB line length from 73 to 78

57c0d42

update chemicals

dc55174

Update atom density values

47e9527

Update atom density values

10705c4

Update mapfly atom density feature

92f7b2c

Update test_generate.py

1952531

fix typo

2a593ca

CunliangGeng requested a review from NicoRenaud September 16, 2019 10:59

CunliangGeng added 2 commits September 16, 2019 13:27

remove trailing whitespace

3343085

fix syntax error

ec2a6eb

NicoRenaud reviewed Sep 17, 2019

View reviewed changes

CunliangGeng added 2 commits September 17, 2019 13:21

Update chemicals.py

0700e47

Update chemicals.py

4bd1a58

Merge branch 'development' into issue92_atomdensity

fb37bed

CunliangGeng merged commit e53f60b into development Sep 18, 2019

CunliangGeng deleted the issue92_atomdensity branch September 18, 2019 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update atom densities feature calculation #101

Update atom densities feature calculation #101

CunliangGeng commented Sep 16, 2019 •

edited

Loading

coveralls commented Sep 16, 2019 •

edited

Loading

NicoRenaud left a comment

NicoRenaud Sep 17, 2019

CunliangGeng Sep 17, 2019 •

edited

Loading

NicoRenaud Sep 17, 2019

CunliangGeng Sep 17, 2019

NicoRenaud Sep 17, 2019

CunliangGeng Sep 17, 2019

NicoRenaud Sep 17, 2019

NicoRenaud Sep 17, 2019

CunliangGeng Sep 17, 2019 •

edited

Loading

NicoRenaud Sep 17, 2019

NicoRenaud commented Sep 17, 2019


		# atom vdw radius
		# https://en.wikipedia.org/wiki/Van_der_Waals_radius

Update atom densities feature calculation #101

Update atom densities feature calculation #101

Conversation

CunliangGeng commented Sep 16, 2019 • edited Loading

coveralls commented Sep 16, 2019 • edited Loading

Pull Request Test Coverage Report for Build 570

💛 - Coveralls

NicoRenaud left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CunliangGeng Sep 17, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CunliangGeng Sep 17, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NicoRenaud commented Sep 17, 2019

CunliangGeng commented Sep 16, 2019 •

edited

Loading

coveralls commented Sep 16, 2019 •

edited

Loading

CunliangGeng Sep 17, 2019 •

edited

Loading

CunliangGeng Sep 17, 2019 •

edited

Loading