debug deeprank for models with no feature value #242

manonreau · 2021-11-09T19:18:02Z

Compute grid_shape only if it is not provided as an input
Compute the feature_mean if self.clip_features == True
feature_mean was computed when self.normalize_features == True, this is not required anymore.
Format logger message in the compute_norm function
Allow feature clipping in the _clip_feature function only if values exists for that feature
Add a condition not to transform features values when they correspond to an empty vector in the mapping process
set save_hit_rate as False by default since it call the IRMSD target that may not be used by the users
Added plots as optional in the test() function since the users, in principle, have no target information for the test set, excepted in benchmark conditions

coveralls · 2021-11-11T08:20:30Z

Pull Request Test Coverage Report for Build 1468853462

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 77.12%

Totals
Change from base Build 1094124503:	0.0%
Covered Lines:	1628
Relevant Lines:	2111

💛 - Coveralls

NicoRenaud

I have left a few comments but overall it looks good :) Thanks !

deeprank/learn/DataSet.py

deeprank/learn/NeuralNet.py

NicoRenaud · 2021-11-15T10:34:46Z

deeprank/learn/DataSet.py

+                maxv = self.feature_mean[ic] + w * self.feature_std[ic]
+                if minv != maxv: 
+                    feature[ic] = np.clip(feature[ic], minv, maxv)
+                    #feature[ic] = self._mad_based_outliers(feature[ic],minv,maxv)


why is that line commented ? should we simply remove it ?

No idea, this line was already commented in the code

compute the norm values if `self.clip_features` exists

Remove the benchmarking mode: The prediction/benchmarking mode can be detected by simply checking the presence of target values for a given input data set when `plot == True` Added the different required controls Note that the hitrate function is only adapted for IRMSD input, should be modified

manonreau · 2021-11-15T12:28:16Z

I have left a few comments but overall it looks good :) Thanks !

I modified the NeuralNet.py class to skip the input data set in the plotting function if no target is provided. This way no benchmarking mode is required.

Modify Hitrate so that it can handle any type of target values (not limited to irmsd anymore)

deeprank/learn/DataSet.py

deeprank/learn/NeuralNet.py

CunliangGeng · 2021-11-15T20:31:58Z

deeprank/learn/NeuralNet.py

-
-                targ = self.data[l]['targets'].flatten()
+                try:
+                    targ = self.data[l]['targets']


Just to comfirm: the targets data does not need to be flatten(), right? I see the old code is using targ = self.data[l]['targets'].flatten()

good catch ! It must be flattened

CunliangGeng · 2021-11-15T20:37:23Z

deeprank/learn/NeuralNet.py

        for fname, mol in data['mol']:

            f5 = h5py.File(fname, 'r')
-            irmsd.append(f5[mol + '/targets/IRMSD'][()])
+            targets.append(f5[mol + f'/targets/self.data_set.select_target'][()])


need to change f'/targets/self.data_set.select_target' to f'/targets/{self.data_set.select_target}'

deeprank/learn/NeuralNet.py

CunliangGeng

Thanks @manonreau, the changes look good. I left a few comments for you to check.

1. Change field self.grid_shape to self._grid_shape 2. Update DataSet class docstring for grid_info 3. Update get_grid_shape method

1. Rename target_thr to hit_cutoff 2. Update hit_cutoff docstring 3. Replace print with logger 4. Shorten long lines

CunliangGeng · 2021-11-16T21:29:00Z

Hi @manonreau I pushed two commits (since it's not easy to comment all the details), take a look please :-)

manonreau

Thanks @CunliangGeng, everything looks fine, I will merge that PR once you approve it

deeprank/learn/DataSet.py

manonreau · 2021-11-16T08:42:49Z

deeprank/learn/NeuralNet.py

-
-                targ = self.data[l]['targets'].flatten()
+                try:
+                    targ = self.data[l]['targets']


good catch ! It must be flattened

CunliangGeng

Thanks @manonreau, I approve now.

CunliangGeng · 2021-11-17T08:59:21Z

BTW, please also close the related issues after merging the PR :-)

manonreau added 2 commits November 9, 2021 20:12

debug deeprank for models with no feature value

f3621dc

Update DataSet.py

ad067bb

manonreau requested review from CunliangGeng and NicoRenaud and removed request for CunliangGeng November 9, 2021 19:23

manonreau mentioned this pull request Nov 10, 2021

Update DataSet.py #237

Closed

Update DataSet.py

89a7600

manonreau added 2 commits November 11, 2021 16:42

adapt code for prediction

bce81ec

Merge branch 'debug' of https://github.com/DeepRank/deeprank into debug

52b5561

NicoRenaud reviewed Nov 15, 2021

View reviewed changes

CunliangGeng self-assigned this Nov 15, 2021

manonreau added 5 commits November 15, 2021 12:43

Update DataSet.py

355a0ba

compute the norm values if `self.clip_features` exists

Create DataSet.py

e3f6a79

compute the norm values if `self.clip_features` exists

Create DataSet.py

5997158

compute the norm values if `self.clip_features` exists

Update DataSet.py

00c02b4

manonreau added 4 commits November 15, 2021 13:45

Update hitrate

9459efc

Modify Hitrate so that it can handle any type of target values (not limited to irmsd anymore)

Update DataSet.py

f60d3eb

Update DataSet.py

9e20f85

Update NeuralNet.py

754060b

CunliangGeng reviewed Nov 15, 2021

View reviewed changes

deeprank/learn/DataSet.py Outdated Show resolved Hide resolved

CunliangGeng reviewed Nov 15, 2021

View reviewed changes

deeprank/learn/NeuralNet.py Outdated Show resolved Hide resolved

CunliangGeng reviewed Nov 15, 2021

View reviewed changes

deeprank/learn/NeuralNet.py Outdated Show resolved Hide resolved

CunliangGeng reviewed Nov 15, 2021

View reviewed changes

deeprank/learn/NeuralNet.py Outdated Show resolved Hide resolved

CunliangGeng suggested changes Nov 15, 2021

View reviewed changes

manonreau added 2 commits November 16, 2021 09:36

Update DataSet.py

ba3f2cb

Update NeuralNet.py

7d972fd

manonreau requested review from CunliangGeng and NicoRenaud November 16, 2021 09:41

manonreau and others added 10 commits November 16, 2021 11:40

Update DataSet.py

d8e7dc5

Update NeuralNet.py

78444a8

Update NeuralNet.py

f1b3824

Update NeuralNet.py

3558609

Update test_learn.py

70ec167

Update DataSet.py

7e674c9

Update NeuralNet.py

257c4ef

Update test_learn.py

5135b33

Update grid shape related field and method

b99a433

1. Change field self.grid_shape to self._grid_shape 2. Update DataSet class docstring for grid_info 3. Update get_grid_shape method

Rename target_thr to hit_cutoff

de92fc6

1. Rename target_thr to hit_cutoff 2. Update hit_cutoff docstring 3. Replace print with logger 4. Shorten long lines

manonreau commented Nov 16, 2021

View reviewed changes

CunliangGeng mentioned this pull request Nov 17, 2021

Update doc for grid_shape #243

Open

2 tasks

CunliangGeng approved these changes Nov 17, 2021

View reviewed changes

manonreau mentioned this pull request Nov 17, 2021

run deeprank on data without target values (test mode) #213

Closed

manonreau merged commit 8dc4df9 into master Nov 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

debug deeprank for models with no feature value #242

debug deeprank for models with no feature value #242

manonreau commented Nov 9, 2021 •

edited

Loading

coveralls commented Nov 11, 2021 •

edited

Loading

NicoRenaud left a comment

NicoRenaud Nov 15, 2021

manonreau Nov 15, 2021

manonreau commented Nov 15, 2021

CunliangGeng Nov 15, 2021

manonreau Nov 16, 2021

CunliangGeng Nov 15, 2021

CunliangGeng left a comment

CunliangGeng commented Nov 16, 2021

manonreau left a comment

manonreau Nov 16, 2021

CunliangGeng left a comment

CunliangGeng commented Nov 17, 2021

debug deeprank for models with no feature value #242

debug deeprank for models with no feature value #242

Conversation

manonreau commented Nov 9, 2021 • edited Loading

coveralls commented Nov 11, 2021 • edited Loading

Pull Request Test Coverage Report for Build 1468853462

💛 - Coveralls

NicoRenaud left a comment

Choose a reason for hiding this comment

NicoRenaud Nov 15, 2021

Choose a reason for hiding this comment

manonreau Nov 15, 2021

Choose a reason for hiding this comment

manonreau commented Nov 15, 2021

CunliangGeng Nov 15, 2021

Choose a reason for hiding this comment

manonreau Nov 16, 2021

Choose a reason for hiding this comment

CunliangGeng Nov 15, 2021

Choose a reason for hiding this comment

CunliangGeng left a comment

Choose a reason for hiding this comment

CunliangGeng commented Nov 16, 2021

manonreau left a comment

Choose a reason for hiding this comment

manonreau Nov 16, 2021

Choose a reason for hiding this comment

CunliangGeng left a comment

Choose a reason for hiding this comment

CunliangGeng commented Nov 17, 2021

manonreau commented Nov 9, 2021 •

edited

Loading

coveralls commented Nov 11, 2021 •

edited

Loading