Integrate get_molnet_dataset with Splitter #209

mottodora · 2018-06-30T06:48:14Z

Please merge after #201 #202

…-integration

codecov-io · 2018-06-30T06:56:22Z

Codecov Report

Merging #209 into master will decrease coverage by 0.73%.
The diff coverage is 14.03%.

@@            Coverage Diff             @@
##           master     #209      +/-   ##
==========================================
- Coverage   82.32%   81.59%   -0.74%     
==========================================
  Files         106      106              
  Lines        4974     5026      +52     
==========================================
+ Hits         4095     4101       +6     
- Misses        879      925      +46

corochann · 2018-07-04T10:00:45Z

chainer_chemistry/datasets/molnet/molnet.py

                       frac_test=.1, seed=777, return_smiles=False,
-                       target_index=None):
+                       target_index=None, task_index=0, **kwargs):


Can you add docstring? or want to do in other PR?

corochann · 2018-07-04T10:02:23Z

chainer_chemistry/datasets/molnet/molnet.py

+        if isinstance(split, str):
+            splitter = split_method_dict[split]()
+        elif isinstance(split, BaseSplitter):
+            splitter = split


I feel adding else: raise TypeError('message') is friendly.

corochann · 2018-07-04T10:02:28Z

chainer_chemistry/datasets/molnet/molnet.py

+        if isinstance(splitter, ScaffoldSplitter):
+            get_smiles = True
+        else:
+            get_smiles = return_smiles


I feel adding else: raise TypeError('message') is friendly.

I think TypeError is not necessary for this line.

I want to fix the message to show the "actual" type.

corochann · 2018-07-04T10:04:19Z

tests/datasets_tests/molnet_tests/test_molnet.py

@@ -7,8 +7,9 @@
 from chainer_chemistry.datasets import NumpyTupleDataset
 from chainer_chemistry.datasets import molnet

-expect_bbbp_lengths = [1631, 203, 205]
+expect_bbbp_lengths = [1633, 203, 203]


Do you know why this happen? Rdkit version??
If rdkit version issue, I want to fix recommended rdkit version for each chainer chemistry version.

It seems that this change is caused by the difference between scaffold split and random split. But unexpected.

I think this is not "unexpected" since you implement that StratifiedSplitter to have much "train" data for the remainder, right?

mottodora · 2018-07-06T11:47:46Z

Fix.

corochann

Ok

corochann · 2018-07-10T09:20:44Z

chainer_chemistry/datasets/molnet/molnet.py

                       frac_test=.1, seed=777, return_smiles=False,
-                       target_index=None):
+                       target_index=None, task_index=0, **kwargs):


corochann · 2018-07-10T09:21:35Z

chainer_chemistry/datasets/molnet/molnet.py

+        if isinstance(splitter, ScaffoldSplitter):
+            get_smiles = True
+        else:
+            get_smiles = return_smiles


I want to fix the message to show the "actual" type.

corochann · 2018-07-10T09:22:53Z

tests/datasets_tests/molnet_tests/test_molnet.py

@@ -7,8 +7,9 @@
 from chainer_chemistry.datasets import NumpyTupleDataset
 from chainer_chemistry.datasets import molnet

-expect_bbbp_lengths = [1631, 203, 205]
+expect_bbbp_lengths = [1633, 203, 203]


I think this is not "unexpected" since you implement that StratifiedSplitter to have much "train" data for the remainder, right?

corochann · 2018-07-10T09:23:57Z

I want to fix some document and message, but I think it is faster to make PR instead of asking you to fix.
So let me merge this.

mottodora added 7 commits June 30, 2018 14:10

Merge remote-tracking branch 'origin/fix-molnet' into molnet-splitter…

758c7f4

…-integration

fix conflict

acfa9ed

fix

95ae659

Merge branch 'scaffold-splitter' into molnet-splitter-integration

4d004af

integrate get_molnet_dataset with Splitter

c6a9cc7

test qm7 dataset

9975b48

change molnet_config

458aed8

mottodora changed the title ~~Integrate get_molnet_dataset with Splitter c6a9cc7~~ Integrate get_molnet_dataset with Splitter Jun 30, 2018

Merge scaffold splitter branch

fd85908

mottodora added this to the 0.4.0 milestone Jul 3, 2018

merge master

95b6c4b

corochann reviewed Jul 4, 2018

View reviewed changes

fix

2555be5

mottodora mentioned this pull request Jul 6, 2018

Add get_molnet_raw_dataframe function #216

Merged

corochann approved these changes Jul 10, 2018

View reviewed changes

corochann merged commit 758e379 into chainer:master Jul 10, 2018

corochann mentioned this pull request Jul 10, 2018

fix document & message #218

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate get_molnet_dataset with Splitter #209

Integrate get_molnet_dataset with Splitter #209

mottodora commented Jun 30, 2018

codecov-io commented Jun 30, 2018 •

edited

Loading

corochann Jul 4, 2018

corochann Jul 10, 2018

corochann Jul 4, 2018

corochann Jul 4, 2018

mottodora Jul 6, 2018 •

edited

Loading

corochann Jul 10, 2018

corochann Jul 4, 2018

mottodora Jul 6, 2018

corochann Jul 10, 2018

mottodora commented Jul 6, 2018

corochann left a comment

corochann Jul 10, 2018

corochann Jul 10, 2018

corochann Jul 10, 2018

corochann commented Jul 10, 2018

Integrate get_molnet_dataset with Splitter #209

Integrate get_molnet_dataset with Splitter #209

Conversation

mottodora commented Jun 30, 2018

codecov-io commented Jun 30, 2018 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mottodora Jul 6, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mottodora commented Jul 6, 2018

corochann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

corochann commented Jul 10, 2018

codecov-io commented Jun 30, 2018 •

edited

Loading

mottodora Jul 6, 2018 •

edited

Loading