Make cluster gcn reproducible #937

kjun9 · 2020-02-25T01:23:16Z

This updates cluster gcn's node generator to use the random_state utility that's now available, and adds some tests to make sure the model is reproducible when we set the global stellargraph seed.

Part of #749

codeclimate · 2020-02-25T01:23:50Z

stellargraph/mapper/mini_batch_node_generators.py

    """

-    def __init__(self, G, clusters=1, q=1, lam=0.1, name=None):
+    def __init__(self, G, clusters=1, q=1, lam=0.1, name=None, seed=None):


Refactor this function to reduce its Cognitive Complexity from 17 to the 15 allowed.

codeclimate · 2020-02-25T01:23:53Z

Code Climate has analyzed commit d078921 and detected 0 issues on this pull request.

View more on Code Climate.

stellar-graph-bot · 2020-02-25T01:38:25Z

Codecov Report

Merging #937 into develop will increase coverage by 0.4%.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           develop    #937     +/-   ##
=========================================
+ Coverage     83.9%   84.3%   +0.4%     
=========================================
  Files           53      53             
  Lines         5101    5103      +2     
=========================================
+ Hits          4281    4301     +20     
+ Misses         820     802     -18

Impacted Files	Coverage Δ
stellargraph/mapper/mini_batch_node_generators.py	`97.1% <0.0%> (+13.3%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 76df745...30686c1. Read the comment docs.

codecov-io · 2020-02-25T01:39:14Z

Codecov Report

Merging #937 into develop will increase coverage by 0.5%.
The diff coverage is 85.9%.

@@            Coverage Diff            @@
##           develop    #937     +/-   ##
=========================================
+ Coverage     84.3%   84.8%   +0.5%     
=========================================
  Files           53      58      +5     
  Lines         5103    4958    -145     
=========================================
- Hits          4301    4206     -95     
+ Misses         802     752     -50

Impacted Files	Coverage Δ
stellargraph/layer/graphsage.py	`79.4% <ø> (ø)`	⬆️
stellargraph/utils/ensemble.py	`0% <0%> (-85.5%)`	⬇️
stellargraph/utils/loss.py	`0% <0%> (ø)`	⬆️
stellargraph/datasets/datasets.py	`70.8% <100%> (+20.8%)`	⬆️
stellargraph/core/graph.py	`98.6% <100%> (ø)`	⬆️
stellargraph/mapper/sequences.py	`92.1% <100%> (ø)`	⬆️
...rgraph/utils/saliency_maps/integrated_gradients.py	`100% <100%> (+4.3%)`	⬆️
...argraph/interpretability/saliency_maps/__init__.py	`100% <100%> (ø)`
...ph/utils/saliency_maps/integrated_gradients_gat.py	`100% <100%> (+10.3%)`	⬆️
stellargraph/version.py	`100% <100%> (ø)`	⬆️
... and 19 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 30686c1...d078921. Read the comment docs.

PantelisElinas · 2020-03-01T23:01:45Z

tests/reproducibility/test_cluster_gcn.py

+    targets = np.random.rand(petersen_graph.number_of_nodes(), target_size)
+    assert_reproducible(
+        lambda: cluster_gcn_nai(petersen_graph, targets, 4, 2, shuffle=shuffle)
+    )


I feel that we need a test to also make sure that if the seed is different, then the models are indeed different. Otherwise, we have not established a causal link between the random seed value and initial model weights.

PantelisElinas

Hi @kjun9

You might want to have a look at my suggestion for another test, otherwise this is good to merge.

P.

Make cluster gcn nai reproducible

30686c1

codeclimate bot reviewed Feb 25, 2020

View reviewed changes

kjun9 mentioned this pull request Feb 25, 2020

Test all algorithms with randomness for reproducibility #749

Open

14 tasks

Merge branch 'develop' into feature/749-cluster-gcn

6c83d36

kjun9 marked this pull request as ready for review February 27, 2020 23:41

kjun9 requested a review from PantelisElinas February 27, 2020 23:42

Remove unused optional parameters

d078921

PantelisElinas reviewed Mar 1, 2020

View reviewed changes

PantelisElinas approved these changes Mar 1, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make cluster gcn reproducible #937

Make cluster gcn reproducible #937

kjun9 commented Feb 25, 2020 •

edited

codeclimate bot Feb 25, 2020

codeclimate bot commented Feb 25, 2020 •

edited

stellar-graph-bot commented Feb 25, 2020

codecov-io commented Feb 25, 2020 •

edited

PantelisElinas Mar 1, 2020

PantelisElinas left a comment

Make cluster gcn reproducible #937

Are you sure you want to change the base?

Make cluster gcn reproducible #937

Conversation

kjun9 commented Feb 25, 2020 • edited

codeclimate bot Feb 25, 2020

Choose a reason for hiding this comment

codeclimate bot commented Feb 25, 2020 • edited

stellar-graph-bot commented Feb 25, 2020

Codecov Report

codecov-io commented Feb 25, 2020 • edited

Codecov Report

PantelisElinas Mar 1, 2020

Choose a reason for hiding this comment

PantelisElinas left a comment

Choose a reason for hiding this comment

kjun9 commented Feb 25, 2020 •

edited

codeclimate bot commented Feb 25, 2020 •

edited

codecov-io commented Feb 25, 2020 •

edited