[Example][Refactor] Refactor GCN example #4160

chang-l · 2022-06-22T23:44:21Z

Description

Referring to: #4186, this PR is for refactoring GCN example. Only single GPU is implemented.

Please note that we have two gcn implementations, one for DGL module (train.py) and one of customized module (gcn_mp.py). To properly show file diff for reviewing, I have not renamed gcn_mp.py. It needs to be addressed before merge.

Checklist

Please feel free to remove inapplicable items for your PR.

The PR title starts with [$CATEGORY] (such as [NN], [Model], [Doc], [Feature]])
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change,
or have been fixed to be compatible with this change
Related issue is referred in this PR
If the PR is for a new model/paper, I've updated the example index here.

Changes

Align to our golden example
Two separate self-contained files for two implementations

Tests

Included in the updated README file.

dgl-bot · 2022-06-22T23:44:54Z

To trigger regression tests:

@dgl-bot run [instance-type] [which tests] [compare-with-branch];
For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

dgl-bot · 2022-06-23T00:19:53Z

Commit ID: 680f120

Build ID: 1

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-06-30T19:43:43Z

Commit ID: 15c922c

Build ID: 2

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-06-30T20:11:44Z

Commit ID: cfd7b15

Build ID: 3

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

chang-l · 2022-06-30T21:54:56Z

CI-test keeps failing since I removed gcn.py file, which includes the gcn module definition. Please let me know if it is desired to keep gcn module file separately. @jermainewang @mufeili

dgl-bot · 2022-06-30T22:28:28Z

Commit ID: a38b0a6

Build ID: 4

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-06-30T23:52:54Z

Commit ID: 7179662

Build ID: 5

Status: ❌ CI test failed in Stage [Torch CPU (Win64) Example test].

Report path: link

Full logs path: link

examples/pytorch/gcn/README.md

examples/pytorch/gcn/gcn_mp.py

jermainewang · 2022-07-01T02:12:27Z

Apart from the inline comments, I feel we could further simplify the example:

The example currently shows how to write a custom graph convolution layer using update_all, built-in functions and UDFs. I think we could just call dgl.nn.GraphConv.
We can then have only one training script.

@mufeili what do you think?

mufeili · 2022-07-01T05:49:56Z

Apart from the inline comments, I feel we could further simplify the example:

The example currently shows how to write a custom graph convolution layer using update_all, built-in functions and UDFs. I think we could just call dgl.nn.GraphConv.

We can then have only one training script.

@mufeili what do you think?

I agree.

dgl-bot · 2022-07-01T17:42:32Z

Commit ID: 65fccf4abf6875a8ef1bb8979d8da691424f90f9

Build ID: 7

Status: ❌ CI test failed in Stage [C++ CPU].

Report path: link

Full logs path: link

dgl-bot · 2022-07-01T17:50:59Z

Commit ID: 0298e8370271f8efb78cfc894d9aa32f0976c7af

Build ID: 6

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-07-01T18:15:20Z

Commit ID: 6095d0d8da7ec2b7a71b7476347a1c37582a4e7a

Build ID: 8

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

examples/pytorch/gcn/README.md

examples/pytorch/gcn/train.py

mufeili · 2022-07-04T07:33:16Z

examples/pytorch/gcn/train.py

+                        help="Dataset name ('cora', 'citeseer', 'pubmed').")
+    args = parser.parse_args()
+    print(f'Training with DGL intrinsic graph convolution module.')
+


You might want to first create a transform object and then pass it to the dataset classes below. Also you need to compose RemoveSelfLoop and AddSelfLoop as some datasets have self-loops at the beginning for some of the nodes.

Yes, I can do that (creating a transform object transform=AddSelfLoop() and pass trans. obj to dataset class)
I think, by default (allow_duplicate=False), AddSelfLoop includes RemoveSelfLoop to remove duplicated self-loops, if I understand the doc correctly (https://docs.dgl.ai/generated/dgl.transforms.AddSelfLoop.html).

I see. Thanks.

examples/pytorch/gcn/train.py

mufeili · 2022-07-04T07:52:54Z

Overall a great job, I've left some minor comments.

yaox12 · 2022-07-05T04:18:03Z

@chang-l Can you add your name in CONTRIBUTORS.md?

mufeili

I'm good.

dgl-bot · 2022-07-05T05:02:09Z

Commit ID: 69c424d4b9d8495c886311317ad84fc0fb11a0bd

Build ID: 9

Status: ❌ CI test failed in Stage [Torch GPU Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-07-05T05:20:48Z

Commit ID: 45df38f

Build ID: 10

Status: ❌ CI test failed in Stage [Torch CPU (Win64) Example test].

Report path: link

Full logs path: link

chang-l · 2022-07-05T05:57:11Z

@yaox12 Thanks for reminding. I just added.

dgl-bot · 2022-07-05T07:17:55Z

Commit ID: be5d1eb

Build ID: 11

Status: ❌ CI test failed in Stage [Torch CPU (Win64) Example test].

Report path: link

Full logs path: link

dgl-bot · 2022-07-05T07:56:36Z

Commit ID: e9060ed

Build ID: 12

Status: ✅ CI test succeeded

Report path: link

Full logs path: link

* Refactor GCN example * Refactor GCN based on graphsage * Readme update * Minor update * update * Remove user-defined GCN implementation * README update * Update * Update CONTRIBUTORS.md * update task_example_test Co-authored-by: Xin Yao <xiny@nvidia.com>

BarclayII requested a review from mufeili June 27, 2022 06:45

jermainewang added the Release Candidate Candidate PRs for the upcoming release label Jun 29, 2022

chang-l added 2 commits June 30, 2022 10:18

Refactor GCN example

416c141

Refactor GCN based on graphsage

15c922c

chang-l force-pushed the gcn-example-refactor branch from 680f120 to 15c922c Compare June 30, 2022 19:08

Readme update

cfd7b15

Minor update

a38b0a6

update

7179662

jermainewang requested changes Jul 1, 2022

View reviewed changes

examples/pytorch/gcn/README.md Outdated Show resolved Hide resolved

examples/pytorch/gcn/gcn_mp.py Outdated Show resolved Hide resolved

examples/pytorch/gcn/gcn_mp.py Outdated Show resolved Hide resolved

examples/pytorch/gcn/gcn_mp.py Outdated Show resolved Hide resolved

chang-l added 2 commits July 1, 2022 10:14

Remove user-defined GCN implementation

9bdfe6d

README update

8d4a1f8

chang-l force-pushed the gcn-example-refactor branch from 43157d6 to 8d4a1f8 Compare July 1, 2022 17:26