Contributing more pooling and convolution layers #49

LeviBorodenko · 2020-04-20T18:44:14Z

Hey,
I have implementations of the following few graphical neural network components:

Convolutions:

Diffusion based graph convolution (as part of my GARNN implementation)
Graph convolutions in the sense of M. Zhang et al., 2018 (as part of my DGCNN implementation)

Pooling:

SortPooling (Also as part my DGCNN implementation)

I would love to add these to your framework however, I am a bit lost with all the requirements that my implementations need to fulfil in order to integrate seamlessly. Is there any guide for what tests, features or properties layers need to have?

Best,
Levi

danielegrattarola · 2020-04-21T10:24:28Z

Hi,

I'd be happy to add your code to the framework. I should probably improve the contribution guidelines, here's more or less what it would be good to have:

Message-passing layers should go in their own files under layers.convolutional;
Make sure that your layer has a preprocess() static method if there is any pre-processing to be done to the adjacency matrix;
Make sure that all functions needed for Keras are implemented (your layer should extend GraphConv);
SortPool should be added as a class in layers/pooling/globalpool.py;
See the documentation in the other layers for how to format docstrings (it's important that the format is that same so that the docs can be built automatically);

Tests are a bit different. It's better if you copy-paste what I did for other methods, or just open a PR and I'll add the test cases myself.

I've never had big PRs so it's better if we work together on this one :D
Just open a PR and we'll go from there.

Cheers,
Daniele

LeviBorodenko · 2020-04-21T12:09:53Z

Okay, wonderful!
I will give myself a week to familiarise myself with your codebase and will open a PR soon thereafter :)

Best,
Levi

ENH (issue #49): Added SortPool layer to spectral.layers.pooling.globalpool

LeviBorodenko · 2020-04-27T21:34:29Z

Is there any place where I can put utility layers used for convolutional layers without them cluttering the convolutional layer docs?

danielegrattarola · 2020-04-28T09:21:51Z

Sure! If it's a proper Layer you can put it in layers/base.py, whereas for utility functions you can modify/create files in layers/ops/.

ENH (issue #49): Added DiffusionConvolution, tests and a script to easily build docs locally.

LeviBorodenko · 2020-04-29T22:04:50Z

Alright, I believe all my implementations have now been merged into the development branch and this issue can be closed.

Now that I feel comfortable with this framework, I'd really like to contribute more. Is there any to-do list I can consult to help with further development?

danielegrattarola · 2020-04-30T08:58:15Z

Thanks again for contributing! I will be merging the new layers into master soon and pushing them to PyPi as well.
I would say that any additional convolutional or pooling layers would be a good contribution. I have thought about creating a general abstraction for message-passing layers for a while now, I will take this chance to start working on it over the following days. This should make future contributions easier.

I am also starting a TODO list in Projects to keep track of what is missing in the library.

Cheers,
Daniele

danielegrattarola · 2020-04-30T14:37:02Z

FYI I have just pushed to develop a MessagePassing class that should make it easier to implement single-mode layers.

LeviBorodenko · 2020-04-30T17:55:58Z

Very nice!

Out of curiosity, is disjoint mode supposed to be an internal data storage format, mainly served by spektral's data loader etc or do we expect many users to have their own data in that format? Because apart from TU, I am not aware of many datasets having that format.

Furthermore, I expect that many users will have batch-mode signal and adjacency data with the node dimension being ragged. It may be worth providing methods that allow to work with graph datasets that have a ragged node dimension.
To that end, I think either looking into tf.RaggedTensor or providing a function that creates batches of graphs with same size might be useful.

Best,
Levi

danielegrattarola · 2020-05-01T09:18:18Z

Disjoint mode solves exactly the problem of having to work with ragged tensors. I have looked into tf.ragged several times since I started this project, but there's always some problem that makes it impossible to move the whole framework to a ragged representation of graphs.

This way of representing graph is not only limited to the TU data, but is a fairly common way of representing graphs (also sometimes called ad "edge list"... it's all the same, really). The new OGB datasets are represented like this, for instance, and also many network datasets found online.

I think it's pretty convenient, and I don't see a clear advantage of batch mode over disjoint, especially because it is impossible to store batch data as sparse matrices.
Other frameworks like Pytorch Geometric also use disjoint mode for basically all of their layers.

So let's say that you have two graphs with a different number of nodes, stored in lists of matrices:

A_list = [A_1, A_2]
X_list = [X_1, X_2]

You can convert them to batch mode (all graphs have the same size):

from spektral.utils.data import numpy_to_batch
A_batch, X_batch = numpy_to_batch(A_list, X_list)

Or you can convert them to disjoint mode:

from spektral.utils.data import numpy_to_disjoint
A_disjoint, X_disjoint, I_disjoint = numpy_to_disjoint(A_list, X_list)

Note that I_disjoint is only needed for pooling, it tells you which nodes belong to which sub-graph.

I hope that answers your doubts.

Cheers,
D

…alpool - Added the global pooling layer "SortPool" that keeps the top \(k\) most relevant nodes as described by (Zhang et al.)[https://www.cse.wustl.edu/~muhan/papers/AAAI_2018_DGCNN.pdf]. For now it only supports single and batched mode. - Covered it in standard pooling tests (tests/test_layers/test_pooling.py) without adding any specific tests. - Added .cache to .gitignore -- its an irrelevant folder that pytest sometimes creates when it fails a test. - Added reference to SortPool in README.md.

ENH (issue #49): Added SortPool layer to spectral.layers.pooling.globalpool

…sily build docs locally. - Added DiffusionConvolution to `layers.convolutional`. Also covered it in tests and made sure the docs look good. - Improved docs for SortPool and covered it in globalpool tests (removed from pool tests). - Added both SortPool and DiffusionConvolution to `autogen.py` so it is contained in the docs. - added `docs/local_build.sh` which is a script that installs the current spektral build from the local branch, builds docs and serves them on a local webserver. For convinience.

ENH (issue #49): Added DiffusionConvolution, tests and a script to easily build docs locally.

danielegrattarola added a commit that referenced this issue Apr 27, 2020

Merge pull request #51 from LeviBorodenko/ENH_SortPool

74e169a

ENH (issue #49): Added SortPool layer to spectral.layers.pooling.globalpool

danielegrattarola added a commit that referenced this issue Apr 29, 2020

Merge pull request #52 from LeviBorodenko/ENH_DiffConv

e5d2be4

ENH (issue #49): Added DiffusionConvolution, tests and a script to easily build docs locally.

LeviBorodenko closed this as completed Apr 29, 2020

danielegrattarola added a commit that referenced this issue Feb 9, 2021

Merge pull request #51 from LeviBorodenko/ENH_SortPool

e1dc1fe

ENH (issue #49): Added SortPool layer to spectral.layers.pooling.globalpool

danielegrattarola added a commit that referenced this issue Feb 9, 2021

Merge pull request #52 from LeviBorodenko/ENH_DiffConv

c9a54bf

ENH (issue #49): Added DiffusionConvolution, tests and a script to easily build docs locally.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing more pooling and convolution layers #49

Contributing more pooling and convolution layers #49

LeviBorodenko commented Apr 20, 2020 •

edited

danielegrattarola commented Apr 21, 2020

LeviBorodenko commented Apr 21, 2020

LeviBorodenko commented Apr 27, 2020

danielegrattarola commented Apr 28, 2020

LeviBorodenko commented Apr 29, 2020

danielegrattarola commented Apr 30, 2020

danielegrattarola commented Apr 30, 2020

LeviBorodenko commented Apr 30, 2020

danielegrattarola commented May 1, 2020

Contributing more pooling and convolution layers #49

Contributing more pooling and convolution layers #49

Comments

LeviBorodenko commented Apr 20, 2020 • edited

danielegrattarola commented Apr 21, 2020

LeviBorodenko commented Apr 21, 2020

LeviBorodenko commented Apr 27, 2020

danielegrattarola commented Apr 28, 2020

LeviBorodenko commented Apr 29, 2020

danielegrattarola commented Apr 30, 2020

danielegrattarola commented Apr 30, 2020

LeviBorodenko commented Apr 30, 2020

danielegrattarola commented May 1, 2020

LeviBorodenko commented Apr 20, 2020 •

edited