Feature/1044 sortpooling layer #1210

PantelisElinas · 2020-04-06T03:43:43Z

This is an implementation of the SortPooling layer introduced in [1].

Note that sorting is performed using only the last column of the input tensor as stated in [1], "For convenience, we set the last graph convolution to have one channel and only used this single channel for sorting."

A related ticket (implementing DGCNN as proposed in [1]) is #1195

An End-to-End Deep Learning Architecture for Graph Classification, M. Zhang, Z. Cui, M. Neuman, and Y. Chen, AAAI, 2018, https://www.cse.wustl.edu/~muhan/papers/AAAI_2018_DGCNN.pdf

…g tensor padding for k larger than the input graph size.

…uncation of output tensor.

… output tensor is checked at runtime for padding or truncation.

codeclimate · 2020-04-06T03:44:32Z

Code Climate has analyzed commit 3e7168d and detected 0 issues on this pull request.

View more on Code Climate.

huonw

(Moving my review from #1212.)

huonw · 2020-04-07T01:14:12Z

stellargraph/layer/sort_pooling.py

+        """
+        outputs = tf.map_fn(
+            lambda x: tf.gather(
+                x, tf.argsort(x[..., -1], axis=0, direction="DESCENDING")


The paper talks about using index -2 (and -3, -4, ...) to break ties. This doesn't seem to be doing so, is that important?

(from https://github.com/stellargraph/stellargraph/pull/1212/files#r403892314)

Reply:

This is addressed in #1210.

My reply:

It might be good to have a comment, since it slightly mismatches the formal description.

I'll just quote myself from above which also quotes from the paper,

Note that sorting is performed using only the last column of the input tensor as stated in [1], "For convenience, we set the last graph convolution to have one channel and only used this single channel for sorting."

I will update the docstring to mention the above. Whether it makes a difference or not, I do not know since the original paper does not show results for sorting with other than the last column. I suspect that given enough GCN layers, the approximation is good enough and that there are few if any ties to break that, in practice, makes little difference to sort beyond the one column.

huonw · 2020-04-07T01:14:23Z

stellargraph/layer/sort_pooling.py

+        """
+        outputs = tf.map_fn(
+            lambda x: tf.gather(
+                x, tf.argsort(x[..., -1], axis=0, direction="DESCENDING")


Also, I think this currently doesn't handle padding perfectly; any padding values may be sorted higher than some elements, and this may lead to peculiar behaviour if as graphs are grouped together differently.

For a small example of what I'm thinking, suppose k=2 and batch_size=2, and we've got three graphs G1, G2 and G3, with 2, 2, 3 nodes respectively. If the network gives G1's nodes sorting score 1 and -1, for a batch G1, G2, the final output will include both of those nodes and be as expected. However, for a batch G1, G3, I think the x tensor here may look like:

[ [..., 1], [..., -1], [..., 0] # padding ]

(For ease of identifying the elements I'm assuming that there's no biases/they cancel out for the all-zero paddings.)

And thus the chosen elements will the higher node and the padding, i.e. [[..., 1], [..., 0]].

Should this take a mask argument and obey it (somehow)?

(from https://github.com/stellargraph/stellargraph/pull/1212/files#r403898355)

Hm, I think in this case we should also force any padded values (padded from the generator) to be zero - otherwise if there's biases the nodes padded by SortPooling will be zero while the nodes padded from the generator won't be.

I did not consider the case of activation function with support outside [0, 1]. I will add the mask as an argument to correctly sort only the relevant part of the tensors.

@kieranricardo I'm not sure I understand your concern. The generator pads with 0s already.

… the paper.

PantelisElinas · 2020-04-16T23:56:30Z

@kieranricardo @huonw

I have updated to use a mask to deal with padding plus some smaller things like the option to flatten the output tensor as in the paper.

Have another look!

P.

huonw

Are you planning to add this to docs/api.txt as part of the DGCNN pull request, instead of in this one?

stellargraph/layer/sort_pooling.py

huonw · 2020-04-17T00:24:36Z

tests/layer/test_sort_pooling.py

+
+
+def test_flatten_output():
+    data = np.array([[3, 4, 0], [1, 2, -1], [5, 0, 1]], dtype=int).reshape((1, 3, 3))


Is it worth using a batch size > 1 here, to ensure that we don't accidentally refactor the code to do something like

outputs = tf.reshape(outputs, [1, -1])

instead of ... [tf.shape(outputs)[0], -1] ...

Updated accordingly.

huonw · 2020-04-17T00:26:15Z

tests/layer/test_sort_pooling.py

+
+def test_mask():
+    data = np.array([[3, 4, 0], [1, 2, -1], [5, 0, 1]], dtype=int).reshape((1, 3, 3))
+    mask = np.array([[True, True, True]])


If this test is testing the mask, should this be something like:

Suggested change

mask = np.array([[True, True, True]])

mask = np.array([[True, True, False]])

otherwise this seems like this first case is the same as test_sorting_truncation?

This is testing the case that the mask allows all rows to be sorted and then truncated correctly because k=2. In line 91, the last row [5, 0, 1] is correctly moved to the first row followed by [3, 4, 0] while the last element is truncated from the result. If the mask is changed as suggested, then the result will be [3, 4, 0] followed by [1, 2, -1]. Generally, I want to make sure here that if all mask values are True, sorting still works across all rows.

There is a test for mask=[True, True, False] after this one.

Makes sense, I guess I am just a bit disconcerted by the complete redundancy with the test_sorting_truncation test above, since that test does exactly the same process just with slightly different input values. My question was whether this was a typo for intending to test the combination of a non-trivial mask and truncation (and the next one is the combination of a non-trivial mask + padding), since all the other tests in this file test various combinations with a trivial mask.

PantelisElinas · 2020-04-17T05:04:30Z

Are you planning to add this to docs/api.txt as part of the DGCNN pull request, instead of in this one?

~~I forgot about docs/api.txt. I will add it.~~

On second thought, I had a look at the structure of docs/api.txt and it might be best to add it together with DGCNN.

…stead of 1.

huonw

Awesome!

huonw · 2020-04-17T05:33:43Z

tests/layer/test_sort_pooling.py

+
+def test_mask():
+    data = np.array([[3, 4, 0], [1, 2, -1], [5, 0, 1]], dtype=int).reshape((1, 3, 3))
+    mask = np.array([[True, True, True]])


Makes sense, I guess I am just a bit disconcerted by the complete redundancy with the test_sorting_truncation test above, since that test does exactly the same process just with slightly different input values. My question was whether this was a typo for intending to test the combination of a non-trivial mask and truncation (and the next one is the combination of a non-trivial mask + padding), since all the other tests in this file test various combinations with a trivial mask.

PantelisElinas added 7 commits March 31, 2020 14:47

Added Sort Pooling custom Keras layer but totally untested and missin…

1d22414

…g tensor padding for k larger than the input graph size.

Sorting by last column and padding truncation added.

e7f0d7a

Merge branch 'develop' into feature/1044-sortpooling-layer

8eb3193

Basic unit tests to make sure sorting works along with padding and tr…

a47db56

…uncation of output tensor.

Updated to use tf.cond and tf.shape so that the dimensionality of the…

397f52d

… output tensor is checked at runtime for padding or truncation.

Merge branch 'develop' into feature/1044-sortpooling-layer

eb82edf

Updated docstring.

288d5f1

PantelisElinas requested a review from kieranricardo April 6, 2020 03:43

PantelisElinas mentioned this pull request Apr 6, 2020

DGCNN implementation #1212

Merged

huonw reviewed Apr 7, 2020

View reviewed changes

huonw mentioned this pull request Apr 8, 2020

Store features as TensorFlow tensors, not NumPy arrays #1167

Closed

Merge branch 'develop' into feature/1044-sortpooling-layer

9519b5f

PantelisElinas self-assigned this Apr 15, 2020

PantelisElinas added 7 commits April 15, 2020 13:31

Add mask for layer input and update call to use mask when sorting.

411a389

Update tests to use mask input tensor.

090681a

Add tests for negative inputs and mask for padded graphs in batch.

1cc4a1c

Add option to flatten layer output.

ae6bc4e

Add test for flatten_output=True.

3ac453c

Black format.

b571fc9

Update docstring to indicate sorting using the last column only as in…

9a29ef8

… the paper.

PantelisElinas requested a review from huonw April 16, 2020 23:56

PantelisElinas added enhancement New feature or request sg-library labels Apr 16, 2020

huonw reviewed Apr 17, 2020

View reviewed changes

PantelisElinas added 2 commits April 17, 2020 15:25

Updated docstring to clarify flatten output operation.

67c290f

Updated test for flatten_output=True case to use data batch size 2 in…

d5e3421

…stead of 1.

PantelisElinas requested a review from huonw April 17, 2020 05:27

huonw approved these changes Apr 17, 2020

View reviewed changes

PantelisElinas added 2 commits April 17, 2020 16:22

Remove whitespace.

164af20

Black formatting.

3e7168d

PantelisElinas merged commit 77ef054 into develop Apr 17, 2020

PantelisElinas deleted the feature/1044-sortpooling-layer branch April 17, 2020 07:44

huonw mentioned this pull request Apr 21, 2020

Add changelog for 1.0.0rc1 release #1287

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/1044 sortpooling layer #1210

Feature/1044 sortpooling layer #1210

PantelisElinas commented Apr 6, 2020

codeclimate bot commented Apr 6, 2020 •

edited

huonw left a comment

huonw Apr 7, 2020 •

edited

PantelisElinas Apr 15, 2020

huonw Apr 7, 2020

kieranricardo Apr 7, 2020

PantelisElinas Apr 15, 2020

PantelisElinas commented Apr 16, 2020

huonw left a comment

huonw Apr 17, 2020

PantelisElinas Apr 17, 2020

huonw Apr 17, 2020

PantelisElinas Apr 17, 2020

huonw Apr 17, 2020

PantelisElinas commented Apr 17, 2020 •

edited

huonw left a comment

huonw Apr 17, 2020



		def test_flatten_output():
		data = np.array([[3, 4, 0], [1, 2, -1], [5, 0, 1]], dtype=int).reshape((1, 3, 3))

	mask = np.array([[True, True, True]])
	mask = np.array([[True, True, False]])

Feature/1044 sortpooling layer #1210

Feature/1044 sortpooling layer #1210

Conversation

PantelisElinas commented Apr 6, 2020

codeclimate bot commented Apr 6, 2020 • edited

huonw left a comment

Choose a reason for hiding this comment

huonw Apr 7, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PantelisElinas commented Apr 16, 2020

huonw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PantelisElinas commented Apr 17, 2020 • edited

huonw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codeclimate bot commented Apr 6, 2020 •

edited

huonw Apr 7, 2020 •

edited

PantelisElinas commented Apr 17, 2020 •

edited