Question about Deep Sets Implementation #4

arnavs · 2020-04-20T17:59:55Z

First of all, thanks for making this code publicly available. It's very useful.

One question, though. I am looking at your implementation of the Zaheer et al network ("Deep Sets.") In his paper, we have something like rho(sum (phi(x))), where we are adding over each element of the set (I believe you call this a set pooling method in your paper )

In your DeepSet class, we have a succession of Linear -> ReLU -> Linear -> ReLU layers, that operate on the entire data set, and then are pooled at the end.

Could you explain a little about why these are equivalent?

The text was updated successfully, but these errors were encountered:

juho-lee · 2020-04-22T07:42:50Z

Hi,
Linear layers act on individual elements, so it is equivalent to applying the same linear operation (phi(x)) to each element in a set. Also, Linear layers in Pytorch supports batched operation, thus the same applies for batched inputs (batch_size * num_elements * dim tensors).

arnavs · 2020-04-22T22:25:46Z

Roger that, thank you @juho-lee for the response.

arnavs closed this as completed Apr 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Deep Sets Implementation #4

Question about Deep Sets Implementation #4

arnavs commented Apr 20, 2020 •

edited

juho-lee commented Apr 22, 2020

arnavs commented Apr 22, 2020

Question about Deep Sets Implementation #4

Question about Deep Sets Implementation #4

Comments

arnavs commented Apr 20, 2020 • edited

juho-lee commented Apr 22, 2020

arnavs commented Apr 22, 2020

arnavs commented Apr 20, 2020 •

edited