Number nodes at each layers #13

tamthuc1995 · 2017-11-03T10:08:58Z

In the unsupervised model, I notice that it use the second layers's num_samples as the number of neighbor nodes for the first layers. This lead to samples shape for next aggregate function become this:

batch_size x [1, 10, 25*10] (which is different from the paper that first layers 25 neighbor nodes, second layer is 10 neighbor nodes)

These line:

Thank you, I learned a lot from the code

bkj · 2017-11-03T14:56:59Z

If I understand your question correctly, the way it works is:

sample a "center" node, n0
sample 10 1-hop neighbors, n1
sample 25 1-hop neighbors of each n1, yielding 250 2-hop neighbors of n0

Does that make sense?

tamthuc1995 · 2017-11-04T02:19:00Z

yes, and it's seem to me that it is different from what I read in the paper.
In addition, I also want to ask about those parameter's effects to the model when the max_degree is ~ 1000-2000.
Last thing, do you think using other neighbors selection method will improving the model ? Since not all nodes have the same contribution to its neighbors, maybe "adding features of edges".

williamleif · 2017-11-14T19:46:21Z

Hi,

Yes the sampling can be a bit counter-intuitive and we tried to clear this up in the newest arxiv version. The description by @bkj is correct. Hopefully the new arxiv minibatch description is more clear, as it directly matches @bkj's description. Apologies for the confusion!

In general, we (and other researchers) have found that increasing the neighborhood sample size improves performance but that the performance quickly plateaus by around 25-50 neighbors. This general trend holds for a number of datasets and has been replicated in a couple ICLR submissions.

Of course, in a network with extremely high average degree or atypical degree distribution things might be different, but right now this is an empirical question and so far, we have found that the model can achieve near optimal performance with <50 sampled neighbors across all datasets tested. In future work, we hope to characterize this tradeoff more formally with regards to graph properties, such as density and clustering.

tamthuc1995 · 2017-11-17T04:16:04Z

Thank you for your answers. I'm really fascinated by your work and hope to see the new arxiv version.

williamleif closed this as completed Dec 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Number nodes at each layers #13

Number nodes at each layers #13

tamthuc1995 commented Nov 3, 2017 •

edited

bkj commented Nov 3, 2017

tamthuc1995 commented Nov 4, 2017 •

edited

williamleif commented Nov 14, 2017

tamthuc1995 commented Nov 17, 2017

Number nodes at each layers #13

Number nodes at each layers #13

Comments

tamthuc1995 commented Nov 3, 2017 • edited

bkj commented Nov 3, 2017

tamthuc1995 commented Nov 4, 2017 • edited

williamleif commented Nov 14, 2017

tamthuc1995 commented Nov 17, 2017

tamthuc1995 commented Nov 3, 2017 •

edited

tamthuc1995 commented Nov 4, 2017 •

edited