Mismatch in Number of Nodes for Friendster Dataset #9

snigdhas1612 · 2024-05-30T15:03:15Z

While running the run_allocate script for the Friendster dataset, the output indicates a discrepancy in the number of nodes. The script reports 124 million nodes, whereas the actual number of nodes should be 65 million, as documented on the corresponding dataset website

Here's the relevant output from the run_allocate script:

Graph(num_nodes=124836180, num_edges=1806067135, ndata_schemes={} edata_schemes={})

Initially I inspected the dataset to observe that they had listed nodes ranging from 1 to 124M, but the unique count of these nodes was only 65M. Therefore, I renumbered the nodes. Even after the pre-processing step, run_allocate still outputs 124M as num_nodes.

I would require some help in verifying the functional correctness of the code in this case. Any pointers would be appreciated.

The text was updated successfully, but these errors were encountered:

initzhang · 2024-08-23T03:12:29Z

Hi @snigdhas1612 , the problem should be the isolated node, maybe you can have a look at this thread: dmlc/dgl#3967

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch in Number of Nodes for Friendster Dataset #9

Mismatch in Number of Nodes for Friendster Dataset #9

snigdhas1612 commented May 30, 2024

initzhang commented Aug 23, 2024 •

edited

Loading

Mismatch in Number of Nodes for Friendster Dataset #9

Mismatch in Number of Nodes for Friendster Dataset #9

Comments

snigdhas1612 commented May 30, 2024

initzhang commented Aug 23, 2024 • edited Loading

initzhang commented Aug 23, 2024 •

edited

Loading