About dataset #9

Syzseisus · 2023-05-31T20:02:12Z

Hi. First of all, I really appreciate to your wonderful work and code.

I open this issue to ask question:

How did you define node feature of each datasets?

Sincerely,
Wooseong Cho.

GRAPH-0 · 2023-06-01T02:18:44Z

Hi @Syzseisus .
Thanks for your interest!
We extract node structural features using the node2vec and node semantic features using Transformer-XL.
Refer to Paper:

Syzseisus · 2023-06-02T16:25:59Z

Thank you for your quick response.

I've already reviewed the provided resources.
However, I'm interested in obtaining a detailed understanding of the configuration settings for
node2vec and Transformer-XL models.
Could you please provide me with the specific parameters or something?

ALTERNATIVELY, for FB15k, I would appreciate it if you could confirm whether the order of nodes in the pickle file provided in Google Drive is the same as in the dgl.
If you are unsure, please let me know the order
or you can check it by following these steps:

For the order of nodes used in dgl, you can access the raw tgz file from here
and find the order in the entities.dict file.
However, please note that 0dgl uses the node entity name as a hash.
To find the real names of the nodes, you can refer to the entities2wikidata.json file.

Thank you for your assistance.

Sincerely,
Wooseong Cho.

GRAPH-0 · 2023-06-04T08:31:12Z

Hi,

For node embedding, you can refer to Ask for the code of word embedding #3 and Ask For the problem of Transformer-XL to get text embedding #6.

The node2vec settings:

FB and tmdb:

model = Node2Vec(data.edge_index, embedding_dim=64, walk_length=20,
                     context_size=10, walks_per_node=10, num_negative_samples=1,
                     p=1, q=1, sparse=True).cuda()
loader = model.loader(batch_size=128, shuffle=True, num_workers=4)
optimizer = torch.optim.SparseAdam(list(model.parameters()), lr=0.01)

imdb:

model = Node2Vec(data.edge_index, embedding_dim=128, walk_length=20,
                     context_size=10, walks_per_node=10, num_negative_samples=1,
                     p=1, q=1, sparse=True).cuda()

Sorry, I fail to open the tgz file from dgl to check the order. But you can get the correspondence between node id and mid from datasets/FB15k/fb15k_description.tsv. I think this can help you!

Sincerely,
Han

GRAPH-0 closed this as completed Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About dataset #9

About dataset #9

Syzseisus commented May 31, 2023

GRAPH-0 commented Jun 1, 2023

Syzseisus commented Jun 2, 2023

GRAPH-0 commented Jun 4, 2023

About dataset #9

About dataset #9

Comments

Syzseisus commented May 31, 2023

GRAPH-0 commented Jun 1, 2023

Syzseisus commented Jun 2, 2023

GRAPH-0 commented Jun 4, 2023