Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About Data Preprocessing #15

Closed
empty-id opened this issue Dec 20, 2020 · 2 comments
Closed

About Data Preprocessing #15

empty-id opened this issue Dec 20, 2020 · 2 comments

Comments

@empty-id
Copy link

empty-id commented Dec 20, 2020

Hi, I have a question about the preprocessing in DBLP.

As I see that in the preprocess_DBLP.ipynb, you use data/raw/DBLP/DBLP4057_GAT_with_idx.mat from HAN.

Where could I find that file? And how to promise the corresponding index of that matrix is same with your data accordingly?

@cynricfu
Copy link
Owner

cynricfu commented Dec 23, 2020

DBLP4057_GAT_with_idx.mat is obtained from HAN's official repo. Specifically, this Baidu Wangpan link with access code 6b3h.

As of the index correspondence, I have verified it by comparing the labels from the DBLP4057_GAT_with_idx.mat file and the labels from the author_label.txt file. They are the same.

@empty-id
Copy link
Author

Yeah, I get it. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants