Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about data sets? #1

Closed
linzhi123 opened this issue Jan 9, 2019 · 8 comments
Closed

Questions about data sets? #1

linzhi123 opened this issue Jan 9, 2019 · 8 comments

Comments

@linzhi123
Copy link

Hello, how do you make the data set under the data folder?

@PetarV-
Copy link
Owner

PetarV- commented Jan 9, 2019

Hello,

I'm not sure I understand your question. We use standard benchmark datasets for all experiments reported in our paper. For example, Cora, Citeseer and Pubmed can all be found in Thomas Kipf's GCN repository:
https://github.com/tkipf/gcn/tree/master/gcn/data

Thanks,
Petar

@linzhi123
Copy link
Author

Thanks

@linzhi123
Copy link
Author

I mean how are the files in these datasets made?
@PetarV-

@PetarV-
Copy link
Owner

PetarV- commented Jan 10, 2019

Perhaps this description can help?
https://github.com/kimiyoung/planetoid/blob/master/README.md

I'm sorry I cannot be of much more help than that---I didn't take part in preparing the files.

@linzhi123
Copy link
Author

Ok, thank you for your answer.

@PetarV- PetarV- closed this as completed Jan 10, 2019
@svjan5
Copy link

svjan5 commented Jan 16, 2019

Hi Petar,
Can you provide Reddit and PPI datasets in the format used in the code?

Thanks

@PetarV-
Copy link
Owner

PetarV- commented Jan 18, 2019

Hello,

For PPI, the preprocessing code found in:

https://github.com/PetarV-/GAT/blob/master/utils/process_ppi.py

should be enough to get you started.

For Reddit, we were unable to get the PyTorch version of GraphSAGE to cooperate, and thus we used the TensorFlow version:

https://github.com/williamleif/GraphSAGE

as a starting point, and modified it to support DGI and load Reddit. Currently there are no plans to release this modified codebase.

Thanks,
Petar

@svjan5
Copy link

svjan5 commented Jan 18, 2019

Thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants