-
Notifications
You must be signed in to change notification settings - Fork 135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about data sets? #1
Comments
Hello, I'm not sure I understand your question. We use standard benchmark datasets for all experiments reported in our paper. For example, Cora, Citeseer and Pubmed can all be found in Thomas Kipf's GCN repository: Thanks, |
Thanks |
I mean how are the files in these datasets made? |
Perhaps this description can help? I'm sorry I cannot be of much more help than that---I didn't take part in preparing the files. |
Ok, thank you for your answer. |
Hi Petar, Thanks |
Hello, For PPI, the preprocessing code found in: https://github.com/PetarV-/GAT/blob/master/utils/process_ppi.py should be enough to get you started. For Reddit, we were unable to get the PyTorch version of GraphSAGE to cooperate, and thus we used the TensorFlow version: https://github.com/williamleif/GraphSAGE as a starting point, and modified it to support DGI and load Reddit. Currently there are no plans to release this modified codebase. Thanks, |
Thanks a lot! |
Hello, how do you make the data set under the data folder?
The text was updated successfully, but these errors were encountered: