Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the format of the data set? #14

Closed
yangxia605 opened this issue Sep 25, 2020 · 2 comments
Closed

About the format of the data set? #14

yangxia605 opened this issue Sep 25, 2020 · 2 comments

Comments

@yangxia605
Copy link

hi @acbull
my case is that i should use a sample of the training data sets to build a graph but not all of the training data sets.
that is to say: Use the internal elements of the sample to construct a heterogeneous graph.
I was wonder if i can use GPT-GNN to fine-tuning it and then Do a classification task in the downstream ?

thanks!!!

@acbull
Copy link
Owner

acbull commented Sep 26, 2020

Sure, pre-training is very suitable for your case.

About data format, you can refer to

class Graph():
to check our data structure. Or you can use your own data structure, and adapt our code to your setting.

To construct such a graph, you can refer to prepreocess_*.py as a reference.

@yangxia605
Copy link
Author

thank you!!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants