Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets Features Names #1

Closed
msharara1998 opened this issue Jan 30, 2023 · 3 comments
Closed

Datasets Features Names #1

msharara1998 opened this issue Jan 30, 2023 · 3 comments

Comments

@msharara1998
Copy link

Hi,
Thank you for your effort on this project.
I would like to know how can we retrieve the names of the columns (features names) in the dataset, as it is provided as a torch tensor with only numerical values. Similarly, the accounts' human/bot labels are provided as a binary vector, without the account name or IDs.

@GraphDetec
Copy link
Owner

MGTAB is a processed data set for GNNs, similar to Cora, Citeseer, and Pubmed. The features and labels of the account correspond one to one in order. Details of the features are shown in Table 10 of the paper (MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark [arXiv.2301.01123]). To protect user privacy, account name and IDs are no longer provided.

@msharara1998
Copy link
Author

Okay I'll check it, thank you.

@Keepingshtum
Copy link

Hello, I was trying to hunt down "Table 10" but it appears to be missing in the arXiv PDF - I only see tables numbered up to "6" - I did see a table about the "Details of user feature representations" - Table A5, but I'm still not sure where to check what the other features (21-788) are.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants