Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

On the inconsistency of sampling methods between training stage and inference stage #10

Open
Chenrj24 opened this issue Oct 9, 2023 · 1 comment

Comments

@Chenrj24
Copy link

Chenrj24 commented Oct 9, 2023

Hello, author. I noticed that you used oversampling when sampling fraud nodes in the training phase, requiring you to select nodes that are consistent with the fraud node label. This procedure requires the type label of the node. However, it is not possible to obtain the node type label in the inference stage, so it is impossible to oversample the fraudulent nodes, which leads to the inconsistency between the sampling method and the training stage, and the model in the training stage is more ideal. Does this not fit the logic of machine learning? Causing the trained model to be unreliable?

@AnonymousDataCodeHub
Copy link

Hello, author. I noticed that you used oversampling when sampling fraud nodes in the training phase, requiring you to select nodes that are consistent with the fraud node label. This procedure requires the type label of the node. However, it is not possible to obtain the node type label in the inference stage, so it is impossible to oversample the fraudulent nodes, which leads to the inconsistency between the sampling method and the training stage, and the model in the training stage is more ideal. Does this not fit the logic of machine learning? Causing the trained model to be unreliable?

I have the same question, so I don't think it is a fair comparision in the paper when tesing in inference stage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants