Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About evaluation result in Table 1 #35

Closed
fyyakaxyy opened this issue Oct 7, 2023 · 2 comments
Closed

About evaluation result in Table 1 #35

fyyakaxyy opened this issue Oct 7, 2023 · 2 comments

Comments

@fyyakaxyy
Copy link

Hello, thank you for such a great project. I have a question when reading Table 1 of the paper:
Are the evaluation data on APE and AVE shown in Table 1 the average of multiple evaluation results? If so I would like to ask how many experiments you did?
Because I changed the random number and trained the model multiple times, the results obtained each time were not as good as shown in the paper. Even if I averaged multiple experiments, I did not achieve the effect shown in the paper.

@Mathux
Copy link
Owner

Mathux commented Dec 10, 2023

Hello @fyyakaxyy,

In Table 1, it is not an average of multiple evaluation results. It corresponds to only one random generation. I actually made only one experiment. In Table 2, I am generating it 10 times and do the average (avg) or take the best.

The training of such models are not 100% deterministic so it may be normal that the results differ.

@Mathux Mathux closed this as completed Dec 10, 2023
@fyyakaxyy
Copy link
Author

Hello @fyyakaxyy,

In Table 1, it is not an average of multiple evaluation results. It corresponds to only one random generation. I actually made only one experiment. In Table 2, I am generating it 10 times and do the average (avg) or take the best.

The training of such models are not 100% deterministic so it may be normal that the results differ.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants