the evaluation on the PCBA dataset seems wrong #92

Noisyntrain · 2022-03-01T02:30:55Z

Hi authors,
Thank you for your great work! I noticed that the result of 'the mean of the 4 cards ap' is different from the result of 'gather all pred and labels of different cards and do evaluation once'. And the latter method's result tends to be lower than the former one. It seems that when doing evaluation, Graphormer is using the former method. May I know that if you have the valid and test result of Graphormer model evulating the whole dataset once? Thank you!

zhengsx · 2022-03-01T08:34:07Z

Thanks for using Graphormer.

In v1: the average precision is correctly calculation by gathering all results from different cards. Please kindly use this for Graphormer-v1 on PCBA.

In v2: currently we haven't prepared pcba script in example since the architecture has been modified and we plan to release the pcba example after we search optimal configuration and hyper-parameters.

If this feature is urgent for you, please kindly click the thumb up reaction at this issue, and we will promote the priority.

Noisyntrain · 2022-03-01T09:50:30Z

Hi zhengsx, thank you for your replying. I'm a little confused now, since in the function validation_epoch_end of model.py， the code goes like:
self.log('valid_ap', loss, sync_dist=True) . Is the code suggesting that each card will calculate it's own ap first then calculate the mean of 4 cards' ap as the final result ?

zhengsx · 2022-03-01T10:21:13Z

Yes, so when we validate and test the results on this dataset, we use only 1 gpu to avoid this potential issue. The logging output during training is a monitor for the program.

Noisyntrain · 2022-03-02T02:46:02Z

Ok, I got it, thank you!

Noisyntrain changed the title ~~About the PCBA Dataset ap evaluation~~ the evaluation on the PCBA dataset seems wrong Mar 1, 2022

Noisyntrain closed this as completed Mar 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the evaluation on the PCBA dataset seems wrong #92

the evaluation on the PCBA dataset seems wrong #92

Noisyntrain commented Mar 1, 2022

zhengsx commented Mar 1, 2022

Noisyntrain commented Mar 1, 2022

zhengsx commented Mar 1, 2022

Noisyntrain commented Mar 2, 2022

the evaluation on the PCBA dataset seems wrong #92

the evaluation on the PCBA dataset seems wrong #92

Comments

Noisyntrain commented Mar 1, 2022

zhengsx commented Mar 1, 2022

Noisyntrain commented Mar 1, 2022

zhengsx commented Mar 1, 2022

Noisyntrain commented Mar 2, 2022