-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
issue with ImageMol performance #16
Comments
Hi, Soo Thank you first for your attention to our papers. In some cases, AUC and accuracy may conflict. This is because the accuracy is calculated based on the default cutoff value (such as 0.5), while the AUC is calculated based on all possible cutoff values, which should be more robust. I am not sure whether your sample has class imbalance. If so, you can try to increase the weight of the minority class. You can also provide me with the predicted probabilities and corresponding ground truth pickle or numpy files so that I can do better analysis. |
Thank you for the prompt reply. Here are the ground truth and the predicted probabilities for the test data: df_pro_imagemol_classification.csv Also, do you have any idea why the predicted values for the regression model for the test data are the same? Here is the predicted values for the regression model: I really appreciate your help. Best Regards |
Hi, Soo The following figure is the AUC curve I drew based on the provided
so, I use 0.001 as classification threshhold:
I can get 0.7685688405797102 accuracy.
Anyway, in cases of extremely imbalanced samples, I recommend reporting the AUC metric since it is a more comprehensive metric and better suited to imbalanced data. In addition, I'm not sure that why the predicted values for the regression model for the test data are the same. But I'm guessing that your regression labels may have a large gap, causing the model to collapse during training. I suggest you can use some normalization method on the labels. |
Thank you for the detailed response. I really appreciate your help.
Thank you |
Sorry for the late reply.
|
Hello, Thanks a lot |
Hello,
training_class.out.txt
train_reg.out.txt
I am reaching out to seek help regarding the ImageMol package. I have used ImageMol to predict the antibiotic activity of compounds both for continuous and binary data. However, I am facing some issues and I am hoping you or one member of your group could help me in this regard.
For classification, the log file shows that the AUC is 0.725, however, I noticed all the probabilities are close to zero and therefore all the y_pred are zero. I was wondering if you have any suggestions to solve this problem. I am not sure why all the y_pred are zero with this high AUC.
For regression, all the y_score have the same value.
I really appreciate your help in this matter.
For your reference, I am attaching the log files for both classification and regression. Could you see if there is anything unusual in the log files?
Best Regards
Soo
The text was updated successfully, but these errors were encountered: