-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Low-frequency labels hard to predict #9
Comments
Hi Muhan, Thanks for taking interest in the work. Best, |
Hi Edward, Thank you for your reply. Sorry for the confusions in my question. Some quick answers to your questions are as follows: -Yes, I did use MIMIC-III. To clarify my question, I saw that my optimization now focused more on high-frequency labels, yet hardly learned to predict extremely rare labels. In fact, I got an overall accuracy@20 = 0.71 (ensemble all labels in one group) which is even higher than your GRAM+ results (0.6267 for [80-100], thus the overall accuracy@20 must be lower than 0.6267). Since I used your code, but got higher results than yours, I was wondering whether my understanding of the grouping scheme or the accuracy calculations, etc. might be wrong, which led to different calculations of the results. Or should I use different validation criterion like selecting the epoch with the highest [0-20] validation accuracy instead of the validation cost? Hope my issues do not bother you much. Of course, I'd like to set up a phone call anytime at your convenience if things are too complicated to explain here. You may send me your times to muhan@wustl.edu. Many thanks, |
Hi,Muhan: Many thanks, |
Hello Dr. Choi,
Thanks for the nice work. I generated the CCS single-level labels as the target, and used your code to predict them. All hyperparameters are set according to the appendix. I group labels to five groups according to their frequencies (first rank all labels by their frequencies, and then equally divide them into five groups). But my results have some differences from those in the paper. I got [0, 0.01835, 0.0811, 0.3042, 0.8263] accuracies for the five groups, respectively. I noticed that I got higher accuracies for high-frequency labels, but cannot match the paper's accuracies for labels with frequency percentile [0-60]. Is there anything I have done wrongly? Furthermore, I found the frequency of labels in the first group (rarest) is only 0.16% out of all labels' frequencies (163/96677). I am wondering is this the correct way to divide to five groups?
Thanks,
Muhan
The text was updated successfully, but these errors were encountered: