-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Information Content varies with version #156
Comments
Hello @huhrichard, Thank you for your interest in GOATOOLS and taking the time to write to us. Yes. You are correct. The information content values have changed. The change is in the calculation of the "aspect counts" (total counts). Our original code incorrectly calculated the aspect counts by counting the same thing multiple times, resulting in large aspect values. This error caused the information content to be scaled to be lower than it should be. Upon adding more tests which are now comparing our calculations to those found in other open-source code, we found and fixed the error. Our new semantic similarity test DAGs and annotations are found here: We are now comparing our results to those described by Yang[1] and implemented in a [1] Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty [2] GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology Thank you again for being alert and asking about the changes. Thank for using GOATOOLS. |
FYI: This fix was implemented with this hash on goatools/semantic.py Here is the diff: 839ad71#diff-688e6e2aa684f60dd85887a80cf6c258 |
Previously I used goatools version 0.99, where all information content from a list of GO term is very low (nearly all < 2), but now I updated to version 1.02, now the IC is like 2-4, Is there any reason? I used the same script and same obo file to see the IC.
Thanks
The text was updated successfully, but these errors were encountered: