Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove amino acid acronyms #36

Merged
merged 3 commits into from
Jul 30, 2020
Merged

Remove amino acid acronyms #36

merged 3 commits into from
Jul 30, 2020

Conversation

bgyori
Copy link
Contributor

@bgyori bgyori commented Jul 30, 2020

This PR removes all synonyms from the PubChem resource file which represent a pair of amino acids as an acronym (259 of them in total). These two-letter combinations appear very commonly in text but virtually never represent a pair of amino acids, resulting in a lot of incorrect groundings.

@MihaiSurdeanu MihaiSurdeanu merged commit 11abd90 into master Jul 30, 2020
@MihaiSurdeanu MihaiSurdeanu deleted the remove_aa_acronyms branch July 30, 2020 03:31
@MihaiSurdeanu
Copy link
Contributor

Thanks @bgyori!
Can you please the CHANGES file, so we have a log of the modifications?
Also, let me know when I should release.

@bgyori
Copy link
Contributor Author

bgyori commented Jul 31, 2020

Thanks, I'm planning to make a couple more small changes and will update the CHANGES file along with those. A release isn't strictly necessary or urgent for the time being since we can build Reach with unreleased versions of bioresources.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants