Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds a shortest_name_length value to the synonyms file #159

Closed
wants to merge 2 commits into from

Conversation

gaurav
Copy link
Collaborator

@gaurav gaurav commented Jun 27, 2023

This restores the previously used sorting method as per TranslatorSRI/NameResolution#65 (comment).

WIP

@gaurav gaurav mentioned this pull request Jun 27, 2023
@gaurav
Copy link
Collaborator Author

gaurav commented Jul 24, 2023

This code has now been moved into PR #166, and so is not needed here. Closing.

@gaurav gaurav closed this Jul 24, 2023
gaurav added a commit that referenced this pull request Jul 25, 2023
Something in the NCBIGene synonym generation code has resulted in just about every NCBIGene being assigned the synonym `-` -- this is because `gene_info.gz` uses `-` to indicate an empty value, but our current code assumes that is a synonym. Fixed by adding a helper function to replace `-` with the empty string, and explicitly ignoring any synonym that is an empty string.

In addition to that fix, this PR:
1. Replaces the `ftp://` URL used to download UniProtKB with an `https://` URL to speed up the download a little.
2. Cleans up some code around synonym list generation.
3. Restores the `shortest_name_length` calculation that we used previously to sort NameRes results (from PR #159).
4. Increases the space stored in Babel file systems (from PR #162).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant