New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large number of duplicated terms #1645
Comments
Hi @d0choa, I believe this is due to the gradual replacement of Orphanet terms with Mondo, all of these duplicates will eventually be an obsoleted Orphanet term with a replaced by link to the Mondo term. I will try to prioritise the removal of some of these in time for the July (18th) release. |
Further replacement of Orphanet terms with Mondo terms #1645
The Orphanet terms have now been obsoleted and replaced with Mondo terms which should now fix this duplication after the July release - please let me know if it persists. |
I have checked the latest release (3.44.0) and we no longer have Orphanet/MONDO duplication. Thanks @zoependlington! However, there are still 49 examples with an identical name after converting them to lowercase. Some of them, like |
3 of these have already been fixed in #1698 Many others remain genuine duplications (e.g. |
I will add mappings for the following:
The rest have either been taken care of (the measurement terms pointed out above) or are phenotype vs disease. |
Added mappings for duplicated EFO Mondo terms for #1645
These mappings have now been added so the only duplicates should now be between disease/phenotype terms. Please let me know if that isn't the case. |
At least in 3.42 and 3.43, there are a large number of duplicated terms in EFO mostly affecting rare diseases.
Just by lower-casing the names and looking for exact matches, there are 3036 duplicated terms (v3.42). Some of them are explained by disease vs phenotype conondrum, but the vast majority correspond to a MONDO vs Orphanet duplication.
Some examples:
Hemophilia Orphanet:448 - hemophilia MONDO:0018660
Fragile X syndrome Orphanet:908 - fragile X syndrome MONDO:0010383
Apert syndrome Orphanet:87 - apert syndrome MONDO:0007041
...
The text was updated successfully, but these errors were encountered: