Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Information 'note' for organism-gene terms: has_gene_template #161

nataled opened this issue Sep 27, 2019 · 0 comments


Copy link

commented Sep 27, 2019

The sentence at the top of organism-gene term pages that indicates what the page is describing (all products of geneG in organismO) currently relies on the parent term's PRO-short-label for the name of geneG. This should instead be changed to use the name given in the has_gene_template line. This will increase the number of terms with such a message by over 30,000. It could also be possible to use the current mechanism ONLY if there is no has_gene_template line for the term of interest and ONLY if the parent term is gene level. Fewer than 400 terms would be covered by that addition, so might be okay to skip that and make it only rely on the has_gene_template line.

Caveat: There are some entries that can be encoded by multiple genes. is one example. For these, we have two choices:

  1. Suppress the message in such cases, or
  2. List them out.

For the example above, the statement currently is "This page represents a class of proteins encompassing all the protein products of the VCY1 gene in human." but it would become "This page represents a class of proteins encompassing all the protein products of the VCY and VCY1B genes in human."

The term with the largest number of genes (37) is PR:P62593. The distribution is:
1 98293
2 248
3 30
4 10
5 10
6 3
7 7
8 2
9 1
10 2
12 3
13 1
14 2
16 2
17 1
18 2
19 1
24 3
37 1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
2 participants
You can’t perform that action at this time.