Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Case sensitive gene symbols #8

Closed
GoogleCodeExporter opened this issue May 15, 2015 · 6 comments
Closed

Case sensitive gene symbols #8

GoogleCodeExporter opened this issue May 15, 2015 · 6 comments

Comments

@GoogleCodeExporter
Copy link

When performing querying symbols seem to be case sensitive. For instance, I can 
query neighborhood of "TP53" but "tp53" says "unknown symbol". If all symbols 
are upper case, can't we just convert all input to uppercase?

Original issue reported on code.google.com by ugurdogr...@gmail.com on 28 Jun 2012 at 12:02

@GoogleCodeExporter
Copy link
Author

Human gene symbols are written in all uppercase letters but for instance, only 
the first letter is capitalized for mouse genes. If we accept all kinds of 
input and convert them into uppercase letters, this may result in misleading 
query results. For example, user can query "Per1" believing that the result 
will be based on the mouse gene whereas we will be returning human gene's 
result. To prevent confusions like these, in my opininon it is safer to 
continue with the current state. 

Original comment by mervecak...@gmail.com on 3 Jul 2012 at 9:56

@GoogleCodeExporter
Copy link
Author

Can others think of a way to improve the current way we take input? I know 
Arman implemented some validation mechanism for gene sets for the portal, 
perhaps he has something to say?

Original comment by ugurdogr...@gmail.com on 4 Jul 2012 at 7:31

@GoogleCodeExporter
Copy link
Author

Original comment by ugurdogr...@gmail.com on 4 Jul 2012 at 7:32

@GoogleCodeExporter
Copy link
Author

Not all human gene symbols are upper case. For instance 
[http://www.genenames.org/data/hgnc_data.php?hgnc_id=28678 C10orf107].

Original comment by ozgunba...@gmail.com on 5 Jul 2012 at 8:00

@GoogleCodeExporter
Copy link
Author

We are somewhat lucky on the portal side, since it is only and only human 
centric. So the gene name matching does not have to deal with the upper/lower 
case issues at all -- we convert all names to upper case, both on the query and 
the database side and match accordingly. Maybe we can make the same assumption 
since the gene->id mapping comes already from HGNC?

Original comment by arm...@gmail.com on 5 Jul 2012 at 8:26

@GoogleCodeExporter
Copy link
Author

Original comment by mervecak...@gmail.com on 30 Jul 2012 at 2:12

  • Changed state: WontFix

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant