-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Figure 1C? [2C?] #1831
Comments
@Antonialock this is what I did, you will need to follow similar for human.
cerevisiae I figured that, largely if the SGD curators had annotated BP root node with ND that any mappings from other sources would be to fairly high level terms. I got the list which had an ND BP root node manual and subtracted it from the 'unslimmed'- This gave me a smaller list to evaluate (119)
|
SGD total 5915 slimmed 4900(~83%) unslimmed 794+221(1015) Note, it is slightly different from |
I will rerun pombe and cerevisia tomorrow.
|
|
@Antonialock you mentioned that I hadn't done the instructions but they are above? |
This is the current list from GO:0140053 |
What slimmin tools are you using? I keep getting an error message from http://go.princeton.edu/cgi-bin/GOTermMapper maybe I'm doing something wrong? I enter the slim terms (above + multicelllar specific terms) I use the GOA_human_GAF downloaded from here: http://geneontology.org/page/download-go-annotations |
I don't think this will work because the file has Uniprot IDs... Therefore you will need to select a data option for goa_human_hgnc (this will recognise the HGNC IDs. This will seem like you are using the hgnc slim, but you aren't because you over-ride that in the advanced options. It's very confusing.... This will then use the current contents of the GO database mapped to HGNC ID set.... |
It looks like it ignores IEA and IBA annotations e.g. this gene doesn't slim is that as expected? |
you can select the evidence codes included, are they all selected? |
I can't see that human gene in the GO database...that's probably why. I didn't say this would be straightforward... you need to contact GO helpdesk for that one... |
actually you can't select evidence for the slimmer, I'm thinking of the enrichment tool. It's probably because the slimmer tool isn't aware of IBA? do you have an example of a missing IEA (this gene only seems to have IBA). If so, you will need to mail gotools and tell them to include IBA and any other codes.... |
It has |
oh I see, in amigo it only has IBA. So why does entrez show IEAs? |
Mail gotools and check which evidences (they will probably get back to you today) |
@Antonialock an alternative is to try the QuickGO slimmer. I'm pretty sure from memory that it does because Jane and I used this when we were building the generic slim. |
Well unfortunately the QuickGO slimming tool is broken. I sent them a message "Hi. I'm trying to use the slimming tool but am having multiple problems I uploaded my own set of BP terms to use as the slimming set. |
Note to self The number of human genes that we want to include is 19674 The list can be retrieved using this search: Removing the "existence:uncertain" drops the number of genes down from 20245 |
Ah I tried again and it "worked" using the QuickGO filter for human gene products. Unfortunately it looks like rubbish.... I think I worked out it is because it doesn't include "regulates" |
if you get me the cerevisiae number I can plug them in |
see comment within your comment for clarification of the human annotation numbers |
what are the 31 missing in cerevisiae @ValWood ? For now I rounded known to make to 100 |
Looks brilliant! I will check the pombe and cerevisiae numbers. |
the most recent numbers above were: SGD total 5915 slimmed 4900(~83%) unslimmed 794+221(1015) I will check them using your final slim so we use the same slim for everything. |
did you definitely use my slim with terms added? I'm sure I had slimmed things which are now not slimming? |
So I need my list + your additions for human? |
I used the list on this page as a base https://curation.pombase.org/pombase-trac/wiki/GOslims GO:0140053 GO:0000278 GO:0006810 GO:0007010 GO:0006412 GO:0007031 GO:0030437 GO:0023052 GO:0006520 GO:0032200 GO:0016074 GO:0005975 GO:0070647 GO:0007059 GO:0030163 GO:0055086 GO:0006351 GO:0006260 GO:0071554 GO:1901990 GO:0140013 GO:0065003 GO:0071941 GO:0006355 GO:0006399 GO:0042254 GO:0006457 GO:0006486 GO:0016071 GO:0007005 GO:0006310 GO:1901135 GO:0000747 GO:0006913 GO:0006091 GO:0006914 GO:0098754 GO:0016192 GO:0051186 GO:0007163 GO:0061024 GO:0006629 GO:0006281 GO:0000910 GO:0051604 GO:0007155 GO:0055085 GO:0006766 GO:0006325 GO:0016073 GO:0006915 GO:0006790 GO:0055065 GO:0140056 |
my slim list is shown above (posted 9 days ago) |
but it excludes some of the terms in my extended slim. Can you just send me your "additional" terms (otherwise i need to complare them one by one). (I want to only report a single slim in the paper so I need to just add the additioanal terms you used to my extended slim...just to ensure that nothing looks odd). |
I used the list above, and some terms I used were missing. Sorry this is getting confusing...just send me list you added to my original list.... |
GO:0022414 |
that is the exact list I was using |
I took your list, and added to it |
and removed zero annotations, e.g. flocculation? I guess some spore term, |
but if you take your exact list (which I thought I was using? but maybe not) and subtract mine, you'll see the difference? |
I wanted to use your list, but when I used it some things weren't slimming for cerevisiae and pombe. I know I needed to add some back (cell wall stuff , flocculation etc, but I wasn't sure exactly which ones you removed..... |
I'm confused. I used the slim terms you provided, then when double checking human the following don't slim? keratinization 163 |
please just send me the list of IDs that you added to my list.....that is all I need, nothing else. |
In a file plain text....no control characters or anything..... |
see the text file above? It's plain text Val. I don't understand what your problem is. Just copy and paste the IDs into the tool. GO:0032502 developmental process is in my list so that catches e.g. keratinization |
Please just send me the terms you added. I don't know what the problem is but when I paste your list into my list it doesn't work. I only want the terms you added. |
I emailed you |
Your list is pretty damn good ! We are on the same page with what to include/exclude. Most of the stuff that isn't mapped is non-specific. I enriched the "drop out" and I think we should add a few terms, but I'll open a new ticket for outstanding tasks and close this one.... |
Mock up.
I will update the pombe and the cerevisiae data.
Antonia will prepare human data and the figure
The text was updated successfully, but these errors were encountered: