-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update EcoCyc with new resource information #1961
Comments
@pgaudet While I don't think it's a deal breaker, it's always desirable if all files are made available as |
@pgaudet As of today, we can no longer run pipelines with the current metadata as go-site/metadata/datasets/ecocyc.yaml Line 19 in b71d70d
I'm assuming that we need to update to #1959, or at least just the source, but I wanted to know if we could just take the whole thing. |
@lmoore207 do you have any information we may need at this time? |
Hi Suzi,
I didn’t get the notification about this issue, so thanks for letting me know. I’ve included Markus on this reply as he has been working with Pascale about the GO terms from EcoCyc. I don’t know if he is on GitHub, but I’m hoping he can provide more information regarding this.
Regards, Lisa
From: suzialeksander ***@***.***>
Date: Thursday, February 16, 2023 at 3:21 PM
To: geneontology/go-site ***@***.***>
Cc: Lisa Moore ***@***.***>, Mention ***@***.***>
Subject: [EXTERNAL] Re: [geneontology/go-site] Update EcoCyc with new resource information (Issue #1961)
@lmoore207<https://urldefense.us/v3/__https:/github.com/lmoore207__;!!Nv3xtKNH_4uope0!0sjew1_Uzyynr0oyxpk87jt6k_OK5g93ZjcF55HiKHB2O9RutAtSJJKJtEbGVQ$> do you have any information? Specifically- is there an active replacement for this GAF at EcoCyc, and if so can you provide a link?
—
Reply to this email directly, view it on GitHub<https://urldefense.us/v3/__https:/github.com/geneontology/go-site/issues/1961*issuecomment-1433873006__;Iw!!Nv3xtKNH_4uope0!0sjew1_Uzyynr0oyxpk87jt6k_OK5g93ZjcF55HiKHB2O9RutAtSJJK1M9a0uw$>, or unsubscribe<https://urldefense.us/v3/__https:/github.com/notifications/unsubscribe-auth/AEVRCZE2GDFUAKWDAKDMT5LWX2Y5VANCNFSM6AAAAAAU5JP37Y__;!!Nv3xtKNH_4uope0!0sjew1_Uzyynr0oyxpk87jt6k_OK5g93ZjcF55HiKHB2O9RutAtSJJJlmqHx8g$>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
OK, thanks for the feedback. I gzipped the ecocyc.gpi file. Available in same directory. I'll try to remember for the future. |
Hi Suzi and Lisa, From: suzialeksander @.> Date: Thursday, February 16, 2023 at 3:21 PM To: geneontology/go-site @.> Cc: Lisa Moore @.>, Mention @.> Subject: [EXTERNAL] Re: [geneontology/go-site] Update EcoCyc with new resource information (Issue #1961) @lmoore207<https://urldefense.us/v3/https:/github.com/lmoore207;!!Nv3xtKNH_4uope0!0sjew1_Uzyynr0oyxpk87jt6k_OK5g93ZjcF55HiKHB2O9RutAtSJJKJtEbGVQ$> do you have any information? Specifically- is there an active replacement for this GAF at EcoCyc, and if so can you provide a link? Suzi, yes, the plan is that the EcoCyc group will directly submit the ecocyc.gaf . One change is that the older versions from EcoliWiki seem to have converted to UniProt IDs, whereas now, EcoCyc IDs (for proteins, etc.) will be in the file. The file also includes things like RNAs, which seem to have been missing in previous files. The current experimental version of the ecocyc.gaf, for release 26.5, can be found at: |
In discussion with @pgaudet , to temporarily have ecoli data so that pipelines can tick over again, while we sort out identifiers, we've decided to use |
@pgaudet Ugh, naturally the switch to the new file rather upset some of the watchdogs we have out looking for weirdness (i.e. ecocyc as a resource suddenly disappeared), so we lost a couple of |
After talking to @pgaudet, as a temporary fix to upstream format issues, I've fixed locally
and will aim at that for the short term. |
@pgaudet Are the recent changes the final ones for this issue? |
not quite. the downloads page is not displaying a file type in the File field: [ecocyc.gaf] () @kltm has fixed this. |
Talking to @pgaudet , the current state is "good". Closing. |
I'm just documenting the history here for posterity/transparency
I assume there are processes in place at uniprot to pull manual annotations from EcoCyc (but it looks like there are no new manual annotations from EcoCyc from 2023 or 2024) |
It looks like there is nothing in place to pull the latest annotations from EcoCyc into GOA Annotations to cydC: This was recently shown to be a heme transporter
This is not in And hence not in GO Central annotations |
Do you know when this annotation was made? We can also @alexsign when was the last EcoCyc load; this is in the GOA loads, see https://ftp.ebi.ac.uk/pub/contrib/goa/GO.annotation_sources Also, we would need to know how often EcoCyc produces their GAF, but usually discrepancies are due to lags in the different release cycles. |
It was in the December 08, 2023 release
We can ask them to create a GAF with every release. Note we have no manual annotations from 2023 or 2024 I think this is just a communication issue. I don't know if @alexsign uses a github-tracked metadata file like we do to indicate the URL source for loading into Protein2GO/QuickGO but I expect this needs to be changed or we need to reinform the EcoCyc team that we are in fact using this. |
@cmungall I do use https://github.com/geneontology/go-site/blob/master/metadata/datasets/ for many import pipelines, but EcoCyc is not one of them. I use GAF and GPI data at the URLs bellow. |
@alexsign it looks like we should follow up with the EcoCyc team. Those URLs don't look like permanent URLs as they point to a user's home directory. I'm guessing that this was set up as a one-off and not updated. @k--r and @lmoore207 -- can you shine any light on this? Are these the URLs we should be using? |
Pinged Markus and Lisa by mail. |
It looks like:
Thanks everyone! |
Update metadata and upstream resource sources for EcoCyc.
Tagging @pgaudet
The text was updated successfully, but these errors were encountered: