-
Notifications
You must be signed in to change notification settings - Fork 252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kegg annotation #99
Comments
> bitr_kegg("K00844", "kegg", "Path", "ko")
kegg Path
1 K00844 ko00010
2 K00844 ko00051
3 K00844 ko00052
4 K00844 ko00500
5 K00844 ko00520
6 K00844 ko00521
7 K00844 ko00524
8 K00844 ko01100
9 K00844 ko01110
10 K00844 ko01120
11 K00844 ko01130
12 K00844 ko01200
13 K00844 ko04066
14 K00844 ko04910
15 K00844 ko04930
16 K00844 ko04973
17 K00844 ko05230 |
Yeah,sorry,It's really the K number,Since I want to obtain the pathway according the K number,such like this ,did have any methods to achieve it ? Thanks |
just write a > bitr_kegg("K00799", "kegg", "Path", "ko") -> x
> ko2name(x$Path) -> y
> merge(x, y, by.x='Path', by.y='ko')
Path kegg name
1 ko00480 K00799 Glutathione metabolism
2 ko00980 K00799 Metabolism of xenobiotics by cytochrome P450
3 ko00982 K00799 Drug metabolism - cytochrome P450
4 ko01524 K00799 Platinum drug resistance
5 ko05204 K00799 Chemical carcinogenesis
6 ko05418 K00799 Fluid shear stress and atherosclerosis |
well,unfortunately,It appears some error while run the |
see the Prerequisites session, https://github.com/GuangchuangYu/clusterProfiler/issues/new. |
BTW: you can use |
Thanks,it's works well ,the software is so good !!! |
Dear GuangChuang: |
BIG-BIG-GREAT THANK YOU!!!! |
Hi @GuangchuangYu I am working with a non-model organism. First I used KAAS to annotate the genome with K numbers, then I got a list of genes vs K numbers. In order to do KEGG pathway analysis, I need to translate K values to ko numbers. In this case, should I use enrichKEGG or enricher? If I use enricher, I need to translate all K numbers to pathways first, and eventually get a list of pathways2genes as the TERM2GENE, right? If I use enrichKEGG, according to your reply I can set organism='ko'. How this can be achieved?There is no way for me to input the gene vs K number list right? I appreciate it if you can clarify this. Thank you so much |
Hi! You solved your problem? I'm doing a kegg enrichment analysis, also with a non-model organism. I used the enrichKEGG( ) function but a get this error message: ca_kegg <- enrichKEGG(ca_list, organism = 'ko', keyType = 'kegg', universe = BBRB_KEGG, pAdjustMethod = "BH") In this case ca_list is my list of DE gene ID's and BBRB_KEGG is a dataframe of two columns with gene ID's and KEGG annotations that I get with Trinotate. How could I solve this problem and what means that "gene can be mapped"? |
Hi, I guess you used K number instead of ko number. I am not familiar with Trinotate but can you check the output from Trinotate? There should be another column with ko number (koxxxxx). Use that number instead of Kxxxxx |
Actually I'm using Ko number (Ko:xxxx) but I removed the prefix "KO:" of the KEGG terms, that's why it looks like that. |
Actually the enrichKEGG with organism='ko' never worked in my case. So I switched to the enricher function (Set everything manually).
Gene_list is my genes of interest. background is equivalent to your BBRB_KEGG but with ko numbers as the first column, kegg2name is a dataframe with 2 columns mapping ko numbers to the corresponding descriptions (This can be skipped if you want to get the enriched ko number rather than the textual descriptions). |
Ohh I can see. There's a way to get the description for every Ko number? |
Just a reminder, I feel that you are still using the K numbers instead of the KEGG pathways. KEGG KO (ko:Kxxxxx) is just the enzyme in the pathway. Normally you get one such KO per gene. Here we actually want to use the pathway id (koxxxxx (without ':') or mapxxxxx, the 'xxxxx' in ko and map are the same. One KEGG KO can be mapped to zero or multiple pathways. So you are supposed to get zero or multiple koxxxxx or mapxxxxx per gene. I used eggnog for annotation so I get both KO and pathway columns, do check your annotation to see if you get such pathway ids (koxxxxx, or mapxxxxx), this is what you want. I am not sure if your 'Ko:xxxx' is KEGG KO or pathway. If you got multiple terms per gene then you can directly use that since I assume thats already pathway ids. If you only got one such term per gene, more possibly it's just the K number. Once u get the pathway id, install KEGG.db package, you can get a list of all pathway numbers to names using KEGGPATHID2NAME. The pathway numbers are the xxxxxx in your pathway ids (koxxxxxx or mapxxxxx), NOT KEGG KO ids (ko:Kxxxxx). If you only got the K numbers (Kxxxxx) map it to the pathways using the method described previously in this post by @GuangchuangYu Hope this is helpful. It did take me a long time to figure all these out... For more info on how KO and pathways are in different formats, check: |
I can imagine it, this is a bit confusing. I'll check that, thank you very much for all the info. |
Thank you so much for taking the time to give me all this information, was very helpful. My analysis is already done! I Will share the information in case that other person have the same problem! n_n |
@Stepmata Glad to hear that :) |
Hi. I'm working with a non-model specie and Trinotate. Please, could you share me your solution about the setting KEGG Trinotate output to use with enricher? |
Hi! To use enricher function with my KEGG annotation I firts get the patways ID (ko number) mapping all my KEGG terms (k number) to KEGG data base using bitr_kegg function. Once a I had the pathways ID I get the pathways name using ko2name function. This two functions are from KEGG.db R package. |
Hi!!. My analysis is already done. Thank you so much for your help and your time!!. 👍 |
That's nice!! Your welcome!! n_n |
Hi i did the ORA analysis from my organism data with the k numbers and it worked but, it reported back human diseases pathways and i'm working with Physcomitrella (moss). I wanted to know if there's any possibility that i could get the species-specific IDs for the ORA analysis from the K numbers or other way do obtain them. Thanks. |
Hi, I have difficulties to generate KEEG into the Trinotate annotation file? What software did you use to generate the kegg annotation? Can you help for this? Many thanks! |
Hi, I used Trinotate to generate all my KEGG annotations. What kind of problem do you have running Trinotate? |
Thanks for the quick reply. Trinotate does not have kegg annotation by default. So I assume you generate the kegg file by yourself. So what kind of software you run to have this file. Sorry for the silly question. |
Well, I got the k number (kegg term) in the Trinotate output file, then I
used that information to made the mapping and get the ko number (pathway
ID).
El mar., 27 de abr. de 2021 3:29 PM, tobytaogla ***@***.***>
escribió:
… Hi. I'm working with a non-model specie and Trinotate. Please, could you
share me your solution about the setting KEGG Trinotate output to use with
enricher?
Hi! To use enricher function with my KEGG annotation I firts get the
patways ID (ko number) mapping all my KEGG terms (k number) to KEGG data
base using bitr_kegg function. Once a I had the pathways ID I get the
pathways name using ko2name function. This two functions are from KEGG.db R
package.
Now to run enricher a made two dataframes of two columns, one dataframe
that I called "term2gene" with ko numbers in first column and annotated
genes ID in the second one. The other dataframe that I called "term2name"
had ko numbers in first column and pathways name in the second one.
Also to apply enricher you have to create a vector with all your
differentially expressed genes ID, and that's all, you need all this
information to run your KEGG enrichment test! n_n
Hi!!. My analysis is already done. Thank you so much for your help and
your time!!. +1
Hi, I have difficulties to generate KEEG into the Trinotate annotation
file? What software did you use to generate the kegg annotation? Can you
help for this? Many thanks!
Hi, I used Trinotate to generate all my KEGG annotations. What kind of
problem do you have running Trinotate?
Thanks for the quick reply. Trinotate does not have kegg annotation by
default. So I assume you generate the kegg file by yourself. So what kind
of software you run to have this file. Sorry for the silly question.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#99 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AO3NFLLMZTC45EWTGTOJVMDTK4UD7ANCNFSM4DWXSTHA>
.
|
Dear author:
Since the clusterProfiler is a very useful tools for GO and Kegg annotation.At present I want to use it to enrich for kegg result while only have the KO number ,So I want to convert the KO number to the pathway function,Is there have any function or methods in the software can convert it?any help will be appreciated
Thanks
The text was updated successfully, but these errors were encountered: