How to export the viral DNA gene sequence? #116

wfgui · 2024-08-02T06:03:15Z

Hi,
In the example above I can see proteins FASTA file of GCF_009025895.1_virus_proteins.faa. I want to calculate the gene abundance with virus gene sequence.Can we output the corresponding nucleotide sequence?

Thanks!

apcamargo · 2024-08-02T08:29:16Z

There is currently no option to do this, but I could implement it as a feature in the future. In the meantime, you can obtain the nucleotide sequences of the CDSs by extracting them from the genomes using the gene coordinates.

wfgui · 2024-08-14T08:26:39Z

I also had a seemingly simple question about whether I could format the output at taxonomy, such as converting it to k__; p__; c__; o__; f__; g__; s__.

Thanks!

apcamargo · 2024-08-19T20:10:48Z

You can use taxopy for that. geNomad's taxdump is inside the database directory, and you can find the TaxIds in the <prefox>_annotate/<prefox>_taxonomy.tsv file.

For instance:

import taxopy

taxdb = taxopy.TaxDb(
    nodes_dmp="genomad_db/nodes.dmp",
    names_dmp="genomad_db/names.dmp",
    keep_files=True
)
taxon = taxopy.Taxon(5797, taxdb)
for rank, name in reversed(taxon.ranked_name_lineage):
    if name != "root":
        print(f"{rank}__{name}")

realm__Duplodnaviria
kingdom__Heunggongvirae
phylum__Uroviricota
class__Caudoviricetes
order__Crassvirales

wfgui · 2024-08-23T05:59:19Z

What's the difference between "Unclassified" and "Viruses;;;;;;" ?

apcamargo · 2024-08-23T06:12:49Z

"Unclassified" means that the genes in the sequence had no matches to markers with taxonomy information. "Viruses" means that the classification is uncertain at a high rank.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to export the viral DNA gene sequence? #116

How to export the viral DNA gene sequence? #116

wfgui commented Aug 2, 2024

apcamargo commented Aug 2, 2024

wfgui commented Aug 14, 2024

apcamargo commented Aug 19, 2024 •

edited

Loading

wfgui commented Aug 23, 2024

apcamargo commented Aug 23, 2024

How to export the viral DNA gene sequence? #116

How to export the viral DNA gene sequence? #116

Comments

wfgui commented Aug 2, 2024

apcamargo commented Aug 2, 2024

wfgui commented Aug 14, 2024

apcamargo commented Aug 19, 2024 • edited Loading

wfgui commented Aug 23, 2024

apcamargo commented Aug 23, 2024

apcamargo commented Aug 19, 2024 •

edited

Loading