Skip to content

ZFF format, what type of feature to use? CDS is it enough? #5

@Juke34

Description

@Juke34

On the common protocol to train snap is through MAKER annotation pipeline.
They provide a script called maker2zff. Looking at their script I realise that they use only the CDS coordinates to create Esngl, Einit, Eterm, Exon, zff features.
What would be your recommendation to better train snap?
Using CDS only is enough? Can we use exons only?
I checked zoeFeature.h, what about the other features?

Would I get a better training if I provide a zff file with Intron, UTR5, UTR3, Acceptor, Donor, Start, Stop, etc features?
Maybe most of them are compute automatically while training (i.e. start, stop, Acceptor, Donor can be deduced by exon coordinates... )

maker2zff defines Esngl, Einit, Eterm, Exon zff features based on CDS gff features, would I get a better training if I define Esngl, Einit, Eterm, Exon based on Exon gff feature and add Coding zff feature to specify which part of the exon is coding?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions