Skip to content
Jorge edited this page Jan 8, 2022 · 1 revision

Input

  • GenBank files (.gb or .gbk format). They need to have the CDS feature, containing the translation qualifier, e.g.:
CDS     join(9347..9379,9435..10346)
        /codon_start=1
        /gene="estA2"
        /note="aflatoxin formation"
        /product="esterase"
        /protein_id="AAN76353.1"
        /transl_table=1
        /translation="METPFAAPWHQFVEDLGQTPCLPGKDLDSILAGWGQLAGTLATRY
        GFPPPDESVTTEDVQLDGLWLRCYTPPNATGQEPVGLYFHGGGWVMGGVKEEDGFCRVI
        SRQCQMRLVSVEYRKAPQTRYPGALNDGVSAALWVLSRYENQPLVLMGTSAGGNLAFGT
        ALRLIDQDMVDKVSGVVALAPITVHPDAVPEYLKEQYTAYEENAELTVNSRAAMQVFFD
        CYKAPVDDVYTSCLLHPRLLALPKVYIAELGLDTLRDDARLMKGALDTAKVPVMYDAYP
        GYPHCLFMFPFKSLGEHQRPFLGGVAKAVRWMS"
  • Amino acid sequences (.fasta format)
  • Binary BGC files (.bgc or .bgccase)
  • Binary Protein files (.proteincase)

To enter the above, use parameters --inputfolders or --files. The former will only read GenBank files (it does so recursively, starting from the given folder).

Learn more about internal file formats here

Clone this wiki locally