EukSpecies BRAKER2

Tomas Bruna, Katharina J Hoff, Alexandre Lomsadze, Mario Stanke, Mark Borodovsky

Georgia Institute of Technology, Atlanta, Georgia, USA

University of Greifswald, Greifswald, Germany

Reference: BRAKER2: Automatic Eukaryotic Genome Annotation with GeneMark-EP+ and AUGUSTUS Supported by a Protein Database

Overview

Genome and annotation preparation protocol for eukaryotic species used in the BRAKER2 project.

Only shared components are described in this readme. Species specific commands are in species README files.

Installation

git clone git@github.com:gatech-genemark/EukSpecies-BRAKER2.git

cd EukSpecies
cd bin

# follow installation instructions in "bin" README file

Setup

Project is set in bash shell.

Genome sequence

Download genomic sequence and reformat it:

Input sequence should be in FASTA format.
Unique sequence ID should have no white space symbols in it.
Simplify FASTA definition line (defline). First word in defline should be a unique sequence identifier.
Select only nuclear DNA sequences from genome (exclude organelles).
Set all sequence letters into uppercase.

Annotation

Use genomic sequence from GenBank when possible. GenBank website and sequence accession IDs are usually more stable than genome project websites. Conversely, annotation is more frequently up-to-date at genomic project locations. Download the most reliable annotation.

Match sequence ID in FASTA file with sequence ID in annotation file.
Use ID from annotation.
Keep information about genome sequence ID and annotation sequence ID in the file "list.tbl".
First column in the "list.tbl" table is sequence ID and second column is annotation ID.

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
Arabidopsis_thaliana		Arabidopsis_thaliana
Bombus_terrestris		Bombus_terrestris
Caenorhabditis_elegans		Caenorhabditis_elegans
Danio_rerio		Danio_rerio
Drosophila_melanogaster		Drosophila_melanogaster
Medicago_truncatula		Medicago_truncatula
Parasteatoda_tepidariorum		Parasteatoda_tepidariorum
Populus_trichocarpa		Populus_trichocarpa
Rhodnius_prolixus		Rhodnius_prolixus
Solanum_lycopersicum		Solanum_lycopersicum
Tetraodon_nigroviridis		Tetraodon_nigroviridis
Xenopus_tropicalis		Xenopus_tropicalis
bin		bin
Complete_Gene_Evaluation.md		Complete_Gene_Evaluation.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EukSpecies BRAKER2

Overview

Installation

Setup

Genome sequence

Annotation

About

Releases

Packages

Contributors 2

Languages

gatech-genemark/EukSpecies-BRAKER2

Folders and files

Latest commit

History

Repository files navigation

EukSpecies BRAKER2

Overview

Installation

Setup

Genome sequence

Annotation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages