Skip to content

Generation of OrthoDB protein sets for gene prediction experiments

Notifications You must be signed in to change notification settings

tomasbruna/orthodb-clades

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

orthodb-clades

Workflow for generating OrthoDB v11 protein sets.

All files are automatically downloaded from OrthoDB and parsed using a Snakemake workflow with the following command:

snakemake --cores N

The resulting protein sets are saved into two different folders:

  • clades contains clade-specific (e.g., Arthropoda.fa or Viridiplantae.fa) OrthoDB sets.
  • species contains species-specific protein sets from which the proteins of the same species or proteins of all species in the same taxonomic order were removed. This is intended for gene prediction experiments, see, e.g., the BRAKER2 paper.

About

Generation of OrthoDB protein sets for gene prediction experiments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages