slug | title | description |
---|---|---|
assemblies-and-sequence |
Assemblies and sequence |
Introduction to the sequence data used by Ensembl |
A genome assembly is a computational representation of a genome sequence. Ensembl does not produce genome assemblies, instead we provide annotation on genome assemblies that have been deposited into the International Nucleotide Sequence Database Collaboration (INSDC) databases (ENA, GenBank and DDBJ) and are publicly available. Links to data sources and acknowledgements can be found on each individual species' home page.
We select species to annotate on a case-by-case basis according to a number of factors such as: phylogenetic position, assembly quality, model organism, availability of species-specific sequence data (eg. RNA-seq), additional funding.
In order to improve consistency between the data provided by different genome browsers/annotation groups, the Genome Browser Agreement was established between Ensembl, UCSC and NSBI to define the minimum requirements for public display of genome data.