Skip to content

Latest commit

 

History

History
182 lines (127 loc) · 5.17 KB

data_records.rst

File metadata and controls

182 lines (127 loc) · 5.17 KB

Data records

Record pages for different types of data can be accessed following a query by clicking appropriate hyperlinks.

pair: isolate; records

Isolate records

An Isolate record page displays everything known about an isolate.

image

Each record will have some or all of the following sections:

Projects

image

This displays a list of projects that the isolate is a member of. Only projects that have a full description and the 'isolate_display' flag in their settings will be displayed.

Provenance metadata

image

This section includes:

  • provenance fields
  • housekeeping data
    • who sent the isolate
    • who last curated
    • record creation times
    • last update times
    • links to update history

The update link displays page with exact times of who and when updated the record.

image

Publications

image

This section includes full citation for papers linked to the isolate record. Each citation has a button that will return a dataset of all isolates linked to the paper.

If there are five or more references they will be hidden by default to avoid cluttering the page too much. Click the 'Show/hide' button to display them in this case.

Sequence bin summary

image

This section contains basic statistics describing the sequence bin. Clicking the 'Show sequence bin' button navigates to the sequence bin record <sequence_bin_records>.

Scheme and locus data

A hierarchical tree displays available schemes. Click within internal nodes to expand them.

image

Clicking any terminal node will display data available for a scheme or group of schemes.

image

Click an allele number within the scheme profile, will display the appropriate allele definition record <allele_definition_records>. Clicking the green 'S' link will display the appropriate sequence tag record <sequence_tag_records>.

pair: allele definition; records

Allele definition records

An allele definition record displays information about a defined allele in a sequence definition database.

image

If the allele is a member of a scheme profile, e.g. MLST, this will be listed. In this case, there will be a button to display all profiles of that scheme that contain the allele.

Similarly, if a client database <client_databases> has been setup for the database and the allele has been identified in an isolate, there will be a button to display all isolates that have that allele.

pair: sequence tag; records

Sequence tag records

image

A sequence tag record displays information about the location within a contig of a region associated with a locus. The nucleotide sequence will be displayed along with upstream and downstream flanking sequence. The length of these flanking sequences can be modified within the general options <general_options>.

If the tag is for a DNA locus and it is marked as a coding sequence, the three-frame translation will also be displayed.

pair: profile; records

Profile records

image

A profile record displays information about a scheme, e.g. MLST, profile. Each allele number within the profile will be hyperlinked. Clicking these will take you to the appropriate allele definition record <allele_definition_records>.

If a client database <client_databases> has been setup for the database and an isolate has the profile, there will be a button to display all isolates that have the profile.

pair: sequence bin; records

Sequence bin records

image

A sequence bin record contains information about the contigs associated with an isolate record. This includes:

  • Number of contigs
  • Total length
  • Minimum length
  • Maximum length
  • N50, N90 and N95 values
  • Size distribution charts

Charts show the distribution of contig sizes and the cumulative contig length against contig number giving a breakdown indication of contig size.

The record includes an embedded genome browser showing the positions of any loci that have been tagged.

There are also links to download the contigs in FASTA, Genbank or EMBL format, along with annotation in GFF3 format.

Finally there is a table that shows the loci that are tagged on each contig. Individual contigs can also be downloaded in EMBL or Genbank format.