diff --git a/.travis.yml b/.travis.yml deleted file mode 100644 index c91c8f2..0000000 --- a/.travis.yml +++ /dev/null @@ -1,7 +0,0 @@ -language: python -python: - - "2.7" -# command to install dependencies -install: "pip install git+https://github.com/edamontology/edamxpathvalidator.git" -# command to run tests -script: "edamxpathvalidator EDAM_dev.owl" diff --git a/EDAM_dev.owl b/EDAM_dev.owl deleted file mode 100644 index 838e844..0000000 --- a/EDAM_dev.owl +++ /dev/null @@ -1,53249 +0,0 @@ - - - - - - - - - - - - - -]> - - - - - EDAM_topic http://edamontology.org/topic_ "EDAM topics" - EDAM_operation http://edamontology.org/operation_ "EDAM operations" - formats "EDAM data formats" - EDAM - Jon Ison, Matus Kalas, Hervé Ménager - identifiers "EDAM types of identifiers" - data "EDAM types of data" - relations "EDAM relations" - edam "EDAM" - EDAM editors: Jon Ison, Matus Kalas, and Herve Menager. Contributors: Inge Jonassen, Dan Bolser, Hamish McWilliam, Mahmut Uludag, James Malone, Rodrigo Lopez, Steve Pettifer, and Peter Rice. Contibutions from these projects: EMBRACE, ELIXIR, and BioMedBridges (EU); EMBOSS (BBSRC, UK); eSysbio, FUGE Bioinformatics Platform, and ELIXIR.NO/Norwegian Bioinformatics Platform (Research Council of Norway). See http://edamontology.org for documentation and licence. - operations "EDAM operations" - Bioinformatics operations, data types, formats, identifiers and topics - EDAM http://edamontology.org/ "EDAM relations and concept properties" - application/rdf+xml - EDAM_data http://edamontology.org/data_ "EDAM types of data" - concept_properties "EDAM concept properties" - Jon Ison - 3730 - Matúš Kalaš - EDAM_format http://edamontology.org/format_ "EDAM data formats" - 1.15_dev - topics "EDAM topics" - 24:02:2016 21:54GMT - Hervé Ménager - EDAM is an ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM is a simple ontology - essentially a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use by curators, software developers and end-users. EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources. EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries. - - - - - - - - - - - - - - - Citation - concept_properties - 1.13 - Publication reference - Publication - 'Citation' concept property ('citation' metadata tag) contains a dereferenceable URI, preferrably including a DOI, pointing to a citeable publication of the given data format. - true - - - - - - - - Created in - Version in which a concept was created. - true - concept_properties - - - - - - - - Documentation - Specification - 'Documentation' trailing modifier (qualifier, 'documentation') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page with explanation, description, documentation, or specification of the given data format. - true - concept_properties - - - - - - - - Example - 'Example' concept property ('example' metadata tag) lists examples of valid values of types of identifiers (accessions). Applicable to some other types of data, too. - true - Separated by bar ('|'). - concept_properties - - - - - - - - File extension - 'File extension' concept property ('file_extension' metadata tag) lists examples of usual file extensions of formats. - Separated by bar ('|'), without a dot ('.') prefix, preferrably not all capital characters. - concept_properties - true - - - - - - - - isdebtag - When 'true', the term has been proposed or is supported within Debian Med as a tag. - concept_properties - true - - - - - - - - Media type - MIME type - 'Media type' trailing modifier (qualifier, 'media_type') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page specifying a media type of the given data format. - true - concept_properties - - - - - - - - - - - - - - Obsolete since - true - concept_properties - Version in which a concept was made obsolete. - - - - - - - - Regular expression - 'Regular expression' concept property ('regex' metadata tag) specifies the allowed values of types of identifiers (accessions). Applicable to some other types of data, too. - concept_properties - true - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - has format - "http://purl.obolibrary.org/obo/OBI_0000298" - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. Object B can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology that is a 'Format' or is in the role of a 'Format'. In EDAM, 'has_format' is not explicitly defined between EDAM concepts, only the inverse 'is_format_of'. - false - OBO_REL:is_a - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - false - false - edam - 'A has_format B' defines for the subject A, that it has the object B as its data format. - false - - - - - - - - - - has function - http://wsio.org/has_function - false - OBO_REL:is_a - OBO_REL:bearer_of - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is (or is in a role of) a function, or an entity outside of an ontology that is (or is in a role of) a function specification. In the scope of EDAM, 'has_function' serves only for relating annotated entities outside of EDAM with 'Operation' concepts. - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - true - 'A has_function B' defines for the subject A, that it has the object B as its function. - "http://purl.obolibrary.org/obo/OBI_0000306" - relations - false - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:bearer_of' is narrower in the sense that it only relates ontological categories (concepts) that are an 'independent_continuant' (snap:IndependentContinuant) with ontological categories that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant), and broader in the sense that it relates with any borne objects not just functions of the subject. - OBO_REL:bearer_of - - - - - In very unusual cases. - true - - - - - - - - - - has identifier - false - false - relations - OBO_REL:is_a - edam - 'A has_identifier B' defines for the subject A, that it has the object B as its identifier. - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is an 'Identifier', or an entity outside of an ontology that is an 'Identifier' or is in the role of an 'Identifier'. In EDAM, 'has_identifier' is not explicitly defined between EDAM concepts, only the inverse 'is_identifier_of'. - false - false - - - - - - - - - - has input - OBO_REL:has_participant - "http://purl.obolibrary.org/obo/OBI_0000293" - false - http://wsio.org/has_input - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_input' is explicitly defined between EDAM concepts ('Operation' 'has_input' 'Data'). The inverse, 'is_input_of', is not explicitly defined. - relations - OBO_REL:is_a - false - 'A has_input B' defines for the subject A, that it has the object B as a necessary or actual input or input argument. - false - true - edam - - - - - true - In very unusual cases. - - - - - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just inputs or input arguments of the subject. - OBO_REL:has_participant - - - - - - - - - - has output - http://wsio.org/has_output - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_output' is explicitly defined between EDAM concepts ('Operation' 'has_output' 'Data'). The inverse, 'is_output_of', is not explicitly defined. - edam - "http://purl.obolibrary.org/obo/OBI_0000299" - OBO_REL:is_a - relations - OBO_REL:has_participant - true - 'A has_output B' defines for the subject A, that it has the object B as a necessary or actual output or output argument. - false - false - false - - - - - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just outputs or output arguments of the subject. It is also not clear whether an output (result) actually participates in the process that generates it. - OBO_REL:has_participant - - - - - true - In very unusual cases. - - - - - - - - - - has topic - relations - true - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology that is a 'Topic' or is in the role of a 'Topic'. In EDAM, only 'has_topic' is explicitly defined between EDAM concepts ('Operation' or 'Data' 'has_topic' 'Topic'). The inverse, 'is_topic_of', is not explicitly defined. - false - 'A has_topic B' defines for the subject A, that it has the object B as its topic (A is in the scope of a topic B). - edam - OBO_REL:is_a - http://annotation-ontology.googlecode.com/svn/trunk/annotation-core.owl#hasTopic - false - "http://purl.obolibrary.org/obo/IAO_0000136" - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality - "http://purl.obolibrary.org/obo/OBI_0000298" - - - - - - - - - - - - true - In very unusual cases. - - - - - - - - - - is format of - false - OBO_REL:is_a - false - false - false - 'A is_format_of B' defines for the subject A, that it is a data format of the object B. - edam - relations - Subject A can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Format' or is in the role of a 'Format'. Object B can be any concept or entity outside of an ontology that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. In EDAM, only 'is_format_of' is explicitly defined between EDAM concepts ('Format' 'is_format_of' 'Data'). The inverse, 'has_format', is not explicitly defined. - OBO_REL:quality_of - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - - - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - OBO_REL:quality_of - - - - - - - - - - is function of - Subject A can either be concept that is (or is in a role of) a function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) a function specification. Object B can be any concept or entity. Within EDAM itself, 'is_function_of' is not used. - OBO_REL:inheres_in - true - OBO_REL:is_a - false - 'A is_function_of B' defines for the subject A, that it is a function of the object B. - OBO_REL:function_of - edam - http://wsio.org/is_function_of - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - false - - - - - In very unusual cases. - true - - - - - OBO_REL:function_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:function_of' only relates subjects that are a 'function' (snap:Function) with objects that are an 'independent_continuant' (snap:IndependentContinuant), so for example no processes. It does not define explicitly that the subject is a function of the object. - - - - - OBO_REL:inheres_in - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:inheres_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant) with ontological categories that are an 'independent_continuant' (snap:IndependentContinuant), and broader in the sense that it relates any borne subjects not just functions. - - - - - - - - - - is identifier of - false - false - edam - false - relations - Subject A can either be a concept that is an 'Identifier', or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is an 'Identifier' or is in the role of an 'Identifier'. Object B can be any concept or entity outside of an ontology. In EDAM, only 'is_identifier_of' is explicitly defined between EDAM concepts (only 'Identifier' 'is_identifier_of' 'Data'). The inverse, 'has_identifier', is not explicitly defined. - 'A is_identifier_of B' defines for the subject A, that it is an identifier of the object B. - OBO_REL:is_a - false - - - - - - - - - - - is input of - false - http://wsio.org/is_input_of - relations - true - false - OBO_REL:participates_in - OBO_REL:is_a - "http://purl.obolibrary.org/obo/OBI_0000295" - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_input_of' is not explicitly defined between EDAM concepts, only the inverse 'has_input'. - false - 'A is_input_of B' defines for the subject A, that it as a necessary or actual input or input argument of the object B. - - - - - - true - In very unusual cases. - - - - - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just inputs or input arguments. - OBO_REL:participates_in - - - - - - - - - - is output of - OBO_REL:is_a - false - false - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_output_of' is not explicitly defined between EDAM concepts, only the inverse 'has_output'. - edam - false - 'A is_output_of B' defines for the subject A, that it as a necessary or actual output or output argument of the object B. - OBO_REL:participates_in - http://wsio.org/is_output_of - true - relations - "http://purl.obolibrary.org/obo/OBI_0000312" - - - - - - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just outputs or output arguments. It is also not clear whether an output (result) actually participates in the process that generates it. - OBO_REL:participates_in - - - - - In very unusual cases. - true - - - - - - - - - - is topic of - 'A is_topic_of B' defines for the subject A, that it is a topic of the object B (a topic A is the scope of B). - relations - OBO_REL:quality_of - false - true - false - Subject A can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Topic' or is in the role of a 'Topic'. Object B can be any concept or entity outside of an ontology. In EDAM, 'is_topic_of' is not explicitly defined between EDAM concepts, only the inverse 'has_topic'. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - OBO_REL:is_a - edam - - - - - - - - - - - - - OBO_REL:quality_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - - - - - In very unusual cases. - true - - - - - - - - - - - - - - - Resource type - - beta12orEarlier - beta12orEarlier - A type of computational resource used in bioinformatics. - true - - - - - - - - - - Data - - - - - Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - http://semanticscience.org/resource/SIO_000088 - http://semanticscience.org/resource/SIO_000069 - "http://purl.obolibrary.org/obo/IAO_0000030" - "http://purl.obolibrary.org/obo/IAO_0000027" - Data set - Data record - beta12orEarlier - http://wsio.org/data_002 - http://purl.org/biotop/biotop.owl#DigitalEntity - http://www.ifomis.org/bfo/1.1/snap#Continuant - Datum - - - - - EDAM does not distinguish a data record (a tool-understandable information artefact) from data or datum (its content, the tool-understandable encoding of an information). - Data record - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Datum - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Data set - - - - - - - - - - Tool - - beta12orEarlier - A bioinformatics package or tool, e.g. a standalone application or web service. - beta12orEarlier - true - - - - - - - - - - Database - - A digital data archive typically based around a relational model but sometimes using an object-oriented, tree or graph-based model. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ontology - - - - - - - - beta12orEarlier - Ontologies - An ontology of biological or bioinformatics concepts and relations, a controlled vocabulary, structured glossary etc. - - - - - - - - - - Directory metadata - - 1.5 - A directory on disk from which files are read. - beta12orEarlier - true - - - - - - - - - - MeSH vocabulary - - beta12orEarlier - true - Controlled vocabulary from National Library of Medicine. The MeSH thesaurus is used to index articles in biomedical journals for the Medline/PubMED databases. - beta12orEarlier - - - - - - - - - - HGNC vocabulary - - beta12orEarlier - beta12orEarlier - Controlled vocabulary for gene names (symbols) from HUGO Gene Nomenclature Committee. - true - - - - - - - - - - UMLS vocabulary - - Compendium of controlled vocabularies for the biomedical domain (Unified Medical Language System). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Identifier - - - - - - - - - - http://semanticscience.org/resource/SIO_000115 - beta12orEarlier - ID - "http://purl.org/dc/elements/1.1/identifier" - http://wsio.org/data_005 - A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things). - - - - - - - Almost exact but limited to identifying resources. - - - - - - - - - - - Database entry - - beta12orEarlier - beta12orEarlier - An entry (retrievable via URL) from a biological database. - true - - - - - - - - - - Molecular mass - - Mass of a molecule. - beta12orEarlier - - - - - - - - - - Molecular charge - - Net charge of a molecule. - beta12orEarlier - PDBML:pdbx_formal_charge - - - - - - - - - - Chemical formula - - Chemical structure specification - A specification of a chemical structure. - beta12orEarlier - - - - - - - - - - QSAR descriptor - - A QSAR quantitative descriptor (name-value pair) of chemical structure. - QSAR descriptors have numeric values that quantify chemical information encoded in a symbolic representation of a molecule. They are used in quantitative structure activity relationship (QSAR) applications. Many subtypes of individual descriptors (not included in EDAM) cover various types of protein properties. - beta12orEarlier - - - - - - - - - - Raw sequence - - beta12orEarlier - A raw molecular sequence (string of characters) which might include ambiguity, unknown positions and non-sequence characters. - Non-sequence characters may be used for example for gaps and translation stop. - - - - - - - - - - Sequence record - - http://purl.bioontology.org/ontology/MSH/D058977 - beta12orEarlier - A molecular sequence and associated metadata. - SO:2000061 - - - - - - - - - - Sequence set - - A collection of multiple molecular sequences and associated metadata that do not (typically) correspond to molecular sequence database records or entries and which (typically) are derived from some analytical method. - This concept may be used for arbitrary sequence sets and associated data arising from processing. - beta12orEarlier - SO:0001260 - - - - - - - - - - Sequence mask character - - true - beta12orEarlier - 1.5 - A character used to replace (mask) other characters in a molecular sequence. - - - - - - - - - - Sequence mask type - - A label (text token) describing the type of sequence masking to perform. - Sequence masking is where specific characters or positions in a molecular sequence are masked (replaced) with an another (mask character). The mask type indicates what is masked, for example regions that are not of interest or which are information-poor including acidic protein regions, basic protein regions, proline-rich regions, low compositional complexity regions, short-periodicity internal repeats, simple repeats and low complexity regions. Masked sequences are used in database search to eliminate statistically significant but biologically uninteresting hits. - beta12orEarlier - 1.5 - true - - - - - - - - - - DNA sense specification - - DNA strand specification - beta12orEarlier - Strand - The strand of a DNA sequence (forward or reverse). - The forward or 'top' strand might specify a sequence is to be used as given, the reverse or 'bottom' strand specifying the reverse complement of the sequence is to be used. - - - - - - - - - - Sequence length specification - - true - A specification of sequence length(s). - beta12orEarlier - 1.5 - - - - - - - - - - Sequence metadata - - beta12orEarlier - Basic or general information concerning molecular sequences. - This is used for such things as a report including the sequence identifier, type and length. - 1.5 - true - - - - - - - - - - Sequence feature source - - This might be the name and version of a software tool, the name of a database, or 'curated' to indicate a manual annotation (made by a human). - How the annotation of a sequence feature (for example in EMBL or Swiss-Prot) was derived. - beta12orEarlier - - - - - - - - - - Sequence search results - - beta12orEarlier - Database hits (sequence) - - Sequence database hits - Sequence search hits - The score list includes the alignment score, percentage of the query sequence matched, length of the database sequence entry in this alignment, identifier of the database sequence entry, excerpt of the database sequence entry description etc. - A report of sequence hits and associated data from searching a database of sequences (for example a BLAST search). This will typically include a list of scores (often with statistical evaluation) and a set of alignments for the hits. - Sequence database search results - - - - - - - - - - Sequence signature matches - - Sequence motif matches - Protein secondary database search results - beta12orEarlier - Report on the location of matches in one or more sequences to profiles, motifs (conserved or functional patterns) or other signatures. - Sequence profile matches - This ncluding reports of hits from a search of a protein secondary or domain database. - Search results (protein secondary database) - - - - - - - - - - Sequence signature model - - Data files used by motif or profile methods. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence signature data - - - - - - - - beta12orEarlier - This can include metadata about a motif or sequence profile such as its name, length, technical details about the profile construction, and so on. - Data concering concerning specific or conserved pattern in molecular sequences and the classifiers used for their identification, including sequence motifs, profiles or other diagnostic element. - - - - - - - - - - Sequence alignment (words) - - 1.5 - beta12orEarlier - true - Sequence word alignment - Alignment of exact matches between subsequences (words) within two or more molecular sequences. - - - - - - - - - - Dotplot - - A dotplot of sequence similarities identified from word-matching or character comparison. - beta12orEarlier - - - - - - - - - - Sequence alignment - - - - - - - - http://en.wikipedia.org/wiki/Sequence_alignment - http://purl.bioontology.org/ontology/MSH/D016415 - http://semanticscience.org/resource/SIO_010066 - beta12orEarlier - Alignment of multiple molecular sequences. - - - - - - - - - - Sequence alignment parameter - - Some simple value controlling a sequence alignment (or similar 'match') operation. - true - 1.5 - beta12orEarlier - - - - - - - - - - Sequence similarity score - - A value representing molecular sequence similarity. - beta12orEarlier - - - - - - - - - - Sequence alignment metadata - - Report of general information on a sequence alignment, typically include a description, sequence identifiers and alignment score. - beta12orEarlier - true - 1.5 - - - - - - - - - - Sequence alignment report - - Use this for any computer-generated reports on sequence alignments, and for general information (metadata) on a sequence alignment, such as a description, sequence identifiers and alignment score. - An informative report of molecular sequence alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Profile-profile alignment - - beta12orEarlier - A profile-profile alignment (each profile typically representing a sequence alignment). - Sequence profile alignment - - - - - - - - - - Sequence-profile alignment - - beta12orEarlier - Alignment of one or more molecular sequence(s) to one or more sequence profile(s) (each profile typically representing a sequence alignment). - Data associated with the alignment might also be included, e.g. ranked list of best-scoring sequences and a graphical representation of scores. - - - - - - - - - - Sequence distance matrix - - beta12orEarlier - Moby:phylogenetic_distance_matrix - A matrix of estimated evolutionary distance between molecular sequences, such as is suitable for phylogenetic tree calculation. - Phylogenetic distance matrix - Methods might perform character compatibility analysis or identify patterns of similarity in an alignment or data matrix. - - - - - - - - - - Phylogenetic character data - - Basic character data from which a phylogenetic tree may be generated. - As defined, this concept would also include molecular sequences, microsatellites, polymorphisms (RAPDs, RFLPs, or AFLPs), restriction sites and fragments - http://www.evolutionaryontology.org/cdao.owl#Character - beta12orEarlier - - - - - - - - - - Phylogenetic tree - - - - - - - - Phylogeny - Moby:Tree - http://www.evolutionaryontology.org/cdao.owl#Tree - A phylogenetic tree is usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. See also 'Phylogenetic tree image'. - http://purl.bioontology.org/ontology/MSH/D010802 - Moby:phylogenetic_tree - The raw data (not just an image) from which a phylogenetic tree is directly generated or plotted, such as topology, lengths (in time or in expected amounts of variance) and a confidence interval for each length. - beta12orEarlier - Moby:myTree - - - - - - - - - - Comparison matrix - - beta12orEarlier - The comparison matrix might include matrix name, optional comment, height and width (or size) of matrix, an index row/column (of characters) and data rows/columns (of integers or floats). - Matrix of integer or floating point numbers for amino acid or nucleotide sequence comparison. - Substitution matrix - - - - - - - - - - Protein topology - - beta12orEarlier - beta12orEarlier - Predicted or actual protein topology represented as a string of protein secondary structure elements. - true - The location and size of the secondary structure elements and intervening loop regions is usually indicated. - - - - - - - - - - Protein features report (secondary structure) - - beta12orEarlier - 1.8 - true - Secondary structure (predicted or real) of a protein. - - - - - - - - - - Protein features report (super-secondary) - - 1.8 - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - true - beta12orEarlier - Super-secondary structure of protein sequence(s). - - - - - - - - - - Secondary structure alignment (protein) - - - Alignment of the (1D representations of) secondary structure of two or more proteins. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (protein) - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RNA secondary structure - - - - - - - - An informative report of secondary structure (predicted or real) of an RNA molecule. - This includes thermodynamically stable or evolutionarily conserved structures such as knots, pseudoknots etc. - Moby:RNAStructML - Secondary structure (RNA) - beta12orEarlier - - - - - - - - - - Secondary structure alignment (RNA) - - Moby:RNAStructAlignmentML - Alignment of the (1D representations of) secondary structure of two or more RNA molecules. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (RNA) - - true - beta12orEarlier - An informative report of RNA secondary structure alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Structure - - - - - - - - beta12orEarlier - Coordinate model - Structure data - The coordinate data may be predicted or real. - http://purl.bioontology.org/ontology/MSH/D015394 - 3D coordinate and associated data for a macromolecular tertiary (3D) structure or part of a structure. - - - - - - - - - - Tertiary structure record - - true - beta12orEarlier - beta12orEarlier - An entry from a molecular tertiary (3D) structure database. - - - - - - - - - - Structure database search results - - 1.8 - Results (hits) from searching a database of tertiary structure. - beta12orEarlier - true - - - - - - - - - - Structure alignment - - - - - - - - Alignment (superimposition) of molecular tertiary (3D) structures. - A tertiary structure alignment will include the untransformed coordinates of one macromolecule, followed by the second (or subsequent) structure(s) with all the coordinates transformed (by rotation / translation) to give a superposition. - beta12orEarlier - - - - - - - - - - Structure alignment report - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - An informative report of molecular tertiary structure alignment-derived data. - - - - - - - - - - Structure similarity score - - beta12orEarlier - A value representing molecular structure similarity, measured from structure alignment or some other type of structure comparison. - - - - - - - - - - Structural profile - - - - - - - - beta12orEarlier - 3D profile - Some type of structural (3D) profile or template (representing a structure or structure alignment). - Structural (3D) profile - - - - - - - - - - Structural (3D) profile alignment - - beta12orEarlier - Structural profile alignment - A 3D profile-3D profile alignment (each profile representing structures or a structure alignment). - - - - - - - - - - Sequence-3D profile alignment - - Sequence-structural profile alignment - 1.5 - An alignment of a sequence to a 3D profile (representing structures or a structure alignment). - beta12orEarlier - true - - - - - - - - - - Protein sequence-structure scoring matrix - - beta12orEarlier - Matrix of values used for scoring sequence-structure compatibility. - - - - - - - - - - Sequence-structure alignment - - beta12orEarlier - An alignment of molecular sequence to structure (from threading sequence(s) through 3D structure or representation of structure(s)). - - - - - - - - - - Amino acid annotation - - An informative report about a specific amino acid. - 1.4 - true - beta12orEarlier - - - - - - - - - - Peptide annotation - - 1.4 - true - An informative report about a specific peptide. - beta12orEarlier - - - - - - - - - - Protein report - - Gene product annotation - beta12orEarlier - An informative human-readable report about one or more specific protein molecules or protein structural domains, derived from analysis of primary (sequence or structural) data. - - - - - - - - - - Protein property - - Protein physicochemical property - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a protein molecule or model. - beta12orEarlier - Protein sequence statistics - Protein properties - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Protein structural motifs and surfaces - - true - 1.8 - 3D structural motifs in a protein. - beta12orEarlier - Protein 3D motifs - - - - - - - - - Protein domain classification - - true - Data concerning the classification of the sequences and/or structures of protein structural domain(s). - 1.5 - beta12orEarlier - - - - - - - - - - Protein features report (domains) - - true - structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - beta12orEarlier - - - - - - - - - - Protein architecture report - - 1.4 - An informative report on architecture (spatial arrangement of secondary structure) of a protein structure. - Protein property (architecture) - Protein structure report (architecture) - beta12orEarlier - true - - - - - - - - - - Protein folding report - - beta12orEarlier - A report on an analysis or model of protein folding properties, folding pathways, residues or sites that are key to protein folding, nucleation or stabilization centers etc. - true - 1.8 - - - - - - - - - - Protein features (mutation) - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on the effect of (typically point) mutation on protein folding, stability, structure and function. - true - beta12orEarlier - Protein property (mutation) - Protein structure report (mutation) - beta13 - Protein report (mutation) - - - - - - - - - - Protein interaction raw data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein interaction report - - - - - - - - Protein report (interaction) - beta12orEarlier - Protein interaction record - Residue interaction data - Atom interaction data - Protein non-covalent interactions report - An informative report on interactions (predicted or known) within or between a protein, structural domain or part of a protein. This includes intra- and inter-residue contacts and distances, as well as interactions with other proteins and non-protein entities such as nucleic acid, metal atoms, water, ions etc. - - - - - - - - - - - - - Protein family report - - - - - - - - beta12orEarlier - An informative report on a specific protein family or other classification or group of protein sequences or structures. - Protein family annotation - Protein classification data - - - - - - - - - - Vmax - - beta12orEarlier - The maximum initial velocity or rate of a reaction. It is the limiting velocity as substrate concentrations get very large. - - - - - - - - - - Km - - Km is the concentration (usually in Molar units) of substrate that leads to half-maximal velocity of an enzyme-catalysed reaction. - beta12orEarlier - - - - - - - - - - Nucleotide base annotation - - beta12orEarlier - true - An informative report about a specific nucleotide base. - 1.4 - - - - - - - - - - Nucleic acid property - - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a nucleic acid molecule. - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - Nucleic acid physicochemical property - beta12orEarlier - GC-content - - - - - - - - - - Codon usage data - - - - - - - - beta12orEarlier - Data derived from analysis of codon usage (typically a codon usage table) of DNA sequences. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Gene report - - Gene structure (repot) - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - Gene and transcript structure (report) - Gene features report - Nucleic acid features (gene and transcript structure) - Moby:gene - This includes any report on a particular locus or gene. This might include the gene name, description, summary and so on. It can include details about the function of a gene, such as its encoded protein or a functional classification of the gene sequence along according to the encoded protein(s). - Gene annotation - beta12orEarlier - Moby_namespace:Human_Readable_Description - Gene function (report) - Moby:GeneInfo - - - - - - - - - - Gene classification - - beta12orEarlier - true - A report on the classification of nucleic acid / gene sequences according to the functional classification of their gene products. - beta12orEarlier - - - - - - - - - - DNA variation - - stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - true - 1.8 - beta12orEarlier - - - - - - - - - - Chromosome report - - beta12orEarlier - An informative report on a specific chromosome. - This includes basic information. e.g. chromosome number, length, karyotype features, chromosome sequence etc. - - - - - - - - - - Genotype/phenotype report - - An informative report on the set of genes (or allelic forms) present in an individual, organism or cell and associated with a specific physical characteristic, or a report concerning an organisms traits and phenotypes. - Genotype/phenotype annotation - beta12orEarlier - - - - - - - - - - Nucleic acid features report (primers) - - true - 1.8 - beta12orEarlier - PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - PCR experiment report - - true - beta12orEarlier - PCR experiments, e.g. quantitative real-time PCR. - 1.8 - - - - - - - - - - Sequence trace - - - Fluorescence trace data generated by an automated DNA sequencer, which can be interprted as a molecular sequence (reads), given associated sequencing metadata such as base-call quality scores. - This is the raw data produced by a DNA sequencing machine. - beta12orEarlier - - - - - - - - - - Sequence assembly - - beta12orEarlier - An assembly of fragments of a (typically genomic) DNA sequence. - Contigs - http://en.wikipedia.org/wiki/Sequence_assembly - SO:0001248 - Typically, an assembly is a collection of contigs (for example ESTs and genomic DNA fragments) that are ordered, aligned and merged. Annotation of the assembled sequence might be included. - SO:0000353 - - - - - SO:0001248 - Perhaps surprisingly, the definition of 'SO:assembly' is narrower than the 'SO:sequence_assembly'. - - - - - - - - - - Radiation Hybrid (RH) scores - - beta12orEarlier - Radiation Hybrid (RH) scores are used in Radiation Hybrid mapping. - Radiation hybrid scores (RH) scores for one or more markers. - - - - - - - - - - Genetic linkage report - - beta12orEarlier - Gene annotation (linkage) - Linkage disequilibrium (report) - An informative report on the linkage of alleles. - This includes linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Gene expression profile - - Data quantifying the level of expression of (typically) multiple genes, derived for example from microarray experiments. - beta12orEarlier - Gene expression pattern - - - - - - - - - - Microarray experiment report - - true - microarray experiments including conditions, protocol, sample:data relationships etc. - 1.8 - beta12orEarlier - - - - - - - - - - Oligonucleotide probe data - - beta12orEarlier - beta13 - true - Data on oligonucleotide probes (typically for use with DNA microarrays). - - - - - - - - - - SAGE experimental data - - beta12orEarlier - true - Output from a serial analysis of gene expression (SAGE) experiment. - Serial analysis of gene expression (SAGE) experimental data - beta12orEarlier - - - - - - - - - - MPSS experimental data - - beta12orEarlier - Massively parallel signature sequencing (MPSS) data. - beta12orEarlier - Massively parallel signature sequencing (MPSS) experimental data - true - - - - - - - - - - SBS experimental data - - beta12orEarlier - beta12orEarlier - true - Sequencing by synthesis (SBS) experimental data - Sequencing by synthesis (SBS) data. - - - - - - - - - - Sequence tag profile (with gene assignment) - - 1.14 - beta12orEarlier - true - Tag to gene assignments (tag mapping) of SAGE, MPSS and SBS data. Typically this is the sequencing-based expression profile annotated with gene identifiers. - - - - - - - - - - Protein X-ray crystallographic data - - X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein NMR data - - Protein nuclear magnetic resonance (NMR) raw data. - beta12orEarlier - - - - - - - - - - Protein circular dichroism (CD) spectroscopic data - - beta12orEarlier - Protein secondary structure from protein coordinate or circular dichroism (CD) spectroscopic data. - - - - - - - - - - Electron microscopy volume map - - - - - - - - beta12orEarlier - Volume map data from electron microscopy. - EM volume map - - - - - - - - - - Electron microscopy model - - - - - - - - beta12orEarlier - Annotation on a structural 3D model (volume map) from electron microscopy. - This might include the location in the model of the known features of a particular macromolecule. - - - - - - - - - - 2D PAGE image - - - - - - - - beta12orEarlier - Two-dimensional gel electrophoresis image - - - - - - - - - - Mass spectrometry spectra - - - - - - - - beta12orEarlier - Spectra from mass spectrometry. - - - - - - - - - - Peptide mass fingerprint - - - - - - - - - Peak list - Protein fingerprint - A molecular weight standard fingerprint is standard protonated molecular masses e.g. from trypsin (modified porcine trypsin, Promega) and keratin peptides. - A set of peptide masses (peptide mass fingerprint) from mass spectrometry. - beta12orEarlier - Molecular weights standard fingerprint - - - - - - - - - - Peptide identification - - - - - - - - Protein or peptide identifications with evidence supporting the identifications, typically from comparing a peptide mass fingerprint (from mass spectrometry) to a sequence database. - beta12orEarlier - - - - - - - - - - Pathway or network annotation - - beta12orEarlier - true - An informative report about a specific biological pathway or network, typically including a map (diagram) of the pathway. - beta12orEarlier - - - - - - - - - - Biological pathway map - - beta12orEarlier - true - A map (typically a diagram) of a biological pathway. - beta12orEarlier - - - - - - - - - - Data resource definition - - beta12orEarlier - true - 1.5 - A definition of a data resource serving one or more types of data, including metadata and links to the resource or data proper. - - - - - - - - - - Workflow metadata - - Basic information, annotation or documentation concerning a workflow (but not the workflow itself). - beta12orEarlier - - - - - - - - - - Mathematical model - - - - - - - - Biological model - beta12orEarlier - A biological model represented in mathematical terms. - - - - - - - - - - Statistical estimate score - - beta12orEarlier - A value representing estimated statistical significance of some observed data; typically sequence database hits. - - - - - - - - - - EMBOSS database resource definition - - beta12orEarlier - Resource definition for an EMBOSS database. - true - 1.5 - - - - - - - - - - Version information - - "http://purl.obolibrary.org/obo/IAO_0000129" - 1.5 - Development status / maturity may be part of the version information, for example in case of tools, standards, or some data records. - http://www.ebi.ac.uk/swo/maturity/SWO_9000061 - beta12orEarlier - Information on a version of software or data, for example name, version number and release date. - http://semanticscience.org/resource/SIO_000653 - true - http://usefulinc.com/ns/doap#Version - - - - - - - - - - Database cross-mapping - - beta12orEarlier - A mapping of the accession numbers (or other database identifier) of entries between (typically) two biological or biomedical databases. - The cross-mapping is typically a table where each row is an accession number and each column is a database being cross-referenced. The cells give the accession number or identifier of the corresponding entry in a database. If a cell in the table is not filled then no mapping could be found for the database. Additional information might be given on version, date etc. - - - - - - - - - - Data index - - - - - - - - An index of data of biological relevance. - beta12orEarlier - - - - - - - - - - Data index report - - - - - - - - A report of an analysis of an index of biological data. - Database index annotation - beta12orEarlier - - - - - - - - - - Database metadata - - Basic information on bioinformatics database(s) or other data sources such as name, type, description, URL etc. - beta12orEarlier - - - - - - - - - - Tool metadata - - beta12orEarlier - Basic information about one or more bioinformatics applications or packages, such as name, type, description, or other documentation. - - - - - - - - - - Job metadata - - beta12orEarlier - true - 1.5 - Moby:PDGJOB - Textual metadata on a submitted or completed job. - - - - - - - - - - User metadata - - beta12orEarlier - Textual metadata on a software author or end-user, for example a person or other software. - - - - - - - - - - Small molecule report - - - - - - - - Small molecule annotation - Chemical structure report - An informative report on a specific chemical compound. - beta12orEarlier - Chemical compound annotation - - - - - - - - - - Cell line report - - Organism strain data - Cell line annotation - Report on a particular strain of organism cell line including plants, virus, fungi and bacteria. The data typically includes strain number, organism type, growth conditions, source and so on. - beta12orEarlier - - - - - - - - - - Scent annotation - - beta12orEarlier - An informative report about a specific scent. - 1.4 - true - - - - - - - - - - Ontology term - - Ontology class name - beta12orEarlier - A term (name) from an ontology. - Ontology terms - - - - - - - - - - Ontology concept data - - beta12orEarlier - Ontology class metadata - Ontology term metadata - Data concerning or derived from a concept from a biological ontology. - - - - - - - - - - Keyword - - Phrases - Keyword(s) or phrase(s) used (typically) for text-searching purposes. - Boolean operators (AND, OR and NOT) and wildcard characters may be allowed. - Moby:QueryString - beta12orEarlier - Moby:BooleanQueryString - Moby:Wildcard_Query - Moby:Global_Keyword - Terms - Text - - - - - - - - - - Citation - - Bibliographic data that uniquely identifies a scientific article, book or other published material. - A bibliographic reference might include information such as authors, title, journal name, date and (possibly) a link to the abstract or full-text of the article if available. - Moby:GCP_SimpleCitation - Reference - Bibliographic reference - Moby:Publication - beta12orEarlier - - - - - - - - - - Article - - - - - - - - A document of scientific text, typically a full text article from a scientific journal. - beta12orEarlier - - - - - - - - - - Text mining report - - An abstract of the results of text mining. - beta12orEarlier - Text mining output - A text mining abstract will typically include an annotated a list of words or sentences extracted from one or more scientific articles. - - - - - - - - - - Entity identifier - - beta12orEarlier - true - beta12orEarlier - An identifier of a biological entity or phenomenon. - - - - - - - - - - Data resource identifier - - true - An identifier of a data resource. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Identifier (typed) - - beta12orEarlier - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - An identifier that identifies a particular type of data. - - - - - - - - - - - Tool identifier - - An identifier of a bioinformatics tool, e.g. an application or web service. - beta12orEarlier - - - - - - - - - - - Discrete entity identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a discrete entity (any biological thing with a distinct, discrete physical existence). - - - - - - - - - - Entity feature identifier - - true - beta12orEarlier - Name or other identifier of an entity feature (a physical part or region of a discrete biological entity, or a feature that can be mapped to such a thing). - beta12orEarlier - - - - - - - - - - Entity collection identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a collection of discrete biological entities. - - - - - - - - - - Phenomenon identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a physical, observable biological occurrence or event. - - - - - - - - - - Molecule identifier - - Name or other identifier of a molecule. - beta12orEarlier - - - - - - - - - - - Atom ID - - Atom identifier - Identifier (e.g. character symbol) of a specific atom. - beta12orEarlier - - - - - - - - - - - Molecule name - - - Name of a specific molecule. - beta12orEarlier - - - - - - - - - - - Molecule type - - For example, 'Protein', 'DNA', 'RNA' etc. - true - 1.5 - beta12orEarlier - A label (text token) describing the type a molecule. - Protein|DNA|RNA - - - - - - - - - - Chemical identifier - - true - beta12orEarlier - beta12orEarlier - Unique identifier of a chemical compound. - - - - - - - - - - Chromosome name - - - - - - - - - beta12orEarlier - Name of a chromosome. - - - - - - - - - - - Peptide identifier - - Identifier of a peptide chain. - beta12orEarlier - - - - - - - - - - - Protein identifier - - - - - - - - beta12orEarlier - Identifier of a protein. - - - - - - - - - - - Compound name - - - Chemical name - Unique name of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number - - beta12orEarlier - Unique registry number of a chemical compound. - - - - - - - - - - - Ligand identifier - - true - beta12orEarlier - Code word for a ligand, for example from a PDB file. - beta12orEarlier - - - - - - - - - - Drug identifier - - - - - - - - beta12orEarlier - Identifier of a drug. - - - - - - - - - - - Amino acid identifier - - - - - - - - Identifier of an amino acid. - beta12orEarlier - Residue identifier - - - - - - - - - - - Nucleotide identifier - - beta12orEarlier - Name or other identifier of a nucleotide. - - - - - - - - - - - Monosaccharide identifier - - beta12orEarlier - Identifier of a monosaccharide. - - - - - - - - - - - Chemical name (ChEBI) - - ChEBI chemical name - Unique name from Chemical Entities of Biological Interest (ChEBI) of a chemical compound. - beta12orEarlier - This is the recommended chemical name for use for example in database annotation. - - - - - - - - - - - Chemical name (IUPAC) - - IUPAC recommended name of a chemical compound. - IUPAC chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (INN) - - INN chemical name - beta12orEarlier - International Non-proprietary Name (INN or 'generic name') of a chemical compound, assigned by the World Health Organization (WHO). - - - - - - - - - - - Chemical name (brand) - - Brand name of a chemical compound. - Brand chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (synonymous) - - beta12orEarlier - Synonymous chemical name - Synonymous name of a chemical compound. - - - - - - - - - - - Chemical registry number (CAS) - - CAS chemical registry number - CAS registry number of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number (Beilstein) - - Beilstein chemical registry number - beta12orEarlier - Beilstein registry number of a chemical compound. - - - - - - - - - - - Chemical registry number (Gmelin) - - Gmelin chemical registry number - beta12orEarlier - Gmelin registry number of a chemical compound. - - - - - - - - - - - HET group name - - 3-letter code word for a ligand (HET group) from a PDB file, for example ATP. - Short ligand name - Component identifier code - beta12orEarlier - - - - - - - - - - - Amino acid name - - String of one or more ASCII characters representing an amino acid. - beta12orEarlier - - - - - - - - - - - Nucleotide code - - - beta12orEarlier - String of one or more ASCII characters representing a nucleotide. - - - - - - - - - - - Polypeptide chain ID - - - - - - - - beta12orEarlier - WHATIF: chain - Chain identifier - Identifier of a polypeptide chain from a protein. - PDBML:pdbx_PDB_strand_id - Protein chain identifier - PDB strand id - PDB chain identifier - This is typically a character (for the chain) appended to a PDB identifier, e.g. 1cukA - Polypeptide chain identifier - - - - - - - - - - - Protein name - - - Name of a protein. - beta12orEarlier - - - - - - - - - - - Enzyme identifier - - beta12orEarlier - Name or other identifier of an enzyme or record from a database of enzymes. - - - - - - - - - - - EC number - - [0-9]+\.-\.-\.-|[0-9]+\.[0-9]+\.-\.-|[0-9]+\.[0-9]+\.[0-9]+\.-|[0-9]+\.[0-9]+\.[0-9]+\.[0-9]+ - EC code - Moby:EC_Number - An Enzyme Commission (EC) number of an enzyme. - EC - Moby:Annotated_EC_Number - beta12orEarlier - Enzyme Commission number - - - - - - - - - - - Enzyme name - - - Name of an enzyme. - beta12orEarlier - - - - - - - - - - - Restriction enzyme name - - Name of a restriction enzyme. - beta12orEarlier - - - - - - - - - - - Sequence position specification - - 1.5 - A specification (partial or complete) of one or more positions or regions of a molecular sequence or map. - beta12orEarlier - true - - - - - - - - - - Sequence feature ID - - - A unique identifier of molecular sequence feature, for example an ID of a feature that is unique within the scope of the GFF file. - beta12orEarlier - - - - - - - - - - - Sequence position - - WHATIF: number - WHATIF: PDBx_atom_site - beta12orEarlier - PDBML:_atom_site.id - SO:0000735 - A position of one or more points (base or residue) in a sequence, or part of such a specification. - - - - - - - - - - Sequence range - - beta12orEarlier - Specification of range(s) of sequence positions. - - - - - - - - - - Nucleic acid feature identifier - - beta12orEarlier - beta12orEarlier - Name or other identifier of an nucleic acid feature. - true - - - - - - - - - - Protein feature identifier - - Name or other identifier of a protein feature. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence feature key - - Sequence feature method - The type of a sequence feature, typically a term or accession from the Sequence Ontology, for example an EMBL or Swiss-Prot sequence feature key. - Sequence feature type - beta12orEarlier - A feature key indicates the biological nature of the feature or information about changes to or versions of the sequence. - - - - - - - - - - Sequence feature qualifier - - beta12orEarlier - Typically one of the EMBL or Swiss-Prot feature qualifiers. - Feature qualifiers hold information about a feature beyond that provided by the feature key and location. - - - - - - - - - - Sequence feature label - - Sequence feature name - Typically an EMBL or Swiss-Prot feature label. - A feature label identifies a feature of a sequence database entry. When used with the database name and the entry's primary accession number, it is a unique identifier of that feature. - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Feature Object - - - beta12orEarlier - UFO - The name of a sequence feature-containing entity adhering to the standard feature naming scheme used by all EMBOSS applications. - - - - - - - - - - Codon name - - beta12orEarlier - beta12orEarlier - String of one or more ASCII characters representing a codon. - true - - - - - - - - - - Gene identifier - - - - - - - - Moby:GeneAccessionList - An identifier of a gene, such as a name/symbol or a unique identifier of a gene in a database. - beta12orEarlier - - - - - - - - - - - Gene symbol - - Moby_namespace:Global_GeneSymbol - beta12orEarlier - Moby_namespace:Global_GeneCommonName - The short name of a gene; a single word that does not contain white space characters. It is typically derived from the gene name. - - - - - - - - - - - Gene ID (NCBI) - - - NCBI geneid - Gene identifier (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs:NCBI_Gene - Entrez gene ID - Gene identifier (Entrez) - http://www.geneontology.org/doc/GO.xrf_abbs:LocusID - An NCBI unique identifier of a gene. - NCBI gene ID - beta12orEarlier - - - - - - - - - - - Gene identifier (NCBI RefSeq) - - beta12orEarlier - true - beta12orEarlier - An NCBI RefSeq unique identifier of a gene. - - - - - - - - - - Gene identifier (NCBI UniGene) - - beta12orEarlier - An NCBI UniGene unique identifier of a gene. - beta12orEarlier - true - - - - - - - - - - Gene identifier (Entrez) - - An Entrez unique identifier of a gene. - beta12orEarlier - true - [0-9]+ - beta12orEarlier - - - - - - - - - - Gene ID (CGD) - - CGD ID - Identifier of a gene or feature from the CGD database. - beta12orEarlier - - - - - - - - - - - Gene ID (DictyBase) - - beta12orEarlier - Identifier of a gene from DictyBase. - - - - - - - - - - - Ensembl gene ID - - - beta12orEarlier - Gene ID (Ensembl) - Unique identifier for a gene (or other feature) from the Ensembl database. - - - - - - - - - - - Gene ID (SGD) - - - Identifier of an entry from the SGD database. - S[0-9]+ - SGD identifier - beta12orEarlier - - - - - - - - - - - Gene ID (GeneDB) - - Moby_namespace:GeneDB - GeneDB identifier - beta12orEarlier - [a-zA-Z_0-9\.-]* - Identifier of a gene from the GeneDB database. - - - - - - - - - - - TIGR identifier - - - beta12orEarlier - Identifier of an entry from the TIGR database. - - - - - - - - - - - TAIR accession (gene) - - - Gene:[0-9]{7} - beta12orEarlier - Identifier of an gene from the TAIR database. - - - - - - - - - - - Protein domain ID - - - - - - - - - beta12orEarlier - Identifier of a protein structural domain. - This is typically a character or string concatenated with a PDB identifier and a chain identifier. - - - - - - - - - - - SCOP domain identifier - - Identifier of a protein domain (or other node) from the SCOP database. - beta12orEarlier - - - - - - - - - - - CATH domain ID - - 1nr3A00 - beta12orEarlier - CATH domain identifier - Identifier of a protein domain from CATH. - - - - - - - - - - - SCOP concise classification string (sccs) - - A SCOP concise classification string (sccs) is a compact representation of a SCOP domain classification. - beta12orEarlier - An scss includes the class (alphabetical), fold, superfamily and family (all numerical) to which a given domain belongs. - - - - - - - - - - - SCOP sunid - - Unique identifier (number) of an entry in the SCOP hierarchy, for example 33229. - beta12orEarlier - A sunid uniquely identifies an entry in the SCOP hierarchy, including leaves (the SCOP domains) and higher level nodes including entries corresponding to the protein level. - sunid - SCOP unique identifier - 33229 - - - - - - - - - - - CATH node ID - - 3.30.1190.10.1.1.1.1.1 - CATH code - A code number identifying a node from the CATH database. - CATH node identifier - beta12orEarlier - - - - - - - - - - - Kingdom name - - The name of a biological kingdom (Bacteria, Archaea, or Eukaryotes). - beta12orEarlier - - - - - - - - - - - Species name - - The name of a species (typically a taxonomic group) of organism. - Organism species - beta12orEarlier - - - - - - - - - - - Strain name - - - beta12orEarlier - The name of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - URI - - A string of characters that name or otherwise identify a resource on the Internet. - URIs - beta12orEarlier - - - - - - - - - - Database ID - - - - - - - - An identifier of a biological or bioinformatics database. - Database identifier - beta12orEarlier - - - - - - - - - - - Directory name - - beta12orEarlier - The name of a directory. - - - - - - - - - - - File name - - The name (or part of a name) of a file (of any type). - beta12orEarlier - - - - - - - - - - - Ontology name - - - - - - - - - beta12orEarlier - Name of an ontology of biological or bioinformatics concepts and relations. - - - - - - - - - - - URL - - A Uniform Resource Locator (URL). - Moby:URL - Moby:Link - beta12orEarlier - - - - - - - - - - URN - - beta12orEarlier - A Uniform Resource Name (URN). - - - - - - - - - - LSID - - beta12orEarlier - LSIDs provide a standard way to locate and describe data. An LSID is represented as a Uniform Resource Name (URN) with the following format: URN:LSID:<Authority>:<Namespace>:<ObjectID>[:<Version>] - Life Science Identifier - A Life Science Identifier (LSID) - a unique identifier of some data. - - - - - - - - - - Database name - - - The name of a biological or bioinformatics database. - beta12orEarlier - - - - - - - - - - - Sequence database name - - The name of a molecular sequence database. - true - beta13 - beta12orEarlier - - - - - - - - - - Enumerated file name - - beta12orEarlier - The name of a file (of any type) with restricted possible values. - - - - - - - - - - - File name extension - - The extension of a file name. - A file extension is the characters appearing after the final '.' in the file name. - beta12orEarlier - - - - - - - - - - - File base name - - beta12orEarlier - The base name of a file. - A file base name is the file name stripped of its directory specification and extension. - - - - - - - - - - - QSAR descriptor name - - - - - - - - - beta12orEarlier - Name of a QSAR descriptor. - - - - - - - - - - - Database entry identifier - - true - This concept is required for completeness. It should never have child concepts. - beta12orEarlier - An identifier of an entry from a database where the same type of identifier is used for objects (data) of different semantic type. - beta12orEarlier - - - - - - - - - - Sequence identifier - - - - - - - - An identifier of molecular sequence(s) or entries from a molecular sequence database. - beta12orEarlier - - - - - - - - - - - Sequence set ID - - - - - - - - - An identifier of a set of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence signature identifier - - beta12orEarlier - beta12orEarlier - true - Identifier of a sequence signature (motif or profile) for example from a database of sequence patterns. - - - - - - - - - - - Sequence alignment ID - - - - - - - - - Identifier of a molecular sequence alignment, for example a record from an alignment database. - beta12orEarlier - - - - - - - - - - - Phylogenetic distance matrix identifier - - beta12orEarlier - Identifier of a phylogenetic distance matrix. - true - beta12orEarlier - - - - - - - - - - Phylogenetic tree ID - - - - - - - - - beta12orEarlier - Identifier of a phylogenetic tree for example from a phylogenetic tree database. - - - - - - - - - - - Comparison matrix identifier - - - - - - - - An identifier of a comparison matrix. - Substitution matrix identifier - beta12orEarlier - - - - - - - - - - - Structure ID - - - beta12orEarlier - A unique and persistent identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Structural (3D) profile ID - - - - - - - - - Structural profile identifier - Identifier or name of a structural (3D) profile or template (representing a structure or structure alignment). - beta12orEarlier - - - - - - - - - - - Structure alignment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of tertiary structure alignments. - - - - - - - - - - - Amino acid index ID - - - - - - - - - Identifier of an index of amino acid physicochemical and biochemical property data. - beta12orEarlier - - - - - - - - - - - Protein interaction ID - - - - - - - - - beta12orEarlier - Molecular interaction ID - Identifier of a report of protein interactions from a protein interaction database (typically). - - - - - - - - - - - Protein family identifier - - - - - - - - Protein secondary database record identifier - Identifier of a protein family. - beta12orEarlier - - - - - - - - - - - Codon usage table name - - - - - - - - - - - - - - - Unique name of a codon usage table. - beta12orEarlier - - - - - - - - - - - Transcription factor identifier - - - Identifier of a transcription factor (or a TF binding site). - beta12orEarlier - - - - - - - - - - - Experiment annotation ID - - - - - - - - beta12orEarlier - Identifier of an entry from a database of microarray data. - - - - - - - - - - - Electron microscopy model ID - - - - - - - - - Identifier of an entry from a database of electron microscopy data. - beta12orEarlier - - - - - - - - - - - Gene expression report ID - - - - - - - - - Accession of a report of gene expression (e.g. a gene expression profile) from a database. - beta12orEarlier - Gene expression profile identifier - - - - - - - - - - - Genotype and phenotype annotation ID - - - - - - - - - Identifier of an entry from a database of genotypes and phenotypes. - beta12orEarlier - - - - - - - - - - - Pathway or network identifier - - - - - - - - Identifier of an entry from a database of biological pathways or networks. - beta12orEarlier - - - - - - - - - - - Workflow ID - - - beta12orEarlier - Identifier of a biological or biomedical workflow, typically from a database of workflows. - - - - - - - - - - - Data resource definition ID - - beta12orEarlier - Identifier of a data type definition from some provider. - Data resource definition identifier - - - - - - - - - - - Biological model ID - - - - - - - - Biological model identifier - beta12orEarlier - Identifier of a mathematical model, typically an entry from a database. - - - - - - - - - - - Compound identifier - - - - - - - - beta12orEarlier - Chemical compound identifier - Identifier of an entry from a database of chemicals. - Small molecule identifier - - - - - - - - - - - Ontology concept ID - - - A unique (typically numerical) identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - Article ID - - - - - - - - - beta12orEarlier - Unique identifier of a scientific article. - Article identifier - - - - - - - - - - - FlyBase ID - - - Identifier of an object from the FlyBase database. - FB[a-zA-Z_0-9]{2}[0-9]{7} - beta12orEarlier - - - - - - - - - - - WormBase name - - - Name of an object from the WormBase database, usually a human-readable name. - beta12orEarlier - - - - - - - - - - - WormBase class - - beta12orEarlier - Class of an object from the WormBase database. - A WormBase class describes the type of object such as 'sequence' or 'protein'. - - - - - - - - - - - Sequence accession - - - beta12orEarlier - A persistent, unique identifier of a molecular sequence database entry. - Sequence accession number - - - - - - - - - - - Sequence type - - 1.5 - Sequence type might reflect the molecule (protein, nucleic acid etc) or the sequence itself (gapped, ambiguous etc). - A label (text token) describing a type of molecular sequence. - true - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Sequence Address - - - EMBOSS USA - beta12orEarlier - The name of a sequence-based entity adhering to the standard sequence naming scheme used by all EMBOSS applications. - - - - - - - - - - - Sequence accession (protein) - - - - - - - - Accession number of a protein sequence database entry. - Protein sequence accession number - beta12orEarlier - - - - - - - - - - - Sequence accession (nucleic acid) - - - - - - - - Accession number of a nucleotide sequence database entry. - beta12orEarlier - Nucleotide sequence accession number - - - - - - - - - - - RefSeq accession - - Accession number of a RefSeq database entry. - beta12orEarlier - RefSeq ID - (NC|AC|NG|NT|NW|NZ|NM|NR|XM|XR|NP|AP|XP|YP|ZP)_[0-9]+ - - - - - - - - - - - UniProt accession (extended) - - true - Accession number of a UniProt (protein sequence) database entry. May contain version or isoform number. - [A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9].[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9].[0-9]+|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]-[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]-[0-9]+ - beta12orEarlier - Q7M1G0|P43353-2|P01012.107 - 1.0 - - - - - - - - - - PIR identifier - - - - - - - - An identifier of PIR sequence database entry. - beta12orEarlier - PIR ID - PIR accession number - - - - - - - - - - - TREMBL accession - - beta12orEarlier - Identifier of a TREMBL sequence database entry. - true - 1.2 - - - - - - - - - - Gramene primary identifier - - beta12orEarlier - Gramene primary ID - Primary identifier of a Gramene database entry. - - - - - - - - - - - EMBL/GenBank/DDBJ ID - - Identifier of a (nucleic acid) entry from the EMBL/GenBank/DDBJ databases. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniGene) - - UniGene identifier - UniGene cluster id - UniGene ID - UniGene cluster ID - beta12orEarlier - A unique identifier of an entry (gene cluster) from the NCBI UniGene database. - - - - - - - - - - - dbEST accession - - - dbEST ID - Identifier of a dbEST database entry. - beta12orEarlier - - - - - - - - - - - dbSNP ID - - beta12orEarlier - dbSNP identifier - Identifier of a dbSNP database entry. - - - - - - - - - - - EMBOSS sequence type - - beta12orEarlier - true - See the EMBOSS documentation (http://emboss.sourceforge.net/) for a definition of what this includes. - beta12orEarlier - The EMBOSS type of a molecular sequence. - - - - - - - - - - EMBOSS listfile - - 1.5 - List of EMBOSS Uniform Sequence Addresses (EMBOSS listfile). - true - beta12orEarlier - - - - - - - - - - Sequence cluster ID - - - - - - - - An identifier of a cluster of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (COG) - - COG ID - beta12orEarlier - Unique identifier of an entry from the COG database. - - - - - - - - - - - Sequence motif identifier - - - - - - - - Identifier of a sequence motif, for example an entry from a motif database. - beta12orEarlier - - - - - - - - - - - Sequence profile ID - - - - - - - - - Identifier of a sequence profile. - beta12orEarlier - A sequence profile typically represents a sequence alignment. - - - - - - - - - - - ELM ID - - Identifier of an entry from the ELMdb database of protein functional sites. - beta12orEarlier - - - - - - - - - - - Prosite accession number - - beta12orEarlier - Accession number of an entry from the Prosite database. - PS[0-9]{5} - Prosite ID - - - - - - - - - - - HMMER hidden Markov model ID - - - - - - - - Unique identifier or name of a HMMER hidden Markov model. - beta12orEarlier - - - - - - - - - - - JASPAR profile ID - - beta12orEarlier - Unique identifier or name of a profile from the JASPAR database. - - - - - - - - - - - Sequence alignment type - - beta12orEarlier - 1.5 - true - Possible values include for example the EMBOSS alignment types, BLAST alignment types and so on. - A label (text token) describing the type of a sequence alignment. - - - - - - - - - - BLAST sequence alignment type - - true - beta12orEarlier - beta12orEarlier - The type of a BLAST sequence alignment. - - - - - - - - - - Phylogenetic tree type - - For example 'nj', 'upgmp' etc. - beta12orEarlier - true - A label (text token) describing the type of a phylogenetic tree. - 1.5 - nj|upgmp - - - - - - - - - - TreeBASE study accession number - - Accession number of an entry from the TreeBASE database. - beta12orEarlier - - - - - - - - - - - TreeFam accession number - - beta12orEarlier - Accession number of an entry from the TreeFam database. - - - - - - - - - - - Comparison matrix type - - 1.5 - true - beta12orEarlier - blosum|pam|gonnet|id - A label (text token) describing the type of a comparison matrix. - Substitution matrix type - For example 'blosum', 'pam', 'gonnet', 'id' etc. Comparison matrix type may be required where a series of matrices of a certain type are used. - - - - - - - - - - Comparison matrix name - - - - - - - - - beta12orEarlier - Substitution matrix name - See for example http://www.ebi.ac.uk/Tools/webservices/help/matrix. - Unique name or identifier of a comparison matrix. - - - - - - - - - - - PDB ID - - An identifier of an entry from the PDB database. - [a-zA-Z_0-9]{4} - PDBID - PDB identifier - beta12orEarlier - - - - - - - - - - - AAindex ID - - beta12orEarlier - Identifier of an entry from the AAindex database. - - - - - - - - - - - BIND accession number - - Accession number of an entry from the BIND database. - beta12orEarlier - - - - - - - - - - - IntAct accession number - - EBI\-[0-9]+ - beta12orEarlier - Accession number of an entry from the IntAct database. - - - - - - - - - - - Protein family name - - - beta12orEarlier - Name of a protein family. - - - - - - - - - - - InterPro entry name - - - - - - - - beta12orEarlier - Name of an InterPro entry, usually indicating the type of protein matches for that entry. - - - - - - - - - - - InterPro accession - - - - - - - - Primary accession number of an InterPro entry. - InterPro primary accession - Every InterPro entry has a unique accession number to provide a persistent citation of database records. - beta12orEarlier - InterPro primary accession number - IPR015590 - IPR[0-9]{6} - - - - - - - - - - - InterPro secondary accession - - - - - - - - Secondary accession number of an InterPro entry. - beta12orEarlier - InterPro secondary accession number - - - - - - - - - - - Gene3D ID - - beta12orEarlier - Unique identifier of an entry from the Gene3D database. - - - - - - - - - - - PIRSF ID - - PIRSF[0-9]{6} - beta12orEarlier - Unique identifier of an entry from the PIRSF database. - - - - - - - - - - - PRINTS code - - beta12orEarlier - PR[0-9]{5} - The unique identifier of an entry in the PRINTS database. - - - - - - - - - - - Pfam accession number - - PF[0-9]{5} - Accession number of a Pfam entry. - beta12orEarlier - - - - - - - - - - - SMART accession number - - Accession number of an entry from the SMART database. - beta12orEarlier - SM[0-9]{5} - - - - - - - - - - - Superfamily hidden Markov model number - - Unique identifier (number) of a hidden Markov model from the Superfamily database. - beta12orEarlier - - - - - - - - - - - TIGRFam ID - - TIGRFam accession number - Accession number of an entry (family) from the TIGRFam database. - beta12orEarlier - - - - - - - - - - - ProDom accession number - - A ProDom domain family accession number. - PD[0-9]+ - beta12orEarlier - ProDom is a protein domain family database. - - - - - - - - - - - TRANSFAC accession number - - beta12orEarlier - Identifier of an entry from the TRANSFAC database. - - - - - - - - - - - ArrayExpress accession number - - Accession number of an entry from the ArrayExpress database. - beta12orEarlier - [AEP]-[a-zA-Z_0-9]{4}-[0-9]+ - ArrayExpress experiment ID - - - - - - - - - - - PRIDE experiment accession number - - [0-9]+ - beta12orEarlier - PRIDE experiment accession number. - - - - - - - - - - - EMDB ID - - beta12orEarlier - Identifier of an entry from the EMDB electron microscopy database. - - - - - - - - - - - GEO accession number - - Accession number of an entry from the GEO database. - o^GDS[0-9]+ - beta12orEarlier - - - - - - - - - - - GermOnline ID - - beta12orEarlier - Identifier of an entry from the GermOnline database. - - - - - - - - - - - EMAGE ID - - Identifier of an entry from the EMAGE database. - beta12orEarlier - - - - - - - - - - - Disease ID - - - Accession number of an entry from a database of disease. - beta12orEarlier - - - - - - - - - - - HGVbase ID - - Identifier of an entry from the HGVbase database. - beta12orEarlier - - - - - - - - - - - HIVDB identifier - - true - beta12orEarlier - Identifier of an entry from the HIVDB database. - beta12orEarlier - - - - - - - - - - OMIM ID - - beta12orEarlier - [*#+%^]?[0-9]{6} - Identifier of an entry from the OMIM database. - - - - - - - - - - - KEGG object identifier - - - beta12orEarlier - Unique identifier of an object from one of the KEGG databases (excluding the GENES division). - - - - - - - - - - - Pathway ID (reactome) - - Identifier of an entry from the Reactome database. - Reactome ID - beta12orEarlier - REACT_[0-9]+(\.[0-9]+)? - - - - - - - - - - - Pathway ID (aMAZE) - - beta12orEarlier - aMAZE ID - true - beta12orEarlier - Identifier of an entry from the aMAZE database. - - - - - - - - - - Pathway ID (BioCyc) - - - BioCyc pathway ID - beta12orEarlier - Identifier of an pathway from the BioCyc biological pathways database. - - - - - - - - - - - Pathway ID (INOH) - - beta12orEarlier - INOH identifier - Identifier of an entry from the INOH database. - - - - - - - - - - - Pathway ID (PATIKA) - - Identifier of an entry from the PATIKA database. - PATIKA ID - beta12orEarlier - - - - - - - - - - - Pathway ID (CPDB) - - This concept refers to identifiers used by the databases collated in CPDB; CPDB identifiers are not independently defined. - CPDB ID - Identifier of an entry from the CPDB (ConsensusPathDB) biological pathways database, which is an identifier from an external database integrated into CPDB. - beta12orEarlier - - - - - - - - - - - Pathway ID (Panther) - - Identifier of a biological pathway from the Panther Pathways database. - beta12orEarlier - PTHR[0-9]{5} - Panther Pathways ID - - - - - - - - - - - MIRIAM identifier - - - - - - - - Unique identifier of a MIRIAM data resource. - MIR:00100005 - MIR:[0-9]{8} - beta12orEarlier - This is the identifier used internally by MIRIAM for a data type. - - - - - - - - - - - MIRIAM data type name - - - - - - - - beta12orEarlier - The name of a data type from the MIRIAM database. - - - - - - - - - - - MIRIAM URI - - - - - - - - - beta12orEarlier - The URI (URL or URN) of a data entity from the MIRIAM database. - identifiers.org synonym - urn:miriam:pubmed:16333295|urn:miriam:obo.go:GO%3A0045202 - A MIRIAM URI consists of the URI of the MIRIAM data type (PubMed, UniProt etc) followed by the identifier of an element of that data type, for example PMID for a publication or an accession number for a GO term. - - - - - - - - - - - MIRIAM data type primary name - - beta12orEarlier - The primary name of a MIRIAM data type is taken from a controlled vocabulary. - UniProt|Enzyme Nomenclature - The primary name of a data type from the MIRIAM database. - - - - - - A protein entity has the MIRIAM data type 'UniProt', and an enzyme has the MIRIAM data type 'Enzyme Nomenclature'. - UniProt|Enzyme Nomenclature - - - - - - - - - - MIRIAM data type synonymous name - - A synonymous name of a data type from the MIRIAM database. - A synonymous name for a MIRIAM data type taken from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Taverna workflow ID - - beta12orEarlier - Unique identifier of a Taverna workflow. - - - - - - - - - - - Biological model name - - - beta12orEarlier - Name of a biological (mathematical) model. - - - - - - - - - - - BioModel ID - - Unique identifier of an entry from the BioModel database. - beta12orEarlier - (BIOMD|MODEL)[0-9]{10} - - - - - - - - - - - PubChem CID - - - [0-9]+ - PubChem compound accession identifier - Chemical structure specified in PubChem Compound Identification (CID), a non-zero integer identifier for a unique chemical structure. - beta12orEarlier - - - - - - - - - - - ChemSpider ID - - Identifier of an entry from the ChemSpider database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ChEBI ID - - Identifier of an entry from the ChEBI database. - ChEBI IDs - ChEBI identifier - CHEBI:[0-9]+ - beta12orEarlier - - - - - - - - - - - BioPax concept ID - - beta12orEarlier - An identifier of a concept from the BioPax ontology. - - - - - - - - - - - GO concept ID - - GO concept identifier - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a concept from The Gene Ontology. - - - - - - - - - - - MeSH concept ID - - beta12orEarlier - An identifier of a concept from the MeSH vocabulary. - - - - - - - - - - - HGNC concept ID - - beta12orEarlier - An identifier of a concept from the HGNC controlled vocabulary. - - - - - - - - - - - NCBI taxonomy ID - - - NCBI taxonomy identifier - [1-9][0-9]{0,8} - NCBI tax ID - A stable unique identifier for each taxon (for a species, a family, an order, or any other group in the NCBI taxonomy database. - 9662|3483|182682 - beta12orEarlier - - - - - - - - - - - Plant Ontology concept ID - - An identifier of a concept from the Plant Ontology (PO). - beta12orEarlier - - - - - - - - - - - UMLS concept ID - - An identifier of a concept from the UMLS vocabulary. - beta12orEarlier - - - - - - - - - - - FMA concept ID - - An identifier of a concept from Foundational Model of Anatomy. - FMA:[0-9]+ - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - - - - - - - - - - - EMAP concept ID - - beta12orEarlier - An identifier of a concept from the EMAP mouse ontology. - - - - - - - - - - - ChEBI concept ID - - beta12orEarlier - An identifier of a concept from the ChEBI ontology. - - - - - - - - - - - MGED concept ID - - beta12orEarlier - An identifier of a concept from the MGED ontology. - - - - - - - - - - - myGrid concept ID - - beta12orEarlier - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - An identifier of a concept from the myGrid ontology. - - - - - - - - - - - PubMed ID - - PMID - [1-9][0-9]{0,8} - PubMed unique identifier of an article. - beta12orEarlier - 4963447 - - - - - - - - - - - DOI - - beta12orEarlier - (doi\:)?[0-9]{2}\.[0-9]{4}/.* - Digital Object Identifier - Digital Object Identifier (DOI) of a published article. - - - - - - - - - - - Medline UI - - beta12orEarlier - Medline UI (unique identifier) of an article. - The use of Medline UI has been replaced by the PubMed unique identifier. - Medline unique identifier - - - - - - - - - - - Tool name - - The name of a computer package, application, method or function. - beta12orEarlier - - - - - - - - - - - Tool name (signature) - - beta12orEarlier - The unique name of a signature (sequence classifier) method. - Signature methods from http://www.ebi.ac.uk/Tools/InterProScan/help.html#results include BlastProDom, FPrintScan, HMMPIR, HMMPfam, HMMSmart, HMMTigr, ProfileScan, ScanRegExp, SuperFamily and HAMAP. - - - - - - - - - - - Tool name (BLAST) - - This include 'blastn', 'blastp', 'blastx', 'tblastn' and 'tblastx'. - The name of a BLAST tool. - beta12orEarlier - BLAST name - - - - - - - - - - - Tool name (FASTA) - - beta12orEarlier - The name of a FASTA tool. - This includes 'fasta3', 'fastx3', 'fasty3', 'fastf3', 'fasts3' and 'ssearch'. - - - - - - - - - - - Tool name (EMBOSS) - - The name of an EMBOSS application. - beta12orEarlier - - - - - - - - - - - Tool name (EMBASSY package) - - The name of an EMBASSY package. - beta12orEarlier - - - - - - - - - - - QSAR descriptor (constitutional) - - A QSAR constitutional descriptor. - beta12orEarlier - QSAR constitutional descriptor - - - - - - - - - - QSAR descriptor (electronic) - - beta12orEarlier - A QSAR electronic descriptor. - QSAR electronic descriptor - - - - - - - - - - QSAR descriptor (geometrical) - - QSAR geometrical descriptor - A QSAR geometrical descriptor. - beta12orEarlier - - - - - - - - - - QSAR descriptor (topological) - - beta12orEarlier - QSAR topological descriptor - A QSAR topological descriptor. - - - - - - - - - - QSAR descriptor (molecular) - - A QSAR molecular descriptor. - QSAR molecular descriptor - beta12orEarlier - - - - - - - - - - Sequence set (protein) - - Any collection of multiple protein sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - beta12orEarlier - - - - - - - - - - Sequence set (nucleic acid) - - beta12orEarlier - Any collection of multiple nucleotide sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - - - - - - - - - - Sequence cluster - - - - - - - - A set of sequences that have been clustered or otherwise classified as belonging to a group including (typically) sequence cluster information. - The cluster might include sequences identifiers, short descriptions, alignment and summary information. - beta12orEarlier - - - - - - - - - - Psiblast checkpoint file - - beta12orEarlier - A Psiblast checkpoint file uses ASN.1 Binary Format and usually has the extension '.asn'. - beta12orEarlier - true - A file of intermediate results from a PSIBLAST search that is used for priming the search in the next PSIBLAST iteration. - - - - - - - - - - HMMER synthetic sequences set - - Sequences generated by HMMER package in FASTA-style format. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Proteolytic digest - - - - - - - - beta12orEarlier - A protein sequence cleaved into peptide fragments (by enzymatic or chemical cleavage) with fragment masses. - - - - - - - - - - Restriction digest - - Restriction digest fragments from digesting a nucleotide sequence with restriction sites using a restriction endonuclease. - SO:0000412 - beta12orEarlier - - - - - - - - - - PCR primers - - beta12orEarlier - Oligonucleotide primer(s) for PCR and DNA amplification, for example a minimal primer set. - - - - - - - - - - vectorstrip cloning vector definition file - - beta12orEarlier - true - File of sequence vectors used by EMBOSS vectorstrip application, or any file in same format. - beta12orEarlier - - - - - - - - - - Primer3 internal oligo mishybridizing library - - true - beta12orEarlier - A library of nucleotide sequences to avoid during hybridization events. Hybridization of the internal oligo to sequences in this library is avoided, rather than priming from them. The file is in a restricted FASTA format. - beta12orEarlier - - - - - - - - - - Primer3 mispriming library file - - true - A nucleotide sequence library of sequences to avoid during amplification (for example repetitive sequences, or possibly the sequences of genes in a gene family that should not be amplified. The file must is in a restricted FASTA format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - primersearch primer pairs sequence record - - true - beta12orEarlier - beta12orEarlier - File of one or more pairs of primer sequences, as used by EMBOSS primersearch application. - - - - - - - - - - Sequence cluster (protein) - - - Protein sequence cluster - The sequences are typically related, for example a family of sequences. - beta12orEarlier - A cluster of protein sequences. - - - - - - - - - - Sequence cluster (nucleic acid) - - - A cluster of nucleotide sequences. - Nucleotide sequence cluster - beta12orEarlier - The sequences are typically related, for example a family of sequences. - - - - - - - - - - Sequence length - - beta12orEarlier - The size (length) of a sequence, subsequence or region in a sequence, or range(s) of lengths. - - - - - - - - - - Word size - - Word size is used for example in word-based sequence database search methods. - Word length - 1.5 - Size of a sequence word. - true - beta12orEarlier - - - - - - - - - - Window size - - 1.5 - true - A window is a region of fixed size but not fixed position over a molecular sequence. It is typically moved (computationally) over a sequence during scoring. - beta12orEarlier - Size of a sequence window. - - - - - - - - - - Sequence length range - - true - Specification of range(s) of length of sequences. - beta12orEarlier - 1.5 - - - - - - - - - - Sequence information report - - Report on basic information about a molecular sequence such as name, accession number, type (nucleic or protein), length, description etc. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence property - - beta12orEarlier - An informative report about non-positional sequence features, typically a report on general molecular sequence properties derived from sequence analysis. - Sequence properties report - - - - - - - - - - Sequence features - - Sequence features report - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D058977 - SO:0000110 - This includes annotation of positional sequence features, organized into a standard feature table, or any other report of sequence features. General feature reports are a source of sequence feature table information although internal conversion would be required. - General sequence features - Annotation of positional features of molecular sequence(s), i.e. that can be mapped to position(s) in the sequence. - Features - Feature record - - - - - - - - - - Sequence features (comparative) - - Comparative data on sequence features such as statistics, intersections (and data on intersections), differences etc. - beta13 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - beta12orEarlier - - - - - - - - - - Sequence property (protein) - - true - A report of general sequence properties derived from protein sequence data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence property (nucleic acid) - - A report of general sequence properties derived from nucleotide sequence data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence complexity report - - A report on sequence complexity, for example low-complexity or repeat regions in sequences. - beta12orEarlier - Sequence property (complexity) - - - - - - - - - - Sequence ambiguity report - - A report on ambiguity in molecular sequence(s). - Sequence property (ambiguity) - beta12orEarlier - - - - - - - - - - Sequence composition report - - beta12orEarlier - A report (typically a table) on character or word composition / frequency of a molecular sequence(s). - Sequence property (composition) - - - - - - - - - - Peptide molecular weight hits - - A report on peptide fragments of certain molecular weight(s) in one or more protein sequences. - beta12orEarlier - - - - - - - - - - Base position variability plot - - beta12orEarlier - A plot of third base position variability in a nucleotide sequence. - - - - - - - - - - Sequence composition table - - A table of character or word composition / frequency of a molecular sequence. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Base frequencies table - - - beta12orEarlier - A table of base frequencies of a nucleotide sequence. - - - - - - - - - - Base word frequencies table - - - A table of word composition of a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Amino acid frequencies table - - - Sequence composition (amino acid frequencies) - A table of amino acid frequencies of a protein sequence. - beta12orEarlier - - - - - - - - - - Amino acid word frequencies table - - - A table of amino acid word composition of a protein sequence. - Sequence composition (amino acid words) - beta12orEarlier - - - - - - - - - - DAS sequence feature annotation - - beta12orEarlier - Annotation of a molecular sequence in DAS format. - beta12orEarlier - true - - - - - - - - - - Feature table - - Sequence feature table - beta12orEarlier - Annotation of positional sequence features, organized into a standard feature table. - - - - - - - - - - Map - - - - - - - - DNA map - beta12orEarlier - A map of (typically one) DNA sequence annotated with positional or non-positional features. - - - - - - - - - - Nucleic acid features - - - An informative report on intrinsic positional features of a nucleotide sequence. - beta12orEarlier - Genome features - This includes nucleotide sequence feature annotation in any known sequence feature table format and any other report of nucleic acid features. - Genomic features - Nucleic acid feature table - Feature table (nucleic acid) - - - - - - - - - - Protein features - - - An informative report on intrinsic positional features of a protein sequence. - beta12orEarlier - This includes protein sequence feature annotation in any known sequence feature table format and any other report of protein features. - Feature table (protein) - Protein feature table - - - - - - - - - - Genetic map - - A map showing the relative positions of genetic markers in a nucleic acid sequence, based on estimation of non-physical distance such as recombination frequencies. - beta12orEarlier - A genetic (linkage) map indicates the proximity of two genes on a chromosome, whether two genes are linked and the frequency they are transmitted together to an offspring. They are limited to genetic markers of traits observable only in whole organisms. - Linkage map - Moby:GeneticMap - - - - - - - - - - Sequence map - - A sequence map typically includes annotation on significant subsequences such as contigs, haplotypes and genes. The contigs shown will (typically) be a set of small overlapping clones representing a complete chromosomal segment. - beta12orEarlier - A map of genetic markers in a contiguous, assembled genomic sequence, with the sizes and separation of markers measured in base pairs. - - - - - - - - - - Physical map - - A map of DNA (linear or circular) annotated with physical features or landmarks such as restriction sites, cloned DNA fragments, genes or genetic markers, along with the physical distances between them. - Distance in a physical map is measured in base pairs. A physical map might be ordered relative to a reference map (typically a genetic map) in the process of genome sequencing. - beta12orEarlier - - - - - - - - - - Sequence signature map - - true - Image of a sequence with matches to signatures, motifs or profiles. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cytogenetic map - - beta12orEarlier - A map showing banding patterns derived from direct observation of a stained chromosome. - Cytologic map - Chromosome map - Cytogenic map - This is the lowest-resolution physical map and can provide only rough estimates of physical (base pair) distances. Like a genetic map, they are limited to genetic markers of traits observable only in whole organisms. - - - - - - - - - - DNA transduction map - - beta12orEarlier - A gene map showing distances between loci based on relative cotransduction frequencies. - - - - - - - - - - Gene map - - Sequence map of a single gene annotated with genetic features such as introns, exons, untranslated regions, polyA signals, promoters, enhancers and (possibly) mutations defining alleles of a gene. - beta12orEarlier - - - - - - - - - - Plasmid map - - Sequence map of a plasmid (circular DNA). - beta12orEarlier - - - - - - - - - - Genome map - - beta12orEarlier - Sequence map of a whole genome. - - - - - - - - - - Restriction map - - - Image of the restriction enzyme cleavage sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - InterPro compact match image - - beta12orEarlier - Image showing matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Each protein is represented as a scaled horizontal line with colored bars indicating the position of the matches. - beta12orEarlier - true - - - - - - - - - - InterPro detailed match image - - beta12orEarlier - beta12orEarlier - Image showing detailed information on matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. - true - - - - - - - - - - InterPro architecture image - - beta12orEarlier - beta12orEarlier - true - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Domain architecture is shown as a series of non-overlapping domains in the protein. - Image showing the architecture of InterPro domains in a protein sequence. - - - - - - - - - - SMART protein schematic - - true - beta12orEarlier - beta12orEarlier - SMART protein schematic in PNG format. - - - - - - - - - - GlobPlot domain image - - beta12orEarlier - beta12orEarlier - true - Images based on GlobPlot prediction of intrinsic disordered regions and globular domains in protein sequences. - - - - - - - - - - Sequence motif matches - - beta12orEarlier - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more sequences. - 1.8 - true - - - - - - - - - - Sequence features (repeats) - - beta12orEarlier - true - 1.5 - Repeat sequence map - The report might include derived data map such as classification, annotation, organization, periodicity etc. - Location of short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Gene and transcript structure (report) - - 1.5 - beta12orEarlier - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - true - - - - - - - - - - Mobile genetic elements - - true - beta12orEarlier - regions of a nucleic acid sequence containing mobile genetic elements. - 1.8 - - - - - - - - - - Nucleic acid features report (PolyA signal or site) - - true - regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features (quadruplexes) - - true - 1.5 - A report on quadruplex-forming motifs in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Nucleic acid features report (CpG island and isochore) - - 1.8 - CpG rich regions (isochores) in a nucleotide sequence. - beta12orEarlier - true - - - - - - - - - - Nucleic acid features report (restriction sites) - - beta12orEarlier - true - 1.8 - restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - - - - - - - - - - Nucleosome exclusion sequences - - beta12orEarlier - true - Report on nucleosome formation potential or exclusion sequence(s). - 1.8 - - - - - - - - - - Nucleic acid features report (splice sites) - - splice sites in a nucleotide sequence or alternative RNA splicing events. - beta12orEarlier - true - 1.8 - - - - - - - - - - Nucleic acid features report (matrix/scaffold attachment sites) - - 1.8 - matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Gene features (exonic splicing enhancer) - - beta12orEarlier - beta13 - true - A report on exonic splicing enhancers (ESE) in an exon. - - - - - - - - - - Nucleic acid features (microRNA) - - true - beta12orEarlier - A report on microRNA sequence (miRNA) or precursor, microRNA targets, miRNA binding sites in an RNA sequence etc. - 1.5 - - - - - - - - - - Gene features report (operon) - - true - operons (operators, promoters and genes) from a bacterial genome. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features report (promoters) - - 1.8 - whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Coding region - - beta12orEarlier - protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. - 1.8 - true - - - - - - - - - - Gene features (SECIS element) - - beta12orEarlier - beta13 - A report on selenocysteine insertion sequence (SECIS) element in a DNA sequence. - true - - - - - - - - - - Transcription factor binding sites - - transcription factor binding sites (TFBS) in a DNA sequence. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein features (sites) - - true - beta12orEarlier - Use this concept for collections of specific sites which are not necessarily contiguous, rather than contiguous stretches of amino acids. - beta12orEarlier - A report on predicted or known key residue positions (sites) in a protein sequence, such as binding or functional sites. - - - - - - - - - - Protein features report (signal peptides) - - true - signal peptides or signal peptide cleavage sites in protein sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Protein features report (cleavage sites) - - true - 1.8 - cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - beta12orEarlier - - - - - - - - - - Protein features (post-translation modifications) - - true - beta12orEarlier - post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein features report (active sites) - - 1.8 - true - beta12orEarlier - catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein features report (binding sites) - - beta12orEarlier - ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids. - true - 1.8 - - - - - - - - - - Protein features (epitopes) - - A report on antigenic determinant sites (epitopes) in proteins, from sequence and / or structural data. - beta13 - beta12orEarlier - Epitope mapping is commonly done during vaccine design. - true - - - - - - - - - - Protein features report (nucleic acid binding sites) - - true - beta12orEarlier - 1.8 - RNA and DNA-binding proteins and binding sites in protein sequences. - - - - - - - - - - MHC Class I epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on epitopes that bind to MHC class I molecules. - - - - - - - - - - MHC Class II epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on predicted epitopes that bind to MHC class II molecules. - - - - - - - - - - Protein features (PEST sites) - - beta12orEarlier - A report or plot of PEST sites in a protein sequence. - true - beta13 - 'PEST' motifs target proteins for proteolytic degradation and reduce the half-lives of proteins dramatically. - - - - - - - - - - Sequence database hits scores list - - Scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database hits alignments list - - beta12orEarlier - Alignments from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - Sequence database hits evaluation data - - beta12orEarlier - A report on the evaluation of the significance of sequence similarity scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - MEME motif alphabet - - Alphabet for the motifs (patterns) that MEME will search for. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - MEME background frequencies file - - MEME background frequencies file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - MEME motifs directive file - - beta12orEarlier - true - File of directives for ordering and spacing of MEME motifs. - beta12orEarlier - - - - - - - - - - Dirichlet distribution - - Dirichlet distribution used by hidden Markov model analysis programs. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts - - Emission and transition counts of a hidden Markov model, generated once HMM has been determined, for example after residues/gaps have been assigned to match, delete and insert states. - true - 1.4 - beta12orEarlier - - - - - - - - - - - Regular expression - - Regular expression pattern. - beta12orEarlier - - - - - - - - - - Sequence motif - - - - - - - - beta12orEarlier - Any specific or conserved pattern (typically expressed as a regular expression) in a molecular sequence. - - - - - - - - - - Sequence profile - - - - - - - - Some type of statistical model representing a (typically multiple) sequence alignment. - http://semanticscience.org/resource/SIO_010531 - beta12orEarlier - - - - - - - - - - Protein signature - - An informative report about a specific or conserved protein sequence pattern. - InterPro entry - Protein repeat signature - Protein region signature - Protein site signature - beta12orEarlier - Protein family signature - Protein domain signature - - - - - - - - - - Prosite nucleotide pattern - - A nucleotide regular expression pattern from the Prosite database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Prosite protein pattern - - A protein regular expression pattern from the Prosite database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Position frequency matrix - - beta12orEarlier - PFM - A profile (typically representing a sequence alignment) that is a simple matrix of nucleotide (or amino acid) counts per position. - - - - - - - - - - Position weight matrix - - PWM - beta12orEarlier - A profile (typically representing a sequence alignment) that is weighted matrix of nucleotide (or amino acid) counts per position. - Contributions of individual sequences to the matrix might be uneven (weighted). - - - - - - - - - - Information content matrix - - beta12orEarlier - ICM - A profile (typically representing a sequence alignment) derived from a matrix of nucleotide (or amino acid) counts per position that reflects information content at each position. - - - - - - - - - - Hidden Markov model - - HMM - beta12orEarlier - A hidden Markov model representation of a set or alignment of sequences. - - - - - - - - - - Fingerprint - - beta12orEarlier - One or more fingerprints (sequence classifiers) as used in the PRINTS database. - - - - - - - - - - Domainatrix signature - - A protein signature of the type used in the EMBASSY Signature package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - HMMER NULL hidden Markov model - - beta12orEarlier - beta12orEarlier - true - NULL hidden Markov model representation used by the HMMER package. - - - - - - - - - - Protein family signature - - Protein family signatures cover all domains in the matching proteins and span >80% of the protein length and with no adjacent protein domain signatures or protein region signatures. - beta12orEarlier - true - 1.5 - A protein family signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein domain signature - - beta12orEarlier - 1.5 - true - A protein domain signature (sequence classifier) from the InterPro database. - Protein domain signatures identify structural or functional domains or other units with defined boundaries. - - - - - - - - - - Protein region signature - - A protein region signature (sequence classifier) from the InterPro database. - true - beta12orEarlier - 1.5 - A protein region signature defines a region which cannot be described as a protein family or domain signature. - - - - - - - - - - Protein repeat signature - - true - 1.5 - A protein repeat signature is a repeated protein motif, that is not in single copy expected to independently fold into a globular domain. - beta12orEarlier - A protein repeat signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein site signature - - A protein site signature is a classifier for a specific site in a protein. - beta12orEarlier - A protein site signature (sequence classifier) from the InterPro database. - true - 1.5 - - - - - - - - - - Protein conserved site signature - - 1.4 - true - A protein conserved site signature is any short sequence pattern that may contain one or more unique residues and is cannot be described as a active site, binding site or post-translational modification. - A protein conserved site signature (sequence classifier) from the InterPro database. - beta12orEarlier - - - - - - - - - - Protein active site signature - - A protein active site signature (sequence classifier) from the InterPro database. - A protein active site signature corresponds to an enzyme catalytic pocket. An active site typically includes non-contiguous residues, therefore multiple signatures may be required to describe an active site. ; residues involved in enzymatic reactions for which mutational data is typically available. - true - 1.4 - beta12orEarlier - - - - - - - - - - Protein binding site signature - - 1.4 - A protein binding site signature (sequence classifier) from the InterPro database. - true - A protein binding site signature corresponds to a site that reversibly binds chemical compounds, which are not themselves substrates of the enzymatic reaction. This includes enzyme cofactors and residues involved in electron transport or protein structure modification. - beta12orEarlier - - - - - - - - - - Protein post-translational modification signature - - A protein post-translational modification signature (sequence classifier) from the InterPro database. - A protein post-translational modification signature corresponds to sites that undergo modification of the primary structure, typically to activate or de-activate a function. For example, methylation, sumoylation, glycosylation etc. The modification might be permanent or reversible. - 1.4 - beta12orEarlier - true - - - - - - - - - - Sequence alignment (pair) - - http://semanticscience.org/resource/SIO_010068 - beta12orEarlier - Alignment of exactly two molecular sequences. - - - - - - - - - - Sequence alignment (multiple) - - beta12orEarlier - beta12orEarlier - Alignment of more than two molecular sequences. - true - - - - - - - - - - Sequence alignment (nucleic acid) - - beta12orEarlier - Alignment of multiple nucleotide sequences. - - - - - - - - - - Sequence alignment (protein) - - - Alignment of multiple protein sequences. - beta12orEarlier - - - - - - - - - - Sequence alignment (hybrid) - - Alignment of multiple molecular sequences of different types. - Hybrid sequence alignments include for example genomic DNA to EST, cDNA or mRNA. - beta12orEarlier - - - - - - - - - - Sequence alignment (nucleic acid pair) - - beta12orEarlier - Alignment of exactly two nucleotide sequences. - true - 1.12 - - - - - - - - - - - Sequence alignment (protein pair) - - true - 1.12 - Alignment of exactly two protein sequences. - beta12orEarlier - - - - - - - - - - - Hybrid sequence alignment (pair) - - true - beta12orEarlier - beta12orEarlier - Alignment of exactly two molecular sequences of different types. - - - - - - - - - - Multiple nucleotide sequence alignment - - beta12orEarlier - Alignment of more than two nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - Multiple protein sequence alignment - - true - beta12orEarlier - beta12orEarlier - Alignment of more than two protein sequences. - - - - - - - - - - Alignment score or penalty - - beta12orEarlier - A simple floating point number defining the penalty for opening or extending a gap in an alignment. - - - - - - - - - - Score end gaps control - - beta12orEarlier - beta12orEarlier - Whether end gaps are scored or not. - true - - - - - - - - - - Aligned sequence order - - beta12orEarlier - beta12orEarlier - true - Controls the order of sequences in an output sequence alignment. - - - - - - - - - - Gap opening penalty - - A penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap extension penalty - - A penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty - - beta12orEarlier - A penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap penalty - - beta12orEarlier - A penalty for gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - - Match reward score - - beta12orEarlier - The score for a 'match' used in various sequence database search applications with simple scoring schemes. - - - - - - - - - - Mismatch penalty score - - beta12orEarlier - The score (penalty) for a 'mismatch' used in various alignment and sequence database search applications with simple scoring schemes. - - - - - - - - - - Drop off score - - This is the threshold drop in score at which extension of word alignment is halted. - beta12orEarlier - - - - - - - - - - Gap opening penalty (integer) - - beta12orEarlier - true - A simple floating point number defining the penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap opening penalty (float) - - beta12orEarlier - beta12orEarlier - A simple floating point number defining the penalty for opening a gap in an alignment. - true - - - - - - - - - - Gap extension penalty (integer) - - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gap extension penalty (float) - - beta12orEarlier - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty (integer) - - A simple floating point number defining the penalty for gaps that are close together in an alignment. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gap separation penalty (float) - - beta12orEarlier - true - beta12orEarlier - A simple floating point number defining the penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap opening penalty - - beta12orEarlier - A number defining the penalty for opening gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - - - - - - - - - - Terminal gap extension penalty - - A number defining the penalty for extending gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence identity - - Sequence identity is the number (%) of matches (identical characters) in positions from an alignment of two molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity - - beta12orEarlier - Sequence similarity is the similarity (expressed as a percentage) of two molecular sequences calculated from their alignment, a scoring matrix for scoring characters substitutions and penalties for gap insertion and extension. - Data Type is float probably. - - - - - - - - - - Sequence alignment metadata (quality report) - - beta12orEarlier - true - beta12orEarlier - Data on molecular sequence alignment quality (estimated accuracy). - - - - - - - - - - Sequence alignment report (site conservation) - - beta12orEarlier - Data on character conservation in a molecular sequence alignment. - 1.4 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - true - - - - - - - - - - Sequence alignment report (site correlation) - - 1.4 - beta12orEarlier - Data on correlations between sites in a molecular sequence alignment, typically to identify possible covarying positions and predict contacts or structural constraints in protein structures. - true - - - - - - - - - - Sequence-profile alignment (Domainatrix signature) - - beta12orEarlier - Alignment of molecular sequences to a Domainatrix signature (representing a sequence alignment). - beta12orEarlier - true - - - - - - - - - - Sequence-profile alignment (HMM) - - beta12orEarlier - 1.5 - true - Alignment of molecular sequence(s) to a hidden Markov model(s). - - - - - - - - - - Sequence-profile alignment (fingerprint) - - Alignment of molecular sequences to a protein fingerprint from the PRINTS database. - 1.5 - beta12orEarlier - true - - - - - - - - - - Phylogenetic continuous quantitative data - - beta12orEarlier - Phylogenetic continuous quantitative characters - Quantitative traits - Continuous quantitative data that may be read during phylogenetic tree calculation. - - - - - - - - - - Phylogenetic discrete data - - Discrete characters - Character data with discrete states that may be read during phylogenetic tree calculation. - Phylogenetic discrete states - beta12orEarlier - Discretely coded characters - - - - - - - - - - Phylogenetic character cliques - - One or more cliques of mutually compatible characters that are generated, for example from analysis of discrete character data, and are used to generate a phylogeny. - Phylogenetic report (cliques) - beta12orEarlier - - - - - - - - - - Phylogenetic invariants - - - - - - - - Phylogenetic invariants data for testing alternative tree topologies. - beta12orEarlier - Phylogenetic report (invariants) - - - - - - - - - - Phylogenetic report - - Phylogenetic tree-derived report - This is a broad data type and is used for example for reports on confidence, shape or stratigraphic (age) data derived from phylogenetic tree analysis. - beta12orEarlier - A report of data concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - Phylogenetic tree report - 1.5 - true - - - - - - - - - - DNA substitution model - - Substitution model - Phylogenetic tree report (DNA substitution model) - Sequence alignment report (DNA substitution model) - beta12orEarlier - A model of DNA substitution that explains a DNA sequence alignment, derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic tree report (tree shape) - - beta12orEarlier - true - 1.4 - Data about the shape of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree report (tree evaluation) - - beta12orEarlier - true - 1.4 - Data on the confidence of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree distances - - beta12orEarlier - Phylogenetic tree report (tree distances) - Distances, such as Branch Score distance, between two or more phylogenetic trees. - - - - - - - - - - Phylogenetic tree report (tree stratigraphic) - - beta12orEarlier - 1.4 - true - Molecular clock and stratigraphic (age) data derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic character contrasts - - Phylogenetic report (character contrasts) - Independent contrasts for characters used in a phylogenetic tree, or covariances, regressions and correlations between characters for those contrasts. - beta12orEarlier - - - - - - - - - - Comparison matrix (integers) - - beta12orEarlier - Substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for sequence comparison. - true - - - - - - - - - - Comparison matrix (floats) - - beta12orEarlier - beta12orEarlier - true - Matrix of floating point numbers for sequence comparison. - Substitution matrix (floats) - - - - - - - - - - Comparison matrix (nucleotide) - - Matrix of integer or floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix - - - - - - - - - - Comparison matrix (amino acid) - - - Amino acid comparison matrix - beta12orEarlier - Matrix of integer or floating point numbers for amino acid comparison. - Amino acid substitution matrix - - - - - - - - - - Nucleotide comparison matrix (integers) - - Nucleotide substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for nucleotide comparison. - true - beta12orEarlier - - - - - - - - - - Nucleotide comparison matrix (floats) - - beta12orEarlier - true - Matrix of floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix (floats) - - - - - - - - - - Amino acid comparison matrix (integers) - - beta12orEarlier - Matrix of integer numbers for amino acid comparison. - Amino acid substitution matrix (integers) - true - beta12orEarlier - - - - - - - - - - Amino acid comparison matrix (floats) - - beta12orEarlier - Amino acid substitution matrix (floats) - beta12orEarlier - true - Matrix of floating point numbers for amino acid comparison. - - - - - - - - - - Protein features report (membrane regions) - - true - beta12orEarlier - 1.8 - trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Nucleic acid structure - - - - - - - - 3D coordinate and associated data for a nucleic acid tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein structure - - - - - - - - Protein structures - 3D coordinate and associated data for a protein tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein-ligand complex - - The structure of a protein in complex with a ligand, typically a small molecule such as an enzyme substrate or cofactor, but possibly another macromolecule. - beta12orEarlier - This includes interactions of proteins with atoms, ions and small molecules or macromolecules such as nucleic acids or other polypeptides. For stable inter-polypeptide interactions use 'Protein complex' instead. - - - - - - - - - - Carbohydrate structure - - - - - - - - - - - - - - beta12orEarlier - 3D coordinate and associated data for a carbohydrate (3D) structure. - - - - - - - - - - Small molecule structure - - - - - - - - 3D coordinate and associated data for the (3D) structure of a small molecule, such as any common chemical compound. - CHEBI:23367 - beta12orEarlier - - - - - - - - - - DNA structure - - beta12orEarlier - 3D coordinate and associated data for a DNA tertiary (3D) structure. - - - - - - - - - - RNA structure - - - - - - - - beta12orEarlier - 3D coordinate and associated data for an RNA tertiary (3D) structure. - - - - - - - - - - tRNA structure - - 3D coordinate and associated data for a tRNA tertiary (3D) structure, including tmRNA, snoRNAs etc. - beta12orEarlier - - - - - - - - - - Protein chain - - beta12orEarlier - 3D coordinate and associated data for the tertiary (3D) structure of a polypeptide chain. - - - - - - - - - - Protein domain - - - - - - - - 3D coordinate and associated data for the tertiary (3D) structure of a protein domain. - beta12orEarlier - - - - - - - - - - Protein structure (all atoms) - - beta12orEarlier - 1.5 - true - 3D coordinate and associated data for a protein tertiary (3D) structure (all atoms). - - - - - - - - - - C-alpha trace - - 3D coordinate and associated data for a protein tertiary (3D) structure (typically C-alpha atoms only). - C-beta atoms from amino acid side-chains may be included. - Protein structure (C-alpha atoms) - beta12orEarlier - - - - - - - - - - Protein chain (all atoms) - - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (all atoms). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Protein chain (C-alpha atoms) - - true - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Protein domain (all atoms) - - 3D coordinate and associated data for a protein domain tertiary (3D) structure (all atoms). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Protein domain (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - 3D coordinate and associated data for a protein domain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Structure alignment (pair) - - Alignment (superimposition) of exactly two molecular tertiary (3D) structures. - beta12orEarlier - Pair structure alignment - - - - - - - - - - Structure alignment (multiple) - - beta12orEarlier - beta12orEarlier - true - Alignment (superimposition) of more than two molecular tertiary (3D) structures. - - - - - - - - - - Structure alignment (protein) - - - Protein structure alignment - beta12orEarlier - Alignment (superimposition) of protein tertiary (3D) structures. - - - - - - - - - - Structure alignment (nucleic acid) - - beta12orEarlier - Alignment (superimposition) of nucleic acid tertiary (3D) structures. - Nucleic acid structure alignment - - - - - - - - - - Structure alignment (protein pair) - - 1.12 - Protein pair structural alignment - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures. - - - - - - - - - - - Multiple protein tertiary structure alignment - - Alignment (superimposition) of more than two protein tertiary (3D) structures. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structure alignment (protein all atoms) - - 1.5 - Alignment (superimposition) of protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - true - - - - - - - - - - Structure alignment (protein C-alpha atoms) - - Alignment (superimposition) of protein tertiary (3D) structures (typically C-alpha atoms only considered). - C-beta atoms from amino acid side-chains may be considered. - 1.5 - C-alpha trace - true - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (all atoms) - - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (all atoms) - - beta12orEarlier - true - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (C-alpha atoms) - - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - true - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Structure alignment (nucleic acid pair) - - beta12orEarlier - 1.12 - true - Nucleic acid pair structure alignment - Alignment (superimposition) of exactly two nucleic acid tertiary (3D) structures. - - - - - - - - - - - Multiple nucleic acid tertiary structure alignment - - beta12orEarlier - Alignment (superimposition) of more than two nucleic acid tertiary (3D) structures. - true - beta12orEarlier - - - - - - - - - - Structure alignment (RNA) - - RNA structure alignment - Alignment (superimposition) of RNA tertiary (3D) structures. - beta12orEarlier - - - - - - - - - Structural transformation matrix - - Matrix to transform (rotate/translate) 3D coordinates, typically the transformation necessary to superimpose two molecular structures. - beta12orEarlier - - - - - - - - - - DaliLite hit table - - DaliLite hit table of protein chain tertiary structure alignment data. - The significant and top-scoring hits for regions of the compared structures is shown. Data such as Z-Scores, number of aligned residues, root-mean-square deviation (RMSD) of atoms and sequence identity are given. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Molecular similarity score - - beta12orEarlier - A score reflecting structural similarities of two molecules. - true - beta12orEarlier - - - - - - - - - - Root-mean-square deviation - - RMSD - beta12orEarlier - Root-mean-square deviation (RMSD) is calculated to measure the average distance between superimposed macromolecular coordinates. - - - - - - - - - - Tanimoto similarity score - - beta12orEarlier - A measure of the similarity between two ligand fingerprints. - A ligand fingerprint is derived from ligand structural data from a Protein DataBank file. It reflects the elements or groups present or absent, covalent bonds and bond orders and the bonded environment in terms of SATIS codes and BLEEP atom types. - - - - - - - - - - 3D-1D scoring matrix - - A matrix of 3D-1D scores reflecting the probability of amino acids to occur in different tertiary structural environments. - beta12orEarlier - - - - - - - - - - Amino acid index - - - beta12orEarlier - A table of 20 numerical values which quantify a property (e.g. physicochemical or biochemical) of the common amino acids. - - - - - - - - - - Amino acid index (chemical classes) - - Chemical classes (amino acids) - Chemical classification (small, aliphatic, aromatic, polar, charged etc) of amino acids. - beta12orEarlier - - - - - - - - - - Amino acid pair-wise contact potentials - - Contact potentials (amino acid pair-wise) - Statistical protein contact potentials. - beta12orEarlier - - - - - - - - - - Amino acid index (molecular weight) - - Molecular weights of amino acids. - Molecular weight (amino acids) - beta12orEarlier - - - - - - - - - - Amino acid index (hydropathy) - - Hydrophobic, hydrophilic or charge properties of amino acids. - beta12orEarlier - Hydropathy (amino acids) - - - - - - - - - - Amino acid index (White-Wimley data) - - beta12orEarlier - White-Wimley data (amino acids) - Experimental free energy values for the water-interface and water-octanol transitions for the amino acids. - - - - - - - - - - Amino acid index (van der Waals radii) - - van der Waals radii (amino acids) - Van der Waals radii of atoms for different amino acid residues. - beta12orEarlier - - - - - - - - - - Enzyme report - - true - 1.5 - Protein report (enzyme) - beta12orEarlier - An informative report on a specific enzyme. - - - - - - - - - - Restriction enzyme report - - An informative report on a specific restriction enzyme such as enzyme reference data. - This might include name of enzyme, organism, isoschizomers, methylation, source, suppliers, literature references, or data on restriction enzyme patterns such as name of enzyme, recognition site, length of pattern, number of cuts made by enzyme, details of blunt or sticky end cut etc. - Restriction enzyme pattern data - Protein report (restriction enzyme) - beta12orEarlier - true - 1.5 - - - - - - - - - - Peptide molecular weights - - beta12orEarlier - List of molecular weight(s) of one or more proteins or peptides, for example cut by proteolytic enzymes or reagents. - The report might include associated data such as frequency of peptide fragment molecular weights. - - - - - - - - - - Peptide hydrophobic moment - - beta12orEarlier - Report on the hydrophobic moment of a polypeptide sequence. - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - - - - - - - - - - Protein aliphatic index - - The aliphatic index of a protein. - beta12orEarlier - The aliphatic index is the relative protein volume occupied by aliphatic side chains. - - - - - - - - - - Protein sequence hydropathy plot - - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - A protein sequence with annotation on hydrophobic or hydrophilic / charged regions, hydrophobicity plot etc. - beta12orEarlier - - - - - - - - - - Protein charge plot - - beta12orEarlier - A plot of the mean charge of the amino acids within a window of specified length as the window is moved along a protein sequence. - - - - - - - - - - Protein solubility - - beta12orEarlier - The solubility or atomic solvation energy of a protein sequence or structure. - Protein solubility data - - - - - - - - - - Protein crystallizability - - beta12orEarlier - Protein crystallizability data - Data on the crystallizability of a protein sequence. - - - - - - - - - - Protein globularity - - Protein globularity data - beta12orEarlier - Data on the stability, intrinsic disorder or globularity of a protein sequence. - - - - - - - - - - Protein titration curve - - - The titration curve of a protein. - beta12orEarlier - - - - - - - - - - Protein isoelectric point - - beta12orEarlier - The isoelectric point of one proteins. - - - - - - - - - - Protein pKa value - - The pKa value of a protein. - beta12orEarlier - - - - - - - - - - Protein hydrogen exchange rate - - beta12orEarlier - The hydrogen exchange rate of a protein. - - - - - - - - - - Protein extinction coefficient - - The extinction coefficient of a protein. - beta12orEarlier - - - - - - - - - - Protein optical density - - The optical density of a protein. - beta12orEarlier - - - - - - - - - - Protein subcellular localization - - Protein report (subcellular localization) - An informative report on protein subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or destination (exported / extracellular proteins). - beta12orEarlier - true - beta13 - - - - - - - - - - Peptide immunogenicity data - - An report on allergenicity / immunogenicity of peptides and proteins. - Peptide immunogenicity report - beta12orEarlier - Peptide immunogenicity - This includes data on peptide ligands that elicit an immune response (immunogens), allergic cross-reactivity, predicted antigenicity (Hopp and Woods plot) etc. These data are useful in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - - - - - - - - - - MHC peptide immunogenicity report - - A report on the immunogenicity of MHC class I or class II binding peptides. - beta13 - true - beta12orEarlier - - - - - - - - - - Protein structure report - - - Protein structural property - Protein structure-derived report - This includes for example reports on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure, protein flexibility or motion, and protein architecture (spatial arrangement of secondary structure). - Protein property (structural) - Annotation about, or structural information derived from, one or more specific protein 3D structure(s) or structural domains. - beta12orEarlier - Protein report (structure) - Protein structure report (domain) - - - - - - - - - - Protein structural quality report - - Report on the quality of a protein three-dimensional model. - Protein structure report (quality evaluation) - Protein structure validation report - Protein property (structural quality) - Model validation might involve checks for atomic packing, steric clashes, agreement with electron density maps etc. - Protein report (structural quality) - beta12orEarlier - - - - - - - - - - Protein non-covalent interactions report - - Data on inter-atomic or inter-residue contacts, distances and interactions in protein structure(s) or on the interactions of protein atoms or residues with non-protein groups. - beta12orEarlier - true - 1.12 - - - - - - - - - - Protein flexibility or motion report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein property (flexibility or motion) - Informative report on flexibility or motion of a protein structure. - Protein flexibility or motion - beta12orEarlier - true - 1.4 - Protein structure report (flexibility or motion) - - - - - - - - - - Protein solvent accessibility report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. This concept covers definitions of the protein surface, interior and interfaces, accessible and buried residues, surface accessible pockets, interior inaccessible cavities etc. - beta12orEarlier - Data on the solvent accessible or buried surface area of a protein structure. - - - - - - - - - - Protein surface report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein structure report (surface) - 1.4 - Data on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure. - beta12orEarlier - true - - - - - - - - - - Ramachandran plot - - beta12orEarlier - Phi/psi angle data or a Ramachandran plot of a protein structure. - - - - - - - - - - Protein dipole moment - - Data on the net charge distribution (dipole moment) of a protein structure. - beta12orEarlier - - - - - - - - - - Protein distance matrix - - - beta12orEarlier - A matrix of distances between amino acid residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map - - An amino acid residue contact map for a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue 3D cluster - - beta12orEarlier - Report on clusters of contacting residues in protein structures such as a key structural residue network. - - - - - - - - - - Protein hydrogen bonds - - Patterns of hydrogen bonding in protein structures. - beta12orEarlier - - - - - - - - - - Protein non-canonical interactions - - Protein non-canonical interactions report - true - Non-canonical atomic interactions in protein structures. - 1.4 - beta12orEarlier - - - - - - - - - - CATH node - - Information on a node from the CATH database. - The report (for example http://www.cathdb.info/cathnode/1.10.10.10) includes CATH code (of the node and upper levels in the hierarchy), classification text (of appropriate levels in hierarchy), list of child nodes, representative domain and other relevant data and links. - 1.5 - beta12orEarlier - true - CATH classification node report - - - - - - - - - - SCOP node - - true - SCOP classification node - Information on a node from the SCOP database. - 1.5 - beta12orEarlier - - - - - - - - - - EMBASSY domain classification - - beta12orEarlier - beta12orEarlier - true - An EMBASSY domain classification file (DCF) of classification and other data for domains from SCOP or CATH, in EMBL-like format. - - - - - - - - - - CATH class - - beta12orEarlier - 1.5 - Information on a protein 'class' node from the CATH database. - true - - - - - - - - - - CATH architecture - - beta12orEarlier - 1.5 - Information on a protein 'architecture' node from the CATH database. - true - - - - - - - - - - CATH topology - - true - 1.5 - Information on a protein 'topology' node from the CATH database. - beta12orEarlier - - - - - - - - - - CATH homologous superfamily - - 1.5 - true - beta12orEarlier - Information on a protein 'homologous superfamily' node from the CATH database. - - - - - - - - - - CATH structurally similar group - - 1.5 - true - beta12orEarlier - Information on a protein 'structurally similar group' node from the CATH database. - - - - - - - - - - CATH functional category - - Information on a protein 'functional category' node from the CATH database. - true - 1.5 - beta12orEarlier - - - - - - - - - - Protein fold recognition report - - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - beta12orEarlier - A report on known protein structural domains or folds that are recognized (identified) in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Protein-protein interaction report - - protein-protein interaction(s), including interactions between protein domains. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein-ligand interaction report - - Protein-drug interaction report - beta12orEarlier - An informative report on protein-ligand (small molecule) interaction(s). - - - - - - - - - - Protein-nucleic acid interactions report - - true - protein-DNA/RNA interaction(s). - beta12orEarlier - 1.8 - - - - - - - - - - Nucleic acid melting profile - - Nucleic acid stability profile - A melting (stability) profile calculated the free energy required to unwind and separate the nucleic acid strands, plotted for sliding windows over a sequence. - Data on the dissociation characteristics of a double-stranded nucleic acid molecule (DNA or a DNA/RNA hybrid) during heating. - beta12orEarlier - - - - - - - - - - Nucleic acid enthalpy - - beta12orEarlier - Enthalpy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - - - - - - - - - - Nucleic acid entropy - - Entropy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - - - - - - - - - - Nucleic acid melting temperature - - Melting temperature of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Nucleic acid stitch profile - - beta12orEarlier - Stitch profile of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - A stitch profile diagram shows partly melted DNA conformations (with probabilities) at a range of temperatures. For example, a stitch profile might show possible loop openings with their location, size, probability and fluctuations at a given temperature. - - - - - - - - - - DNA base pair stacking energies data - - DNA base pair stacking energies data. - beta12orEarlier - - - - - - - - - - DNA base pair twist angle data - - beta12orEarlier - DNA base pair twist angle data. - - - - - - - - - - DNA base trimer roll angles data - - beta12orEarlier - DNA base trimer roll angles data. - - - - - - - - - - Vienna RNA parameters - - RNA parameters used by the Vienna package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA structure constraints - - true - Structure constraints used by the Vienna package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA concentration data - - RNA concentration data used by the Vienna package. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Vienna RNA calculated energy - - beta12orEarlier - beta12orEarlier - true - RNA calculated energy data generated by the Vienna package. - - - - - - - - - - Base pairing probability matrix dotplot - - - beta12orEarlier - Such as generated by the Vienna package. - Dotplot of RNA base pairing probability matrix. - - - - - - - - - - Nucleic acid folding report - - Nucleic acid report (folding) - beta12orEarlier - Nucleic acid report (folding model) - RNA secondary structure folding probablities - A report on an analysis of RNA/DNA folding, minimum folding energies for DNA or RNA sequences, energy landscape of RNA mutants etc. - RNA secondary structure folding classification - - - - - - - - - - Codon usage table - - - - - - - - Table of codon usage data calculated from one or more nucleic acid sequences. - A codon usage table might include the codon usage table name, optional comments and a table with columns for codons and corresponding codon usage data. A genetic code can be extracted from or represented by a codon usage table. - beta12orEarlier - - - - - - - - - - Genetic code - - beta12orEarlier - A genetic code for an organism. - A genetic code need not include detailed codon usage information. - - - - - - - - - - Codon adaptation index - - true - A simple measure of synonymous codon usage bias often used to predict gene expression levels. - CAI - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon usage bias plot - - Synonymous codon usage statistic plot - beta12orEarlier - A plot of the synonymous codon usage calculated for windows over a nucleotide sequence. - - - - - - - - - - Nc statistic - - true - beta12orEarlier - The effective number of codons used in a gene sequence. This reflects how far codon usage of a gene departs from equal usage of synonymous codons. - beta12orEarlier - - - - - - - - - - Codon usage fraction difference - - The differences in codon usage fractions between two codon usage tables. - beta12orEarlier - - - - - - - - - - Pharmacogenomic test report - - beta12orEarlier - The report might correlate gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - Data on the influence of genotype on drug response. - - - - - - - - - - Disease report - - - - - - - - An informative report on a specific disease. - For example, an informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Linkage disequilibrium (report) - - true - A report on linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - 1.8 - beta12orEarlier - - - - - - - - - - Heat map - - - A graphical 2D tabular representation of gene expression data, typically derived from a DNA microarray experiment. - beta12orEarlier - A heat map is a table where rows and columns correspond to different genes and contexts (for example, cells or samples) and the cell color represents the level of expression of a gene that context. - - - - - - - - - - Affymetrix probe sets library file - - true - Affymetrix library file of information about which probes belong to which probe set. - CDF file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Affymetrix probe sets information library file - - true - Affymetrix library file of information about the probe sets such as the gene name with which the probe set is associated. - GIN file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Molecular weights standard fingerprint - - beta12orEarlier - true - 1.12 - Standard protonated molecular masses from trypsin (modified porcine trypsin, Promega) and keratin peptides, used in EMBOSS. - - - - - - - - - - Metabolic pathway report - - This includes carbohydrate, energy, lipid, nucleotide, amino acid, glycan, PK/NRP, cofactor/vitamin, secondary metabolite, xenobiotics etc. - beta12orEarlier - A report typically including a map (diagram) of a metabolic pathway. - 1.8 - true - - - - - - - - - - Genetic information processing pathway report - - beta12orEarlier - 1.8 - true - genetic information processing pathways. - - - - - - - - - - Environmental information processing pathway report - - true - environmental information processing pathways. - beta12orEarlier - 1.8 - - - - - - - - - - Signal transduction pathway report - - A report typically including a map (diagram) of a signal transduction pathway. - 1.8 - true - beta12orEarlier - - - - - - - - - - Cellular process pathways report - - 1.8 - Topic concernning cellular process pathways. - true - beta12orEarlier - - - - - - - - - - Disease pathway or network report - - true - beta12orEarlier - disease pathways, typically of human disease. - 1.8 - - - - - - - - - - Drug structure relationship map - - A report typically including a map (diagram) of drug structure relationships. - beta12orEarlier - - - - - - - - - - Protein interaction networks - - 1.8 - networks of protein interactions. - true - beta12orEarlier - - - - - - - - - - MIRIAM datatype - - A MIRIAM entry describes a MIRIAM data type including the official name, synonyms, root URI, identifier pattern (regular expression applied to a unique identifier of the data type) and documentation. Each data type can be associated with several resources. Each resource is a physical location of a service (typically a database) providing information on the elements of a data type. Several resources may exist for each data type, provided the same (mirrors) or different information. MIRIAM provides a stable and persistent reference to its data types. - An entry (data type) from the Minimal Information Requested in the Annotation of Biochemical Models (MIRIAM) database of data resources. - beta12orEarlier - true - 1.5 - - - - - - - - - - E-value - - An expectation value (E-Value) is the expected number of observations which are at least as extreme as observations expected to occur by random chance. The E-value describes the number of hits with a given score or better that are expected to occur at random when searching a database of a particular size. It decreases exponentially with the score (S) of a hit. A low E value indicates a more significant score. - beta12orEarlier - A simple floating point number defining the lower or upper limit of an expectation value (E-value). - Expectation value - - - - - - - - - - Z-value - - beta12orEarlier - The z-value is the number of standard deviations a data value is above or below a mean value. - A z-value might be specified as a threshold for reporting hits from database searches. - - - - - - - - - - P-value - - beta12orEarlier - A z-value might be specified as a threshold for reporting hits from database searches. - The P-value is the probability of obtaining by random chance a result that is at least as extreme as an observed result, assuming a NULL hypothesis is true. - - - - - - - - - - Database version information - - true - Ontology version information - 1.5 - Information on a database (or ontology) version, for example name, version number and release date. - beta12orEarlier - - - - - - - - - - Tool version information - - beta12orEarlier - Information on an application version, for example name, version number and release date. - true - 1.5 - - - - - - - - - - CATH version information - - beta12orEarlier - beta12orEarlier - true - Information on a version of the CATH database. - - - - - - - - - - Swiss-Prot to PDB mapping - - Cross-mapping of Swiss-Prot codes to PDB identifiers. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database cross-references - - Cross-references from a sequence record to other databases. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Job status - - Metadata on the status of a submitted job. - beta12orEarlier - 1.5 - true - Values for EBI services are 'DONE' (job has finished and the results can then be retrieved), 'ERROR' (the job failed or no results where found), 'NOT_FOUND' (the job id is no longer available; job results might be deleted, 'PENDING' (the job is in a queue waiting processing), 'RUNNING' (the job is currently being processed). - - - - - - - - - - Job ID - - 1.0 - The (typically numeric) unique identifier of a submitted job. - beta12orEarlier - true - - - - - - - - - - Job type - - 1.5 - true - beta12orEarlier - A label (text token) describing the type of job, for example interactive or non-interactive. - - - - - - - - - - Tool log - - 1.5 - A report of tool-specific metadata on some analysis or process performed, for example a log of diagnostic or error messages. - true - beta12orEarlier - - - - - - - - - - DaliLite log file - - true - beta12orEarlier - DaliLite log file describing all the steps taken by a DaliLite alignment of two protein structures. - beta12orEarlier - - - - - - - - - - STRIDE log file - - STRIDE log file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - NACCESS log file - - beta12orEarlier - beta12orEarlier - true - NACCESS log file. - - - - - - - - - - EMBOSS wordfinder log file - - EMBOSS wordfinder log file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS domainatrix log file - - beta12orEarlier - EMBOSS (EMBASSY) domainatrix application log file. - beta12orEarlier - true - - - - - - - - - - EMBOSS sites log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS (EMBASSY) sites application log file. - - - - - - - - - - EMBOSS supermatcher error file - - EMBOSS (EMBASSY) supermatcher error file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS megamerger log file - - beta12orEarlier - beta12orEarlier - EMBOSS megamerger log file. - true - - - - - - - - - - EMBOSS whichdb log file - - beta12orEarlier - true - EMBOSS megamerger log file. - beta12orEarlier - - - - - - - - - - EMBOSS vectorstrip log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS vectorstrip log file. - - - - - - - - - - Username - - A username on a computer system. - beta12orEarlier - - - - - - - - - - - Password - - beta12orEarlier - A password on a computer system. - - - - - - - - - - - Email address - - beta12orEarlier - Moby:Email - A valid email address of an end-user. - Moby:EmailAddress - - - - - - - - - - - Person name - - beta12orEarlier - The name of a person. - - - - - - - - - - - Number of iterations - - 1.5 - Number of iterations of an algorithm. - true - beta12orEarlier - - - - - - - - - - Number of output entities - - Number of entities (for example database hits, sequences, alignments etc) to write to an output file. - 1.5 - beta12orEarlier - true - - - - - - - - - - Hit sort order - - Controls the order of hits (reported matches) in an output file from a database search. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Drug report - - - - - - - - An informative report on a specific drug. - beta12orEarlier - Drug annotation - - - - - - - - - - - Phylogenetic tree image - - beta12orEarlier - An image (for viewing or printing) of a phylogenetic tree including (typically) a plot of rooted or unrooted phylogenies, cladograms, circular trees or phenograms and associated information. - See also 'Phylogenetic tree' - - - - - - - - - - RNA secondary structure image - - beta12orEarlier - Image of RNA secondary structure, knots, pseudoknots etc. - - - - - - - - - - Protein secondary structure image - - Image of protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure image - - beta12orEarlier - Image of one or more molecular tertiary (3D) structures. - - - - - - - - - - Sequence alignment image - - beta12orEarlier - Image of two or more aligned molecular sequences possibly annotated with alignment features. - - - - - - - - - - Chemical structure image - - An image of the structure of a small chemical compound. - The molecular identifier and formula are typically included. - Small molecule structure image - beta12orEarlier - - - - - - - - - - Fate map - - - - - - - - - beta12orEarlier - A fate map is a plan of early stage of an embryo such as a blastula, showing areas that are significance to development. - - - - - - - - - - Microarray spots image - - - beta12orEarlier - An image of spots from a microarray experiment. - - - - - - - - - - BioPax term - - beta12orEarlier - A term from the BioPax ontology. - beta12orEarlier - true - - - - - - - - - - GO - - beta12orEarlier - Gene Ontology term - Moby:Annotated_GO_Term - Moby:Annotated_GO_Term_With_Probability - true - A term definition from The Gene Ontology (GO). - beta12orEarlier - Moby:GO_Term - Moby:GOTerm - - - - - - - - - - MeSH - - true - A term from the MeSH vocabulary. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HGNC - - beta12orEarlier - true - A term from the HGNC controlled vocabulary. - beta12orEarlier - - - - - - - - - - NCBI taxonomy vocabulary - - beta12orEarlier - beta12orEarlier - true - A term from the NCBI taxonomy vocabulary. - - - - - - - - - - Plant ontology term - - beta12orEarlier - true - beta12orEarlier - A term from the Plant Ontology (PO). - - - - - - - - - - UMLS - - beta12orEarlier - beta12orEarlier - A term from the UMLS vocabulary. - true - - - - - - - - - - FMA - - beta12orEarlier - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - A term from Foundational Model of Anatomy. - true - - - - - - - - - - EMAP - - A term from the EMAP mouse ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - ChEBI - - beta12orEarlier - A term from the ChEBI ontology. - true - beta12orEarlier - - - - - - - - - - MGED - - beta12orEarlier - true - A term from the MGED ontology. - beta12orEarlier - - - - - - - - - - myGrid - - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - beta12orEarlier - true - A term from the myGrid ontology. - beta12orEarlier - - - - - - - - - - GO (biological process) - - beta12orEarlier - true - beta12orEarlier - Data Type is an enumerated string. - A term definition for a biological process from the Gene Ontology (GO). - - - - - - - - - - GO (molecular function) - - A term definition for a molecular function from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - true - beta12orEarlier - - - - - - - - - - GO (cellular component) - - beta12orEarlier - true - A term definition for a cellular component from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - - - - - - - - - - Ontology relation type - - 1.5 - beta12orEarlier - true - A relation type defined in an ontology. - - - - - - - - - - Ontology concept definition - - beta12orEarlier - Ontology class definition - The definition of a concept from an ontology. - - - - - - - - - - Ontology concept comment - - beta12orEarlier - 1.4 - true - A comment on a concept from an ontology. - - - - - - - - - - Ontology concept reference - - beta12orEarlier - true - Reference for a concept from an ontology. - beta12orEarlier - - - - - - - - - - doc2loc document information - - beta12orEarlier - true - The doc2loc output includes the url, format, type and availability code of a document for every service provider. - beta12orEarlier - Information on a published article provided by the doc2loc program. - - - - - - - - - - PDB residue number - - WHATIF: pdb_number - PDBML:PDB_residue_no - beta12orEarlier - A residue identifier (a string) from a PDB file. - - - - - - - - - - Atomic coordinate - - Cartesian coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian coordinate - - - - - - - - - - Atomic x coordinate - - WHATIF: PDBx_Cartn_x - Cartesian x coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_x in PDBML - Cartesian x coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic y coordinate - - WHATIF: PDBx_Cartn_y - Cartesian y coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_y in PDBML - Cartesian y coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic z coordinate - - PDBML:_atom_site.Cartn_z - WHATIF: PDBx_Cartn_z - Cartesian z coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian z coordinate - - - - - - - - - - PDB atom name - - WHATIF: PDBx_type_symbol - beta12orEarlier - WHATIF: PDBx_auth_atom_id - WHATIF: alternate_atom - PDBML:pdbx_PDB_atom_name - WHATIF: atom_type - Identifier (a string) of a specific atom from a PDB file for a molecular structure. - - - - - - - - - - - Protein atom - - Atom data - CHEBI:33250 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on a single atom from a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue - - beta12orEarlier - Data on a single amino acid residue position in a protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Residue - - - - - - - - - - Atom name - - - Name of an atom. - beta12orEarlier - - - - - - - - - - - PDB residue name - - Three-letter amino acid residue names as used in PDB files. - WHATIF: type - beta12orEarlier - - - - - - - - - - - PDB model number - - Identifier of a model structure from a PDB file. - beta12orEarlier - PDBML:pdbx_PDB_model_num - Model number - WHATIF: model_number - - - - - - - - - - - CATH domain report - - beta12orEarlier - true - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - Summary of domain classification information for a CATH domain. - - - - - - - - - - CATH representative domain sequences (ATOM) - - beta12orEarlier - beta12orEarlier - FASTA sequence database (based on ATOM records in PDB) for CATH domains (clustered at different levels of sequence identity). - true - - - - - - - - - - CATH representative domain sequences (COMBS) - - true - FASTA sequence database (based on COMBS sequence data) for CATH domains (clustered at different levels of sequence identity). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (ATOM) - - true - FASTA sequence database for all CATH domains (based on PDB ATOM records). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (COMBS) - - FASTA sequence database for all CATH domains (based on COMBS sequence data). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence version - - beta12orEarlier - Information on an molecular sequence version. - Sequence version information - - - - - - - - - - Score - - A numerical value, that is some type of scored value arising for example from a prediction method. - beta12orEarlier - - - - - - - - - - Protein report (function) - - true - For properties that can be mapped to a sequence, use 'Sequence report' instead. - beta13 - Report on general functional properties of specific protein(s). - beta12orEarlier - - - - - - - - - - Gene name (ASPGD) - - 1.3 - beta12orEarlier - true - Name of a gene from Aspergillus Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs:ASPGD_LOCUS - - - - - - - - - - Gene name (CGD) - - Name of a gene from Candida Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:CGD_LOCUS - beta12orEarlier - 1.3 - - - - - - - - - - Gene name (dictyBase) - - http://www.geneontology.org/doc/GO.xrf_abbs:dictyBase - beta12orEarlier - 1.3 - true - Name of a gene from dictyBase database. - - - - - - - - - - Gene name (EcoGene primary) - - http://www.geneontology.org/doc/GO.xrf_abbs:ECOGENE_G - Primary name of a gene from EcoGene Database. - EcoGene primary gene name - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (MaizeGDB) - - http://www.geneontology.org/doc/GO.xrf_abbs:MaizeGDB_Locus - 1.3 - Name of a gene from MaizeGDB (maize genes) database. - true - beta12orEarlier - - - - - - - - - - Gene name (SGD) - - true - 1.3 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs:SGD_LOCUS - Name of a gene from Saccharomyces Genome Database. - - - - - - - - - - Gene name (TGD) - - beta12orEarlier - 1.3 - Name of a gene from Tetrahymena Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:TGD_LOCUS - - - - - - - - - - Gene name (CGSC) - - beta12orEarlier - 1.3 - true - http://www.geneontology.org/doc/GO.xrf_abbs: CGSC - Symbol of a gene from E.coli Genetic Stock Center. - - - - - - - - - - Gene name (HGNC) - - beta12orEarlier - HUGO symbol - 1.3 - true - HGNC symbol - Official gene name - HUGO gene name - http://www.geneontology.org/doc/GO.xrf_abbs: HGNC_gene - HGNC gene name - HUGO gene symbol - HGNC:[0-9]{1,5} - Gene name (HUGO) - HGNC gene symbol - Symbol of a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - Gene name (MGD) - - MGI:[0-9]+ - Symbol of a gene from the Mouse Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs: MGD - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (Bacillus subtilis) - - http://www.geneontology.org/doc/GO.xrf_abbs: SUBTILISTG - Symbol of a gene from Bacillus subtilis Genome Sequence Project. - beta12orEarlier - 1.3 - true - - - - - - - - - - Gene ID (PlasmoDB) - - Identifier of a gene from PlasmoDB Plasmodium Genome Resource. - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ApiDB_PlasmoDB - - - - - - - - - - - Gene ID (EcoGene) - - Identifier of a gene from EcoGene Database. - EcoGene Accession - EcoGene ID - beta12orEarlier - - - - - - - - - - - Gene ID (FlyBase) - - beta12orEarlier - Gene identifier from FlyBase database. - http://www.geneontology.org/doc/GO.xrf_abbs: FB - http://www.geneontology.org/doc/GO.xrf_abbs: FlyBase - - - - - - - - - - - Gene ID (GeneDB Glossina morsitans) - - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Gmorsitans - beta13 - Gene identifier from Glossina morsitans GeneDB database. - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Leishmania major) - - Gene identifier from Leishmania major GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Lmajor - beta12orEarlier - beta13 - - - - - - - - - - Gene ID (GeneDB Plasmodium falciparum) - - Gene identifier from Plasmodium falciparum GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Pfalciparum - beta13 - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Schizosaccharomyces pombe) - - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Spombe - beta12orEarlier - true - beta13 - Gene identifier from Schizosaccharomyces pombe GeneDB database. - - - - - - - - - - Gene ID (GeneDB Trypanosoma brucei) - - Gene identifier from Trypanosoma brucei GeneDB database. - true - beta13 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Tbrucei - - - - - - - - - - Gene ID (Gramene) - - http://www.geneontology.org/doc/GO.xrf_abbs: GR_gene - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GR_GENE - Gene identifier from Gramene database. - - - - - - - - - - - Gene ID (Virginia microbial) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: PAMGO_VMD - Gene identifier from Virginia Bioinformatics Institute microbial database. - http://www.geneontology.org/doc/GO.xrf_abbs: VMD - - - - - - - - - - - Gene ID (SGN) - - http://www.geneontology.org/doc/GO.xrf_abbs: SGN - Gene identifier from Sol Genomics Network. - beta12orEarlier - - - - - - - - - - - Gene ID (WormBase) - - - Gene identifier used by WormBase database. - WBGene[0-9]{8} - http://www.geneontology.org/doc/GO.xrf_abbs: WB - http://www.geneontology.org/doc/GO.xrf_abbs: WormBase - beta12orEarlier - - - - - - - - - - - Gene synonym - - Gene name synonym - true - Any name (other than the recommended one) for a gene. - beta12orEarlier - beta12orEarlier - - - - - - - - - - ORF name - - - beta12orEarlier - The name of an open reading frame attributed by a sequencing project. - - - - - - - - - - - Sequence assembly component - - A component of a larger sequence assembly. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Chromosome annotation (aberration) - - beta12orEarlier - beta12orEarlier - true - A report on a chromosome aberration such as abnormalities in chromosome structure. - - - - - - - - - - Clone ID - - beta12orEarlier - An identifier of a clone (cloned molecular sequence) from a database. - - - - - - - - - - - PDB insertion code - - beta12orEarlier - WHATIF: insertion_code - PDBML:pdbx_PDB_ins_code - An insertion code (part of the residue number) for an amino acid residue from a PDB file. - - - - - - - - - - Atomic occupancy - - WHATIF: PDBx_occupancy - The fraction of an atom type present at a site in a molecular structure. - beta12orEarlier - The sum of the occupancies of all the atom types at a site should not normally significantly exceed 1.0. - - - - - - - - - - Isotropic B factor - - Isotropic B factor (atomic displacement parameter) for an atom from a PDB file. - WHATIF: PDBx_B_iso_or_equiv - beta12orEarlier - - - - - - - - - - Deletion map - - A cytogenetic map is built from a set of mutant cell lines with sub-chromosomal deletions and a reference wild-type line ('genome deletion panel'). The panel is used to map markers onto the genome by comparing mutant to wild-type banding patterns. Markers are linked (occur in the same deleted region) if they share the same banding pattern (presence or absence) as the deletion panel. - beta12orEarlier - A cytogenetic map showing chromosome banding patterns in mutant cell lines relative to the wild type. - Deletion-based cytogenetic map - - - - - - - - - - QTL map - - A genetic map which shows the approximate location of quantitative trait loci (QTL) between two or more markers. - beta12orEarlier - Quantitative trait locus map - - - - - - - - - - Haplotype map - - beta12orEarlier - Moby:Haplotyping_Study_obj - A map of haplotypes in a genome or other sequence, describing common patterns of genetic variation. - - - - - - - - - - Map set data - - beta12orEarlier - Data describing a set of multiple genetic or physical maps, typically sharing a common set of features which are mapped. - Moby:GCP_CorrelatedLinkageMapSet - Moby:GCP_CorrelatedMapSet - - - - - - - - - - Map feature - - beta12orEarlier - true - A feature which may mapped (positioned) on a genetic or other type of map. - Moby:MapFeature - beta12orEarlier - Mappable features may be based on Gramene's notion of map features; see http://www.gramene.org/db/cmap/feature_type_info. - - - - - - - - - - - - Map type - - A designation of the type of map (genetic map, physical map, sequence map etc) or map set. - Map types may be based on Gramene's notion of a map type; see http://www.gramene.org/db/cmap/map_type_info. - 1.5 - true - beta12orEarlier - - - - - - - - - - Protein fold name - - The name of a protein fold. - beta12orEarlier - - - - - - - - - - - Taxon - - Moby:PotentialTaxon - Taxonomy rank - beta12orEarlier - Taxonomic rank - For a complete list of taxonomic ranks see https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - The name of a group of organisms belonging to the same taxonomic rank. - Moby:BriefTaxonConcept - - - - - - - - - - - Organism identifier - - - - - - - - beta12orEarlier - A unique identifier of a (group of) organisms. - - - - - - - - - - - Genus name - - beta12orEarlier - The name of a genus of organism. - - - - - - - - - - - Taxonomic classification - - Moby:TaxonName - Moby:GCP_Taxon - beta12orEarlier - The full name for a group of organisms, reflecting their biological classification and (usually) conforming to a standard nomenclature. - Moby:iANT_organism-xml - Taxonomic name - Name components correspond to levels in a taxonomic hierarchy (e.g. 'Genus', 'Species', etc.) Meta information such as a reference where the name was defined and a date might be included. - Taxonomic information - Moby:TaxonScientificName - Moby:TaxonTCS - - - - - - - - - - - iHOP organism ID - - beta12orEarlier - Moby_namespace:iHOPorganism - A unique identifier for an organism used in the iHOP database. - - - - - - - - - - - Genbank common name - - Common name for an organism as used in the GenBank database. - beta12orEarlier - - - - - - - - - - - NCBI taxon - - The name of a taxon from the NCBI taxonomy database. - beta12orEarlier - - - - - - - - - - - Synonym - - beta12orEarlier - Alternative name - beta12orEarlier - true - An alternative for a word. - - - - - - - - - - Misspelling - - A common misspelling of a word. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Acronym - - true - An abbreviation of a phrase or word. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Misnomer - - A term which is likely to be misleading of its meaning. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Author ID - - Information on the authors of a published work. - Moby:Author - beta12orEarlier - - - - - - - - - - - DragonDB author identifier - - An identifier representing an author in the DragonDB database. - beta12orEarlier - - - - - - - - - - - Annotated URI - - beta12orEarlier - A URI along with annotation describing the data found at the address. - Moby:DescribedLink - - - - - - - - - - UniProt keywords - - true - beta12orEarlier - beta12orEarlier - A controlled vocabulary for words and phrases that can appear in the keywords field (KW line) of entries from the UniProt database. - - - - - - - - - - Gene ID (GeneFarm) - - Moby_namespace:GENEFARM_GeneID - Identifier of a gene from the GeneFarm database. - beta12orEarlier - - - - - - - - - - - Blattner number - - beta12orEarlier - Moby_namespace:Blattner_number - The blattner identifier for a gene. - - - - - - - - - - - Gene ID (MIPS Maize) - - MIPS genetic element identifier (Maize) - Identifier for genetic elements in MIPS Maize database. - beta12orEarlier - Moby_namespace:MIPS_GE_Maize - beta13 - true - - - - - - - - - - Gene ID (MIPS Medicago) - - MIPS genetic element identifier (Medicago) - beta12orEarlier - beta13 - true - Moby_namespace:MIPS_GE_Medicago - Identifier for genetic elements in MIPS Medicago database. - - - - - - - - - - Gene name (DragonDB) - - true - The name of an Antirrhinum Gene from the DragonDB database. - beta12orEarlier - Moby_namespace:DragonDB_Gene - 1.3 - - - - - - - - - - Gene name (Arabidopsis) - - Moby_namespace:ArabidopsisGeneSymbol - true - A unique identifier for an Arabidopsis gene, which is an acronym or abbreviation of the gene name. - beta12orEarlier - 1.3 - - - - - - - - - - iHOP symbol - - - - A unique identifier of a protein or gene used in the iHOP database. - Moby_namespace:iHOPsymbol - beta12orEarlier - - - - - - - - - - - Gene name (GeneFarm) - - 1.3 - true - Name of a gene from the GeneFarm database. - Moby_namespace:GENEFARM_GeneName - GeneFarm gene ID - beta12orEarlier - - - - - - - - - - Locus ID - - - - - - - - - A unique name or other identifier of a genetic locus, typically conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - Locus name - beta12orEarlier - Locus identifier - - - - - - - - - - - Locus ID (AGI) - - AT[1-5]G[0-9]{5} - AGI ID - Locus identifier for Arabidopsis Genome Initiative (TAIR, TIGR and MIPS databases) - http://www.geneontology.org/doc/GO.xrf_abbs:AGI_LocusCode - Arabidopsis gene loci number - AGI locus code - beta12orEarlier - AGI identifier - - - - - - - - - - - Locus ID (ASPGD) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGD - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGDID - Identifier for loci from ASPGD (Aspergillus Genome Database). - - - - - - - - - - - Locus ID (MGG) - - Identifier for loci from Magnaporthe grisea Database at the Broad Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: Broad_MGG - beta12orEarlier - - - - - - - - - - - Locus ID (CGD) - - Identifier for loci from CGD (Candida Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: CGDID - beta12orEarlier - CGDID - CGD locus identifier - http://www.geneontology.org/doc/GO.xrf_abbs: CGD - - - - - - - - - - - Locus ID (CMR) - - http://www.geneontology.org/doc/GO.xrf_abbs: TIGR_CMR - Locus identifier for Comprehensive Microbial Resource at the J. Craig Venter Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: JCVI_CMR - beta12orEarlier - - - - - - - - - - - NCBI locus tag - - beta12orEarlier - Moby_namespace:LocusID - Locus ID (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs: NCBI_locus_tag - Identifier for loci from NCBI database. - - - - - - - - - - - Locus ID (SGD) - - - Identifier for loci from SGD (Saccharomyces Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: SGDID - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: SGD - SGDID - - - - - - - - - - - Locus ID (MMP) - - Identifier of loci from Maize Mapping Project. - Moby_namespace:MMP_Locus - beta12orEarlier - - - - - - - - - - - Locus ID (DictyBase) - - Moby_namespace:DDB_gene - Identifier of locus from DictyBase (Dictyostelium discoideum). - beta12orEarlier - - - - - - - - - - - Locus ID (EntrezGene) - - Identifier of a locus from EntrezGene database. - beta12orEarlier - Moby_namespace:EntrezGene_ID - Moby_namespace:EntrezGene_EntrezGeneID - - - - - - - - - - - Locus ID (MaizeGDB) - - Identifier of locus from MaizeGDB (Maize genome database). - Moby_namespace:MaizeGDB_Locus - beta12orEarlier - - - - - - - - - - - Quantitative trait locus - - QTL - A QTL sometimes but does not necessarily correspond to a gene. - true - beta12orEarlier - beta12orEarlier - A stretch of DNA that is closely linked to the genes underlying a quantitative trait (a phenotype that varies in degree and depends upon the interactions between multiple genes and their environment). - Moby:SO_QTL - - - - - - - - - - Gene ID (KOME) - - Identifier of a gene from the KOME database. - beta12orEarlier - Moby_namespace:GeneId - - - - - - - - - - - Locus ID (Tropgene) - - Identifier of a locus from the Tropgene database. - Moby:Tropgene_locus - beta12orEarlier - - - - - - - - - - - Alignment - - An alignment of molecular sequences, structures or profiles derived from them. - beta12orEarlier - - - - - - - - - - Atomic property - - General atomic property - Data for an atom (in a molecular structure). - beta12orEarlier - - - - - - - - - - UniProt keyword - - beta12orEarlier - A word or phrase that can appear in the keywords field (KW line) of entries from the UniProt database. - Moby_namespace:SP_KW - http://www.geneontology.org/doc/GO.xrf_abbs: SP_KW - - - - - - - - - - Ordered locus name - - beta12orEarlier - true - A name for a genetic locus conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - beta12orEarlier - - - - - - - - - - Sequence coordinates - - - - Map position - Moby:Position - Locus - Sequence co-ordinates - A position in a map (for example a genetic map), either a single position (point) or a region / interval. - Moby:GenePosition - This includes positions in genomes based on a reference sequence. A position may be specified for any mappable object, i.e. anything that may have positional information such as a physical position in a chromosome. Data might include sequence region name, strand, coordinate system name, assembly name, start position and end position. - Moby:HitPosition - beta12orEarlier - Moby:MapPosition - Moby:Locus - Moby:GCP_MapInterval - Moby:GCP_MapPosition - Moby:GCP_MapPoint - PDBML:_atom_site.id - - - - - - - - - - Amino acid property - - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all amino acids. - Amino acid data - beta12orEarlier - - - - - - - - - - Annotation - - beta12orEarlier - true - beta13 - This is a broad data type and is used a placeholder for other, more specific types. - A human-readable collection of information which (typically) is generated or collated by hand and which describes a biological entity, phenomena or associated primary (e.g. sequence or structural) data, as distinct from the primary data itself and computer-generated reports derived from it. - - - - - - - - - - Map data - - - - - - - - Map attribute - A molecular map (genetic or physical), an attribute of such a map, or data extracted from or derived from the analysis of such a map. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - - - - - - - - - - Vienna RNA structural data - - true - Data used by the Vienna RNA analysis package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence mask parameter - - beta12orEarlier - 1.5 - true - Data used to replace (mask) characters in a molecular sequence. - - - - - - - - - - Enzyme kinetics data - - - Data concerning chemical reaction(s) catalysed by enzyme(s). - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Michaelis Menten plot - - A plot giving an approximation of the kinetics of an enzyme-catalysed reaction, assuming simple kinetics (i.e. no intermediate or product inhibition, allostericity or cooperativity). It plots initial reaction rate to the substrate concentration (S) from which the maximum rate (vmax) is apparent. - beta12orEarlier - - - - - - - - - - Hanes Woolf plot - - beta12orEarlier - A plot based on the Michaelis Menten equation of enzyme kinetics plotting the ratio of the initial substrate concentration (S) against the reaction velocity (v). - - - - - - - - - - Experimental data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - Raw data from or annotation on laboratory experiments. - beta12orEarlier - Experimental measurement data - beta13 - - - - - - - - - - - Genome version information - - beta12orEarlier - true - Information on a genome version. - 1.5 - - - - - - - - - - Evidence - - Typically a statement about some data or results, including evidence or the source of a statement, which may include computational prediction, laboratory experiment, literature reference etc. - beta12orEarlier - - - - - - - - - - Sequence record lite - - beta12orEarlier - A molecular sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - 1.8 - - - - - - - - - - Sequence - - - - - - - - http://purl.bioontology.org/ontology/MSH/D008969 - Sequences - http://purl.org/biotop/biotop.owl#BioMolecularSequenceInformation - This concept is a placeholder of concepts for primary sequence data including raw sequences and sequence records. It should not normally be used for derivatives such as sequence alignments, motifs or profiles. - beta12orEarlier - One or more molecular sequences, possibly with associated annotation. - - - - - - - - - - Nucleic acid sequence record (lite) - - beta12orEarlier - 1.8 - true - A nucleic acid sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - - - - - - - - - - Protein sequence record (lite) - - 1.8 - Sequence record lite (protein) - beta12orEarlier - A protein sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - - - - - - - - - - Report - - You can use this term by default for any textual report, in case you can't find another, more specific term. Reports may be generated automatically or collated by hand and can include metadata on the origin, source, history, ownership or location of some thing. - http://semanticscience.org/resource/SIO_000148 - Document - A human-readable collection of information including annotation on a biological entity or phenomena, computer-generated reports of analysis of primary data (e.g. sequence or structural), and metadata (data about primary data) or any other free (essentially unformatted) text, as distinct from the primary data itself. - beta12orEarlier - - - - - - - - - - Molecular property (general) - - General molecular property - General data for a molecule. - beta12orEarlier - - - - - - - - - - Structural data - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - true - Data concerning molecular structural data. - beta13 - - - - - - - - - - - Sequence motif (nucleic acid) - - Nucleic acid sequence motif - DNA sequence motif - A nucleotide sequence motif. - beta12orEarlier - RNA sequence motif - - - - - - - - - - Sequence motif (protein) - - beta12orEarlier - An amino acid sequence motif. - Protein sequence motif - - - - - - - - - - Search parameter - - beta12orEarlier - 1.5 - true - Some simple value controlling a search operation, typically a search of a database. - - - - - - - - - - Database search results - - beta12orEarlier - A report of hits from searching a database of some type. - Search results - Database hits - - - - - - - - - - Secondary structure - - 1.5 - true - beta12orEarlier - The secondary structure assignment (predicted or real) of a nucleic acid or protein. - - - - - - - - - - Matrix - - beta12orEarlier - Array - This is a broad data type and is used a placeholder for other, more specific types. - An array of numerical values. - - - - - - - - - - Alignment data - - beta12orEarlier - 1.8 - true - Data concerning, extracted from, or derived from the analysis of molecular alignment of some type. - This is a broad data type and is used a placeholder for other, more specific types. - Alignment report - - - - - - - - - - Nucleic acid report - - An informative human-readable report about one or more specific nucleic acid molecules, derived from analysis of primary (sequence or structural) data. - beta12orEarlier - - - - - - - - - - Structure report - - An informative report on general information, properties or features of one or more molecular tertiary (3D) structures. - beta12orEarlier - Structure-derived report - - - - - - - - - - Nucleic acid structure data - - Nucleic acid property (structural) - This includes reports on the stiffness, curvature, twist/roll data or other conformational parameters or properties. - Nucleic acid structural property - beta12orEarlier - A report on nucleic acid structure-derived data, describing structural properties of a DNA molecule, or any other annotation or information about specific nucleic acid 3D structure(s). - - - - - - - - - - Molecular property - - beta12orEarlier - SO:0000400 - A report on the physical (e.g. structural) or chemical properties of molecules, or parts of a molecule. - Physicochemical property - - - - - - - - - - DNA base structural data - - Structural data for DNA base pairs or runs of bases, such as energy or angle data. - beta12orEarlier - - - - - - - - - - Database entry version information - - true - beta12orEarlier - 1.5 - Information on a database (or ontology) entry version, such as name (or other identifier) or parent database, unique identifier of entry, data, author and so on. - - - - - - - - - - Accession - - beta12orEarlier - http://semanticscience.org/resource/SIO_000731 - A persistent (stable) and unique identifier, typically identifying an object (entry) from a database. - http://semanticscience.org/resource/SIO_000675 - - - - - - - - - - - SNP - - single nucleotide polymorphism (SNP) in a DNA sequence. - true - beta12orEarlier - 1.8 - - - - - - - - - - Data reference - - A list of database accessions or identifiers are usually included. - Reference to a dataset (or a cross-reference between two datasets), typically one or more entries in a biological database or ontology. - beta12orEarlier - - - - - - - - - - Job identifier - - http://wsio.org/data_009 - An identifier of a submitted job. - beta12orEarlier - - - - - - - - - - - Name - - http://semanticscience.org/resource/SIO_000116 - http://usefulinc.com/ns/doap#name - "http://www.w3.org/2000/01/rdf-schema#label - beta12orEarlier - A name of a thing, which need not necessarily uniquely identify it. - Symbolic name - - - - - - - Closely related, but focusing on labeling and human readability but not on identification. - - - - - - - - - - - Type - - A label (text token) describing the type of a thing, typically an enumerated string (a string with one of a limited set of values). - http://purl.org/dc/elements/1.1/type - 1.5 - beta12orEarlier - true - - - - - - - - - - User ID - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - KEGG organism code - - - A three-letter code used in the KEGG databases to uniquely identify organisms. - beta12orEarlier - - - - - - - - - - - Gene name (KEGG GENES) - - beta12orEarlier - KEGG GENES entry name - [a-zA-Z_0-9]+:[a-zA-Z_0-9\.-]* - Name of an entry (gene) from the KEGG GENES database. - Moby_namespace:GeneId - true - 1.3 - - - - - - - - - - BioCyc ID - - - Identifier of an object from one of the BioCyc databases. - beta12orEarlier - - - - - - - - - - - Compound ID (BioCyc) - - - BioCyc compound identifier - Identifier of a compound from the BioCyc chemical compounds database. - BioCyc compound ID - beta12orEarlier - - - - - - - - - - - Reaction ID (BioCyc) - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from the BioCyc reactions database. - - - - - - - - - - - Enzyme ID (BioCyc) - - - BioCyc enzyme ID - beta12orEarlier - Identifier of an enzyme from the BioCyc enzymes database. - - - - - - - - - - - Reaction ID - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from a database. - - - - - - - - - - - Identifier (hybrid) - - An identifier that is re-used for data objects of fundamentally different types (typically served from a single database). - beta12orEarlier - This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - - - - - - - - - - - Molecular property identifier - - - - - - - - beta12orEarlier - Identifier of a molecular property. - - - - - - - - - - - Codon usage table ID - - - - - - - - - - - - - - Identifier of a codon usage table, for example a genetic code. - Codon usage table identifier - beta12orEarlier - - - - - - - - - - - FlyBase primary identifier - - beta12orEarlier - Primary identifier of an object from the FlyBase database. - - - - - - - - - - - WormBase identifier - - beta12orEarlier - Identifier of an object from the WormBase database. - - - - - - - - - - - WormBase wormpep ID - - - Protein identifier used by WormBase database. - CE[0-9]{5} - beta12orEarlier - - - - - - - - - - - Nucleic acid features (codon) - - beta12orEarlier - true - An informative report on a trinucleotide sequence that encodes an amino acid including the triplet sequence, the encoded amino acid or whether it is a start or stop codon. - beta12orEarlier - - - - - - - - - - Map identifier - - - - - - - - An identifier of a map of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Person identifier - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - Nucleic acid identifier - - - - - - - - Name or other identifier of a nucleic acid molecule. - beta12orEarlier - - - - - - - - - - - Translation frame specification - - beta12orEarlier - Frame for translation of DNA (3 forward and 3 reverse frames relative to a chromosome). - - - - - - - - - - Genetic code identifier - - - - - - - - An identifier of a genetic code. - beta12orEarlier - - - - - - - - - - - Genetic code name - - - Informal name for a genetic code, typically an organism name. - beta12orEarlier - - - - - - - - - - - File format name - - - Name of a file format such as HTML, PNG, PDF, EMBL, GenBank and so on. - beta12orEarlier - - - - - - - - - - - Sequence profile type - - true - 1.5 - A label (text token) describing a type of sequence profile such as frequency matrix, Gribskov profile, hidden Markov model etc. - beta12orEarlier - - - - - - - - - - Operating system name - - beta12orEarlier - Name of a computer operating system such as Linux, PC or Mac. - - - - - - - - - - - Mutation type - - beta12orEarlier - true - beta12orEarlier - A type of point or block mutation, including insertion, deletion, change, duplication and moves. - - - - - - - - - - Logical operator - - beta12orEarlier - A logical operator such as OR, AND, XOR, and NOT. - - - - - - - - - - - Results sort order - - Possible options including sorting by score, rank, by increasing P-value (probability, i.e. most statistically significant hits given first) and so on. - beta12orEarlier - true - 1.5 - A control of the order of data that is output, for example the order of sequences in an alignment. - - - - - - - - - - Toggle - - beta12orEarlier - A simple parameter that is a toggle (boolean value), typically a control for a modal tool. - true - beta12orEarlier - - - - - - - - - - Sequence width - - true - beta12orEarlier - beta12orEarlier - The width of an output sequence or alignment. - - - - - - - - - - Gap penalty - - beta12orEarlier - A penalty for introducing or extending a gap in an alignment. - - - - - - - - - - Nucleic acid melting temperature - - beta12orEarlier - A temperature concerning nucleic acid denaturation, typically the temperature at which the two strands of a hybridized or double stranded nucleic acid (DNA or RNA/DNA) molecule separate. - Melting temperature - - - - - - - - - - Concentration - - beta12orEarlier - The concentration of a chemical compound. - - - - - - - - - - Window step size - - 1.5 - beta12orEarlier - true - Size of the incremental 'step' a sequence window is moved over a sequence. - - - - - - - - - - EMBOSS graph - - beta12orEarlier - true - beta12orEarlier - An image of a graph generated by the EMBOSS suite. - - - - - - - - - - EMBOSS report - - An application report generated by the EMBOSS suite. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence offset - - true - beta12orEarlier - 1.5 - An offset for a single-point sequence position. - - - - - - - - - - Threshold - - 1.5 - beta12orEarlier - true - A value that serves as a threshold for a tool (usually to control scoring or output). - - - - - - - - - - Protein report (transcription factor) - - beta13 - true - This might include conformational or physicochemical properties, as well as sequence information for transcription factor(s) binding sites. - An informative report on a transcription factor protein. - Transcription factor binding site data - beta12orEarlier - - - - - - - - - - Database category name - - true - The name of a category of biological or bioinformatics database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence profile name - - beta12orEarlier - Name of a sequence profile. - true - beta12orEarlier - - - - - - - - - - Color - - Specification of one or more colors. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Rendering parameter - - true - beta12orEarlier - 1.5 - A parameter that is used to control rendering (drawing) to a device or image. - Graphics parameter - Graphical parameter - - - - - - - - - - Sequence name - - - Any arbitrary name of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Date - - 1.5 - A temporal date. - beta12orEarlier - true - - - - - - - - - - Word composition - - beta12orEarlier - Word composition data for a molecular sequence. - true - beta12orEarlier - - - - - - - - - - - Fickett testcode plot - - A plot of Fickett testcode statistic (identifying protein coding regions) in a nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity plot - - - Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - beta12orEarlier - Sequence conservation report - A plot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Helical wheel - - beta12orEarlier - An image of peptide sequence sequence looking down the axis of the helix for highlighting amphipathicity and other properties. - - - - - - - - - - Helical net - - beta12orEarlier - Useful for highlighting amphipathicity and other properties. - An image of peptide sequence sequence in a simple 3,4,3,4 repeating pattern that emulates at a simple level the arrangement of residues around an alpha helix. - - - - - - - - - - Protein sequence properties plot - - true - beta12orEarlier - beta12orEarlier - A plot of general physicochemical properties of a protein sequence. - - - - - - - - - - Protein ionization curve - - - beta12orEarlier - A plot of pK versus pH for a protein. - - - - - - - - - - Sequence composition plot - - - beta12orEarlier - A plot of character or word composition / frequency of a molecular sequence. - - - - - - - - - - Nucleic acid density plot - - - beta12orEarlier - Density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence trace image - - Image of a sequence trace (nucleotide sequence versus probabilities of each of the 4 bases). - beta12orEarlier - - - - - - - - - - Nucleic acid features (siRNA) - - true - 1.5 - beta12orEarlier - A report on siRNA duplexes in mRNA. - - - - - - - - - - Sequence set (stream) - - beta12orEarlier - true - This concept may be used for sequence sets that are expected to be read and processed a single sequence at a time. - A collection of multiple molecular sequences and (typically) associated metadata that is intended for sequential processing. - beta12orEarlier - - - - - - - - - - FlyBase secondary identifier - - Secondary identifier of an object from the FlyBase database. - Secondary identifier are used to handle entries that were merged with or split from other entries in the database. - beta12orEarlier - - - - - - - - - - - Cardinality - - The number of a certain thing. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 1 - - beta12orEarlier - beta12orEarlier - A single thing. - true - - - - - - - - - - 1 or more - - One or more things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 2 - - Exactly two things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 2 or more - - Two or more things. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence checksum - - A fixed-size datum calculated (by using a hash function) for a molecular sequence, typically for purposes of error detection or indexing. - beta12orEarlier - Hash code - Hash sum - Hash - Hash value - - - - - - - - - - Protein features report (chemical modifications) - - 1.8 - beta12orEarlier - chemical modification of a protein. - true - - - - - - - - - - Error - - beta12orEarlier - Data on an error generated by computer system or tool. - 1.5 - true - - - - - - - - - - Database entry metadata - - beta12orEarlier - Basic information on any arbitrary database entry. - - - - - - - - - - Gene cluster - - beta13 - true - beta12orEarlier - A cluster of similar genes. - - - - - - - - - - Sequence record full - - true - beta12orEarlier - A molecular sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - - - - - - - - - - Plasmid identifier - - An identifier of a plasmid in a database. - beta12orEarlier - - - - - - - - - - - Mutation ID - - - beta12orEarlier - A unique identifier of a specific mutation catalogued in a database. - - - - - - - - - - - Mutation annotation (basic) - - Information describing the mutation itself, the organ site, tissue and type of lesion where the mutation has been identified, description of the patient origin and life-style. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Mutation annotation (prevalence) - - beta12orEarlier - true - An informative report on the prevalence of mutation(s), including data on samples and mutation prevalence (e.g. by tumour type).. - beta12orEarlier - - - - - - - - - - Mutation annotation (prognostic) - - beta12orEarlier - An informative report on mutation prognostic data, such as information on patient cohort, the study settings and the results of the study. - beta12orEarlier - true - - - - - - - - - - Mutation annotation (functional) - - An informative report on the functional properties of mutant proteins including transcriptional activities, promotion of cell growth and tumorigenicity, dominant negative effects, capacity to induce apoptosis, cell-cycle arrest or checkpoints in human cells and so on. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon number - - beta12orEarlier - The number of a codon, for instance, at which a mutation is located. - - - - - - - - - - Tumor annotation - - true - 1.4 - An informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Server metadata - - Basic information about a server on the web, such as an SRS server. - beta12orEarlier - 1.5 - true - - - - - - - - - - Database field name - - The name of a field in a database. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (SYSTERS) - - SYSTERS cluster ID - Unique identifier of a sequence cluster from the SYSTERS database. - beta12orEarlier - - - - - - - - - - - Ontology metadata - - - - - - - - beta12orEarlier - Data concerning a biological ontology. - - - - - - - - - - Raw SCOP domain classification - - true - beta12orEarlier - Raw SCOP domain classification data files. - beta13 - These are the parsable data files provided by SCOP. - - - - - - - - - - Raw CATH domain classification - - Raw CATH domain classification data files. - These are the parsable data files provided by CATH. - true - beta13 - beta12orEarlier - - - - - - - - - - Heterogen annotation - - 1.4 - true - beta12orEarlier - An informative report on the types of small molecules or 'heterogens' (non-protein groups) that are represented in PDB files. - - - - - - - - - - Phylogenetic property values - - beta12orEarlier - Phylogenetic property values data. - true - beta12orEarlier - - - - - - - - - - Sequence set (bootstrapped) - - 1.5 - beta12orEarlier - Bootstrapping is often performed in phylogenetic analysis. - true - A collection of sequences output from a bootstrapping (resampling) procedure. - - - - - - - - - - Phylogenetic consensus tree - - true - A consensus phylogenetic tree derived from comparison of multiple trees. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Schema - - beta12orEarlier - true - A data schema for organising or transforming data of some type. - 1.5 - - - - - - - - - - DTD - - A DTD (document type definition). - true - beta12orEarlier - 1.5 - - - - - - - - - - XML Schema - - beta12orEarlier - XSD - An XML Schema. - true - 1.5 - - - - - - - - - - Relax-NG schema - - beta12orEarlier - 1.5 - A relax-NG schema. - true - - - - - - - - - - XSLT stylesheet - - 1.5 - beta12orEarlier - An XSLT stylesheet. - true - - - - - - - - - - Data resource definition name - - - beta12orEarlier - The name of a data type. - - - - - - - - - - - OBO file format name - - Name of an OBO file format such as OBO-XML, plain and so on. - beta12orEarlier - - - - - - - - - - - Gene ID (MIPS) - - Identifier for genetic elements in MIPS database. - beta12orEarlier - MIPS genetic element identifier - - - - - - - - - - - Sequence identifier (protein) - - An identifier of protein sequence(s) or protein sequence database entries. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence identifier (nucleic acid) - - An identifier of nucleotide sequence(s) or nucleotide sequence database entries. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - EMBL accession - - EMBL ID - beta12orEarlier - EMBL accession number - EMBL identifier - An accession number of an entry from the EMBL sequence database. - - - - - - - - - - - UniProt ID - - - - - - - - UniProtKB identifier - An identifier of a polypeptide in the UniProt database. - UniProtKB entry name - beta12orEarlier - UniProt identifier - UniProt entry name - - - - - - - - - - - GenBank accession - - GenBank ID - GenBank identifier - Accession number of an entry from the GenBank sequence database. - beta12orEarlier - GenBank accession number - - - - - - - - - - - Gramene secondary identifier - - beta12orEarlier - Gramene internal identifier - Gramene internal ID - Secondary (internal) identifier of a Gramene database entry. - Gramene secondary ID - - - - - - - - - - - Sequence variation ID - - - An identifier of an entry from a database of molecular sequence variation. - beta12orEarlier - - - - - - - - - - - Gene ID - - - Gene accession - beta12orEarlier - A unique (and typically persistent) identifier of a gene in a database, that is (typically) different to the gene name/symbol. - Gene code - - - - - - - - - - - Gene name (AceView) - - AceView gene name - 1.3 - true - Name of an entry (gene) from the AceView genes database. - beta12orEarlier - - - - - - - - - - Gene ID (ECK) - - ECK accession - beta12orEarlier - E. coli K-12 gene identifier - Identifier of an E. coli K-12 gene from EcoGene Database. - http://www.geneontology.org/doc/GO.xrf_abbs: ECK - - - - - - - - - - - Gene ID (HGNC) - - HGNC ID - beta12orEarlier - Identifier for a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - - Gene name - - - The name of a gene, (typically) assigned by a person and/or according to a naming scheme. It may contain white space characters and is typically more intuitive and readable than a gene symbol. It (typically) may be used to identify similar genes in different species and to derive a gene symbol. - Allele name - beta12orEarlier - - - - - - - - - - - Gene name (NCBI) - - beta12orEarlier - 1.3 - NCBI gene name - Name of an entry (gene) from the NCBI genes database. - true - - - - - - - - - - SMILES string - - A specification of a chemical structure in SMILES format. - beta12orEarlier - - - - - - - - - - STRING ID - - Unique identifier of an entry from the STRING database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - Virus annotation - - An informative report on a specific virus. - true - 1.4 - beta12orEarlier - - - - - - - - - - Virus annotation (taxonomy) - - An informative report on the taxonomy of a specific virus. - beta12orEarlier - true - 1.4 - - - - - - - - - - Reaction ID (SABIO-RK) - - Identifier of a biological reaction from the SABIO-RK reactions database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - Carbohydrate report - - Annotation on or information derived from one or more specific carbohydrate 3D structure(s). - beta12orEarlier - - - - - - - - - - GI number - - beta12orEarlier - NCBI GI number - gi number - A series of digits that are assigned consecutively to each sequence record processed by NCBI. The GI number bears no resemblance to the Accession number of the sequence record. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - - - - - - - - - - - NCBI version - - beta12orEarlier - NCBI accession.version - Nucleotide sequence version contains two letters followed by six digits, a dot, and a version number (or for older nucleotide sequence records, the format is one letter followed by five digits, a dot, and a version number). Protein sequence version contains three letters followed by five digits, a dot, and a version number. - An identifier assigned to sequence records processed by NCBI, made of the accession number of the database record followed by a dot and a version number. - accession.version - - - - - - - - - - - Cell line name - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (exact) - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (truncated) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (no punctuation) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (assonant) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Enzyme ID - - - beta12orEarlier - A unique, persistent identifier of an enzyme. - Enzyme accession - - - - - - - - - - - REBASE enzyme number - - Identifier of an enzyme from the REBASE enzymes database. - beta12orEarlier - - - - - - - - - - - DrugBank ID - - beta12orEarlier - DB[0-9]{5} - Unique identifier of a drug from the DrugBank database. - - - - - - - - - - - GI number (protein) - - beta12orEarlier - protein gi number - A unique identifier assigned to NCBI protein sequence records. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - protein gi - - - - - - - - - - - Bit score - - A score derived from the alignment of two sequences, which is then normalized with respect to the scoring system. - Bit scores are normalized with respect to the scoring system and therefore can be used to compare alignment scores from different searches. - beta12orEarlier - - - - - - - - - - Translation phase specification - - beta12orEarlier - Phase for translation of DNA (0, 1 or 2) relative to a fragment of the coding sequence. - Phase - - - - - - - - - - Resource metadata - - Data concerning or describing some core computational resource, as distinct from primary data. This includes metadata on the origin, source, history, ownership or location of some thing. - This is a broad data type and is used a placeholder for other, more specific types. - Provenance metadata - beta12orEarlier - - - - - - - - - - Ontology identifier - - - - - - - - beta12orEarlier - Any arbitrary identifier of an ontology. - - - - - - - - - - - Ontology concept name - - - The name of a concept in an ontology. - beta12orEarlier - - - - - - - - - - - Genome build identifier - - beta12orEarlier - An identifier of a build of a particular genome. - - - - - - - - - - - Pathway or network name - - The name of a biological pathway or network. - beta12orEarlier - - - - - - - - - - - Pathway ID (KEGG) - - - Identifier of a pathway from the KEGG pathway database. - beta12orEarlier - [a-zA-Z_0-9]{2,3}[0-9]{5} - KEGG pathway ID - - - - - - - - - - - Pathway ID (NCI-Nature) - - beta12orEarlier - [a-zA-Z_0-9]+ - Identifier of a pathway from the NCI-Nature pathway database. - - - - - - - - - - - Pathway ID (ConsensusPathDB) - - - beta12orEarlier - Identifier of a pathway from the ConsensusPathDB pathway database. - - - - - - - - - - - Sequence cluster ID (UniRef) - - Unique identifier of an entry from the UniRef database. - UniRef cluster id - UniRef entry accession - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniRef100) - - UniRef100 cluster id - beta12orEarlier - UniRef100 entry accession - Unique identifier of an entry from the UniRef100 database. - - - - - - - - - - - Sequence cluster ID (UniRef90) - - UniRef90 entry accession - beta12orEarlier - UniRef90 cluster id - Unique identifier of an entry from the UniRef90 database. - - - - - - - - - - - Sequence cluster ID (UniRef50) - - beta12orEarlier - UniRef50 cluster id - UniRef50 entry accession - Unique identifier of an entry from the UniRef50 database. - - - - - - - - - - - Ontology data - - - - - - - - Data concerning or derived from an ontology. - Ontological data - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - RNA family report - - beta12orEarlier - An informative report on a specific RNA family or other group of classified RNA sequences. - RNA family annotation - - - - - - - - - - RNA family identifier - - - - - - - - beta12orEarlier - Identifier of an RNA family, typically an entry from a RNA sequence classification database. - - - - - - - - - - - RFAM accession - - - Stable accession number of an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Protein signature type - - beta12orEarlier - true - A label (text token) describing a type of protein family signature (sequence classifier) from the InterPro database. - 1.5 - - - - - - - - - - Domain-nucleic acid interaction report - - 1.5 - true - An informative report on protein domain-DNA/RNA interaction(s). - beta12orEarlier - - - - - - - - - - Domain-domain interactions - - 1.8 - An informative report on protein domain-protein domain interaction(s). - beta12orEarlier - true - - - - - - - - - - Domain-domain interaction (indirect) - - true - beta12orEarlier - beta12orEarlier - Data on indirect protein domain-protein domain interaction(s). - - - - - - - - - - Sequence accession (hybrid) - - - - - - - - Accession number of a nucleotide or protein sequence database entry. - beta12orEarlier - - - - - - - - - - - 2D PAGE data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - beta13 - beta12orEarlier - true - Data concerning two-dimensional polygel electrophoresis. - - - - - - - - - - 2D PAGE report - - beta12orEarlier - two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - true - - - - - - - - - - Pathway or network accession - - - A persistent, unique identifier of a biological pathway or network (typically a database entry). - beta12orEarlier - - - - - - - - - - - Secondary structure alignment - - Alignment of the (1D representations of) secondary structure of two or more molecules. - beta12orEarlier - - - - - - - - - - ASTD ID - - - beta12orEarlier - Identifier of an object from the ASTD database. - - - - - - - - - - - ASTD ID (exon) - - beta12orEarlier - Identifier of an exon from the ASTD database. - - - - - - - - - - - ASTD ID (intron) - - beta12orEarlier - Identifier of an intron from the ASTD database. - - - - - - - - - - - ASTD ID (polya) - - Identifier of a polyA signal from the ASTD database. - beta12orEarlier - - - - - - - - - - - ASTD ID (tss) - - Identifier of a transcription start site from the ASTD database. - beta12orEarlier - - - - - - - - - - - 2D PAGE spot report - - 2D PAGE spot annotation - beta12orEarlier - An informative report on individual spot(s) from a two-dimensional (2D PAGE) gel. - 1.8 - true - - - - - - - - - - Spot ID - - - beta12orEarlier - Unique identifier of a spot from a two-dimensional (protein) gel. - - - - - - - - - - - Spot serial number - - Unique identifier of a spot from a two-dimensional (protein) gel in the SWISS-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Spot ID (HSC-2DPAGE) - - Unique identifier of a spot from a two-dimensional (protein) gel from a HSC-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Protein-motif interaction - - beta13 - true - Data on the interaction of a protein (or protein domain) with specific structural (3D) and/or sequence motifs. - beta12orEarlier - - - - - - - - - - Strain identifier - - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - beta12orEarlier - - - - - - - - - - - CABRI accession - - - A unique identifier of an item from the CABRI database. - beta12orEarlier - - - - - - - - - - - Experiment report (genotyping) - - true - Report of genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - 1.8 - beta12orEarlier - - - - - - - - - - Genotype experiment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of genotype experiment metadata. - - - - - - - - - - - EGA accession - - beta12orEarlier - Identifier of an entry from the EGA database. - - - - - - - - - - - IPI protein ID - - Identifier of a protein entry catalogued in the International Protein Index (IPI) database. - IPI[0-9]{8} - beta12orEarlier - - - - - - - - - - - RefSeq accession (protein) - - RefSeq protein ID - Accession number of a protein from the RefSeq database. - beta12orEarlier - - - - - - - - - - - EPD ID - - beta12orEarlier - Identifier of an entry (promoter) from the EPD database. - EPD identifier - - - - - - - - - - - TAIR accession - - - beta12orEarlier - Identifier of an entry from the TAIR database. - - - - - - - - - - - TAIR accession (At gene) - - beta12orEarlier - Identifier of an Arabidopsis thaliana gene from the TAIR database. - - - - - - - - - - - UniSTS accession - - beta12orEarlier - Identifier of an entry from the UniSTS database. - - - - - - - - - - - UNITE accession - - beta12orEarlier - Identifier of an entry from the UNITE database. - - - - - - - - - - - UTR accession - - beta12orEarlier - Identifier of an entry from the UTR database. - - - - - - - - - - - UniParc accession - - beta12orEarlier - UPI[A-F0-9]{10} - Accession number of a UniParc (protein sequence) database entry. - UniParc ID - UPI - - - - - - - - - - - mFLJ/mKIAA number - - beta12orEarlier - Identifier of an entry from the Rouge or HUGE databases. - - - - - - - - - - - Fungi annotation - - true - beta12orEarlier - 1.4 - An informative report on a specific fungus. - - - - - - - - - - Fungi annotation (anamorph) - - beta12orEarlier - An informative report on a specific fungus anamorph. - 1.4 - true - - - - - - - - - - Gene features report (exon) - - true - exons in a nucleotide sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Ensembl protein ID - - - Ensembl ID (protein) - beta12orEarlier - Protein ID (Ensembl) - Unique identifier for a protein from the Ensembl database. - - - - - - - - - - - Gene transcriptional features report - - 1.8 - beta12orEarlier - transcription of DNA into RNA including the regulation of transcription. - true - - - - - - - - - - Toxin annotation - - beta12orEarlier - An informative report on a specific toxin. - 1.4 - true - - - - - - - - - - Protein report (membrane protein) - - beta12orEarlier - true - An informative report on a membrane protein. - beta12orEarlier - - - - - - - - - - Protein-drug interaction report - - true - An informative report on tentative or known protein-drug interaction(s). - 1.12 - beta12orEarlier - - - - - - - - - - Map data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - true - beta13 - Data concerning a map of molecular sequence(s). - - - - - - - - - - - Phylogenetic data - - Data concerning phylogeny, typically of molecular sequences, including reports of information concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - - - - - - - - - - Protein data - - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning one or more protein molecules. - true - beta12orEarlier - - - - - - - - - - Nucleic acid data - - true - Data concerning one or more nucleic acid molecules. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - Article report - Data concerning, extracted from, or derived from the analysis of a scientific text (or texts) such as a full text article from a scientific journal. - - - - - - - - - - - Parameter - - http://semanticscience.org/resource/SIO_000144 - Tool-specific parameter - beta12orEarlier - http://www.e-lico.eu/ontologies/dmo/DMOP/DMOP.owl#Parameter - Typically a simple numerical or string value that controls the operation of a tool. - Parameters - Tool parameter - - - - - - - - - - Molecular data - - Molecule-specific data - true - Data concerning a specific type of molecule. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Molecule report - - An informative report on a specific molecule. - beta12orEarlier - Molecular report - 1.5 - true - - - - - - - - - - - Organism report - - An informative report on a specific organism. - beta12orEarlier - Organism annotation - - - - - - - - - - Experiment report - - Experiment metadata - beta12orEarlier - Experiment annotation - Annotation on a wet lab experiment, such as experimental conditions. - - - - - - - - - - Nucleic acid features report (mutation) - - DNA mutation. - 1.8 - true - beta12orEarlier - - - - - - - - - - Sequence attribute - - An attribute of a molecular sequence, possibly in reference to some other sequence. - Sequence parameter - beta12orEarlier - - - - - - - - - - Sequence tag profile - - SAGE, MPSS and SBS experiments are usually performed to study gene expression. The sequence tags are typically subsequently annotated (after a database search) with the mRNA (and therefore gene) the tag was extracted from. - beta12orEarlier - Sequencing-based expression profile - This includes tag to gene assignments (tag mapping) of SAGE, MPSS and SBS data. Typically this is the sequencing-based expression profile annotated with gene identifiers. - Sequence tag profile (with gene assignment) - Output from a serial analysis of gene expression (SAGE), massively parallel signature sequencing (MPSS) or sequencing by synthesis (SBS) experiment. In all cases this is a list of short sequence tags and the number of times it is observed. - - - - - - - - - - Mass spectrometry data - - beta12orEarlier - Data concerning a mass spectrometry measurement. - - - - - - - - - - Protein structure raw data - - beta12orEarlier - Raw data from experimental methods for determining protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - - - - - - - - - - Mutation identifier - - An identifier of a mutation. - beta12orEarlier - - - - - - - - - - - Alignment data - - This is a broad data type and is used a placeholder for other, more specific types. This includes entities derived from sequences and structures such as motifs and profiles. - true - beta13 - Data concerning an alignment of two or more molecular sequences, structures or derived data. - beta12orEarlier - - - - - - - - - - - Data index data - - true - Data concerning an index of data. - beta12orEarlier - beta13 - Database index - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Amino acid name (single letter) - - beta12orEarlier - Single letter amino acid identifier, e.g. G. - - - - - - - - - - - Amino acid name (three letter) - - beta12orEarlier - Three letter amino acid identifier, e.g. GLY. - - - - - - - - - - - Amino acid name (full name) - - beta12orEarlier - Full name of an amino acid, e.g. Glycine. - - - - - - - - - - - Toxin identifier - - - - - - - - beta12orEarlier - Identifier of a toxin. - - - - - - - - - - - ArachnoServer ID - - Unique identifier of a toxin from the ArachnoServer database. - beta12orEarlier - - - - - - - - - - - Expressed gene list - - beta12orEarlier - true - 1.5 - Gene annotation (expressed gene list) - A simple summary of expressed genes. - - - - - - - - - - BindingDB Monomer ID - - Unique identifier of a monomer from the BindingDB database. - beta12orEarlier - - - - - - - - - - - GO concept name - - true - beta12orEarlier - beta12orEarlier - The name of a concept from the GO ontology. - - - - - - - - - - GO concept ID (biological process) - - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a 'biological process' concept from the the Gene Ontology. - - - - - - - - - - - GO concept ID (molecular function) - - beta12orEarlier - [0-9]{7}|GO:[0-9]{7} - An identifier of a 'molecular function' concept from the the Gene Ontology. - - - - - - - - - - - GO concept name (cellular component) - - The name of a concept for a cellular component from the GO ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Northern blot image - - beta12orEarlier - An image arising from a Northern Blot experiment. - - - - - - - - - - Blot ID - - - Unique identifier of a blot from a Northern Blot. - beta12orEarlier - - - - - - - - - - - BlotBase blot ID - - beta12orEarlier - Unique identifier of a blot from a Northern Blot from the BlotBase database. - - - - - - - - - - - Hierarchy - - beta12orEarlier - Raw data on a biological hierarchy, describing the hierarchy proper, hierarchy components and possibly associated annotation. - Hierarchy annotation - - - - - - - - - - Hierarchy identifier - - Identifier of an entry from a database of biological hierarchies. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Brite hierarchy ID - - beta12orEarlier - Identifier of an entry from the Brite database of biological hierarchies. - - - - - - - - - - - Cancer type - - true - A type (represented as a string) of cancer. - beta12orEarlier - beta12orEarlier - - - - - - - - - - BRENDA organism ID - - A unique identifier for an organism used in the BRENDA database. - beta12orEarlier - - - - - - - - - - - UniGene taxon - - The name of a taxon using the controlled vocabulary of the UniGene database. - UniGene organism abbreviation - beta12orEarlier - - - - - - - - - - - UTRdb taxon - - beta12orEarlier - The name of a taxon using the controlled vocabulary of the UTRdb database. - - - - - - - - - - - Catalogue ID - - beta12orEarlier - An identifier of a catalogue of biological resources. - Catalogue identifier - - - - - - - - - - - CABRI catalogue name - - - The name of a catalogue of biological resources from the CABRI database. - beta12orEarlier - - - - - - - - - - - Secondary structure alignment metadata - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Molecule interaction report - - An informative report on the physical, chemical or other information concerning the interaction of two or more molecules (or parts of molecules). - beta12orEarlier - Molecular interaction report - Molecular interaction data - - - - - - - - - Pathway or network - - - - - - - - Network - beta12orEarlier - Pathway - Primary data about a specific biological pathway or network (the nodes and connections within the pathway or network). - - - - - - - - - - Small molecule data - - true - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - beta13 - Data concerning one or more small molecules. - - - - - - - - - - Genotype and phenotype data - - beta12orEarlier - true - beta13 - Data concerning a particular genotype, phenotype or a genotype / phenotype relation. - - - - - - - - - - Gene expression data - - - - - - - - beta12orEarlier - Image or hybridisation data for a microarray, typically a study of gene expression. - Microarray data - This is a broad data type and is used a placeholder for other, more specific types. See also http://edamontology.org/data_0931 - - - - - - - - - - Compound ID (KEGG) - - - C[0-9]+ - Unique identifier of a chemical compound from the KEGG database. - beta12orEarlier - KEGG compound ID - KEGG compound identifier - - - - - - - - - - - RFAM name - - - Name (not necessarily stable) an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Reaction ID (KEGG) - - - Identifier of a biological reaction from the KEGG reactions database. - R[0-9]+ - beta12orEarlier - - - - - - - - - - - Drug ID (KEGG) - - - beta12orEarlier - Unique identifier of a drug from the KEGG Drug database. - D[0-9]+ - - - - - - - - - - - Ensembl ID - - - beta12orEarlier - ENS[A-Z]*[FPTG][0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl database. - Ensembl IDs - - - - - - - - - - - ICD identifier - - - - - - - - An identifier of a disease from the International Classification of Diseases (ICD) database. - beta12orEarlier - [A-Z][0-9]+(\.[-[0-9]+])? - - - - - - - - - - - Sequence cluster ID (CluSTr) - - Unique identifier of a sequence cluster from the CluSTr database. - [0-9A-Za-z]+:[0-9]+:[0-9]{1,5}(\.[0-9])? - CluSTr ID - beta12orEarlier - CluSTr cluster ID - - - - - - - - - - - KEGG Glycan ID - - - G[0-9]+ - Unique identifier of a glycan ligand from the KEGG GLYCAN database (a subset of KEGG LIGAND). - beta12orEarlier - - - - - - - - - - - TCDB ID - - beta12orEarlier - OBO file for regular expression. - TC number - [0-9]+\.[A-Z]\.[0-9]+\.[0-9]+\.[0-9]+ - A unique identifier of a family from the transport classification database (TCDB) of membrane transport proteins. - - - - - - - - - - - MINT ID - - MINT\-[0-9]{1,5} - Unique identifier of an entry from the MINT database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - DIP ID - - Unique identifier of an entry from the DIP database of protein-protein interactions. - beta12orEarlier - DIP[\:\-][0-9]{3}[EN] - - - - - - - - - - - Signaling Gateway protein ID - - beta12orEarlier - Unique identifier of a protein listed in the UCSD-Nature Signaling Gateway Molecule Pages database. - A[0-9]{6} - - - - - - - - - - - Protein modification ID - - - beta12orEarlier - Identifier of a protein modification catalogued in a database. - - - - - - - - - - - RESID ID - - Identifier of a protein modification catalogued in the RESID database. - AA[0-9]{4} - beta12orEarlier - - - - - - - - - - - RGD ID - - - [0-9]{4,7} - beta12orEarlier - Identifier of an entry from the RGD database. - - - - - - - - - - - TAIR accession (protein) - - - - - - - - - AASequence:[0-9]{10} - Identifier of a protein sequence from the TAIR database. - beta12orEarlier - - - - - - - - - - - Compound ID (HMDB) - - HMDB[0-9]{5} - beta12orEarlier - HMDB ID - Identifier of a small molecule metabolite from the Human Metabolome Database (HMDB). - - - - - - - - - - - LIPID MAPS ID - - beta12orEarlier - LM ID - Identifier of an entry from the LIPID MAPS database. - LM(FA|GL|GP|SP|ST|PR|SL|PK)[0-9]{4}([0-9a-zA-Z]{4})? - - - - - - - - - - - PeptideAtlas ID - - Identifier of a peptide from the PeptideAtlas peptide databases. - PDBML:pdbx_PDB_strand_id - beta12orEarlier - PAp[0-9]{8} - - - - - - - - - - - Molecular interaction ID - - Identifier of a report of molecular interactions from a database (typically). - true - beta12orEarlier - 1.7 - - - - - - - - - - BioGRID interaction ID - - [0-9]+ - beta12orEarlier - A unique identifier of an interaction from the BioGRID database. - - - - - - - - - - - Enzyme ID (MEROPS) - - MEROPS ID - Unique identifier of a peptidase enzyme from the MEROPS database. - beta12orEarlier - S[0-9]{2}\.[0-9]{3} - - - - - - - - - - - Mobile genetic element ID - - - An identifier of a mobile genetic element. - beta12orEarlier - - - - - - - - - - - ACLAME ID - - beta12orEarlier - mge:[0-9]+ - An identifier of a mobile genetic element from the Aclame database. - - - - - - - - - - - SGD ID - - - PWY[a-zA-Z_0-9]{2}\-[0-9]{3} - beta12orEarlier - Identifier of an entry from the Saccharomyces genome database (SGD). - - - - - - - - - - - Book ID - - - beta12orEarlier - Unique identifier of a book. - - - - - - - - - - - ISBN - - beta12orEarlier - (ISBN)?(-13|-10)?[:]?[ ]?([0-9]{2,3}[ -]?)?[0-9]{1,5}[ -]?[0-9]{1,7}[ -]?[0-9]{1,6}[ -]?([0-9]|X) - The International Standard Book Number (ISBN) is for identifying printed books. - - - - - - - - - - - Compound ID (3DMET) - - B[0-9]{5} - 3DMET ID - beta12orEarlier - Identifier of a metabolite from the 3DMET database. - - - - - - - - - - - MatrixDB interaction ID - - ([A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9])_.*|([OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]_.*)|(GAG_.*)|(MULT_.*)|(PFRAG_.*)|(LIP_.*)|(CAT_.*) - A unique identifier of an interaction from the MatrixDB database. - beta12orEarlier - - - - - - - - - - - cPath ID - - - [0-9]+ - These identifiers are unique within the cPath database, however, they are not stable between releases. - beta12orEarlier - A unique identifier for pathways, reactions, complexes and small molecules from the cPath (Pathway Commons) database. - - - - - - - - - - - PubChem bioassay ID - - - Identifier of an assay from the PubChem database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PubChem ID - - - PubChem identifier - beta12orEarlier - Identifier of an entry from the PubChem database. - - - - - - - - - - - Reaction ID (MACie) - - beta12orEarlier - M[0-9]{4} - MACie entry number - Identifier of an enzyme reaction mechanism from the MACie database. - - - - - - - - - - - Gene ID (miRBase) - - beta12orEarlier - miRNA name - miRNA ID - Identifier for a gene from the miRBase database. - MI[0-9]{7} - miRNA identifier - - - - - - - - - - - Gene ID (ZFIN) - - Identifier for a gene from the Zebrafish information network genome (ZFIN) database. - beta12orEarlier - ZDB\-GENE\-[0-9]+\-[0-9]+ - - - - - - - - - - - Reaction ID (Rhea) - - [0-9]{5} - Identifier of an enzyme-catalysed reaction from the Rhea database. - beta12orEarlier - - - - - - - - - - - Pathway ID (Unipathway) - - UPA[0-9]{5} - upaid - beta12orEarlier - Identifier of a biological pathway from the Unipathway database. - - - - - - - - - - - Compound ID (ChEMBL) - - Identifier of a small molecular from the ChEMBL database. - ChEMBL ID - beta12orEarlier - [0-9]+ - - - - - - - - - - - LGICdb identifier - - Unique identifier of an entry from the Ligand-gated ion channel (LGICdb) database. - beta12orEarlier - [a-zA-Z_0-9]+ - - - - - - - - - - - Reaction kinetics ID (SABIO-RK) - - Identifier of a biological reaction (kinetics entry) from the SABIO-RK reactions database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PharmGKB ID - - - beta12orEarlier - Identifier of an entry from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Pathway ID (PharmGKB) - - - PA[0-9]+ - Identifier of a pathway from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - - - - - - - - - - - Disease ID (PharmGKB) - - - Identifier of a disease from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - PA[0-9]+ - - - - - - - - - - - Drug ID (PharmGKB) - - - beta12orEarlier - Identifier of a drug from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Drug ID (TTD) - - DAP[0-9]+ - Identifier of a drug from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Target ID (TTD) - - TTDS[0-9]+ - Identifier of a target protein from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Cell type identifier - - beta12orEarlier - A unique identifier of a type or group of cells. - - - - - - - - - - - NeuronDB ID - - [0-9]+ - beta12orEarlier - A unique identifier of a neuron from the NeuronDB database. - - - - - - - - - - - NeuroMorpho ID - - beta12orEarlier - A unique identifier of a neuron from the NeuroMorpho database. - [a-zA-Z_0-9]+ - - - - - - - - - - - Compound ID (ChemIDplus) - - Identifier of a chemical from the ChemIDplus database. - ChemIDplus ID - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (SMPDB) - - beta12orEarlier - Identifier of a pathway from the Small Molecule Pathway Database (SMPDB). - SMP[0-9]{5} - - - - - - - - - - - BioNumbers ID - - Identifier of an entry from the BioNumbers database of key numbers and associated data in molecular biology. - [0-9]+ - beta12orEarlier - - - - - - - - - - - T3DB ID - - beta12orEarlier - T3D[0-9]+ - Unique identifier of a toxin from the Toxin and Toxin Target Database (T3DB) database. - - - - - - - - - - - Carbohydrate identifier - - - - - - - - - - - - - - beta12orEarlier - Identifier of a carbohydrate. - - - - - - - - - - - GlycomeDB ID - - Identifier of an entry from the GlycomeDB database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - LipidBank ID - - beta12orEarlier - [a-zA-Z_0-9]+[0-9]+ - Identifier of an entry from the LipidBank database. - - - - - - - - - - - CDD ID - - beta12orEarlier - cd[0-9]{5} - Identifier of a conserved domain from the Conserved Domain Database. - - - - - - - - - - - MMDB ID - - [0-9]{1,5} - beta12orEarlier - An identifier of an entry from the MMDB database. - MMDB accession - - - - - - - - - - - iRefIndex ID - - Unique identifier of an entry from the iRefIndex database of protein-protein interactions. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ModelDB ID - - Unique identifier of an entry from the ModelDB database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (DQCS) - - [0-9]+ - Identifier of a signaling pathway from the Database of Quantitative Cellular Signaling (DQCS). - beta12orEarlier - - - - - - - - - - - Ensembl ID (Homo sapiens) - - beta12orEarlier - true - beta12orEarlier - ENS([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database (Homo sapiens division). - - - - - - - - - - Ensembl ID ('Bos taurus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Bos taurus' division). - true - beta12orEarlier - ENSBTA([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Canis familiaris') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Canis familiaris' division). - true - ENSCAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Cavia porcellus') - - ENSCPO([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Cavia porcellus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Ciona intestinalis') - - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona intestinalis' division). - beta12orEarlier - beta12orEarlier - ENSCIN([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Ciona savignyi') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona savignyi' division). - ENSCSAV([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Ensembl ID ('Danio rerio') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Danio rerio' division). - true - beta12orEarlier - beta12orEarlier - ENSDAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Dasypus novemcinctus') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Dasypus novemcinctus' division). - beta12orEarlier - beta12orEarlier - ENSDNO([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Echinops telfairi') - - ENSETE([EGTP])[0-9]{11} - true - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Echinops telfairi' division). - - - - - - - - - - Ensembl ID ('Erinaceus europaeus') - - true - ENSEEU([EGTP])[0-9]{11} - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Erinaceus europaeus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Felis catus') - - beta12orEarlier - true - ENSFCA([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Felis catus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gallus gallus') - - ENSGAL([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gallus gallus' division). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gasterosteus aculeatus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gasterosteus aculeatus' division). - true - ENSGAC([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Homo sapiens') - - ENSHUM([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Homo sapiens' division). - true - - - - - - - - - - Ensembl ID ('Loxodonta africana') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Loxodonta africana' division). - ENSLAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Macaca mulatta') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Macaca mulatta' division). - beta12orEarlier - ENSMMU([EGTP])[0-9]{11} - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Monodelphis domestica') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Monodelphis domestica' division). - true - ENSMOD([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Mus musculus') - - ENSMUS([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Mus musculus' division). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Ensembl ID ('Myotis lucifugus') - - beta12orEarlier - ENSMLU([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Myotis lucifugus' division). - - - - - - - - - - Ensembl ID ("Ornithorhynchus anatinus") - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ornithorhynchus anatinus' division). - ENSOAN([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryctolagus cuniculus') - - beta12orEarlier - ENSOCU([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryctolagus cuniculus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryzias latipes') - - ENSORL([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryzias latipes' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Otolemur garnettii') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Otolemur garnettii' division). - true - beta12orEarlier - ENSSAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Pan troglodytes') - - beta12orEarlier - beta12orEarlier - ENSPTR([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Pan troglodytes' division). - true - - - - - - - - - - Ensembl ID ('Rattus norvegicus') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Rattus norvegicus' division). - ENSRNO([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Spermophilus tridecemlineatus') - - true - beta12orEarlier - ENSSTO([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Spermophilus tridecemlineatus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Takifugu rubripes') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Takifugu rubripes' division). - ENSFRU([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Tupaia belangeri') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Tupaia belangeri' division). - true - ENSTBE([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Xenopus tropicalis') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Xenopus tropicalis' division). - beta12orEarlier - beta12orEarlier - true - ENSXET([EGTP])[0-9]{11} - - - - - - - - - - CATH identifier - - beta12orEarlier - Identifier of a protein domain (or other node) from the CATH database. - - - - - - - - - - - CATH node ID (family) - - beta12orEarlier - A code number identifying a family from the CATH database. - 2.10.10.10 - - - - - - - - - - - Enzyme ID (CAZy) - - Identifier of an enzyme from the CAZy enzymes database. - beta12orEarlier - CAZy ID - - - - - - - - - - - Clone ID (IMAGE) - - I.M.A.G.E. cloneID - IMAGE cloneID - A unique identifier assigned by the I.M.A.G.E. consortium to a clone (cloned molecular sequence). - beta12orEarlier - - - - - - - - - - - GO concept ID (cellular compartment) - - An identifier of a 'cellular compartment' concept from the Gene Ontology. - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - GO concept identifier (cellular compartment) - - - - - - - - - - - Chromosome name (BioCyc) - - Name of a chromosome as used in the BioCyc database. - beta12orEarlier - - - - - - - - - - - CleanEx entry name - - beta12orEarlier - An identifier of a gene expression profile from the CleanEx database. - - - - - - - - - - - CleanEx dataset code - - beta12orEarlier - An identifier of (typically a list of) gene expression experiments catalogued in the CleanEx database. - - - - - - - - - - - Genome report - - An informative report of general information concerning a genome as a whole. - beta12orEarlier - - - - - - - - - - Protein ID (CORUM) - - beta12orEarlier - CORUM complex ID - Unique identifier for a protein complex from the CORUM database. - - - - - - - - - - - CDD PSSM-ID - - beta12orEarlier - Unique identifier of a position-specific scoring matrix from the CDD database. - - - - - - - - - - - Protein ID (CuticleDB) - - CuticleDB ID - beta12orEarlier - Unique identifier for a protein from the CuticleDB database. - - - - - - - - - - - DBD ID - - Identifier of a predicted transcription factor from the DBD database. - beta12orEarlier - - - - - - - - - - - Oligonucleotide probe annotation - - - - - - - - General annotation on an oligonucleotide probe, or a set of probes. - beta12orEarlier - Oligonucleotide probe sets annotation - - - - - - - - - - Oligonucleotide ID - - - Identifier of an oligonucleotide from a database. - beta12orEarlier - - - - - - - - - - - dbProbe ID - - Identifier of an oligonucleotide probe from the dbProbe database. - beta12orEarlier - - - - - - - - - - - Dinucleotide property - - beta12orEarlier - Physicochemical property data for one or more dinucleotides. - - - - - - - - - - DiProDB ID - - beta12orEarlier - Identifier of an dinucleotide property from the DiProDB database. - - - - - - - - - - - Protein features report (disordered structure) - - 1.8 - true - beta12orEarlier - disordered structure in a protein. - - - - - - - - - - Protein ID (DisProt) - - DisProt ID - beta12orEarlier - Unique identifier for a protein from the DisProt database. - - - - - - - - - - - Embryo report - - Annotation on an embryo or concerning embryological development. - true - Embryo annotation - beta12orEarlier - 1.5 - - - - - - - - - - Ensembl transcript ID - - - beta12orEarlier - Transcript ID (Ensembl) - Unique identifier for a gene transcript from the Ensembl database. - - - - - - - - - - - Inhibitor annotation - - 1.4 - beta12orEarlier - An informative report on one or more small molecules that are enzyme inhibitors. - true - - - - - - - - - - Promoter ID - - - beta12orEarlier - An identifier of a promoter of a gene that is catalogued in a database. - Moby:GeneAccessionList - - - - - - - - - - - EST accession - - Identifier of an EST sequence. - beta12orEarlier - - - - - - - - - - - COGEME EST ID - - beta12orEarlier - Identifier of an EST sequence from the COGEME database. - - - - - - - - - - - COGEME unisequence ID - - Identifier of a unisequence from the COGEME database. - A unisequence is a single sequence assembled from ESTs. - beta12orEarlier - - - - - - - - - - - Protein family ID (GeneFarm) - - GeneFarm family ID - beta12orEarlier - Accession number of an entry (family) from the TIGRFam database. - - - - - - - - - - - Family name - - beta12orEarlier - The name of a family of organism. - - - - - - - - - - - Genus name (virus) - - true - The name of a genus of viruses. - beta13 - beta12orEarlier - - - - - - - - - - Family name (virus) - - beta13 - The name of a family of viruses. - true - beta12orEarlier - - - - - - - - - - Database name (SwissRegulon) - - true - beta13 - The name of a SwissRegulon database. - beta12orEarlier - - - - - - - - - - Sequence feature ID (SwissRegulon) - - beta12orEarlier - A feature identifier as used in the SwissRegulon database. - This can be name of a gene, the ID of a TFBS, or genomic coordinates in form "chr:start..end". - - - - - - - - - - - FIG ID - - A FIG ID consists of four parts: a prefix, genome id, locus type and id number. - A unique identifier of gene in the NMPDR database. - beta12orEarlier - - - - - - - - - - - Gene ID (Xenbase) - - A unique identifier of gene in the Xenbase database. - beta12orEarlier - - - - - - - - - - - Gene ID (Genolist) - - beta12orEarlier - A unique identifier of gene in the Genolist database. - - - - - - - - - - - Gene name (Genolist) - - beta12orEarlier - true - Genolist gene name - 1.3 - Name of an entry (gene) from the Genolist genes database. - - - - - - - - - - ABS ID - - ABS identifier - beta12orEarlier - Identifier of an entry (promoter) from the ABS database. - - - - - - - - - - - AraC-XylS ID - - Identifier of a transcription factor from the AraC-XylS database. - beta12orEarlier - - - - - - - - - - - Gene name (HUGO) - - beta12orEarlier - beta12orEarlier - true - Name of an entry (gene) from the HUGO database. - - - - - - - - - - Locus ID (PseudoCAP) - - beta12orEarlier - Identifier of a locus from the PseudoCAP database. - - - - - - - - - - - Locus ID (UTR) - - beta12orEarlier - Identifier of a locus from the UTR database. - - - - - - - - - - - MonosaccharideDB ID - - Unique identifier of a monosaccharide from the MonosaccharideDB database. - beta12orEarlier - - - - - - - - - - - Database name (CMD) - - beta12orEarlier - true - The name of a subdivision of the Collagen Mutation Database (CMD) database. - beta13 - - - - - - - - - - Database name (Osteogenesis) - - beta12orEarlier - true - beta13 - The name of a subdivision of the Osteogenesis database. - - - - - - - - - - Genome identifier - - An identifier of a particular genome. - beta12orEarlier - - - - - - - - - - - GenomeReviews ID - - beta12orEarlier - An identifier of a particular genome. - - - - - - - - - - - GlycoMap ID - - [0-9]+ - beta12orEarlier - Identifier of an entry from the GlycosciencesDB database. - - - - - - - - - - - Carbohydrate conformational map - - beta12orEarlier - A conformational energy map of the glycosidic linkages in a carbohydrate molecule. - - - - - - - - - - Gene features report (intron) - - introns in a nucleotide sequences. - true - beta12orEarlier - 1.8 - - - - - - - - - - Transcription factor name - - - The name of a transcription factor. - beta12orEarlier - - - - - - - - - - - TCID - - Identifier of a membrane transport proteins from the transport classification database (TCDB). - beta12orEarlier - - - - - - - - - - - Pfam domain name - - beta12orEarlier - Name of a domain from the Pfam database. - PF[0-9]{5} - - - - - - - - - - - Pfam clan ID - - beta12orEarlier - CL[0-9]{4} - Accession number of a Pfam clan. - - - - - - - - - - - Gene ID (VectorBase) - - VectorBase ID - beta12orEarlier - Identifier for a gene from the VectorBase database. - - - - - - - - - - - UTRSite ID - - Identifier of an entry from the UTRSite database of regulatory motifs in eukaryotic UTRs. - beta12orEarlier - - - - - - - - - - - Sequence signature report - - - - - - - - Sequence motif report - Sequence profile report - An informative report about a specific or conserved pattern in a molecular sequence, such as its context in genes or proteins, its role, origin or method of construction, etc. - beta12orEarlier - - - - - - - - - - Locus annotation - - Locus report - true - beta12orEarlier - An informative report on a particular locus. - beta12orEarlier - - - - - - - - - - Protein name (UniProt) - - Official name of a protein as used in the UniProt database. - beta12orEarlier - - - - - - - - - - - Term ID list - - One or more terms from one or more controlled vocabularies which are annotations on an entity. - beta12orEarlier - true - The concepts are typically provided as a persistent identifier or some other link the source ontologies. Evidence of the validity of the annotation might be included. - 1.5 - - - - - - - - - - HAMAP ID - - Name of a protein family from the HAMAP database. - beta12orEarlier - - - - - - - - - - - Identifier with metadata - - Basic information concerning an identifier of data (typically including the identifier itself). For example, a gene symbol with information concerning its provenance. - beta12orEarlier - true - 1.12 - - - - - - - - - - Gene symbol annotation - - true - beta12orEarlier - Annotation about a gene symbol. - beta12orEarlier - - - - - - - - - - Transcript ID - - - - - - - - - Identifier of a RNA transcript. - beta12orEarlier - - - - - - - - - - - HIT ID - - Identifier of an RNA transcript from the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HIX ID - - A unique identifier of gene cluster in the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HPA antibody id - - beta12orEarlier - Identifier of a antibody from the HPA database. - - - - - - - - - - - IMGT/HLA ID - - Identifier of a human major histocompatibility complex (HLA) or other protein from the IMGT/HLA database. - beta12orEarlier - - - - - - - - - - - Gene ID (JCVI) - - A unique identifier of gene assigned by the J. Craig Venter Institute (JCVI). - beta12orEarlier - - - - - - - - - - - Kinase name - - beta12orEarlier - The name of a kinase protein. - - - - - - - - - - - ConsensusPathDB entity ID - - - Identifier of a physical entity from the ConsensusPathDB database. - beta12orEarlier - - - - - - - - - - - ConsensusPathDB entity name - - - beta12orEarlier - Name of a physical entity from the ConsensusPathDB database. - - - - - - - - - - - CCAP strain number - - The number of a strain of algae and protozoa from the CCAP database. - beta12orEarlier - - - - - - - - - - - Stock number - - - beta12orEarlier - An identifier of stock from a catalogue of biological resources. - - - - - - - - - - - Stock number (TAIR) - - beta12orEarlier - A stock number from The Arabidopsis information resource (TAIR). - - - - - - - - - - - REDIdb ID - - beta12orEarlier - Identifier of an entry from the RNA editing database (REDIdb). - - - - - - - - - - - SMART domain name - - Name of a domain from the SMART database. - beta12orEarlier - - - - - - - - - - - Protein family ID (PANTHER) - - beta12orEarlier - Panther family ID - Accession number of an entry (family) from the PANTHER database. - - - - - - - - - - - RNAVirusDB ID - - beta12orEarlier - Could list (or reference) other taxa here from https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - A unique identifier for a virus from the RNAVirusDB database. - - - - - - - - - - - Virus ID - - - beta12orEarlier - An accession of annotation on a (group of) viruses (catalogued in a database). - - - - - - - - - - - NCBI Genome Project ID - - An identifier of a genome project assigned by NCBI. - beta12orEarlier - - - - - - - - - - - NCBI genome accession - - A unique identifier of a whole genome assigned by the NCBI. - beta12orEarlier - - - - - - - - - - - Sequence profile data - - 1.8 - Data concerning, extracted from, or derived from the analysis of a sequence profile, such as its name, length, technical details about the profile or it's construction, the biological role or annotation, and so on. - true - beta12orEarlier - - - - - - - - - - Protein ID (TopDB) - - beta12orEarlier - TopDB ID - Unique identifier for a membrane protein from the TopDB database. - - - - - - - - - - - Gel ID - - Gel identifier - Identifier of a two-dimensional (protein) gel. - beta12orEarlier - - - - - - - - - - - Reference map name (SWISS-2DPAGE) - - - beta12orEarlier - Name of a reference map gel from the SWISS-2DPAGE database. - - - - - - - - - - - Protein ID (PeroxiBase) - - PeroxiBase ID - beta12orEarlier - Unique identifier for a peroxidase protein from the PeroxiBase database. - - - - - - - - - - - SISYPHUS ID - - beta12orEarlier - Identifier of an entry from the SISYPHUS database of tertiary structure alignments. - - - - - - - - - - - ORF ID - - - beta12orEarlier - Accession of an open reading frame (catalogued in a database). - - - - - - - - - - - ORF identifier - - An identifier of an open reading frame. - beta12orEarlier - - - - - - - - - - - Linucs ID - - Identifier of an entry from the GlycosciencesDB database. - beta12orEarlier - - - - - - - - - - - Protein ID (LGICdb) - - beta12orEarlier - LGICdb ID - Unique identifier for a ligand-gated ion channel protein from the LGICdb database. - - - - - - - - - - - MaizeDB ID - - beta12orEarlier - Identifier of an EST sequence from the MaizeDB database. - - - - - - - - - - - Gene ID (MfunGD) - - beta12orEarlier - A unique identifier of gene in the MfunGD database. - - - - - - - - - - - Orpha number - - - - - - - - beta12orEarlier - An identifier of a disease from the Orpha database. - - - - - - - - - - - Protein ID (EcID) - - beta12orEarlier - Unique identifier for a protein from the EcID database. - - - - - - - - - - - Clone ID (RefSeq) - - - A unique identifier of a cDNA molecule catalogued in the RefSeq database. - beta12orEarlier - - - - - - - - - - - Protein ID (ConoServer) - - beta12orEarlier - Unique identifier for a cone snail toxin protein from the ConoServer database. - - - - - - - - - - - GeneSNP ID - - Identifier of a GeneSNP database entry. - beta12orEarlier - - - - - - - - - - - Lipid identifier - - - - - - - - - - - - - - Identifier of a lipid. - beta12orEarlier - - - - - - - - - - - Databank - - true - beta12orEarlier - A flat-file (textual) data archive. - beta12orEarlier - - - - - - - - - - Web portal - - A web site providing data (web pages) on a common theme to a HTTP client. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gene ID (VBASE2) - - Identifier for a gene from the VBASE2 database. - beta12orEarlier - VBASE2 ID - - - - - - - - - - - DPVweb ID - - DPVweb virus ID - beta12orEarlier - A unique identifier for a virus from the DPVweb database. - - - - - - - - - - - Pathway ID (BioSystems) - - beta12orEarlier - Identifier of a pathway from the BioSystems pathway database. - [0-9]+ - - - - - - - - - - - Experimental data (proteomics) - - true - Data concerning a proteomics experiment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Abstract - - beta12orEarlier - An abstract of a scientific article. - - - - - - - - - - Lipid structure - - beta12orEarlier - 3D coordinate and associated data for a lipid structure. - - - - - - - - - - Drug structure - - beta12orEarlier - 3D coordinate and associated data for the (3D) structure of a drug. - - - - - - - - - - Toxin structure - - 3D coordinate and associated data for the (3D) structure of a toxin. - beta12orEarlier - - - - - - - - - - Position-specific scoring matrix - - - beta12orEarlier - PSSM - A simple matrix of numbers, where each value (or column of values) is derived derived from analysis of the corresponding position in a sequence alignment. - - - - - - - - - - Distance matrix - - A matrix of distances between molecular entities, where a value (distance) is (typically) derived from comparison of two entities and reflects their similarity. - beta12orEarlier - - - - - - - - - - Structural distance matrix - - Distances (values representing similarity) between a group of molecular structures. - beta12orEarlier - - - - - - - - - - Article metadata - - true - beta12orEarlier - Bibliographic data concerning scientific article(s). - 1.5 - - - - - - - - - - Ontology concept - - beta12orEarlier - This includes any fields from the concept definition such as concept name, definition, comments and so on. - A concept from a biological ontology. - - - - - - - - - - Codon usage bias - - A numerical measure of differences in the frequency of occurrence of synonymous codons in DNA sequences. - beta12orEarlier - - - - - - - - - - Northern blot report - - true - beta12orEarlier - 1.8 - Northern Blot experiments. - - - - - - - - - - Nucleic acid features report (VNTR) - - 1.8 - beta12orEarlier - true - variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - - - - - - - - - - Nucleic acid features report (microsatellite) - - true - microsatellite polymorphism in a DNA sequence. - 1.8 - beta12orEarlier - - - - - - - - - - - Nucleic acid features report (RFLP) - - beta12orEarlier - true - 1.8 - restriction fragment length polymorphisms (RFLP) in a DNA sequence. - - - - - - - - - - Radiation hybrid map - - The radiation method can break very closely linked markers providing a more detailed map. Most genetic markers and subsequences may be located to a defined map position and with a more precise estimates of distance than a linkage map. - A map showing distance between genetic markers estimated by radiation-induced breaks in a chromosome. - beta12orEarlier - RH map - - - - - - - - - - ID list - - A simple list of data identifiers (such as database accessions), possibly with additional basic information on the addressed data. - beta12orEarlier - - - - - - - - - - Phylogenetic gene frequencies data - - beta12orEarlier - Gene frequencies data that may be read during phylogenetic tree calculation. - - - - - - - - - - Sequence set (polymorphic) - - beta13 - beta12orEarlier - true - A set of sub-sequences displaying some type of polymorphism, typically indicating the sequence in which they occur, their position and other metadata. - - - - - - - - - - DRCAT resource - - 1.5 - An entry (resource) from the DRCAT bioinformatics resource catalogue. - beta12orEarlier - true - - - - - - - - - - Protein complex - - beta12orEarlier - 3D coordinate and associated data for a multi-protein complex; two or more polypeptides chains in a stable, functional association with one another. - - - - - - - - - - Protein structural motif - - beta12orEarlier - 3D coordinate and associated data for a protein (3D) structural motif; any group of contiguous or non-contiguous amino acid residues but typically those forming a feature with a structural or functional role. - - - - - - - - - - Lipid report - - beta12orEarlier - Annotation on or information derived from one or more specific lipid 3D structure(s). - - - - - - - - - - Secondary structure image - - 1.4 - beta12orEarlier - Image of one or more molecular secondary structures. - true - - - - - - - - - - Secondary structure report - - Secondary structure-derived report - beta12orEarlier - true - An informative report on general information, properties or features of one or more molecular secondary structures. - 1.5 - - - - - - - - - - DNA features - - beta12orEarlier - DNA sequence-specific feature annotation (not in a feature table). - true - beta12orEarlier - - - - - - - - - - RNA features report - - true - beta12orEarlier - 1.5 - Features concerning RNA or regions of DNA that encode an RNA molecule. - RNA features - Nucleic acid features (RNA features) - - - - - - - - - - Plot - - beta12orEarlier - Biological data that has been plotted as a graph of some type. - - - - - - - - - - Nucleic acid features report (polymorphism) - - true - DNA polymorphism. - beta12orEarlier - - - - - - - - - - Protein sequence record - - - A protein sequence and associated metadata. - beta12orEarlier - Sequence record (protein) - - - - - - - - - - Nucleic acid sequence record - - - RNA sequence record - Nucleotide sequence record - A nucleic acid sequence and associated metadata. - beta12orEarlier - DNA sequence record - Sequence record (nucleic acid) - - - - - - - - - - Protein sequence record (full) - - A protein sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - beta12orEarlier - true - - - - - - - - - - Nucleic acid sequence record (full) - - true - A nucleic acid sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - beta12orEarlier - 1.8 - - - - - - - - - - Biological model accession - - - beta12orEarlier - Accession of a mathematical model, typically an entry from a database. - - - - - - - - - - - Cell type name - - - The name of a type or group of cells. - beta12orEarlier - - - - - - - - - - - Cell type accession - - - Cell type ID - beta12orEarlier - Accession of a type or group of cells (catalogued in a database). - - - - - - - - - - - Compound accession - - - Small molecule accession - Accession of an entry from a database of chemicals. - beta12orEarlier - Chemical compound accession - - - - - - - - - - - Drug accession - - - Accession of a drug. - beta12orEarlier - - - - - - - - - - - Toxin name - - - Name of a toxin. - beta12orEarlier - - - - - - - - - - - Toxin accession - - - beta12orEarlier - Accession of a toxin (catalogued in a database). - - - - - - - - - - - Monosaccharide accession - - - Accession of a monosaccharide (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Drug name - - - beta12orEarlier - Common name of a drug. - - - - - - - - - - - Carbohydrate accession - - - Accession of an entry from a database of carbohydrates. - beta12orEarlier - - - - - - - - - - - Molecule accession - - - Accession of a specific molecule (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Data resource definition accession - - - beta12orEarlier - Accession of a data definition (catalogued in a database). - - - - - - - - - - - Genome accession - - - An accession of a particular genome (in a database). - beta12orEarlier - - - - - - - - - - - Map accession - - - An accession of a map of a molecular sequence (deposited in a database). - beta12orEarlier - - - - - - - - - - - Lipid accession - - - beta12orEarlier - Accession of an entry from a database of lipids. - - - - - - - - - - - Peptide ID - - - beta12orEarlier - Accession of a peptide deposited in a database. - - - - - - - - - - - Protein accession - - - Protein accessions - beta12orEarlier - Accession of a protein deposited in a database. - - - - - - - - - - - Organism accession - - - An accession of annotation on a (group of) organisms (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Organism name - - - Moby:Organism_Name - Moby:OrganismsShortName - Moby:OccurrenceRecord - Moby:BriefOccurrenceRecord - Moby:FirstEpithet - Moby:InfraspecificEpithet - beta12orEarlier - Moby:OrganismsLongName - The name of an organism (or group of organisms). - - - - - - - - - - - Protein family accession - - - beta12orEarlier - Accession of a protein family (that is deposited in a database). - - - - - - - - - - - Transcription factor accession - - - - beta12orEarlier - Accession of an entry from a database of transcription factors or binding sites. - - - - - - - - - - - Strain accession - - - - - - - - - beta12orEarlier - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - Virus identifier - - An accession of annotation on a (group of) viruses (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Sequence features metadata - - beta12orEarlier - Metadata on sequence features. - - - - - - - - - - Gramene identifier - - beta12orEarlier - Identifier of a Gramene database entry. - - - - - - - - - - - DDBJ accession - - beta12orEarlier - DDBJ accession number - DDBJ identifier - DDBJ ID - An identifier of an entry from the DDBJ sequence database. - - - - - - - - - - - ConsensusPathDB identifier - - beta12orEarlier - An identifier of an entity from the ConsensusPathDB database. - - - - - - - - - - - Sequence data - - This is a broad data type and is used a placeholder for other, more specific types. - 1.8 - beta12orEarlier - true - Data concerning, extracted from, or derived from the analysis of molecular sequence(s). - - - - - - - - - - Codon usage - - beta12orEarlier - true - beta13 - Data concerning codon usage. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article report - - beta12orEarlier - 1.5 - Data derived from the analysis of a scientific text such as a full text article from a scientific journal. - true - - - - - - - - - - Sequence report - - An informative report of information about molecular sequence(s), including basic information (metadata), and reports generated from molecular sequence analysis, including positional features and non-positional properties. - beta12orEarlier - Sequence-derived report - - - - - - - - - - Protein secondary structure report - - An informative report about the properties or features of one or more protein secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plot - - - A Hopp and Woods plot of predicted antigenicity of a peptide or protein. - beta12orEarlier - - - - - - - - - - Nucleic acid melting curve - - - Shows the proportion of nucleic acid which are double-stranded versus temperature. - A melting curve of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile - - A probability profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - Shows the probability of a base pair not being melted (i.e. remaining as double-stranded DNA) at a specified temperature - - - - - - - - - - Nucleic acid temperature profile - - A temperature profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - Plots melting temperature versus base position. - beta12orEarlier - Melting map - - - - - - - - - - Gene regulatory network report - - 1.8 - A report typically including a map (diagram) of a gene regulatory network. - true - beta12orEarlier - - - - - - - - - - 2D PAGE gel report - - An informative report on a two-dimensional (2D PAGE) gel. - 2D PAGE image report - 1.8 - true - 2D PAGE gel annotation - beta12orEarlier - 2D PAGE image annotation - - - - - - - - - - Oligonucleotide probe sets annotation - - beta12orEarlier - 1.14 - true - General annotation on a set of oligonucleotide probes, such as the gene name with which the probe set is associated and which probes belong to the set. - - - - - - - - - - Microarray image - - 1.5 - beta12orEarlier - Gene expression image - An image from a microarray experiment which (typically) allows a visualisation of probe hybridisation and gene-expression data. - true - - - - - - - - - - Image - - http://semanticscience.org/resource/SIO_000081 - Biological or biomedical data has been rendered into an image, typically for display on screen. - http://semanticscience.org/resource/SIO_000079 - Image data - beta12orEarlier - - - - - - - - - - Sequence image - - - Image of a molecular sequence, possibly with sequence features or properties shown. - beta12orEarlier - - - - - - - - - - Protein hydropathy data - - Protein hydropathy report - A report on protein properties concerning hydropathy. - beta12orEarlier - - - - - - - - - - Workflow data - - beta12orEarlier - beta13 - Data concerning a computational workflow. - true - - - - - - - - - - Workflow - - true - beta12orEarlier - 1.5 - A computational workflow. - - - - - - - - - - Secondary structure data - - beta13 - true - beta12orEarlier - Data concerning molecular secondary structure data. - - - - - - - - - - Protein sequence (raw) - - - Raw protein sequence - beta12orEarlier - Raw sequence (protein) - A raw protein sequence (string of characters). - - - - - - - - - - Nucleic acid sequence (raw) - - - Nucleic acid raw sequence - beta12orEarlier - Nucleotide sequence (raw) - Raw sequence (nucleic acid) - A raw nucleic acid sequence. - - - - - - - - - - Protein sequence - - One or more protein sequences, possibly with associated annotation. - Protein sequences - beta12orEarlier - http://purl.org/biotop/biotop.owl#AminoAcidSequenceInformation - - - - - - - - - - Nucleic acid sequence - - One or more nucleic acid sequences, possibly with associated annotation. - beta12orEarlier - DNA sequence - Nucleotide sequence - Nucleotide sequences - Nucleic acid sequences - http://purl.org/biotop/biotop.owl#NucleotideSequenceInformation - - - - - - - - - - Reaction data - - Enzyme kinetics annotation - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Reaction annotation - Data concerning a biochemical reaction, typically data and more general annotation on the kinetics of enzyme-catalysed reaction. - - - - - - - - - - Peptide property - - beta12orEarlier - Peptide data - Data concerning small peptides. - - - - - - - - - - Protein classification - - This is a broad data type and is used a placeholder for other, more specific types. - Protein classification data - An informative report concerning the classification of protein sequences or structures. - beta12orEarlier - - - - - - - - - Sequence motif data - - true - 1.8 - Data concerning specific or conserved pattern in molecular sequences. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Sequence profile data - - beta12orEarlier - true - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning models representing a (typically multiple) sequence alignment. - - - - - - - - - - Pathway or network data - - Data concerning a specific biological pathway or network. - beta13 - true - beta12orEarlier - - - - - - - - - - - Pathway or network report - - - - - - - - beta12orEarlier - An informative report concerning or derived from the analysis of a biological pathway or network, such as a map (diagram) or annotation. - - - - - - - - - - Nucleic acid thermodynamic data - - Nucleic acid property (thermodynamic or kinetic) - A thermodynamic or kinetic property of a nucleic acid molecule. - Nucleic acid thermodynamic property - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification of nucleic acid sequences or structures. - Nucleic acid classification data - - - - - - - - - Classification report - - This can include an entire classification, components such as classifiers, assignments of entities to a classification and so on. - beta12orEarlier - true - Classification data - A report on a classification of molecular sequences, structures or other entities. - 1.5 - - - - - - - - - - Protein features report (key folding sites) - - beta12orEarlier - key residues involved in protein folding. - 1.8 - true - - - - - - - - - - Protein geometry report - - Torsion angle data - beta12orEarlier - Geometry data for a protein structure, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - - - - - - - - - - Protein structure image - - - An image of protein structure. - beta12orEarlier - Structure image (protein) - - - - - - - - - - Phylogenetic character weights - - Weights for sequence positions or characters in phylogenetic analysis where zero is defined as unweighted. - beta12orEarlier - - - - - - - - - - Annotation track - - beta12orEarlier - Genomic track - Annotation of one particular positional feature on a biomolecular (typically genome) sequence, suitable for import and display in a genome browser. - Genome annotation track - Genome-browser track - Genome track - Sequence annotation track - - - - - - - - - - UniProt accession - - - - - - - - UniProtKB accession number - beta12orEarlier - P43353|Q7M1G0|Q9C199|A5A6J6 - UniProt entry accession - [OPQ][0-9][A-Z0-9]{3}[0-9]|[A-NR-Z][0-9]([A-Z][A-Z0-9]{2}[0-9]){1,2} - Swiss-Prot entry accession - TrEMBL entry accession - Accession number of a UniProt (protein sequence) database entry. - UniProtKB accession - UniProt accession number - - - - - - - - - - - NCBI genetic code ID - - - Identifier of a genetic code in the NCBI list of genetic codes. - [1-9][0-9]? - 16 - beta12orEarlier - - - - - - - - - - - Ontology concept identifier - - - - - - - - Identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - GO concept name (biological process) - - true - The name of a concept for a biological process from the GO ontology. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GO concept name (molecular function) - - true - beta12orEarlier - The name of a concept for a molecular function from the GO ontology. - beta12orEarlier - - - - - - - - - - Taxonomy - - - - - - - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification, identification and naming of organisms. - Taxonomic data - - - - - - - - - - Protein ID (EMBL/GenBank/DDBJ) - - beta13 - EMBL/GENBANK/DDBJ coding feature protein identifier, issued by International collaborators. - This qualifier consists of a stable ID portion (3+5 format with 3 position letters and 5 numbers) plus a version number after the decimal point. When the protein sequence encoded by the CDS changes, only the version number of the /protein_id value is incremented; the stable part of the /protein_id remains unchanged and as a result will permanently be associated with a given protein; this qualifier is valid only on CDS features which translate into a valid protein. - - - - - - - - - - - Core data - - Core data entities typically have a format and may be identified by an accession number. - A type of data that (typically) corresponds to entries from the primary biological databases and which is (typically) the primary input or output of a tool, i.e. the data the tool processes or generates, as distinct from metadata and identifiers which describe and identify such core data, parameters that control the behaviour of tools, reports of derivative data generated by tools and annotation. - 1.5 - true - beta13 - - - - - - - - - - Sequence feature identifier - - - - - - - - beta13 - Name or other identifier of molecular sequence feature(s). - - - - - - - - - - - Structure identifier - - - - - - - - beta13 - An identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Matrix identifier - - - - - - - - An identifier of an array of numerical values, such as a comparison matrix. - beta13 - - - - - - - - - - - Protein sequence composition - - beta13 - 1.8 - true - A report (typically a table) on character or word composition / frequency of protein sequence(s). - - - - - - - - - - Nucleic acid sequence composition (report) - - 1.8 - A report (typically a table) on character or word composition / frequency of nucleic acid sequence(s). - true - beta13 - - - - - - - - - - Protein domain classification node - - beta13 - A node from a classification of protein structural domain(s). - true - 1.5 - - - - - - - - - - CAS number - - beta13 - CAS registry number - Unique numerical identifier of chemicals in the scientific literature, as assigned by the Chemical Abstracts Service. - - - - - - - - - - - ATC code - - Unique identifier of a drug conforming to the Anatomical Therapeutic Chemical (ATC) Classification System, a drug classification system controlled by the WHO Collaborating Centre for Drug Statistics Methodology (WHOCC). - beta13 - - - - - - - - - - - UNII - - beta13 - A unique, unambiguous, alphanumeric identifier of a chemical substance as catalogued by the Substance Registration System of the Food and Drug Administration (FDA). - Unique Ingredient Identifier - - - - - - - - - - - Geotemporal metadata - - 1.5 - beta13 - true - Basic information concerning geographical location or time. - - - - - - - - - - System metadata - - Metadata concerning the software, hardware or other aspects of a computer system. - beta13 - - - - - - - - - - Sequence feature name - - - A name of a sequence feature, e.g. the name of a feature to be displayed to an end-user. - beta13 - - - - - - - - - - - Experimental measurement - - beta13 - Raw data such as measurements or other results from laboratory experiments, as generated from laboratory hardware. - Experimental measurement data - Measurement - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Measured data - Experimentally measured data - Measurement metadata - Measurement data - Raw experimental data - - - - - - - - - - Raw microarray data - - - beta13 - Raw data (typically MIAME-compliant) for hybridisations from a microarray experiment. - Such data as found in Affymetrix CEL or GPR files. - - - - - - - - - - Processed microarray data - - - - - - - - Data generated from processing and analysis of probe set data from a microarray experiment. - Gene annotation (expression) - Microarray probe set data - beta13 - Gene expression report - Such data as found in Affymetrix .CHP files or data from other software such as RMA or dChip. - - - - - - - - - - Gene expression matrix - - - This combines data from all hybridisations. - beta13 - Normalised microarray data - The final processed (normalised) data for a set of hybridisations in a microarray experiment. - Gene expression data matrix - - - - - - - - - - Sample annotation - - Annotation on a biological sample, for example experimental factors and their values. - This might include compound and dose in a dose response experiment. - beta13 - - - - - - - - - - Microarray metadata - - This might include gene identifiers, genomic coordinates, probe oligonucleotide sequences etc. - Annotation on the array itself used in a microarray experiment. - beta13 - - - - - - - - - - Microarray protocol annotation - - true - This might describe e.g. the normalisation methods used to process the raw data. - beta13 - 1.8 - Annotation on laboratory and/or data processing protocols used in an microarray experiment. - - - - - - - - - - Microarray hybridisation data - - Data concerning the hybridisations measured during a microarray experiment. - beta13 - - - - - - - - - - Protein features report (topological domains) - - 1.8 - beta13 - topological domains such as cytoplasmic regions in a protein. - true - - - - - - - - - - Sequence features (compositionally-biased regions) - - 1.5 - beta13 - true - A report of regions in a molecular sequence that are biased to certain characters. - - - - - - - - - - Nucleic acid features (difference and change) - - beta13 - A report on features in a nucleic acid sequence that indicate changes to or differences between sequences. - 1.5 - true - - - - - - - - - - Nucleic acid features report (expression signal) - - true - beta13 - regions within a nucleic acid sequence containing a signal that alters a biological function. - 1.8 - - - - - - - - - - Nucleic acid features report (binding) - - nucleic acids binding to some other molecule. - 1.8 - true - beta13 - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes). - - - - - - - - - - Nucleic acid repeats (report) - - true - repetitive elements within a nucleic acid sequence. - 1.8 - beta13 - - - - - - - - - - Nucleic acid features report (replication and recombination) - - beta13 - true - 1.8 - DNA replication or recombination. - - - - - - - - - - Nucleic acid structure report - - - A report on regions within a nucleic acid sequence which form secondary or tertiary (3D) structures. - Stem loop (report) - d-loop (report) - Nucleic acid features (structure) - Quadruplexes (report) - beta13 - - - - - - - - - - Protein features report (repeats) - - 1.8 - short repetitive subsequences (repeat sequences) in a protein sequence. - beta13 - true - - - - - - - - - - Sequence motif matches (protein) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more protein sequences. - 1.8 - beta13 - true - - - - - - - - - - Sequence motif matches (nucleic acid) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more nucleic acid sequences. - beta13 - true - 1.8 - - - - - - - - - - Nucleic acid features (d-loop) - - beta13 - true - 1.5 - A report on displacement loops in a mitochondrial DNA sequence. - A displacement loop is a region of mitochondrial DNA in which one of the strands is displaced by an RNA molecule. - - - - - - - - - - Nucleic acid features (stem loop) - - beta13 - true - A report on stem loops in a DNA sequence. - 1.5 - A stem loop is a hairpin structure; a double-helical structure formed when two complementary regions of a single strand of RNA or DNA molecule form base-pairs. - - - - - - - - - - Gene transcript report - - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Nucleic acid features (mRNA features) - beta13 - Transcript (report) - mRNA features - Gene transcript annotation - Clone or EST (report) - mRNA (report) - An informative report on features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. This includes reports on a specific gene transcript, clone or EST. - - - - - - - - - - - Nucleic acid features report (signal or transit peptide) - - true - coding sequences for a signal or transit peptide. - 1.8 - beta13 - - - - - - - - - - Non-coding RNA - - beta13 - true - features of non-coding or functional RNA molecules, including tRNA and rRNA. - 1.8 - - - - - - - - - - Transcriptional features (report) - - 1.5 - true - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - Features concerning transcription of DNA into RNA including the regulation of transcription. - beta13 - - - - - - - - - - Nucleic acid features report (STS) - - sequence tagged sites (STS) in nucleic acid sequences. - 1.8 - true - beta13 - - - - - - - - - - Nucleic acid features (immunoglobulin gene structure) - - true - beta13 - 1.5 - A report on predicted or actual immunoglobulin gene structure including constant, switch and variable regions and diversity, joining and variable segments. - - - - - - - - - - SCOP class - - 1.5 - beta13 - true - Information on a 'class' node from the SCOP database. - - - - - - - - - - SCOP fold - - beta13 - Information on a 'fold' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP superfamily - - beta13 - Information on a 'superfamily' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP family - - 1.5 - true - Information on a 'family' node from the SCOP database. - beta13 - - - - - - - - - - SCOP protein - - Information on a 'protein' node from the SCOP database. - true - beta13 - 1.5 - - - - - - - - - - SCOP species - - 1.5 - true - beta13 - Information on a 'species' node from the SCOP database. - - - - - - - - - - Mass spectrometry experiment - - 1.8 - true - mass spectrometry experiments. - beta13 - - - - - - - - - - Gene family report - - An informative report on a particular family of genes, typically a set of genes with similar sequence that originate from duplication of a common ancestor gene, or any other classification of nucleic acid sequences or structures that reflects gene structure. - This includes reports on on gene homologues between species. - beta13 - Gene annotation (homology information) - Homology information - Gene annotation (homology) - Nucleic acid classification - Gene family annotation - Gene homology (report) - - - - - - - - - - Protein image - - beta13 - An image of a protein. - - - - - - - - - - Protein alignment - - An alignment of protein sequences and/or structures. - beta13 - - - - - - - - - - NGS experiment - - 1.8 - 1.0 - sequencing experiment, including samples, sampling, preparation, sequencing, and analysis. - true - - - - - - - - - - Sequence assembly report - - An informative report about a DNA sequence assembly. - 1.1 - This might include an overall quality assement of the assembly and summary statistics including counts, average length and number of bases for reads, matches and non-matches, contigs, reads in pairs etc. - Assembly report - - - - - - - - - - Genome index - - 1.1 - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. - An index of a genome sequence. - - - - - - - - - - GWAS report - - 1.8 - 1.1 - Report concerning genome-wide association study experiments. - true - Genome-wide association study - - - - - - - - - - Cytoband position - - 1.2 - The position of a cytogenetic band in a genome. - Information might include start and end position in a chromosome sequence, chromosome identifier, name of band and so on. - - - - - - - - - - Cell type ontology ID - - - CL ID - Cell type ontology concept ID. - CL_[0-9]{7} - 1.2 - beta12orEarlier - - - - - - - - - - - Kinetic model - - 1.2 - Mathematical model of a network, that contains biochemical kinetics. - - - - - - - - - - COSMIC ID - - COSMIC identifier - cosmic ID - Identifier of a COSMIC database entry. - cosmic identifier - cosmic id - 1.3 - - - - - - - - - - - HGMD ID - - Identifier of a HGMD database entry. - hgmd ID - hgmd identifier - beta12orEarlier - hgmd id - HGMD identifier - - - - - - - - - - - Sequence assembly ID - - Sequence assembly version - Unique identifier of sequence assembly. - 1.3 - - - - - - - - - - - Sequence feature type - - true - A label (text token) describing a type of sequence feature such as gene, transcript, cds, exon, repeat, simple, misc, variation, somatic variation, structural variation, somatic structural variation, constrained or regulatory. - 1.3 - 1.5 - - - - - - - - - - Gene homology (report) - - beta12orEarlier - true - An informative report on gene homologues between species. - 1.5 - - - - - - - - - - Ensembl gene tree ID - - - ENSGT00390000003602 - Ensembl ID (gene tree) - Unique identifier for a gene tree from the Ensembl database. - 1.3 - - - - - - - - - - - Gene tree - - 1.3 - A phylogenetic tree that is an estimate of the character's phylogeny. - - - - - - - - - - Species tree - - A phylogenetic tree that reflects phylogeny of the taxa from which the characters (used in calculating the tree) were sampled. - 1.3 - - - - - - - - - - Sample ID - - - - - - - - - 1.3 - Sample accession - Name or other identifier of an entry from a biosample database. - - - - - - - - - - - MGI accession - - - Identifier of an object from the MGI database. - 1.3 - - - - - - - - - - - Phenotype name - - - 1.3 - Name of a phenotype. - Phenotypes - Phenotype - - - - - - - - - - - Transition matrix - - A HMM transition matrix contains the probabilities of switching from one HMM state to another. - Consider for example an HMM with two states (AT-rich and GC-rich). The transition matrix will hold the probabilities of switching from the AT-rich to the GC-rich state, and vica versa. - HMM transition matrix - 1.4 - - - - - - - - - Emission matrix - - A HMM emission matrix holds the probabilities of choosing the four nucleotides (A, C, G and T) in each of the states of a HMM. - 1.4 - Consider for example an HMM with two states (AT-rich and GC-rich). The emission matrix holds the probabilities of choosing each of the four nucleotides (A, C, G and T) in the AT-rich state and in the GC-rich state. - HMM emission matrix - - - - - - - - - Hidden Markov model - - A statistical Markov model of a system which is assumed to be a Markov process with unobserved (hidden) states. - 1.4 - - - - - - - - - Format identifier - - An identifier of a data format. - 1.4 - - - - - - - - - Raw image - - 1.5 - Amino acid data - http://semanticscience.org/resource/SIO_000081 - beta12orEarlier - Image data - Raw biological or biomedical image generated by some experimental technique. - - - - - - - - - - Carbohydrate property - - Carbohydrate data - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all carbohydrates. - 1.5 - - - - - - - - - - Proteomics experiment report - - true - 1.8 - Report concerning proteomics experiments. - 1.5 - - - - - - - - - - RNAi report - - 1.5 - RNAi experiments. - true - 1.8 - - - - - - - - - - Simulation experiment report - - 1.5 - biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - true - 1.8 - - - - - - - - - - MRI image - - - - - - - - MRT image - 1.7 - Magnetic resonance tomography image - Nuclear magnetic resonance imaging image - - Magnetic resonance imaging image - - NMRI image - An imaging technique that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - - - - - - - - - - Cell migration track image - - - - - - - - 1.7 - An image from a cell migration track assay. - - - - - - - - - - Rate of association - - kon - 1.7 - Rate of association of a protein with another protein or some other molecule. - - - - - - - - - - Gene order - - Such data are often used for genome rearrangement tools and phylogenetic tree labeling. - Multiple gene identifiers in a specific order. - 1.7 - - - - - - - - - - Spectrum - - 1.7 - The spectrum of frequencies of electromagnetic radiation emitted from a molecule as a result of some spectroscopy experiment. - Spectra - - - - - - - - - - NMR spectrum - - - - - - - - Spectral information for a molecule from a nuclear magnetic resonance experiment. - 1.7 - NMR spectra - - - - - - - - - - Chemical structure sketch - - Chemical structure sketches are used for presentational purposes but also as inputs to various analysis software. - 1.8 - Small molecule sketch - A sketch of a small molecule made with some specialised drawing package. - - - - - - - - - - Nucleic acid signature - - 1.8 - An informative report about a specific or conserved nucleic acid sequence pattern. - - - - - - - - - - DNA sequence - - DNA sequences - 1.8 - A DNA sequence. - - - - - - - - - - RNA sequence - - A DNA sequence. - DNA sequences - RNA sequences - 1.8 - - - - - - - - - - RNA sequence (raw) - - - Raw sequence (RNA) - 1.8 - A raw RNA sequence. - RNA raw sequence - - - - - - - - - - DNA sequence (raw) - - - Raw sequence (DNA) - A raw DNA sequence. - 1.8 - DNA raw sequence - - - - - - - - - - Sequence variations - - - - - - - - 1.8 - Data on gene sequence variations resulting large-scale genotyping and DNA sequencing projects. - Gene sequence variations - Variations are stored along with a reference genome. - - - - - - - - - - Bibliography - - 1.8 - A list of publications such as scientic papers or books. - - - - - - - - - - Ontology mapping - - A mapping of supplied textual terms or phrases to ontology concepts (URIs). - beta12orEarlier - - - - - - - - - - Image metadata - - Image-associated data - This can include basic provenance and technical information about the image, scientific annotation and so on. - Any data concerning a specific biological or biomedical image. - 1.9 - Image data - Image-related data - - - - - - - - - - Clinical trial report - - Clinical trial information - A report concerning a clinical trial. - 1.9 - - - - - - - - - - Reference sample report - - 1.10 - A report about a biosample. - Biosample report - - - - - - - - - - Gene Expression Atlas Experiment ID - - Accession number of an entry from the Gene Expression Atlas. - 1.10 - - - - - - - - - - - Disease identifier - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of disease. - - - - - - - - - - - Disease name - - - The name of some disease. - 1.12 - - - - - - - - - - - Training material - - Open educational resource - Some material that is used for educational (training) purposes. - OER - 1.12 - - - - - - - - - - Online course - - MOOC - A training course available for use on the Web. - On-line course - 1.12 - Massive open online course - - - - - - - - - - Text - - - Any free or plain text, as often specified as some search query. - Plain text - Free text - 1.12 - - - - - - - - - - Biodiversity report - - Biodiversity information - 1.9 - A report about biodiversity data. - - - - - - - - - - Biosafety report - - A report about biosafety data. - Biosafety information - 1.14 - - - - - - - - - - Isolation report - - Geographic location - Isolation source - 1.14 - A report about any kind of isolation of biological material. - - - - - - - - - - Pathogenicity report - - 1.14 - Information about the ability of an organism to cause disease in a corresponding host. - Pathogenicity - - - - - - - - - - Biosafety classification - - Information about the biosafety classification of an organism according to corresponding law. - Biosafety level - 1.14 - - - - - - - - - - Geographic location - - A report about localisation of the isolaton of biological material e.g. country or coordinates. - 1.14 - - - - - - - - - - Isolation source - - A report about any kind of isolation source of biological material e.g. blood, water, soil. - 1.14 - - - - - - - - - - Physiology parameter - - Experimentally determined parameter of the physiology of an organism, e.g. substrate spectrum. - 1.14 - - - - - - - - - - Morphology parameter - - Experimentally determined parameter of the morphology of an organism, e.g. size & shape. - 1.14 - - - - - - - - - - Cultivation parameter - - Salinity - Carbon source - Experimental determined parameter for the cultivation of an organism. - Cultivation conditions - Temperature - 1.14 - Culture media composition - pH value - Nitrogen source - - - - - - - - - - SMILES - - - Chemical structure specified in Simplified Molecular Input Line Entry System (SMILES) line notation. - beta12orEarlier - - - - - - - - - - - - - - InChI - - - Chemical structure specified in IUPAC International Chemical Identifier (InChI) line notation. - beta12orEarlier - - - - - - - - - - mf - - - Chemical structure specified by Molecular Formula (MF), including a count of each element in a compound. - beta12orEarlier - The general MF query format consists of a series of valid atomic symbols, with an optional number or range. - - - - - - - - - - InChIKey - - - An InChIKey identifier is not human- nor machine-readable but is more suitable for web searches than an InChI chemical structure specification. - The InChIKey (hashed InChI) is a fixed length (25 character) condensed digital representation of an InChI chemical structure specification. It uniquely identifies a chemical compound. - beta12orEarlier - - - - - - - - - - smarts - - SMILES ARbitrary Target Specification (SMARTS) format for chemical structure specification, which is a subset of the SMILES line notation. - beta12orEarlier - - - - - - - - - - unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - nucleotide - - - Non-sequence characters may be used for example for gaps. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Nucleotide_sequence - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - protein - - - Alphabet for a protein sequence with possible ambiguity, unknown positions and non-sequence characters. - beta12orEarlier - Non-sequence characters may be used for gaps and translation stop. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Amino_acid_sequence - - - - - - - - - - consensus - - - beta12orEarlier - Alphabet for the consensus of two or more molecular sequences. - - - - - - - - - - pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - unambiguous pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence (characters ACGTU only) with possible unknown positions but without ambiguity or non-sequence characters . - - - - - - - - - - dna - - beta12orEarlier - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#DNA_sequence - Alphabet for a DNA sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - rna - - Alphabet for an RNA sequence with possible ambiguity, unknown positions and non-sequence characters. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#RNA_sequence - beta12orEarlier - - - - - - - - - - unambiguous pure dna - - - Alphabet for a DNA sequence (characters ACGT only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure dna - - - Alphabet for a DNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure rna - - - Alphabet for an RNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - UniGene entry format - - beta12orEarlier - Format of an entry from UniGene. - A UniGene entry includes a set of transcript sequences assigned to the same transcription locus (gene or expressed pseudogene), with information on protein similarities, gene expression, cDNA clone reagents, and genomic location. - beta12orEarlier - true - - - - - - - - - - COG sequence cluster format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the COG database of clusters of (related) protein sequences. - - - - - - - - - - EMBL feature location - - - beta12orEarlier - Feature location - Format for sequence positions (feature location) as used in DDBJ/EMBL/GenBank database. - - - - - - - - - - quicktandem - - - Report format for tandem repeats in a nucleotide sequence (format generated by the Sanger Centre quicktandem program). - beta12orEarlier - - - - - - - - - - Sanger inverted repeats - - - beta12orEarlier - Report format for inverted repeats in a nucleotide sequence (format generated by the Sanger Centre inverted program). - - - - - - - - - - EMBOSS repeat - - - Report format for tandem repeats in a sequence (an EMBOSS report format). - beta12orEarlier - - - - - - - - - - est2genome format - - - beta12orEarlier - Format of a report on exon-intron structure generated by EMBOSS est2genome. - - - - - - - - - - restrict format - - - Report format for restriction enzyme recognition sites used by EMBOSS restrict program. - beta12orEarlier - - - - - - - - - - restover format - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by EMBOSS restover program. - - - - - - - - - - REBASE restriction sites - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by REBASE database. - - - - - - - - - - FASTA search results format - - - Format of results of a sequence database search using FASTA. - beta12orEarlier - This includes (typically) score data, alignment data and a histogram (of observed and expected distribution of E values.) - - - - - - - - - - BLAST results - - - Format of results of a sequence database search using some variant of BLAST. - beta12orEarlier - This includes score data, alignment data and summary table. - - - - - - - - - - mspcrunch - - - beta12orEarlier - Format of results of a sequence database search using some variant of MSPCrunch. - - - - - - - - - - Smith-Waterman format - - - beta12orEarlier - Format of results of a sequence database search using some variant of Smith Waterman. - - - - - - - - - - dhf - - - The hits are relatives to a SCOP or CATH family and are found from a search of a sequence database. - beta12orEarlier - Format of EMBASSY domain hits file (DHF) of hits (sequences) with domain classification information. - - - - - - - - - - lhf - - - beta12orEarlier - Format of EMBASSY ligand hits file (LHF) of database hits (sequences) with ligand classification information. - The hits are putative ligand-binding sequences and are found from a search of a sequence database. - - - - - - - - - - InterPro hits format - - - Results format for searches of the InterPro database. - beta12orEarlier - - - - - - - - - - InterPro protein view report format - - Format of results of a search of the InterPro database showing matches of query protein sequence(s) to InterPro entries. - The report includes a classification of regions in a query protein sequence which are assigned to a known InterPro protein family or group. - beta12orEarlier - - - - - - - - - - InterPro match table format - - Format of results of a search of the InterPro database showing matches between protein sequence(s) and signatures for an InterPro entry. - beta12orEarlier - The table presents matches between query proteins (rows) and signature methods (columns) for this entry. Alternatively the sequence(s) might be from from the InterPro entry itself. The match position in the protein sequence and match status (true positive, false positive etc) are indicated. - - - - - - - - - - HMMER Dirichlet prior - - - beta12orEarlier - Dirichlet distribution HMMER format. - - - - - - - - - - MEME Dirichlet prior - - - beta12orEarlier - Dirichlet distribution MEME format. - - - - - - - - - - HMMER emission and transition - - - Format of a report from the HMMER package on the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - prosite-pattern - - - Format of a regular expression pattern from the Prosite database. - beta12orEarlier - - - - - - - - - - EMBOSS sequence pattern - - - Format of an EMBOSS sequence pattern. - beta12orEarlier - - - - - - - - - - meme-motif - - - A motif in the format generated by the MEME program. - beta12orEarlier - - - - - - - - - - prosite-profile - - - Sequence profile (sequence classifier) format used in the PROSITE database. - beta12orEarlier - - - - - - - - - - JASPAR format - - - beta12orEarlier - A profile (sequence classifier) in the format used in the JASPAR database. - - - - - - - - - - MEME background Markov model - - - Format of the model of random sequences used by MEME. - beta12orEarlier - - - - - - - - - - HMMER format - - - Format of a hidden Markov model representation used by the HMMER package. - beta12orEarlier - - - - - - - - - - HMMER-aln - - - - beta12orEarlier - FASTA-style format for multiple sequences aligned by HMMER package to an HMM. - - - - - - - - - - DIALIGN format - - - Format of multiple sequences aligned by DIALIGN package. - beta12orEarlier - - - - - - - - - - daf - - - The format is clustal-like and includes annotation of domain family classification information. - EMBASSY 'domain alignment file' (DAF) format, containing a sequence alignment of protein domains belonging to the same SCOP or CATH family. - beta12orEarlier - - - - - - - - - - Sequence-MEME profile alignment - - - beta12orEarlier - Format for alignment of molecular sequences to MEME profiles (position-dependent scoring matrices) as generated by the MAST tool from the MEME package. - - - - - - - - - - HMMER profile alignment (sequences versus HMMs) - - - Format used by the HMMER package for an alignment of a sequence against a hidden Markov model database. - beta12orEarlier - - - - - - - - - - HMMER profile alignment (HMM versus sequences) - - - Format used by the HMMER package for of an alignment of a hidden Markov model against a sequence database. - beta12orEarlier - - - - - - - - - - Phylip distance matrix - - - Data Type must include the distance matrix, probably as pairs of sequence identifiers with a distance (integer or float). - beta12orEarlier - Format of PHYLIP phylogenetic distance matrix data. - - - - - - - - - - ClustalW dendrogram - - - beta12orEarlier - Dendrogram (tree file) format generated by ClustalW. - - - - - - - - - - Phylip tree raw - - - Raw data file format used by Phylip from which a phylogenetic tree is directly generated or plotted. - beta12orEarlier - - - - - - - - - - Phylip continuous quantitative characters - - - beta12orEarlier - PHYLIP file format for continuous quantitative character data. - - - - - - - - - - Phylogenetic property values format - - Format of phylogenetic property data. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Phylip character frequencies format - - - beta12orEarlier - PHYLIP file format for phylogenetics character frequency data. - - - - - - - - - - Phylip discrete states format - - - Format of PHYLIP discrete states data. - beta12orEarlier - - - - - - - - - - Phylip cliques format - - - beta12orEarlier - Format of PHYLIP cliques data. - - - - - - - - - - Phylip tree format - - - Phylogenetic tree data format used by the PHYLIP program. - beta12orEarlier - - - - - - - - - - TreeBASE format - - - beta12orEarlier - The format of an entry from the TreeBASE database of phylogenetic data. - - - - - - - - - - TreeFam format - - - beta12orEarlier - The format of an entry from the TreeFam database of phylogenetic data. - - - - - - - - - - Phylip tree distance format - - - Format for distances, such as Branch Score distance, between two or more phylogenetic trees as used by the Phylip package. - beta12orEarlier - - - - - - - - - - dssp - - - beta12orEarlier - The DSSP database is built using the DSSP application which defines secondary structure, geometrical features and solvent exposure of proteins, given atomic coordinates in PDB format. - Format of an entry from the DSSP database (Dictionary of Secondary Structure in Proteins). - - - - - - - - - - hssp - - - Entry format of the HSSP database (Homology-derived Secondary Structure in Proteins). - beta12orEarlier - - - - - - - - - - Dot-bracket format - - - beta12orEarlier - Format of RNA secondary structure in dot-bracket notation, originally generated by the Vienna RNA package/server. - Vienna RNA secondary structure format - Vienna RNA format - - - - - - - - - - Vienna local RNA secondary structure format - - - Format of local RNA secondary structure components with free energy values, generated by the Vienna RNA package/server. - beta12orEarlier - - - - - - - - - - PDB database entry format - - - - - - - - beta12orEarlier - PDB entry format - Format of an entry (or part of an entry) from the PDB database. - - - - - - - - - - PDB - - - PDB format - beta12orEarlier - Entry format of PDB database in PDB format. - - - - - - - - - - mmCIF - - - Chemical MIME (http://www.ch.ic.ac.uk/chemime): chemical/x-mmcif - Entry format of PDB database in mmCIF format. - beta12orEarlier - mmcif - - - - - - - - - - PDBML - - - Entry format of PDB database in PDBML (XML) format. - beta12orEarlier - - - - - - - - - - Domainatrix 3D-1D scoring matrix format - - beta12orEarlier - true - beta12orEarlier - Format of a matrix of 3D-1D scores used by the EMBOSS Domainatrix applications. - - - - - - - - - - aaindex - - - Amino acid index format used by the AAindex database. - beta12orEarlier - - - - - - - - - - IntEnz enzyme report format - - beta12orEarlier - beta12orEarlier - Format of an entry from IntEnz (The Integrated Relational Enzyme Database). - IntEnz is the master copy of the Enzyme Nomenclature, the recommendations of the NC-IUBMB on the Nomenclature and Classification of Enzyme-Catalysed Reactions. - true - - - - - - - - - - BRENDA enzyme report format - - true - Format of an entry from the BRENDA enzyme database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - KEGG REACTION enzyme report format - - true - beta12orEarlier - Format of an entry from the KEGG REACTION database of biochemical reactions. - beta12orEarlier - - - - - - - - - - KEGG ENZYME enzyme report format - - beta12orEarlier - true - Format of an entry from the KEGG ENZYME database. - beta12orEarlier - - - - - - - - - - REBASE proto enzyme report format - - Format of an entry from the proto section of the REBASE enzyme database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - REBASE withrefm enzyme report format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the withrefm section of the REBASE enzyme database. - - - - - - - - - - Pcons report format - - - Format of output of the Pcons Model Quality Assessment Program (MQAP). - beta12orEarlier - Pcons ranks protein models by assessing their quality based on the occurrence of recurring common three-dimensional structural patterns. Pcons returns a score reflecting the overall global quality and a score for each individual residue in the protein reflecting the local residue quality. - - - - - - - - - - ProQ report format - - - beta12orEarlier - ProQ is a neural network-based predictor that predicts the quality of a protein model based on the number of structural features. - Format of output of the ProQ protein model quality predictor. - - - - - - - - - - SMART domain assignment report format - - beta12orEarlier - true - Format of SMART domain assignment data. - The SMART output file includes data on genetically mobile domains / analysis of domain architectures, including phyletic distributions, functional class, tertiary structures and functionally important residues. - beta12orEarlier - - - - - - - - - - BIND entry format - - Entry format for the BIND database of protein interaction. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - IntAct entry format - - beta12orEarlier - beta12orEarlier - Entry format for the IntAct database of protein interaction. - true - - - - - - - - - - InterPro entry format - - Entry format for the InterPro database of protein signatures (sequence classifiers) and classified sequences. - true - beta12orEarlier - This includes signature metadata, sequence references and a reference to the signature itself. There is normally a header (entry accession numbers and name), abstract, taxonomy information, example proteins etc. Each entry also includes a match list which give a number of different views of the signature matches for the sequences in each InterPro entry. - beta12orEarlier - - - - - - - - - - InterPro entry abstract format - - true - beta12orEarlier - References are included and a functional inference is made where possible. - beta12orEarlier - Entry format for the textual abstract of signatures in an InterPro entry and its protein matches. - - - - - - - - - - Gene3D entry format - - Entry format for the Gene3D protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PIRSF entry format - - beta12orEarlier - Entry format for the PIRSF protein secondary database. - true - beta12orEarlier - - - - - - - - - - PRINTS entry format - - beta12orEarlier - beta12orEarlier - true - Entry format for the PRINTS protein secondary database. - - - - - - - - - - Panther Families and HMMs entry format - - beta12orEarlier - beta12orEarlier - Entry format for the Panther library of protein families and subfamilies. - true - - - - - - - - - - Pfam entry format - - Entry format for the Pfam protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - SMART entry format - - true - beta12orEarlier - Entry format for the SMART protein secondary database. - beta12orEarlier - - - - - - - - - - Superfamily entry format - - Entry format for the Superfamily protein secondary database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - TIGRFam entry format - - beta12orEarlier - true - Entry format for the TIGRFam protein secondary database. - beta12orEarlier - - - - - - - - - - ProDom entry format - - Entry format for the ProDom protein domain classification database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FSSP entry format - - Entry format for the FSSP database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - findkm - - - beta12orEarlier - A report format for the kinetics of enzyme-catalysed reaction(s) in a format generated by EMBOSS findkm. This includes Michaelis Menten plot, Hanes Woolf plot, Michaelis Menten constant (Km) and maximum velocity (Vmax). - - - - - - - - - - Ensembl gene report format - - beta12orEarlier - Entry format of Ensembl genome database. - beta12orEarlier - true - - - - - - - - - - DictyBase gene report format - - true - beta12orEarlier - Entry format of DictyBase genome database. - beta12orEarlier - - - - - - - - - - CGD gene report format - - beta12orEarlier - true - beta12orEarlier - Entry format of Candida Genome database. - - - - - - - - - - DragonDB gene report format - - beta12orEarlier - Entry format of DragonDB genome database. - beta12orEarlier - true - - - - - - - - - - EcoCyc gene report format - - Entry format of EcoCyc genome database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FlyBase gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of FlyBase genome database. - - - - - - - - - - Gramene gene report format - - beta12orEarlier - beta12orEarlier - Entry format of Gramene genome database. - true - - - - - - - - - - KEGG GENES gene report format - - true - beta12orEarlier - Entry format of KEGG GENES genome database. - beta12orEarlier - - - - - - - - - - MaizeGDB gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Maize genetics and genomics database (MaizeGDB). - - - - - - - - - - MGD gene report format - - Entry format of the Mouse Genome Database (MGD). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RGD gene report format - - true - beta12orEarlier - Entry format of the Rat Genome Database (RGD). - beta12orEarlier - - - - - - - - - - SGD gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of the Saccharomyces Genome Database (SGD). - - - - - - - - - - GeneDB gene report format - - Entry format of the Sanger GeneDB genome database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - TAIR gene report format - - beta12orEarlier - beta12orEarlier - Entry format of The Arabidopsis Information Resource (TAIR) genome database. - true - - - - - - - - - - WormBase gene report format - - Entry format of the WormBase genomes database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - ZFIN gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Zebrafish Information Network (ZFIN) genome database. - - - - - - - - - - TIGR gene report format - - true - Entry format of the TIGR genome database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - dbSNP polymorphism report format - - beta12orEarlier - Entry format for the dbSNP database. - true - beta12orEarlier - - - - - - - - - - OMIM entry format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the OMIM database of genotypes and phenotypes. - - - - - - - - - - HGVbase entry format - - true - Format of a record from the HGVbase database of genotypes and phenotypes. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HIVDB entry format - - beta12orEarlier - beta12orEarlier - true - Format of a record from the HIVDB database of genotypes and phenotypes. - - - - - - - - - - KEGG DISEASE entry format - - beta12orEarlier - Format of an entry from the KEGG DISEASE database. - true - beta12orEarlier - - - - - - - - - - Primer3 primer - - - Report format on PCR primers and hybridization oligos as generated by Whitehead primer3 program. - beta12orEarlier - - - - - - - - - - ABI - - - A format of raw sequence read data from an Applied Biosystems sequencing machine. - beta12orEarlier - - - - - - - - - - mira - - - Format of MIRA sequence trace information file. - beta12orEarlier - - - - - - - - - - CAF - - - Common Assembly Format (CAF). A sequence assembly format including contigs, base-call qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - exp - - - Sequence assembly project file EXP format. - beta12orEarlier - - - - - - - - - - SCF - - - Staden Chromatogram Files format (SCF) of base-called sequence reads, qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - PHD - - - beta12orEarlier - PHD sequence trace format to store serialised chromatogram data (reads). - - - - - - - - - - - - dat - - - - - - - - - beta12orEarlier - Format of Affymetrix data file of raw image data. - Affymetrix image data file format - - - - - - - - - - cel - - - - - - - - - beta12orEarlier - Affymetrix probe raw data format - Format of Affymetrix data file of information about (raw) expression levels of the individual probes. - - - - - - - - - - affymetrix - - - Format of affymetrix gene cluster files (hc-genes.txt, hc-chips.txt) from hierarchical clustering. - beta12orEarlier - - - - - - - - - - ArrayExpress entry format - - beta12orEarlier - true - Entry format for the ArrayExpress microarrays database. - beta12orEarlier - - - - - - - - - - affymetrix-exp - - - Affymetrix data file format for information about experimental conditions and protocols. - Affymetrix experimental conditions data file format - beta12orEarlier - - - - - - - - - - CHP - - - - - - - - - Affymetrix probe normalised data format - beta12orEarlier - Format of Affymetrix data file of information about (normalised) expression levels of the individual probes. - - - - - - - - - - EMDB entry format - - beta12orEarlier - Format of an entry from the Electron Microscopy DataBase (EMDB). - true - beta12orEarlier - - - - - - - - - - KEGG PATHWAY entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PATHWAY database of pathway maps for molecular interactions and reaction networks. - true - - - - - - - - - - MetaCyc entry format - - true - beta12orEarlier - The format of an entry from the MetaCyc metabolic pathways database. - beta12orEarlier - - - - - - - - - - HumanCyc entry format - - The format of a report from the HumanCyc metabolic pathways database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - INOH entry format - - beta12orEarlier - true - The format of an entry from the INOH signal transduction pathways database. - beta12orEarlier - - - - - - - - - - PATIKA entry format - - beta12orEarlier - The format of an entry from the PATIKA biological pathways database. - beta12orEarlier - true - - - - - - - - - - Reactome entry format - - beta12orEarlier - The format of an entry from the reactome biological pathways database. - true - beta12orEarlier - - - - - - - - - - aMAZE entry format - - beta12orEarlier - true - The format of an entry from the aMAZE biological pathways and molecular interactions database. - beta12orEarlier - - - - - - - - - - CPDB entry format - - The format of an entry from the CPDB database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Panther Pathways entry format - - beta12orEarlier - true - beta12orEarlier - The format of an entry from the Panther Pathways database. - - - - - - - - - - Taverna workflow format - - - Format of Taverna workflows. - beta12orEarlier - - - - - - - - - - BioModel mathematical model format - - beta12orEarlier - beta12orEarlier - Format of mathematical models from the BioModel database. - true - Models are annotated and linked to relevant data resources, such as publications, databases of compounds and pathways, controlled vocabularies, etc. - - - - - - - - - - KEGG LIGAND entry format - - The format of an entry from the KEGG LIGAND chemical database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - KEGG COMPOUND entry format - - beta12orEarlier - The format of an entry from the KEGG COMPOUND database. - true - beta12orEarlier - - - - - - - - - - KEGG PLANT entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PLANT database. - true - - - - - - - - - - KEGG GLYCAN entry format - - true - beta12orEarlier - The format of an entry from the KEGG GLYCAN database. - beta12orEarlier - - - - - - - - - - PubChem entry format - - beta12orEarlier - The format of an entry from PubChem. - true - beta12orEarlier - - - - - - - - - - ChemSpider entry format - - beta12orEarlier - The format of an entry from a database of chemical structures and property predictions. - beta12orEarlier - true - - - - - - - - - - ChEBI entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from Chemical Entities of Biological Interest (ChEBI). - true - ChEBI includes an ontological classification defining relations between entities or classes of entities. - - - - - - - - - - MSDchem ligand dictionary entry format - - The format of an entry from the MSDchem ligand dictionary. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - HET group dictionary entry format - - - The format of an entry from the HET group dictionary (HET groups from PDB files). - beta12orEarlier - - - - - - - - - - KEGG DRUG entry format - - The format of an entry from the KEGG DRUG database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PubMed citation - - - beta12orEarlier - Format of bibliographic reference as used by the PubMed database. - - - - - - - - - - Medline Display Format - - - beta12orEarlier - Format for abstracts of scientific articles from the Medline database. - Bibliographic reference information including citation information is included - - - - - - - - - - CiteXplore-core - - - beta12orEarlier - CiteXplore 'core' citation format including title, journal, authors and abstract. - - - - - - - - - - CiteXplore-all - - - CiteXplore 'all' citation format includes all known details such as Mesh terms and cross-references. - beta12orEarlier - - - - - - - - - - pmc - - - beta12orEarlier - Article format of the PubMed Central database. - - - - - - - - - - iHOP text mining abstract format - - - beta12orEarlier - iHOP abstract format. - - - - - - - - - - Oscar3 - - - Oscar 3 performs chemistry-specific parsing of chemical documents. It attempts to identify chemical names, ontology concepts and chemical data from a document. - Text mining abstract format from the Oscar 3 application. - beta12orEarlier - - - - - - - - - - PDB atom record format - - true - beta13 - beta12orEarlier - Format of an ATOM record (describing data for an individual atom) from a PDB file. - - - - - - - - - - CATH chain report format - - The report (for example http://www.cathdb.info/chain/1cukA) includes chain identifiers, domain identifiers and CATH codes for domains in a given protein chain. - beta12orEarlier - Format of CATH domain classification information for a polypeptide chain. - beta12orEarlier - true - - - - - - - - - - CATH PDB report format - - beta12orEarlier - beta12orEarlier - true - Format of CATH domain classification information for a protein PDB file. - The report (for example http://www.cathdb.info/pdb/1cuk) includes chain identifiers, domain identifiers and CATH codes for domains in a given PDB file. - - - - - - - - - - NCBI gene report format - - true - Entry (gene) format of the NCBI database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GeneIlluminator gene report format - - Report format for biological functions associated with a gene name and its alternative names (synonyms, homonyms), as generated by the GeneIlluminator service. - This includes a gene name and abbreviation of the name which may be in a name space indicating the gene status and relevant organisation. - beta12orEarlier - beta12orEarlier - Moby:GI_Gene - true - - - - - - - - - - BacMap gene card format - - Format of a report on the DNA and protein sequences for a given gene label from a bacterial chromosome maps from the BacMap database. - true - beta12orEarlier - beta12orEarlier - Moby:BacMapGeneCard - - - - - - - - - - ColiCard report format - - Format of a report on Escherichia coli genes, proteins and molecules from the CyberCell Database (CCDB). - true - beta12orEarlier - Moby:ColiCard - beta12orEarlier - - - - - - - - - - PlasMapper TextMap - - - beta12orEarlier - Map of a plasmid (circular DNA) in PlasMapper TextMap format. - - - - - - - - - - newick - - - nh - beta12orEarlier - Phylogenetic tree Newick (text) format. - - - - - - - - - - TreeCon format - - - beta12orEarlier - Phylogenetic tree TreeCon (text) format. - - - - - - - - - - Nexus format - - - Phylogenetic tree Nexus (text) format. - beta12orEarlier - - - - - - - - - - Format - - - - http://en.wikipedia.org/wiki/File_format - http://purl.org/biotop/biotop.owl#MachineLanguage - File format - Data model - http://www.onto-med.de/ontologies/gfo.owl#Symbol_structure - Exchange format - "http://purl.obolibrary.org/obo/IAO_0000098" - http://semanticscience.org/resource/SIO_000612 - http://semanticscience.org/resource/SIO_000618 - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - "http://purl.org/dc/elements/1.1/format" - http://wsio.org/compression_004 - A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere. - http://en.wikipedia.org/wiki/List_of_file_formats - http://www.ifomis.org/bfo/1.1/snap#Quality - Data format - http://purl.org/biotop/biotop.owl#Quality - The main focus in EDAM lies on formats as means of structuring data exchanged between different tools or resources. The serialisation, compression, or encoding of concrete data formats/models is not in scope of EDAM. Format 'is format of' Data. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - - - - - Data model - A defined data format has its implicit or explicit data model, and EDAM does not distinguish the two. Some data models however do not have any standard way of serialisation into an exchange format, and those are thus not considered formats in EDAM. (Remark: even broader - or closely related - term to 'Data model' would be an 'Information model'.) - - - - - File format - File format denotes only formats of a computer file, but the same formats apply also to data blobs or exchanged messages. - - - - - - - - - - Atomic data format - - beta12orEarlier - beta13 - Data format for an individual atom. - true - - - - - - - - - - Sequence record format - - - - - - - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence feature annotation format - - - - - - - - beta12orEarlier - Data format for molecular sequence feature information. - - - - - - - - - - Alignment format - - - - - - - - Data format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - acedb - - beta12orEarlier - ACEDB sequence format. - - - - - - - - - - clustal sequence format - - true - beta12orEarlier - Clustalw output format. - beta12orEarlier - - - - - - - - - - codata - - - Codata entry format. - beta12orEarlier - - - - - - - - - - dbid - - beta12orEarlier - Fasta format variant with database name before ID. - - - - - - - - - - EMBL format - - - EMBL entry format. - EMBL sequence format - EMBL - beta12orEarlier - - - - - - - - - - Staden experiment format - - - Staden experiment file format. - beta12orEarlier - - - - - - - - - - FASTA - - - beta12orEarlier - FASTA format - FASTA sequence format - FASTA format including NCBI-style IDs. - - - - - - - - - - FASTQ - - FASTQ short read format ignoring quality scores. - beta12orEarlier - FASTAQ - fq - - - - - - - - - - FASTQ-illumina - - FASTQ Illumina 1.3 short read format. - beta12orEarlier - - - - - - - - - - FASTQ-sanger - - FASTQ short read format with phred quality. - beta12orEarlier - - - - - - - - - - FASTQ-solexa - - FASTQ Solexa/Illumina 1.0 short read format. - beta12orEarlier - - - - - - - - - - fitch program - - - Fitch program format. - beta12orEarlier - - - - - - - - - - GCG - - - GCG SSF - beta12orEarlier - GCG SSF (single sequence file) file format. - GCG sequence file format. - - - - - - - - - - GenBank format - - - beta12orEarlier - Genbank entry format. - GenBank - - - - - - - - - - genpept - - beta12orEarlier - Genpept protein entry format. - Currently identical to refseqp format - - - - - - - - - - GFF2-seq - - - GFF feature file format with sequence in the header. - beta12orEarlier - - - - - - - - - - GFF3-seq - - - GFF3 feature file format with sequence. - beta12orEarlier - - - - - - - - - - giFASTA format - - FASTA sequence format including NCBI-style GIs. - beta12orEarlier - - - - - - - - - - hennig86 - - - beta12orEarlier - Hennig86 output sequence format. - - - - - - - - - - ig - - - Intelligenetics sequence format. - beta12orEarlier - - - - - - - - - - igstrict - - - beta12orEarlier - Intelligenetics sequence format (strict version). - - - - - - - - - - jackknifer - - - Jackknifer interleaved and non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - mase format - - - beta12orEarlier - Mase program sequence format. - - - - - - - - - - mega-seq - - - beta12orEarlier - Mega interleaved and non-interleaved sequence format. - - - - - - - - - - GCG MSF - - beta12orEarlier - GCG MSF (multiple sequence file) file format. - MSF - - - - - - - - - - nbrf/pir - - NBRF/PIR entry sequence format. - nbrf - beta12orEarlier - pir - - - - - - - - - - nexus-seq - - - - beta12orEarlier - Nexus/paup interleaved sequence format. - - - - - - - - - - pdbatom - - - - pdb format in EMBOSS. - beta12orEarlier - PDB sequence format (ATOM lines). - - - - - - - - - - pdbatomnuc - - - - beta12orEarlier - pdbnuc format in EMBOSS. - PDB nucleotide sequence format (ATOM lines). - - - - - - - - - - pdbseqresnuc - - - - pdbnucseq format in EMBOSS. - PDB nucleotide sequence format (SEQRES lines). - beta12orEarlier - - - - - - - - - - pdbseqres - - - - PDB sequence format (SEQRES lines). - beta12orEarlier - pdbseq format in EMBOSS. - - - - - - - - - - Pearson format - - beta12orEarlier - Plain old FASTA sequence format (unspecified format for IDs). - - - - - - - - - - phylip sequence format - - beta12orEarlier - Phylip interleaved sequence format. - true - beta12orEarlier - - - - - - - - - - phylipnon sequence format - - true - Phylip non-interleaved sequence format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - raw - - - beta12orEarlier - Raw sequence format with no non-sequence characters. - - - - - - - - - - refseqp - - - beta12orEarlier - Refseq protein entry sequence format. - Currently identical to genpept format - - - - - - - - - - selex sequence format - - beta12orEarlier - true - beta12orEarlier - Selex sequence format. - - - - - - - - - - Staden format - - - beta12orEarlier - Staden suite sequence format. - - - - - - - - - - - - - - Stockholm format - - - Stockholm multiple sequence alignment format (used by Pfam and Rfam). - beta12orEarlier - - - - - - - - - - - - strider format - - - DNA strider output sequence format. - beta12orEarlier - - - - - - - - - - UniProtKB format - - UniProt format - SwissProt format - beta12orEarlier - UniProtKB entry sequence format. - - - - - - - - - - plain text format (unformatted) - - beta12orEarlier - Plain text sequence format (essentially unformatted). - - - - - - - - - - treecon sequence format - - true - beta12orEarlier - beta12orEarlier - Treecon output sequence format. - - - - - - - - - - ASN.1 sequence format - - - NCBI ASN.1-based sequence format. - beta12orEarlier - - - - - - - - - - DAS format - - - das sequence format - DAS sequence (XML) format (any type). - beta12orEarlier - - - - - - - - - - dasdna - - - beta12orEarlier - DAS sequence (XML) format (nucleotide-only). - The use of this format is deprecated. - - - - - - - - - - debug-seq - - - EMBOSS debugging trace sequence format of full internal data content. - beta12orEarlier - - - - - - - - - - jackknifernon - - - beta12orEarlier - Jackknifer output sequence non-interleaved format. - - - - - - - - - - meganon sequence format - - beta12orEarlier - beta12orEarlier - Mega non-interleaved output sequence format. - true - - - - - - - - - - NCBI format - - NCBI FASTA sequence format with NCBI-style IDs. - beta12orEarlier - There are several variants of this. - - - - - - - - - - nexusnon - - - - Nexus/paup non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - GFF2 - - beta12orEarlier - General Feature Format (GFF) of sequence features. - - - - - - - - - - - - GFF3 - - beta12orEarlier - Generic Feature Format version 3 (GFF3) of sequence features. - - - - - - - - - - - - pir - - true - 1.7 - PIR feature format. - beta12orEarlier - - - - - - - - - - swiss feature - - true - Swiss-Prot feature format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - DASGFF - - - DAS GFF (XML) feature format. - das feature - DASGFF feature - beta12orEarlier - - - - - - - - - - debug-feat - - - EMBOSS debugging trace feature format of full internal data content. - beta12orEarlier - - - - - - - - - - EMBL feature - - beta12orEarlier - EMBL feature format. - true - beta12orEarlier - - - - - - - - - - GenBank feature - - beta12orEarlier - Genbank feature format. - beta12orEarlier - true - - - - - - - - - - ClustalW format - - - clustal - beta12orEarlier - ClustalW format for (aligned) sequences. - - - - - - - - - - debug - - - EMBOSS alignment format for debugging trace of full internal data content. - beta12orEarlier - - - - - - - - - - FASTA-aln - - - beta12orEarlier - Fasta format for (aligned) sequences. - - - - - - - - - - markx0 - - beta12orEarlier - Pearson MARKX0 alignment format. - - - - - - - - - - markx1 - - Pearson MARKX1 alignment format. - beta12orEarlier - - - - - - - - - - markx10 - - beta12orEarlier - Pearson MARKX10 alignment format. - - - - - - - - - - markx2 - - beta12orEarlier - Pearson MARKX2 alignment format. - - - - - - - - - - markx3 - - beta12orEarlier - Pearson MARKX3 alignment format. - - - - - - - - - - match - - - Alignment format for start and end of matches between sequence pairs. - beta12orEarlier - - - - - - - - - - mega - - Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - meganon - - Mega non-interleaved format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - msf alignment format - - true - beta12orEarlier - beta12orEarlier - MSF format for (aligned) sequences. - - - - - - - - - - nexus alignment format - - Nexus/paup format for (aligned) sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - nexusnon alignment format - - beta12orEarlier - true - Nexus/paup non-interleaved format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - pair - - EMBOSS simple sequence pair alignment format. - beta12orEarlier - - - - - - - - - - PHYLIP format - - phy - beta12orEarlier - ph - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - PHYLIP interleaved format - Phylip format for (aligned) sequences. - - - - - - - - - - phylipnon - - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - beta12orEarlier - PHYLIP sequential format - Phylip non-interleaved format for (aligned) sequences. - - - - - - - - - - scores format - - - Alignment format for score values for pairs of sequences. - beta12orEarlier - - - - - - - - - - selex - - - - beta12orEarlier - SELEX format for (aligned) sequences. - - - - - - - - - - EMBOSS simple format - - - EMBOSS simple multiple alignment format. - beta12orEarlier - - - - - - - - - - srs format - - - beta12orEarlier - Simple multiple sequence (alignment) format for SRS. - - - - - - - - - - srspair - - - beta12orEarlier - Simple sequence pair (alignment) format for SRS. - - - - - - - - - - T-Coffee format - - - T-Coffee program alignment format. - beta12orEarlier - - - - - - - - - - TreeCon-seq - - - - Treecon format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format - - - - - - - - Data format for a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Biological pathway or network format - - - - - - - - beta12orEarlier - Data format for a biological pathway or network. - - - - - - - - - - Sequence-profile alignment format - - - - - - - - beta12orEarlier - Data format for a sequence-profile alignment. - - - - - - - - - - Sequence-profile alignment (HMM) format - - beta12orEarlier - beta12orEarlier - true - Data format for a sequence-HMM profile alignment. - - - - - - - - - - Amino acid index format - - - - - - - - Data format for an amino acid index. - beta12orEarlier - - - - - - - - - - Article format - - - - - - - - beta12orEarlier - Literature format - Data format for a full-text scientific article. - - - - - - - - - - Text mining report format - - - - - - - - beta12orEarlier - Data format for an abstract (report) from text mining. - - - - - - - - - - Enzyme kinetics report format - - - - - - - - Data format for reports on enzyme kinetics. - beta12orEarlier - - - - - - - - - - Small molecule report format - - - - - - - - beta12orEarlier - Chemical compound annotation format - Format of a report on a chemical compound. - - - - - - - - - - Gene annotation format - - - - - - - - Format of a report on a particular locus, gene, gene system or groups of genes. - beta12orEarlier - Gene features format - - - - - - - - - - Workflow format - - beta12orEarlier - Format of a workflow. - - - - - - - - - - Tertiary structure format - - beta12orEarlier - Data format for a molecular tertiary structure. - - - - - - - - - - Biological model format - - Data format for a biological model. - beta12orEarlier - 1.2 - true - - - - - - - - - - Chemical formula format - - - - - - - - beta12orEarlier - Text format of a chemical formula. - - - - - - - - - - Phylogenetic character data format - - - - - - - - beta12orEarlier - Format of raw (unplotted) phylogenetic data. - - - - - - - - - - Phylogenetic continuous quantitative character format - - - - - - - - Format of phylogenetic continuous quantitative character data. - beta12orEarlier - - - - - - - - - - Phylogenetic discrete states format - - - - - - - - Format of phylogenetic discrete states data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (cliques) format - - - - - - - - Format of phylogenetic cliques data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (invariants) format - - - - - - - - beta12orEarlier - Format of phylogenetic invariants data. - - - - - - - - - - Electron microscopy model format - - beta12orEarlier - true - beta12orEarlier - Annotation format for electron microscopy models. - - - - - - - - - - Phylogenetic tree report (tree distances) format - - - - - - - - Format for phylogenetic tree distance data. - beta12orEarlier - - - - - - - - - - Polymorphism report format - - beta12orEarlier - true - 1.0 - Format for sequence polymorphism data. - - - - - - - - - - Protein family report format - - - - - - - - beta12orEarlier - Format for reports on a protein family. - - - - - - - - - - Protein interaction format - - - - - - - - beta12orEarlier - Format for molecular interaction data. - Molecular interaction format - - - - - - - - - - Sequence assembly format - - - - - - - - beta12orEarlier - Format for sequence assembly data. - - - - - - - - - - Microarray experiment data format - - Format for information about a microarray experimental per se (not the data generated from that experiment). - beta12orEarlier - - - - - - - - - - Sequence trace format - - - - - - - - Format for sequence trace data (i.e. including base call information). - beta12orEarlier - - - - - - - - - - Gene expression report format - - - - - - - - Gene expression data format - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - beta12orEarlier - - - - - - - - - - Genotype and phenotype annotation format - - beta12orEarlier - true - Format of a report on genotype / phenotype information. - beta12orEarlier - - - - - - - - - - Map format - - - - - - - - Format of a map of (typically one) molecular sequence annotated with features. - beta12orEarlier - - - - - - - - - - Nucleic acid features (primers) format - - beta12orEarlier - Format of a report on PCR primers or hybridization oligos in a nucleic acid sequence. - - - - - - - - - - Protein report format - - - - - - - - Format of a report of general information about a specific protein. - beta12orEarlier - - - - - - - - - - Protein report (enzyme) format - - Format of a report of general information about a specific enzyme. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix format - - - - - - - - beta12orEarlier - Format of a matrix of 3D-1D scores (amino acid environment probabilities). - - - - - - - - - - Protein structure report (quality evaluation) format - - - - - - - - Format of a report on the quality of a protein three-dimensional model. - beta12orEarlier - - - - - - - - - - Database hits (sequence) format - - - - - - - - Format of a report on sequence hits and associated data from searching a sequence database. - beta12orEarlier - - - - - - - - - - Sequence distance matrix format - - - - - - - - beta12orEarlier - Format of a matrix of genetic distances between molecular sequences. - - - - - - - - - - Sequence motif format - - - - - - - - Format of a sequence motif. - beta12orEarlier - - - - - - - - - - Sequence profile format - - - - - - - - Format of a sequence profile. - beta12orEarlier - - - - - - - - - - Hidden Markov model format - - - - - - - - beta12orEarlier - Format of a hidden Markov model. - - - - - - - - - - Dirichlet distribution format - - - - - - - - Data format of a dirichlet distribution. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts format - - - - - - - - - - - - - - Data format for the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - RNA secondary structure format - - - - - - - - beta12orEarlier - Format for secondary structure (predicted or real) of an RNA molecule. - - - - - - - - - - Protein secondary structure format - - Format for secondary structure (predicted or real) of a protein molecule. - beta12orEarlier - - - - - - - - - - Sequence range format - - - - - - - - beta12orEarlier - Format used to specify range(s) of sequence positions. - - - - - - - - - - pure - - - Alphabet for molecular sequence with possible unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unpure - - - Alphabet for a molecular sequence with possible unknown positions but possibly with non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous sequence - - - Alphabet for a molecular sequence with possible unknown positions but without ambiguity characters. - beta12orEarlier - - - - - - - - - - ambiguous - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions and possible ambiguity characters. - - - - - - - - - - Sequence features (repeats) format - - beta12orEarlier - Format used for map of repeats in molecular (typically nucleotide) sequences. - - - - - - - - - - Nucleic acid features (restriction sites) format - - beta12orEarlier - Format used for report on restriction enzyme recognition sites in nucleotide sequences. - - - - - - - - - - Gene features (coding region) format - - beta12orEarlier - Format used for report on coding regions in nucleotide sequences. - true - 1.10 - - - - - - - - - - Sequence cluster format - - - - - - - - beta12orEarlier - Format used for clusters of molecular sequences. - - - - - - - - - - Sequence cluster format (protein) - - Format used for clusters of protein sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster format (nucleic acid) - - Format used for clusters of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Gene cluster format - - true - beta13 - beta12orEarlier - Format used for clusters of genes. - - - - - - - - - - EMBL-like (text) - - - This concept may be used for the many non-standard EMBL-like text formats. - beta12orEarlier - A text format resembling EMBL entry format. - - - - - - - - - - FASTQ-like format (text) - - - A text format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - EMBLXML - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - cdsxml - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - insdxml - - beta12orEarlier - XML format for EMBL entries. - - - - - - - - - - geneseq - - Geneseq sequence format. - beta12orEarlier - - - - - - - - - - UniProt-like (text) - - - A text sequence format resembling uniprotkb entry format. - beta12orEarlier - - - - - - - - - - UniProt format - - beta12orEarlier - true - UniProt entry sequence format. - 1.8 - - - - - - - - - - ipi - - 1.8 - beta12orEarlier - ipi sequence format. - true - - - - - - - - - - medline - - - Abstract format used by MedLine database. - beta12orEarlier - - - - - - - - - - Ontology format - - - - - - - - Format used for ontologies. - beta12orEarlier - - - - - - - - - - OBO format - - beta12orEarlier - A serialisation format conforming to the Open Biomedical Ontologies (OBO) model. - - - - - - - - - - OWL format - - A serialisation format conforming to the Web Ontology Language (OWL) model. - beta12orEarlier - - - - - - - - - - FASTA-like (text) - - - This concept may also be used for the many non-standard FASTA-like formats. - http://filext.com/file-extension/FASTA - beta12orEarlier - A text format resembling FASTA format. - - - - - - - - - - Sequence record full format - - 1.8 - beta12orEarlier - Data format for a molecular sequence record, typically corresponding to a full entry from a molecular sequence database. - true - - - - - - - - - - Sequence record lite format - - true - 1.8 - beta12orEarlier - Data format for a molecular sequence record 'lite', typically molecular sequence and minimal metadata, such as an identifier of the sequence and/or a comment. - - - - - - - - - - EMBL format (XML) - - beta12orEarlier - An XML format for EMBL entries. - This is a placeholder for other more specific concepts. It should not normally be used for annotation. - - - - - - - - - - GenBank-like format (text) - - - A text format resembling GenBank entry (plain text) format. - This concept may be used for the non-standard GenBank-like text formats. - beta12orEarlier - - - - - - - - - - Sequence feature table format (text) - - Text format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Strain data format - - Format of a report on organism strain data / cell line. - beta12orEarlier - true - 1.0 - - - - - - - - - - CIP strain data format - - Format for a report of strain data as used for CIP database entries. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - phylip property values - - true - PHYLIP file format for phylogenetic property data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - STRING entry format (HTML) - - beta12orEarlier - true - beta12orEarlier - Entry format (HTML) for the STRING database of protein interaction. - - - - - - - - - - STRING entry format (XML) - - - Entry format (XML) for the STRING database of protein interaction. - beta12orEarlier - - - - - - - - - - GFF - - - GFF feature format (of indeterminate version). - beta12orEarlier - - - - - - - - - - GTF - - Gene Transfer Format (GTF), a restricted version of GFF. - beta12orEarlier - - - - - - - - - - - - - FASTA-HTML - - - FASTA format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - EMBL-HTML - - - EMBL entry format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - BioCyc enzyme report format - - true - beta12orEarlier - beta12orEarlier - Format of an entry from the BioCyc enzyme database. - - - - - - - - - - ENZYME enzyme report format - - Format of an entry from the Enzyme nomenclature database (ENZYME). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - PseudoCAP gene report format - - true - beta12orEarlier - beta12orEarlier - Format of a report on a gene from the PseudoCAP database. - - - - - - - - - - GeneCards gene report format - - Format of a report on a gene from the GeneCards database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Textual format - - http://filext.com/file-extension/TSV - http://www.iana.org/assignments/media-types/text/plain - Textual format. - Data in text format can be compressed into binary format, or can be a value of an XML element or attribute. Markup formats are not considered textual (or more precisely, not plain-textual). - txt - http://filext.com/file-extension/TXT - Plain text - http://www.iana.org/assignments/media-types/media-types.xhtml#text - beta12orEarlier - - - - - - - - - - HTML - - - - - - - - HTML format. - beta12orEarlier - http://filext.com/file-extension/HTML - Hypertext Markup Language - - - - - - - - - - XML - - Data in XML format can be serialised into text, or binary format. - beta12orEarlier - eXtensible Markup Language (XML) format. - xml - - Extensible Markup Language - - - - - - - - - - - - - Binary format - - Only specific native binary formats are listed under 'Binary format' in EDAM. Generic binary formats - such as any data being zipped, or any XML data being serialised into the Efficient XML Interchange (EXI) format - are not modelled in EDAM. Refer to http://wsio.org/compression_004. - beta12orEarlier - Binary format. - - - - - - - - - - URI format - - beta13 - true - Typical textual representation of a URI. - beta12orEarlier - - - - - - - - - - NCI-Nature pathway entry format - - beta12orEarlier - true - The format of an entry from the NCI-Nature pathways database. - beta12orEarlier - - - - - - - - - - Format (typed) - - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. The concept branch under 'Format (typed)' provides an alternative organisation of the concepts nested under the other top-level branches ('Binary', 'HTML', 'RDF', 'Text' and 'XML'. All concepts under here are already included under those branches. - beta12orEarlier - A broad class of format distinguished by the scientific nature of the data that is identified. - - - - - - - - - - BioXSD - - - - - - - - - - - - - - - - - - - - - - - - BioXSD XML format - beta12orEarlier - BioXSD XML format of basic bioinformatics types of data (sequence records, alignments, feature records, references to resources, and more). - - - - - - - - - - - - RDF format - - - beta12orEarlier - A serialisation format conforming to the Resource Description Framework (RDF) model. - - - - - - - - - - GenBank-HTML - - - beta12orEarlier - Genbank entry format wrapped in HTML elements. - - - - - - - - - - Protein features (domains) format - - beta12orEarlier - true - beta12orEarlier - Format of a report on protein features (domain composition). - - - - - - - - - - EMBL-like format - - beta12orEarlier - A format resembling EMBL entry (plain text) format. - This concept may be used for the many non-standard EMBL-like formats. - - - - - - - - - - FASTQ-like format - - A format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - FASTA-like - - This concept may be used for the many non-standard FASTA-like formats. - beta12orEarlier - A format resembling FASTA format. - - - - - - - - - - uniprotkb-like format - - - beta12orEarlier - A sequence format resembling uniprotkb entry format. - - - - - - - - - - Sequence feature table format - - - - - - - - Format for a sequence feature table. - beta12orEarlier - - - - - - - - - - OBO - - - beta12orEarlier - OBO ontology text format. - - - - - - - - - - OBO-XML - - - beta12orEarlier - OBO ontology XML format. - - - - - - - - - - Sequence record format (text) - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence record format (XML) - - beta12orEarlier - Data format for a molecular sequence record. - - - - - - - - - - Sequence feature table format (XML) - - XML format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Alignment format (text) - - Text format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Alignment format (XML) - - XML format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format (text) - - beta12orEarlier - Text format for a phylogenetic tree. - - - - - - - - - - Phylogenetic tree format (XML) - - beta12orEarlier - XML format for a phylogenetic tree. - - - - - - - - - - EMBL-like (XML) - - - An XML format resembling EMBL entry format. - This concept may be used for the any non-standard EMBL-like XML formats. - beta12orEarlier - - - - - - - - - - GenBank-like format - - A format resembling GenBank entry (plain text) format. - beta12orEarlier - This concept may be used for the non-standard GenBank-like formats. - - - - - - - - - - STRING entry format - - beta12orEarlier - Entry format for the STRING database of protein interaction. - beta12orEarlier - true - - - - - - - - - - Sequence assembly format (text) - - beta12orEarlier - Text format for sequence assembly data. - - - - - - - - - - Amino acid identifier format - - beta13 - Text format (representation) of amino acid residues. - true - beta12orEarlier - - - - - - - - - - completely unambiguous - - - beta12orEarlier - Alphabet for a molecular sequence without any unknown positions or ambiguity characters. - - - - - - - - - - completely unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure nucleotide - - - Alphabet for a nucleotide sequence (characters ACGTU only) without unknown positions, ambiguity or non-sequence characters . - beta12orEarlier - - - - - - - - - - completely unambiguous pure dna - - - beta12orEarlier - Alphabet for a DNA sequence (characters ACGT only) without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) without unknown positions, ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - Raw sequence format - - - - - - - - http://www.onto-med.de/ontologies/gfo.owl#Symbol_sequence - beta12orEarlier - Format of a raw molecular sequence (i.e. the alphabet used). - - - - - - - - - - BAM - - - - beta12orEarlier - BAM format, the binary, BGZF-formatted compressed version of SAM format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SAM - - - - The format supports short and long reads (up to 128Mbp) produced by different sequencing platforms and is used to hold mapped data within the GATK and across the Broad Institute, the Sanger Centre, and throughout the 1000 Genomes project. - beta12orEarlier - Sequence Alignment/Map (SAM) format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SBML - - - Systems Biology Markup Language (SBML), the standard XML format for models of biological processes such as for example metabolism, cell signaling, and gene regulation. - beta12orEarlier - - - - - - - - - - - - completely unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - Bibliographic reference format - - - - - - - - - - - - - - Format of a bibliographic reference. - beta12orEarlier - - - - - - - - - - Sequence annotation track format - - - - - - - - Format of a sequence annotation track. - beta12orEarlier - - - - - - - - - - Alignment format (pair only) - - - - - - - - beta12orEarlier - Data format for molecular sequence alignment information that can hold sequence alignment(s) of only 2 sequences. - - - - - - - - - - Sequence variation annotation format - - - - - - - - Format of sequence variation annotation. - beta12orEarlier - - - - - - - - - - markx0 variant - - - Some variant of Pearson MARKX alignment format. - beta12orEarlier - - - - - - - - - - mega variant - - - - Some variant of Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylip format variant - - - - beta12orEarlier - Some variant of Phylip format for (aligned) sequences. - - - - - - - - - - AB1 - - - beta12orEarlier - AB1 binary format of raw DNA sequence reads (output of Applied Biosystems' sequencing analysis software). Contains an electropherogram and the DNA base sequence. - AB1 uses the generic binary Applied Biosystems, Inc. Format (ABIF). - - - - - - - - - - ACE - - - ACE sequence assembly format including contigs, base-call qualities, and other metadata (version Aug 1998 and onwards). - beta12orEarlier - - - - - - - - - - - - BED - - - beta12orEarlier - BED detail format includes 2 additional columns (http://genome.ucsc.edu/FAQ/FAQformat#format1.7) and BED 15 includes 3 additional columns for experiment scores (http://genomewiki.ucsc.edu/index.php/Microarray_track). - Browser Extensible Data (BED) format of sequence annotation track, typically to be displayed in a genome browser. - - - - - - - - - - - - bigBed - - - beta12orEarlier - bigBed format for large sequence annotation tracks, similar to textual BED format. - - - - - - - - - - - - WIG - - - Wiggle format (WIG) of a sequence annotation track that consists of a value for each sequence position. Typically to be displayed in a genome browser. - beta12orEarlier - - - - - - - - - - - - bigWig - - - beta12orEarlier - bigWig format for large sequence annotation tracks that consist of a value for each sequence position. Similar to textual WIG format. - - - - - - - - - - - - PSL - - - - PSL format of alignments, typically generated by BLAT or psLayout. Can be displayed in a genome browser like a sequence annotation track. - beta12orEarlier - - - - - - - - - - - - MAF - - - - Multiple Alignment Format (MAF) supporting alignments of whole genomes with rearrangements, directions, multiple pieces to the alignment, and so forth. - Typically generated by Multiz and TBA aligners; can be displayed in a genome browser like a sequence annotation track. This should not be confused with MIRA Assembly Format or Mutation Annotation Format. - beta12orEarlier - - - - - - - - - - - - 2bit - - - beta12orEarlier - 2bit binary format of nucleotide sequences using 2 bits per nucleotide. In addition encodes unknown nucleotides and lower-case 'masking'. - - - - - - - - - - - - - .nib - - - beta12orEarlier - .nib (nibble) binary format of a nucleotide sequence using 4 bits per nucleotide (including unknown) and its lower-case 'masking'. - - - - - - - - - - - - genePred - - - genePred table format for gene prediction tracks. - genePred format has 3 main variations (http://genome.ucsc.edu/FAQ/FAQformat#format9 http://www.broadinstitute.org/software/igv/genePred). They reflect UCSC Browser DB tables. - beta12orEarlier - - - - - - - - - - - - pgSnp - - - Personal Genome SNP (pgSnp) format for sequence variation tracks (indels and polymorphisms), supported by the UCSC Genome Browser. - beta12orEarlier - - - - - - - - - - - - axt - - - beta12orEarlier - axt format of alignments, typically produced from BLASTZ. - - - - - - - - - - - - LAV - - - beta12orEarlier - LAV format of alignments generated by BLASTZ and LASTZ. - - - - - - - - - - - - Pileup - - - beta12orEarlier - Pileup format of alignment of sequences (e.g. sequencing reads) to (a) reference sequence(s). Contains aligned bases per base of the reference sequence(s). - - - - - - - - - - - - VCF - - - beta12orEarlier - Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - - SRF - - - Sequence Read Format (SRF) of sequence trace data. Supports submission to the NCBI Short Read Archive. - beta12orEarlier - - - - - - - - - - - - ZTR - - - ZTR format for storing chromatogram data from DNA sequencing instruments. - beta12orEarlier - - - - - - - - - - - - GVF - - - Genome Variation Format (GVF). A GFF3-compatible format with defined header and attribute tags for sequence variation. - beta12orEarlier - - - - - - - - - - - - BCF - - - beta12orEarlier - BCF, the binary version of Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - Matrix format - - - - - - - - Format of a matrix (array) of numerical values. - beta13 - - - - - - - - - - Protein domain classification format - - - - - - - - Format of data concerning the classification of the sequences and/or structures of protein structural domain(s). - beta13 - - - - - - - - - - Raw SCOP domain classification format - - Format of raw SCOP domain classification data files. - These are the parsable data files provided by SCOP. - beta13 - - - - - - - - - - Raw CATH domain classification format - - These are the parsable data files provided by CATH. - beta13 - Format of raw CATH domain classification data files. - - - - - - - - - - CATH domain report format - - Format of summary of domain classification information for a CATH domain. - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - - - - - - - - - - SBRML - - - 1.0 - Systems Biology Result Markup Language (SBRML), the standard XML format for simulated or calculated results (e.g. trajectories) of systems biology models. - - - - - - - - - - - - BioPAX - - BioPAX is an exchange format for pathway data, with its data model defined in OWL. - 1.0 - - - - - - - - - - - - EBI Application Result XML - - - - EBI Application Result XML is a format returned by sequence similarity search Web services at EBI. - 1.0 - - - - - - - - - - - - PSI MI XML (MIF) - - - 1.0 - XML Molecular Interaction Format (MIF), standardised by HUPO PSI MI. - MIF - - - - - - - - - - - - phyloXML - - - phyloXML is a standardised XML format for phylogenetic trees, networks, and associated data. - 1.0 - - - - - - - - - - - - NeXML - - - 1.0 - NeXML is a standardised XML format for rich phyloinformatic data. - - - - - - - - - - - - MAGE-ML - - - - - - - - - 1.0 - MAGE-ML XML format for microarray expression data, standardised by MGED (now FGED). - - - - - - - - - - - - MAGE-TAB - - - - - - - - - MAGE-TAB textual format for microarray expression data, standardised by MGED (now FGED). - 1.0 - - - - - - - - - - - - GCDML - - - GCDML XML format for genome and metagenome metadata according to MIGS/MIMS/MIMARKS information standards, standardised by the Genomic Standards Consortium (GSC). - 1.0 - - - - - - - - - - - - GTrack - - - 1.0 - GTrack is an optimised tabular format for genome/sequence feature tracks unifying the power of other tabular formats (e.g. GFF3, BED, WIG). - - - - - - - - - - - - Biological pathway or network report format - - - - - - - - Data format for a report of information derived from a biological pathway or network. - beta12orEarlier - - - - - - - - - - Experiment annotation format - - - - - - - - beta12orEarlier - Data format for annotation on a laboratory experiment. - - - - - - - - - - Cytoband format - - - - - - - - - 1.2 - Cytoband format for chromosome cytobands. - Reflects a UCSC Browser DB table. - - - - - - - - - - - - CopasiML - - - - 1.2 - CopasiML, the native format of COPASI. - - - - - - - - - - - - CellML - - - CellML, the format for mathematical models of biological and other networks. - 1.2 - - - - - - - - - - - - - - PSI MI TAB (MITAB) - - - 1.2 - Tabular Molecular Interaction format (MITAB), standardised by HUPO PSI MI. - - - - - - - - - - - - PSI-PAR - - Protein affinity format (PSI-PAR), standardised by HUPO PSI MI. It is compatible with PSI MI XML (MIF) and uses the same XML Schema. - 1.2 - - - - - - - - - - - - mzML - - - mzML is the successor and unifier of the mzData format developed by PSI and mzXML developed at the Seattle Proteome Center. - 1.2 - mzML format for raw spectrometer output data, standardised by HUPO PSI MSS. - - - - - - - - - - - - Mass spectrometry data format - - - - - - - - Format for mass pectra and derived data, include peptide sequences etc. - 1.2 - - - - - - - - - - TraML - - - TraML (Transition Markup Language) is the format for mass spectrometry transitions, standardised by HUPO PSI MSS. - 1.2 - - - - - - - - - - - - mzIdentML - - - mzIdentML is the exchange format for peptides and proteins identified from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of proteomics search engines. - 1.2 - - - - - - - - - - - - mzQuantML - - - mzQuantML is the format for quantitation values associated with peptides, proteins and small molecules from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of quantitation software for proteomics. - 1.2 - - - - - - - - - - - - GelML - - - 1.2 - GelML is the format for describing the process of gel electrophoresis, standardised by HUPO PSI PS. - - - - - - - - - - - - spML - - - 1.2 - spML is the format for describing proteomics sample processing, other than using gels, prior to mass spectrometric protein identification, standardised by HUPO PSI PS. It may also be applicable for metabolomics. - - - - - - - - - - - - OWL Functional Syntax - - - A human-readable encoding for the Web Ontology Language (OWL). - 1.2 - - - - - - - - - - Manchester OWL Syntax - - - A syntax for writing OWL class expressions. - 1.2 - This format was influenced by the OWL Abstract Syntax and the DL style syntax. - - - - - - - - - - KRSS2 Syntax - - - This format is used in Protege 4. - A superset of the "Description-Logic Knowledge Representation System Specification from the KRSS Group of the ARPA Knowledge Sharing Effort". - 1.2 - - - - - - - - - - Turtle - - - The SPARQL Query Language incorporates a very similar syntax. - 1.2 - The Terse RDF Triple Language (Turtle) is a human-friendly serialization format for RDF (Resource Description Framework) graphs. - - - - - - - - - - N-Triples - - - N-Triples should not be confused with Notation 3 which is a superset of Turtle. - 1.2 - A plain text serialisation format for RDF (Resource Description Framework) graphs, and a subset of the Turtle (Terse RDF Triple Language) format. - - - - - - - - - - Notation3 - - - N3 - A shorthand non-XML serialization of Resource Description Framework model, designed with human-readability in mind. - - - - - - - - - - RDF/XML - - - - RDF - Resource Description Framework (RDF) XML format. - 1.2 - http://www.ebi.ac.uk/SWO/data/SWO_3000006 - RDF/XML is a serialization syntax for OWL DL, but not for OWL Full. - - - - - - - - - - OWL/XML - - - OWL ontology XML serialisation format. - 1.2 - OWL - - - - - - - - - - A2M - - - The A2M format is used as the primary format for multiple alignments of protein or nucleic-acid sequences in the SAM suite of tools. It is a small modification of FASTA format for sequences and is compatible with most tools that read FASTA. - 1.3 - - - - - - - - - - - - SFF - - - Standard flowgram format - Standard flowgram format (SFF) is a binary file format used to encode results of pyrosequencing from the 454 Life Sciences platform for high-throughput sequencing. - 1.3 - - - - - - - - - - - - MAP - - The MAP file describes SNPs and is used by the Plink package. - 1.3 - Plink MAP - - - - - - - - - - - PED - - Plink PED - 1.3 - The PED file describes individuals and genetic data and is used by the Plink package. - - - - - - - - - - - Individual genetic data format - - Data format for a metadata on an individual and their genetic data. - 1.3 - - - - - - - - - - PED/MAP - - - The PED/MAP file describes data used by the Plink package. - Plink PED/MAP - 1.3 - - - - - - - - - - - CT - - - File format of a CT (Connectivity Table) file from the RNAstructure package. - beta12orEarlier - Connect format - Connectivity Table file format - - - - - - - - - - - - SS - - - beta12orEarlier - XRNA old input style format. - - - - - - - - - - - RNAML - - - - RNA Markup Language. - beta12orEarlier - - - - - - - - - - - GDE - - - Format for the Genetic Data Environment (GDE). - beta12orEarlier - - - - - - - - - - - BLC - - 1.3 - Block file format - A multiple alignment in vertical format, as used in the AMPS (Alignment of Multiple Protein Sequences) pacakge. - - - - - - - - - - - Data index format - - - - - - - - - 1.3 - - - - - - - - - - BAI - - - - - - - - 1.3 - BAM indexing format - - - - - - - - - - - HMMER2 - - HMMER profile HMM file for HMMER versions 2.x - 1.3 - - - - - - - - - - - HMMER3 - - 1.3 - HMMER profile HMM file for HMMER versions 3.x - - - - - - - - - - - PO - - EMBOSS simple sequence pair alignment format. - 1.3 - - - - - - - - - - - BLAST XML results format - - - XML format as produced by the NCBI Blast package - 1.3 - - - - - - - - - - CRAM - - - Reference-based compression of alignment format - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - 1.7 - - - - - - - - - - JSON - - 1.7 - Javascript Object Notation format; a lightweight, text-based format to represent structured data using key-value pairs. - - - - - - - - - - EPS - - Encapsulated PostScript format - 1.7 - - - - - - - - - - GIF - - 1.7 - Graphics Interchange Format. - - - - - - - - - - xls - - - Microsoft Excel spreadsheet format. - Microsoft Excel format - 1.7 - - - - - - - - - - TSV - - Tabular format - http://filext.com/file-extension/CSV - http://www.iana.org/assignments/media-types/text/csv - Tabular data represented as tab-separated values in a text file. - 1.7 - http://filext.com/file-extension/TSV - CSV - - - - - - - - - - Gene expression data format - - true - 1.10 - 1.7 - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - - - - - - - - - - Cytoscape input file format - - - Format of the cytoscape input file of gene expression ratios or values are specified over one or more experiments. - 1.7 - - - - - - - - - - ebwt - - - - - - - - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie index format - 1.7 - Bowtie format for indexed reference genome for "small" genomes. - - - - - - - - - - RSF - - http://www.molbiol.ox.ac.uk/tutorials/Seqlab_GCG.pdf - RSF-format files contain one or more sequences that may or may not be related. In addition to the sequence data, each sequence can be annotated with descriptive sequence information (from the GCG manual). - Rich sequence format. - 1.7 - GCG RSF - - - - - - - - - - GCG format variant - - - - 1.7 - Some format based on the GCG format. - - - - - - - - - - BSML - - - http://rothlab.ucdavis.edu/genhelp/chapter_2_using_sequences.html#_Creating_and_Editing_Single_Sequenc - Bioinformatics Sequence Markup Language format. - 1.7 - - - - - - - - - - ebwtl - - - - - - - - 1.7 - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie long index format - Bowtie format for indexed reference genome for "large" genomes. - - - - - - - - - - Ensembl variation file format - - - Ensembl standard format for variation data. - 1.8 - - - - - - - - - - - docx - - - 1.8 - Microsoft Word format - doc - Microsoft Word format. - - - - - - - - - - Document format - - Format of documents including word processor, spreadsheet and presentation. - 1.8 - - - - - - - - - - PDF - - - 1.8 - Portable Document Format - - - - - - - - - - Image format - - - - - - - - Format used for images and image metadata. - 1.9 - - - - - - - - - - DICOM format - - - 1.9 - Medical image format corresponding to the Digital Imaging and Communications in Medicine (DICOM) standard. - - - - - - - - - - - - - nii - - - Medical image and metadata format of the Neuroimaging Informatics Technology Initiative. - - - NIfTI-1 format - 1.9 - - - - - - - - - - - mhd - - - Metalmage format - 1.9 - Text-based tagged file format for medical images generated using the MetaImage software package. - - - - - - - - - - - nrrd - - - 1.9 - Nearly Raw Rasta Data format designed to support scientific visualization and image processing involving N-dimensional raster data. - - - - - - - - - - - R file format - - File format used for scripts written in the R programming language for execution within the R software environment, typically for statistical computation and graphics. - - 1.9 - - - - - - - - - - SPSS - - 1.9 - File format used for scripts for the Statistical Package for the Social Sciences. - - - - - - - - - - - MHT - MIME HTML format for Web pages, which can include external resources, including images, Flash animations and so on. - - EMBL entry format wrapped in HTML elements. - 1.9 - MHTML - - - - - - - - - - IDAT - - - - - - - - - Proprietary file format for (raw) BeadArray data used by genomewide profiling platforms from Illumina Inc. This format is output directly from the scanner and stores summary intensities for each probe-type on an array. - 1.10 - - - - - - - - - - JPG - - - 1.10 - Joint Picture Group file format for lossy graphics file. - - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rcc - - - 1.10 - Reporter Code Count-A data file (.csv) output by the Nanostring nCounter Digital Analyzer, which contains gene sample information, probe information and probe counts. - - - - - - - - - - arff - - ARFF (Attribute-Relation File Format) is an ASCII text file format that describes a list of instances sharing a set of attributes. - 1.11 - This file format is for machine learning. - - - - - - - - - - - - afg - - - 1.11 - AFG is a single text-based file assembly format that holds read and consensus information together - - - - - - - - - - - - bedgraph - - - Holds a tab-delimited chromosome /start /end / datavalue dataset. - 1.11 - The bedGraph format allows display of continuous-valued data in track format. This display type is useful for probability scores and transcriptome data - - - - - - - - - - - - bedstrict - - Browser Extensible Data (BED) format of sequence annotation track that strictly does not contain non-standard fields beyond the first 3 columns. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - 1.11 - - - - - - - - - - - - bed6 - - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 6 - BED file format where each feature is described by chromosome, start, end, name, score, and strand. - 1.11 - - - - - - - - - - - - bed12 - - 1.11 - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 12 - A BED file where each feature is described by all twelve columns. - - - - - - - - - - - - chrominfo - - - 1.11 - Tabular format of chromosome names and sizes used by Galaxy. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - - - - - - - - - - - - customtrack - - - 1.11 - Custom Sequence annotation track format used by Galaxy. - Used for tracks/track views within galaxy. - - - - - - - - - - - - csfasta - - - Color space FASTA format sequence variant. - 1.3 - FASTA format extended for color space information. - - - - - - - - - - - - hdf5 - - An HDF5 file appears to the user as a directed graph. The nodes of this graph are the higher-level HDF5 objects that are exposed by the HDF5 APIs: Groups, Datasets, Named datatypes. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. - 1.11 - h5 - Binary format used by Galaxy for hierarchical data. - - - - - - - - - - - - tiff - - - The TIFF format is perhaps the most versatile and diverse bitmap format in existence. Its extensible nature and support for numerous data compression schemes allow developers to customize the TIFF format to fit any peculiar data storage needs. - - A versatile bitmap format. - 1.11 - - - - - - - - - - - bmp - - - Standard bitmap storage format in the Microsoft Windows environment. - 1.11 - Although it is based on Windows internal bitmap data structures, it is supported by many non-Windows and non-PC applications. - - - - - - - - - - - im - - - IM is a format used by LabEye and other applications based on the IFUNC image processing library. - IFUNC library reads and writes most uncompressed interchange versions of this format. - - 1.11 - - - - - - - - - - - pcd - - - PCD was developed by Kodak. A PCD file contains five different resolution (ranging from low to high) of a slide or film negative. Due to it PCD is often used by many photographers and graphics professionals for high-end printed applications. - 1.11 - Photo CD format, which is the highest resolution format for images on a CD. - - - - - - - - - - - pcx - - - 1.11 - PCX is an image file format that uses a simple form of run-length encoding. It is lossless. - - - - - - - - - - - - ppm - - - The PPM format is a lowest common denominator color image file format. - - 1.11 - - - - - - - - - - - psd - - - 1.11 - PSD (Photoshop Document) is a proprietary file that allows the user to work with the images’ individual layers even after the file has been saved. - - - - - - - - - - - xbm - - - The XBM format was replaced by XPM for X11 in 1989. - 1.11 - X BitMap is a plain text binary image format used by the X Window System used for storing cursor and icon bitmaps used in the X GUI. - - - - - - - - - - - xpm - - - X PixMap (XPM) is an image file format used by the X Window System, it is intended primarily for creating icon pixmaps, and supports transparent pixels. - - 1.11 - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rgb - - - RGB file format is the native raster graphics file format for Silicon Graphics workstations. - - 1.11 - - - - - - - - - - - pbm - - - The PBM format is a lowest common denominator monochrome file format. It serves as the common language of a large family of bitmap image conversion filters. - - 1.11 - - - - - - - - - - - pgm - - - It is designed to be extremely easy to learn and write programs for. - The PGM format is a lowest common denominator grayscale file format. - - 1.11 - - - - - - - - - - - PNG - - - 1.11 - png - PNG is a file format for image compression. - - It iis expected to replace the Graphics Interchange Format (GIF). - - - - - - - - - - - SVG - - - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - Scalable Vector Graphics (SVG) is an XML-based vector image format for two-dimensional graphics with support for interactivity and animation. - svg - Scalable Vector Graphics - 1.11 - - - - - - - - - - - rast - - - Sun Raster is a raster graphics file format used on SunOS by Sun Microsystems - 1.11 - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - - - - - - - - - - - Sequence quality report format (text) - - - - - - - - - Textual report format for sequence quality for reports from sequencing machines. - 1.11 - - - - - - - - - - qual - - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - Phred quality scores are defined as a property which is logarithmically related to the base-calling error probabilities. - FASTQ format subset for Phred sequencing quality score data only (no sequences). - - - - - - - - - - qualsolexa - - - Solexa/Illumina 1.0 format can encode a Solexa/Illumina quality score from -5 to 62 using ASCII 59 to 126 (although in raw read data Solexa scores from -5 to 40 only are expected) - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) for Solexa/Illumina 1.0 format. - - - - - - - - - - qualillumina - - - Starting in Illumina 1.5 and before Illumina 1.8, the Phred scores 0 to 2 have a slightly different meaning. The values 0 and 1 are no longer used and the value 2, encoded by ASCII 66 "B", is used also at the end of reads as a Read Segment Quality Control Indicator. - FASTQ format subset for Phred sequencing quality score data only (no sequences) from Illumina 1.5 and before Illumina 1.8. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qualsolid - - For SOLiD data, the sequence is in color space, except the first position. The quality values are those of the Sanger format. - FASTQ format subset for Phred sequencing quality score data only (no sequences) for SOLiD data. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qual454 - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) from 454 sequencers. - - - - - - - - - - ENCODE peak format - - 1.11 - Human ENCODE peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE narrow peak format - - 1.11 - Human ENCODE narrow peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE broad peak format - - 1.11 - Human ENCODE broad peak format. - - - - - - - - - - - - bgzip - - - BAM files are compressed using a variant of GZIP (GNU ZIP), into a format called BGZF (Blocked GNU Zip Format). - Blocked GNU Zip format. - 1.11 - - - - - - - - - - - tabix - - - TAB-delimited genome position file index format. - 1.11 - - - - - - - - - - - - Graph format - - Data format for graph data. - 1.11 - - - - - - - - - - xgmml - - XML-based format used to store graph descriptions within Galaxy. - 1.11 - - - - - - - - - - - sif - - 1.11 - SIF (simple interaction file) Format - a network/pathway format used for instance in cytoscape. - - - - - - - - - - - xlsx - - - 1.11 - MS Excel spreadsheet format consisting of a set of XML documents stored in a ZIP-compressed file. - - - - - - - - - - SQLite - - https://www.sqlite.org/fileformat2.html - Data format used by the SQLite database. - 1.11 - - - - - - - - - - GeminiSQLite - - https://gemini.readthedocs.org/en/latest/content/quick_start.html - 1.11 - Data format used by the SQLite database conformant to the Gemini schema. - - - - - - - - - - Index format - - - - - - - - - Format of a data index of some type. - 1.11 - - - - - - - - - - snpeffdb - - An index of a genome database, indexed for use by the snpeff tool. - 1.11 - - - - - - - - - - MAT - - - - - - - - MATLAB file format - Binary format used by MATLAB files to store workspace variables. - 1.12 - MAT file format - .mat file format - - - - - - - - - - - netCDF - - 1.12 - ANDI-MS - Format used by netCDF software library for writing and reading chromatography-MS data files. - - - - - - - - - - - MGF - - Files includes *m*/*z*, intensity pairs separated by headers; headers can contain a bit more information, including search engine instructions. - Mascot Generic Format. Encodes multiple MS/MS spectra in a single file. - 1.12 - - - - - - - - - - dta - - Each file contains one header line for the known or assumed charge and the mass of the precursor peptide ion, calculated from the measured *m*/*z* and the charge. This one line was then followed by all the *m*/*z*, intensity pairs that represent the spectrum. - 1.12 - Spectral data format file where each spectrum is written to a separate file. - - - - - - - - - - pkl - - Spectral data file similar to dta. - Differ from .dta only in subtleties of the header line format and content and support the added feature of being able to. - 1.12 - - - - - - - - - - mzXML - - 1.12 - https://dx.doi.org/10.1038%2Fnbt1031 - Common file format for proteomics mass spectrometric data developed at the Seattle Proteome Center/Institute for Systems Biology. - - - - - - - - - - pepXML - - http://sashimi.sourceforge.net/schema_revision/pepXML/pepXML_v118.xsd - Open data format for the storage, exchange, and processing of peptide sequence assignments of MS/MS scans, intended to provide a common data output format for many different MS/MS search engines and subsequent peptide-level analyses. - 1.12 - - - - - - - - - - GPML - - - 1.12 - Graphical Pathway Markup Language (GPML) is an XML format used - for exchanging biological pathways. - - - - - - - - - - - K-mer countgraph - - - 1.12 - oxlicg - http://www.iana.org/assignments/media-types/application/vnd.oxli.countgraph - A list of k-mers and their occurences in a dataset. Can also be used as an implicit De Bruijn graph. - - - - - - - - - - - mzTab - - - 1.13 - mzTab is a tab-delimited format for mass spectrometry-based proteomics and metabolomics results. - - - - - - - - - - - - - imzML - - - - imzML is a data format for mass spectrometry imaging data. NB.: See comment. - 1.13 - imzML|ibd - Data is recorded in 2 files: '.imzXML' is a metadata XML file based on mzML by HUPO-PSI, and '.ibd' is a binary file containing the mass spectra. - - - - - - - - - - - - - qcML - - - - The focus of qcML is towards mass spectrometry based proteomics, but the format is suitable for metabolomics and sequencing as well. - qcML is an XML format for quality-related data of mass spectrometry and other high-throughput measurements. - 1.13 - - - - - - - - - - - - PRIDE XML - - - - 1.13 - PRIDE XML is an XML format for mass spectra, peptide and protein identifications, and metadata about a corresponding measurement, sample, experiment. - - - - - - - - - - - - SED-ML - - - Simulation Experiment Description Markup Language (SED-ML) is an XML format for encoding simulation setups, according to the MIASE (Minimum Information About a Simulation Experiment) requirements. - 1.13 - - - - - - - - - - - - - - COMBINE OMEX - - - - 1.13 - An OMEX file is a ZIP container that includes a manifest file, listing the content of the archive, an optional metadata file adding information about the archive and its content, and the files describing the model. OMEX is one of the standardised formats within COMBINE (Computational Modeling in Biology Network). - Open Modeling EXchange format (OMEX) is a ZIPped format for encapsulating all information necessary for a modeling and simulation project in systems biology. - - - - - - - - - - - - - ISA-TAB - - - - ISA-TAB is based on MAGE-TAB. Other than tabular, the ISA model can also be represented in RDF, and in JSON (compliable with a set of defined JSON Schemata). - The Investigation / Study / Assay (ISA) tab-delimited (TAB) format incorporates metadata from -experiments employing a combination of technologies. - 1.13 - ISA-Tab - - - - - - - - - - - - SBtab - - - 1.13 - SBtab is a tabular format for biochemical network models. - - - - - - - - - - - - - BCML - - - 1.13 - Biological Connection Markup Language (BCML) is an XML format for biological pathways. - - - - - - - - - - - - BDML - - Biological Dynamics Markup Language (BDML) is an XML format for quantitative data describing biological dynamics. - 1.13 - - - - - - - - - - - - - BEL - - 1.13 - Biological Expression Language (BEL) is a textual format for representing scientific findings in life sciences in a computable form. - - - - - - - - - - - - SBGN-ML - - - SBGN-ML is an XML format for Systems Biology Graphical Notation (SBGN) diagrams of biological pathways or networks. - 1.13 - - - - - - - - - - - - AGP - - - 1.13 - AGP is a tabular format for a sequence assembly (a contig, a scaffold/supercontig, or a chromosome). - - - - - - - - - - - - PS - - PostScript - PostScript format - 1.13 - - - - - - - - - - SRA format - - SRA archive format (SRA) is the archive format used for input to the NCBI Sequence Read Archive. - SRA archive format - 1.13 - SRA - - - - - - - - - - - VDB - - VDB ('vertical database') is the format (SRA) is the native format used for export from the NCBI Sequence Read Archive. - SRA native format - 1.13 - SRA - - - - - - - - - - - Tabix index file format - - - - - - - - 1.3 - Index file format used by the samtools package to index TAB-delimited genome position files. - - - - - - - - - - - sequin - - A five-column, tab-delimited table of feature locations and qualifiers for importing annotation into an existing Sequin submission (an NCBI tool for submitting and updating GenBank entries). - 1.13 - - - - - - - - - - MSF - - Magellan storage file format - This format corresponds to an SQLite database, and you can look into the files with e.g. SQLiteStudio3. There are also some readers (http://pubs.acs.org/doi/abs/10.1021/pr2005154) and converters (http://www.sciencedirect.com/science/article/pii/S1874391915300531) for this format available, which re-engineered the database schema, but there is no official DB schema specification of Thermo Scientific for the format. - Proprietary mass-spectrometry format of Thermo Scientific's ProteomeDiscoverer software. - 1.14 - - - - - - - - - - Biodiversity data format - - - - - - - - Data format for biodiversity data. - 1.14 - - - - - - - - - - ABCD format - - - - - - - - ABCD - Exchange format of the Access to Biological Collections Data (ABCD) Schema; a standard for the access to and exchange of data about specimens and observations (primary biodiversity data). - 1.14 - - - - - - - - - - - GCT/Res format - - - Res format - Tab-delimited text files of GenePattern that contain a column for each sample, a row for each gene, and an expression value for each gene in each sample. - GCT format - 1.14 - - - - - - - - - - WIFF format - - - wiff - wiff - 1.14 - Mass spectrum file format from QSTAR and QTRAP instruments (ABI/Sciex). - - - - - - - - - - X!Tandem XML - - - - Output format used by X! series search engines that is based on the XML language BIOML. - 1.14 - - - - - - - - - - - Thermo RAW - - - Proprietary format for which documentation is not available. - Proprietary file format for mass spectrometry data from Thermo Scientific. - 1.14 - - - - - - - - - - Mascot .dat file - - - "Raw" result file from Mascot database search. - 1.14 - - - - - - - - - - - MaxQuant APL peaklist format - - - 1.14 - MaxQuant APL - Format of peak list files from Andromeda search engine (MaxQuant) that consist of arbitrarily many spectra. - - - - - - - - - - - SBOL - - 1.14 - SBOL introduces a standardized format for the electronic exchange of information on the structural and functional aspects of biological designs. - Synthetic Biology Open Language (SBOL) is an XML format for the specification and exchange of biological design information in synthetic biology. - - - - - - - - - - - PMML - - One or more mining models can be contained in a PMML document. - 1.14 - PMML uses XML to represent mining models. The structure of the models is described by an XML Schema. - - - - - - - - - - - OME-TIFF - - - Image file format used by the Open Microscopy Environment (OME). - - 1.14 - OME develops open-source software and data format standards for the storage and manipulation of biological microscopy data. It is a joint project between universities, research establishments, industry and the software development community. - An OME-TIFF dataset consists of one or more files in standard TIFF or BigTIFF format, with the file extension .ome.tif or .ome.tiff, and an identical (or in the case of multiple files, nearly identical) string of OME-XML metadata embedded in the ImageDescription tag of each file’s first IFD (Image File Directory). BigTIFF file extensions are also permitted, with the file extension .ome.tf2, .ome.tf8 or .ome.btf, but note these file extensions are an addition to the original specification, and software using an older version of the specification may not be able to handle these file extensions. - - - - - - - - - - - LocARNA PP - - 1.14 - Format for multiple aligned or single sequences together with the probabilistic description of the (consensus) RNA secondary structure ensemble by probabilities of base pairs, base pair stackings, and base pairs and unpaired bases in the loop of base pairs. - The LocARNA PP format combines sequence or alignment information and (respectively, single or consensus) ensemble probabilities into an PP 2.0 record. - - - - - - - - - - - dbGaP format - - Input format used by the Database of Genotypes and Phenotypes (dbGaP). - The Database of Genotypes and Phenotypes (dbGaP) is a National Institutes of Health (NIH) sponsored repository charged to archive, curate and distribute information produced by studies investigating the interaction of genotype and phenotype. - 1.14 - - - - - - - - - - - Operation - - - A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs). - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - Computational tool - Function - http://purl.org/biotop/biotop.owl#Function - http://www.ifomis.org/bfo/1.1/snap#Function - http://en.wikipedia.org/wiki/Function_(mathematics) - Computational method - http://semanticscience.org/resource/SIO_000017 - http://www.ebi.ac.uk/swo/SWO_0000003 - Mathematical operation - sumo:Function - beta12orEarlier - Process - Computational operation - Computational subroutine - http://semanticscience.org/resource/SIO_000649 - Special cases are: a) An operation that consumes no input (has no input arguments). Such operation is either a constant function, or an operation depending only on the underlying state. b) An operation that may modify the underlying state but has no output. c) The singular-case operation with no input or output, that still may modify the underlying state. - http://www.ifomis.org/bfo/1.1/span#Process - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - Computational procedure - Mathematical function - Lambda abstraction - Function (programming) - http://www.onto-med.de/ontologies/gfo.owl#Process - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - http://wsio.org/operation_001 - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#process - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Function - http://en.wikipedia.org/wiki/Function_(computer_science) - http://en.wikipedia.org/wiki/Subroutine - - - - - Process can have a function (as its quality/attribute), and can also perform an operation with inputs and outputs. - Process - - - - - Computational tool - Computational tool provides one or more operations. - - - - - Operation is a function that is computational. It typically has input(s) and output(s), which are always data. - Function - - - - - - - - - - Query and retrieval - - - - - - - - - - - - - - beta12orEarlier - Query - Search or query a data resource and retrieve entries and / or annotation. - Database retrieval - - - - - - - - - - Data retrieval (database cross-reference) - - beta12orEarlier - Search database to retrieve all relevant references to a particular entity or entry. - true - beta13 - - - - - - - - - - Annotation - - - - - - - - - - - - - - Annotate an entity (typically a biological or biomedical database entity) with terms from a controlled vocabulary. - beta12orEarlier - This is a broad concept and is used a placeholder for other, more specific concepts. - - - - - - - - - - Indexing - - - - - - - - Data indexing - beta12orEarlier - Generate an index of (typically a file of) biological data. - Database indexing - - - - - - - - - - Data index analysis - - Database index analysis - Analyse an index of biological data. - beta12orEarlier - true - 1.6 - - - - - - - - - - Annotation retrieval (sequence) - - true - beta12orEarlier - Retrieve basic information about a molecular sequence. - beta12orEarlier - - - - - - - - - - Sequence generation - - - beta12orEarlier - Generate a molecular sequence by some means. - - - - - - - - - - Sequence editing - - - Edit or change a molecular sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence merging - - beta12orEarlier - Merge two or more (typically overlapping) molecular sequences. - Sequence splicing - - - - - - - - - - Sequence conversion - - - Convert a molecular sequence from one type to another. - beta12orEarlier - - - - - - - - - - Sequence complexity calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate sequence complexity, for example to find low-complexity regions in sequences. - - - - - - - - - - Sequence ambiguity calculation - - - - - - - - - - - - - - Calculate sequence ambiguity, for example identity regions in protein or nucleotide sequences with many ambiguity codes. - beta12orEarlier - - - - - - - - - - Sequence composition calculation - - - - - - - - - - - - - - - beta12orEarlier - Calculate character or word composition or frequency of a molecular sequence. - - - - - - - - - - Repeat sequence analysis - - - - - - - - Find and/or analyse repeat sequences in (typically nucleotide) sequences. - beta12orEarlier - Repeat sequences include tandem repeats, inverted or palindromic repeats, DNA microsatellites (Simple Sequence Repeats or SSRs), interspersed repeats, maximal duplications and reverse, complemented and reverse complemented repeats etc. Repeat units can be exact or imperfect, in tandem or dispersed, of specified or unspecified length. - - - - - - - - - - Sequence motif discovery - - - - - - - - - - - - - - - Motifs and patterns might be conserved or over-represented (occur with improbable frequency). - beta12orEarlier - Discover new motifs or conserved patterns in sequences or sequence alignments (de-novo discovery). - Motif discovery - - - - - - - - - - Sequence motif recognition - - - - - - - - - - - - - - - beta12orEarlier - Sequence signature recognition - Motif scanning - Motif search - Sequence motif search - Protein secondary database search - Motif detection - Sequence signature detection - Sequence profile search - Find (scan for) known motifs, patterns and regular expressions in molecular sequence(s). - Sequence motif detection - Motif recognition - - - - - - - - - - Sequence motif comparison - - - - - - - - - - - - - - - beta12orEarlier - Find motifs shared by molecular sequences. - - - - - - - - - - Transcription regulatory sequence analysis - - beta12orEarlier - beta13 - Analyse the sequence, conformational or physicochemical properties of transcription regulatory elements in DNA sequences. - For example transcription factor binding sites (TFBS) analysis to predict accessibility of DNA to binding factors. - true - - - - - - - - - - Conserved transcription regulatory sequence identification - - - For example cross-species comparison of transcription factor binding sites (TFBS). Methods might analyse co-regulated or co-expressed genes, or sets of oppositely expressed genes. - beta12orEarlier - Identify common, conserved (homologous) or synonymous transcriptional regulatory motifs (transcription factor binding sites). - - - - - - - - - - Protein property calculation (from structure) - - - - - - - - - - - - - - - This might be a residue-level search for properties such as solvent accessibility, hydropathy, secondary structure, ligand-binding etc. - Extract, calculate or predict non-positional (physical or chemical) properties of a protein from processing a protein (3D) structure. - beta12orEarlier - Protein structural property calculation - - - - - - - - - - Protein flexibility and motion analysis - - - beta12orEarlier - Analyse flexibility and motion in protein structure. - Use this concept for analysis of flexible and rigid residues, local chain deformability, regions undergoing conformational change, molecular vibrations or fluctuational dynamics, domain motions or other large-scale structural transitions in a protein structure. - - - - - - - - - - Protein structural motif recognition - - - - - - - - - Identify or screen for 3D structural motifs in protein structure(s). - This includes conserved substructures and conserved geometry, such as spatial arrangement of secondary structure or protein backbone. Methods might use structure alignment, structural templates, searches for similar electrostatic potential and molecular surface shape, surface-mapping of phylogenetic information etc. - beta12orEarlier - Protein structural feature identification - - - - - - - - - - Protein domain recognition - - - - - - - - - beta12orEarlier - Identify structural domains in a protein structure from first principles (for example calculations on structural compactness). - - - - - - - - - - Protein architecture analysis - - beta12orEarlier - Analyse the architecture (spatial arrangement of secondary structure) of protein structure(s). - - - - - - - - - - Residue interaction calculation - - - - - - - - WHATIF: SymShellTenXML - WHATIF:ListContactsRelaxed - WHATIF: SymShellTwoXML - WHATIF:ListSideChainContactsRelaxed - beta12orEarlier - WHATIF:ListSideChainContactsNormal - WHATIF:ListContactsNormal - Calculate or extract inter-atomic, inter-residue or residue-atom contacts, distances and interactions in protein structure(s). - WHATIF: SymShellFiveXML - WHATIF: SymShellOneXML - - - - - - - - - - Protein geometry calculation - - - - - - - - WHATIF:ResidueTorsions - beta12orEarlier - Backbone torsion angle calculation - WHATIF:CysteineTorsions - Calculate, visualise or analyse phi/psi angles of a protein structure. - WHATIF:ResidueTorsionsBB - WHATIF:ShowTauAngle - Torsion angle calculation - Tau angle calculation - Cysteine torsion angle calculation - - - - - - - - - - Protein property calculation - - - - This includes methods to render and visualise the properties of a protein sequence. - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - beta12orEarlier - Protein property rendering - - - - - - - - - - Peptide immunogenicity prediction - - - - - - - - - - - - - - - Immunogenicity prediction - beta12orEarlier - This is usually done in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - This includes methods that generate a graphical rendering of antigenicity of a protein, such as a Hopp and Woods plot. - Hopp and Woods plotting - Predict antigenicity, allergenicity / immunogenicity, allergic cross-reactivity etc of peptides and proteins. - MHC peptide immunogenicity prediction - - - - - - - - - - Sequence feature detection - - - - - - - - - - - - - - - Sequence feature prediction - Predict, recognise and identify positional features in molecular sequences such as key functional sites or regions. - Sequence feature recognition - beta12orEarlier - Motif database search - SO:0000110 - - - - - - - - - - Data retrieval (feature table) - - beta13 - Extract a sequence feature table from a sequence database entry. - true - beta12orEarlier - - - - - - - - - - Feature table query - - 1.6 - beta12orEarlier - true - Query the features (in a feature table) of molecular sequence(s). - - - - - - - - - - Sequence feature comparison - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Compare the feature tables of two or more molecular sequences. - Feature comparison - Feature table comparison - - - - - - - - - - Data retrieval (sequence alignment) - - beta12orEarlier - true - beta13 - Display basic information about a sequence alignment. - - - - - - - - - - Sequence alignment analysis - - - - - - - - Analyse a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence alignment comparison - - - Compare (typically by aligning) two molecular sequence alignments. - beta12orEarlier - See also 'Sequence profile alignment'. - - - - - - - - - - Sequence alignment conversion - - - beta12orEarlier - Convert a molecular sequence alignment from one type to another (for example amino acid to coding nucleotide sequence). - - - - - - - - - - Nucleic acid property processing - - beta12orEarlier - true - Process (read and / or write) physicochemical property data of nucleic acids. - beta13 - - - - - - - - - - Nucleic acid property calculation - - - - - - - - - beta12orEarlier - Calculate or predict physical or chemical properties of nucleic acid molecules, including any non-positional properties of the molecular sequence. - - - - - - - - - - Splice transcript prediction - - - - - - - - beta12orEarlier - Predict splicing alternatives or transcript isoforms from analysis of sequence data. - - - - - - - - - - Frameshift detection - - - - - - - - - Detect frameshifts in DNA sequences, including frameshift sites and signals, and frameshift errors from sequencing projects. - Frameshift error detection - beta12orEarlier - Methods include sequence alignment (if related sequences are available) and word-based sequence comparison. - - - - - - - - - - Vector sequence detection - - - beta12orEarlier - Detect vector sequences in nucleotide sequence, typically by comparison to a set of known vector sequences. - - - - - - - - - - Protein secondary structure prediction - - - - Methods might use amino acid composition, local sequence information, multiple sequence alignments, physicochemical features, estimated energy content, statistical algorithms, hidden Markov models, support vector machines, kernel machines, neural networks etc. - Predict secondary structure of protein sequences. - Secondary structure prediction (protein) - beta12orEarlier - - - - - - - - - - Protein super-secondary structure prediction - - - - - - - - beta12orEarlier - Predict super-secondary structure of protein sequence(s). - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - - - - - - - - - - Transmembrane protein prediction - - - Predict and/or classify transmembrane proteins or transmembrane (helical) domains or regions in protein sequences. - beta12orEarlier - - - - - - - - - - Transmembrane protein analysis - - - - - - - - beta12orEarlier - Analyse transmembrane protein(s), typically by processing sequence and / or structural data, and write an informative report for example about the protein and its transmembrane domains / regions. - Use this (or child) concept for analysis of transmembrane domains (buried and exposed faces), transmembrane helices, helix topology, orientation, inter-helical contacts, membrane dipping (re-entrant) loops and other secondary structure etc. Methods might use pattern discovery, hidden Markov models, sequence alignment, structural profiles, amino acid property analysis, comparison to known domains or some combination (hybrid methods). - - - - - - - - - - Structure prediction - - - - - - - - - - - - - - - Predict tertiary structure of a molecular (biopolymer) sequence. - beta12orEarlier - - - - - - - - - - Residue interaction prediction - - - - - - - - - Methods usually involve multiple sequence alignment analysis. - Predict contacts, non-covalent interactions and distance (constraints) between amino acids in protein sequences. - beta12orEarlier - - - - - - - - - - Protein interaction raw data analysis - - - - - - - - - - - - - - Analyse experimental protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein-protein interaction prediction (from protein sequence) - - beta12orEarlier - 1.12 - true - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein sequences. - - - - - - - - - - Protein-protein interaction prediction (from protein structure) - - true - 1.12 - beta12orEarlier - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein structures. - - - - - - - - - - Protein interaction network analysis - - - - - - - - - - - - - - - beta12orEarlier - Analyse a network of protein interactions. - - - - - - - - - - Protein interaction network comparison - - - beta12orEarlier - Compare two or more networks of protein interactions. - - - - - - - - - - RNA secondary structure prediction - - - - - - - - - - Predict RNA secondary structure (for example knots, pseudoknots, alternative structures etc). - beta12orEarlier - Methods might use RNA motifs, predicted intermolecular contacts, or RNA sequence-structure compatibility (inverse RNA folding). - - - - - - - - - - Nucleic acid folding analysis - - - - - - - - - - beta12orEarlier - Analyse some aspect of RNA/DNA folding, typically by processing sequence and/or structural data. - Nucleic acid folding modelling - Nucleic acid folding prediction - Nucleic acid folding - - - - - - - - - - Data retrieval (restriction enzyme annotation) - - beta13 - Restriction enzyme information retrieval - true - Retrieve information on restriction enzymes or restriction enzyme sites. - beta12orEarlier - - - - - - - - - - Genetic marker identification - - true - beta12orEarlier - beta13 - Identify genetic markers in DNA sequences. - A genetic marker is any DNA sequence of known chromosomal location that is associated with and specific to a particular gene or trait. This includes short sequences surrounding a SNP, Sequence-Tagged Sites (STS) which are well suited for PCR amplification, a longer minisatellites sequence etc. - - - - - - - - - - Genetic mapping - - - - - - - - - beta12orEarlier - QTL mapping - This includes mapping of the genetic architecture of dynamic complex traits (functional mapping), e.g. by characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Linkage mapping - Genetic map generation - Mapping involves ordering genetic loci along a chromosome and estimating the physical distance between loci. A genetic map shows the relative (not physical) position of known genes and genetic markers. - Generate a genetic (linkage) map of a DNA sequence (typically a chromosome) showing the relative positions of genetic markers based on estimation of non-physical distances. - Genetic map construction - Functional mapping - - - - - - - - - - Linkage analysis - - - - - - - - - - - - - - beta12orEarlier - For example, estimate how close two genes are on a chromosome by calculating how often they are transmitted together to an offspring, ascertain whether two genes are linked and parental linkage, calculate linkage map distance etc. - Analyse genetic linkage. - - - - - - - - - - Codon usage table generation - - - - - - - - - Calculate codon usage statistics and create a codon usage table. - beta12orEarlier - Codon usage table construction - - - - - - - - - - Codon usage table comparison - - - beta12orEarlier - Compare two or more codon usage tables. - - - - - - - - - - Codon usage analysis - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - synon: Codon usage data analysis - Process (read and / or write) codon usage data, e.g. analyse codon usage tables or codon usage in molecular sequences. - synon: Codon usage table analysis - - - - - - - - - - Base position variability plotting - - - - - - - - - - - - - - - Identify and plot third base position variability in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Sequence word comparison - - Find exact character or word matches between molecular sequences without full sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence distance matrix generation - - - - - - - - - - - - - - - Sequence distance matrix construction - Phylogenetic distance matrix generation - beta12orEarlier - Calculate a sequence distance matrix or otherwise estimate genetic distances between molecular sequences. - - - - - - - - - - Sequence redundancy removal - - - - - - - - beta12orEarlier - Compare two or more molecular sequences, identify and remove redundant sequences based on some criteria. - - - - - - - - - - Sequence clustering - - - - - - - - - - The clusters may be output or used internally for some other purpose. - Sequence cluster construction - beta12orEarlier - Build clusters of similar sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - Sequence cluster generation - - - - - - - - - - Sequence alignment - - - - - - - - - - Sequence alignment construction - beta12orEarlier - Align (identify equivalent sites within) molecular sequences. - Sequence alignment generation - Sequence alignment computation - - - - - - - - - - Hybrid sequence alignment construction - - Hybrid sequence alignment - true - beta13 - beta12orEarlier - Align two or more molecular sequences of different types (for example genomic DNA to EST, cDNA or mRNA). - Hybrid sequence alignment generation - - - - - - - - - - Structure-based sequence alignment - - Sequence alignment generation (structure-based) - Structure-based sequence alignment construction - beta12orEarlier - Sequence alignment (structure-based) - Structure-based sequence alignment generation - Align molecular sequences using sequence and structural information. - - - - - - - - - - Structure alignment - - - - - - - - - - Align (superimpose) molecular tertiary structures. - Structure alignment generation - Structure alignment construction - beta12orEarlier - Multiple structure alignment construction - Multiple structure alignment generation - - - - - - - - - - Sequence profile generation - - - - - - - - - - - - - - - - - - - - - Sequence profile construction - beta12orEarlier - Generate some type of sequence profile (for example a hidden Markov model) from a sequence alignment. - - - - - - - - - - 3D profile generation - - - - - - - - - - - - - - - - - - - - - Structural profile generation - Generate some type of structural (3D) profile or template from a structure or structure alignment. - Structural profile construction - beta12orEarlier - - - - - - - - - - Profile-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence profile alignment - beta12orEarlier - See also 'Sequence alignment comparison'. - Sequence profile alignment construction - Align sequence profiles (representing sequence alignments). - Sequence profile alignment generation - - - - - - - - - - 3D profile-to-3D profile alignment - - - - - - - - - - - - - - beta12orEarlier - 3D profile alignment (multiple) - 3D profile alignment - Multiple 3D profile alignment construction - Structural profile alignment construction (multiple) - Structural profile alignment - Structural profile alignment generation - Structural profile alignment construction - Align structural (3D) profiles or templates (representing structures or structure alignments). - - - - - - - - - - Sequence-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence-profile alignment construction - Sequence-profile alignment generation - beta12orEarlier - Align molecular sequence(s) to sequence profile(s). - Sequence-profile alignment - A sequence profile typically represents a sequence alignment. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - - - - - - - - - - Sequence-to-3D-profile alignment - - - - - - - - - - - - - - - beta12orEarlier - Sequence-3D profile alignment construction - Align molecular sequence(s) to structural (3D) profile(s) or template(s) (representing a structure or structure alignment). - Sequence-3D profile alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-3D profile alignment - - - - - - - - - - Protein threading - - - - - - - - - - - - - - - beta12orEarlier - Align molecular sequence to structure in 3D space (threading). - Use this concept for methods that evaluate sequence-structure compatibility by assessing residue interactions in 3D. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-structure alignment - - - - - - - - - - Protein fold recognition - - - - - beta12orEarlier - Protein domain prediction - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - Recognize (predict and identify) known protein structural domains or folds in protein sequence(s). - Protein fold prediction - - - - - - - - - - Metadata retrieval - - - - - - - - Data retrieval (documentation) - Search for and retrieve data concerning or describing some core data, as distinct from the primary data that is being described. - Data retrieval (metadata) - beta12orEarlier - This includes documentation, general information and other metadata on entities such as databases, database entries and tools. - - - - - - - - - - Literature search - - - - - - - - - - - - - - beta12orEarlier - Query the biomedical and informatics literature. - - - - - - - - - - Text mining - - - - - - - - - - - - - - - - - - - - Text data mining - beta12orEarlier - Process and analyse text (typically the biomedical and informatics literature) to extract information from it. - - - - - - - - - - Virtual PCR - - - - - - - - beta12orEarlier - Perform in-silico (virtual) PCR. - - - - - - - - - - PCR primer design - - - - - - - - - - - - - - - - - - - - This includes predicting primers based on gene structure, promoters, exon-exon junctions, predicting primers that are conserved across multiple genomes or species, primers for for gene transcription profiling, for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs), for large scale sequencing, or for methylation PCRs. - PCR primer design (based on gene structure) - PCR primer design (for methylation PCRs) - beta12orEarlier - PCR primer design (for large scale sequencing) - PCR primer prediction - Primer design involves predicting or selecting primers that are specific to a provided PCR template. Primers can be designed with certain properties such as size of product desired, primer size etc. The output might be a minimal or overlapping primer set. - PCR primer design (for conserved primers) - Design or predict oligonucleotide primers for PCR and DNA amplification etc. - PCR primer design (for gene transcription profiling) - PCR primer design (for genotyping polymorphisms) - - - - - - - - - - Microarray probe design - - - - - - - - - - - - - - - - - - - - - - - - - - - Predict and/or optimize oligonucleotide probes for DNA microarrays, for example for transcription profiling of genes, or for genomes and gene families. - beta12orEarlier - Microarray probe prediction - - - - - - - - - - Sequence assembly - - - - - - - - - - - - - - - beta12orEarlier - For example, assemble overlapping reads from paired-end sequencers into contigs (a contiguous sequence corresponding to read overlaps). Or assemble contigs, for example ESTs and genomic DNA fragments, depending on the detected fragment overlaps. - Combine (align and merge) overlapping fragments of a DNA sequence to reconstruct the original sequence. - - - - - - - - - - Microarray data standardization and normalization - - - - - - - - - - - - - - - beta12orEarlier - Standardize or normalize microarray data. - This includes statistical analysis, for example of variability amongst microarrays experiments, comparison of heterogeneous microarray platforms etc. - - - - - - - - - - Sequencing-based expression profile data processing - - Process (read and / or write) SAGE, MPSS or SBS experimental data. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gene expression profile clustering - - - - - - - - - beta12orEarlier - Perform cluster analysis of gene expression (microarray) data, for example clustering of similar gene expression profiles. - - - - - - - - - - Gene expression profiling - - - - - - - - - Expression profiling - Gene expression profile construction - Functional profiling - Generate a gene expression profile or pattern, for example from microarray data. - beta12orEarlier - Gene expression profile generation - - - - - - - - - - Gene expression profile comparison - - - - - - - - - beta12orEarlier - Compare gene expression profiles or patterns. - - - - - - - - - - Functional profiling - - true - beta12orEarlier - Interpret (in functional terms) and annotate gene expression data. - beta12orEarlier - - - - - - - - - - EST and cDNA sequence analysis - - Analyse EST or cDNA sequences. - For example, identify full-length cDNAs from EST sequences or detect potential EST antisense transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Structural genomics target selection - - beta12orEarlier - Identify and select targets for protein structural determination. - beta12orEarlier - Methods will typically navigate a graph of protein families of known structure. - true - - - - - - - - - - Protein secondary structure assignment - - - - - - - - - - - - - - beta12orEarlier - Assign secondary structure from protein coordinate or experimental data. - - - - - - - - - - Protein structure assignment - - - - - - - - - - - - - - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw experimental data. - - - - - - - - - - Protein model validation - - - - - - - - - - - - - - - WHATIF: UseResidueDB - Evaluate the quality or correctness a protein three-dimensional model. - This includes methods that calculate poor quality residues. The scoring function to identify poor quality residues may consider residues with bad atoms or atoms with high B-factor, residues in the N- or C-terminal position, adjacent to an unstructured residue, non-canonical residues, glycine and proline (or adjacent to these such residues). - Model validation might involve checks for atomic packing, steric clashes (bumps), volume irregularities, agreement with electron density maps, number of amino acid residues, percentage of residues with missing or bad atoms, irregular Ramachandran Z-scores, irregular Chi-1 / Chi-2 normality scores, RMS-Z score on bonds and angles etc. - Residue validation - WHATIF: CorrectedPDBasXML - Protein structure validation - WHATIF: UseFileDB - The PDB file format has had difficulties, inconsistencies and errors. Corrections can include identifying a meaningful sequence, removal of alternate atoms, correction of nomenclature problems, removal of incomplete residues and spurious waters, addition or removal of water, modelling of missing side chains, optimisation of cysteine bonds, regularisation of bond lengths, bond angles and planarities etc. - beta12orEarlier - - - - - - - - - - Molecular model refinement - - - Protein model refinement - WHATIF: CorrectedPDBasXML - beta12orEarlier - Refine (after evaluation) a model of a molecular structure (typically a protein structure) to reduce steric clashes, volume irregularities etc. - - - - - - - - - - Phylogenetic tree generation - - - - - - - - - - - - - - - Phylogenetic trees are usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. - Phylogenetic tree construction - Construct a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis - - - - - - - - beta12orEarlier - Analyse an existing phylogenetic tree or trees, typically to detect features or make predictions. - - - - - - - - - - Phylogenetic tree comparison - - - beta12orEarlier - Compare two or more phylogenetic trees. - For example, to produce a consensus tree, subtrees, supertrees, calculate distances between trees or test topological similarity between trees (e.g. a congruence index) etc. - - - - - - - - - - Phylogenetic tree editing - - - - - - - - - - - - - - - Edit a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic footprinting / shadowing - - - - - - - - A phylogenetic 'shadow' represents the additive differences between individual sequences. By masking or 'shadowing' variable positions a conserved sequence is produced with few or none of the variations, which is then compared to the sequences of interest to identify significant regions of conservation. - beta12orEarlier - Infer a phylogenetic tree by comparing orthologous sequences in different species, particularly many closely related species (phylogenetic shadowing). - - - - - - - - - - Protein folding simulation - - beta12orEarlier - Simulate the folding of a protein. - - - - - - - - - - Protein folding pathway prediction - - - Predict the folding pathway(s) or non-native structural intermediates of a protein. - beta12orEarlier - - - - - - - - - - Protein SNP mapping - - true - beta12orEarlier - Map and model the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). - 1.12 - - - - - - - - - - Protein modelling (mutation) - - - - - - - - - - - - - - - Protein SNP mapping - Protein mutation modelling - Predict the effect of point mutation on a protein structure, in terms of strucural effects and protein folding, stability and function. - Rotamer likelihood prediction - beta12orEarlier - This includes 1) rotamer likelihood prediction: the prediction of rotamer likelihoods for all 20 amino acid types at each position in a protein structure, where output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. 2) Protein SNP mapping, which maps and modesl the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). Methods might predict silent or pathological mutations. - - - - - - - - - - Immunogen design - - true - Design molecules that elicit an immune response (immunogens). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Zinc finger prediction - - - - - - - - - - - - - - Predict and optimise zinc finger protein domains for DNA/RNA binding (for example for transcription factors and nucleases). - beta12orEarlier - - - - - - - - - - Enzyme kinetics calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate Km, Vmax and derived data for an enzyme reaction. - - - - - - - - - - Formatting - - beta12orEarlier - Reformat a file of data (or equivalent entity in memory). - Format conversion - File formatting - Reformatting - File reformatting - File format conversion - - - - - - - - - - Format validation - - Test and validate the format and content of a data file. - File format validation - beta12orEarlier - - - - - - - - - - Visualisation - - - - - - - - - - - - - - - - - - - - Visualization - beta12orEarlier - Visualise, plot or render (graphically) biomolecular data such as molecular sequences or structures. - Rendering - - - - - - - - - - Sequence database search - - - - - - - - - Search a sequence database by sequence comparison and retrieve similar sequences. - -sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - This excludes direct retrieval methods (e.g. the dbfetch program). - - - - - - - - - - Structure database search - - - - - - - - beta12orEarlier - Search a tertiary structure database, typically by sequence and/or structure comparison, or some other means, and retrieve structures and associated data. - - - - - - - - - - Protein secondary database search - - 1.8 - beta12orEarlier - true - Search a secondary protein database (of classification information) to assign a protein sequence(s) to a known protein family or group. - - - - - - - - - - Motif database search - - beta12orEarlier - Screen a sequence against a motif or pattern database. - true - 1.8 - - - - - - - - - - Sequence profile database search - - true - beta12orEarlier - Search a database of sequence profiles with a query sequence. - 1.4 - - - - - - - - - - Transmembrane protein database search - - true - beta12orEarlier - Search a database of transmembrane proteins, for example for sequence or structural similarities. - beta12orEarlier - - - - - - - - - - Sequence retrieval (by code) - - Query a database and retrieve sequences with a given entry code or accession number. - true - 1.6 - beta12orEarlier - - - - - - - - - - Sequence retrieval (by keyword) - - true - Query a database and retrieve sequences containing a given keyword. - beta12orEarlier - 1.6 - - - - - - - - - - Sequence similarity search - - - Structure database search (by sequence) - Sequence database search (by sequence) - beta12orEarlier - Search a sequence database and retrieve sequences that are similar to a query sequence. - - - - - - - - - - Sequence database search (by motif or pattern) - - 1.8 - Search a sequence database and retrieve sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - true - - - - - - - - - - Sequence database search (by amino acid composition) - - true - Search a sequence database and retrieve sequences of a given amino acid composition. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence database search (by property) - - Search a sequence database and retrieve sequences with a specified property, typically a physicochemical or compositional property. - beta12orEarlier - - - - - - - - - - Sequence database search (by sequence using word-based methods) - - beta12orEarlier - Word-based methods (for example BLAST, gapped BLAST, MEGABLAST, WU-BLAST etc.) are usually quicker than alignment-based methods. They may or may not handle gaps. - 1.6 - true - Sequence similarity search (word-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a word-based method. - - - - - - - - - - Sequence database search (by sequence using profile-based methods) - - true - Sequence similarity search (profile-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a sequence profile-based method, or with a supplied profile as query. - beta12orEarlier - This includes tools based on PSI-BLAST. - 1.6 - - - - - - - - - - Sequence database search (by sequence using local alignment-based methods) - - Search a sequence database for sequences that are similar to a query sequence using a local alignment-based method. - 1.6 - beta12orEarlier - true - Sequence similarity search (local alignment-based methods) - This includes tools based on the Smith-Waterman algorithm or FASTA. - - - - - - - - - - Sequence database search (by sequence using global alignment-based methods) - - This includes tools based on the Needleman and Wunsch algorithm. - Search sequence(s) or a sequence database for sequences that are similar to a query sequence using a global alignment-based method. - 1.6 - Sequence similarity search (global alignment-based methods) - beta12orEarlier - true - - - - - - - - - - Sequence database search (by sequence for primer sequences) - - true - beta12orEarlier - Search a DNA database (for example a database of conserved sequence tags) for matches to Sequence-Tagged Site (STS) primer sequences. - 1.6 - STSs are genetic markers that are easily detected by the polymerase chain reaction (PCR) using specific primers. - Sequence similarity search (primer sequences) - - - - - - - - - - Sequence database search (by molecular weight) - - Search sequence(s) or a sequence database for sequences which match a set of peptide masses, for example a peptide mass fingerprint from mass spectrometry. - 1.6 - true - beta12orEarlier - - - - - - - - - - Sequence database search (by isoelectric point) - - 1.6 - beta12orEarlier - Search sequence(s) or a sequence database for sequences of a given isoelectric point. - true - - - - - - - - - - Structure retrieval (by code) - - Query a tertiary structure database and retrieve entries with a given entry code or accession number. - 1.6 - beta12orEarlier - true - - - - - - - - - - Structure retrieval (by keyword) - - true - 1.6 - Query a tertiary structure database and retrieve entries containing a given keyword. - beta12orEarlier - - - - - - - - - - Structure database search (by sequence) - - beta12orEarlier - true - Search a tertiary structure database and retrieve structures with a sequence similar to a query sequence. - 1.8 - - - - - - - - - - Structural similarity search - - - beta12orEarlier - Search a database of molecular structure and retrieve structures that are similar to a query structure. - Structure database search (by structure) - Structure retrieval by structure - - - - - - - - - - Sequence annotation - - - - - - - - - - - - - - beta12orEarlier - Annotate a molecular sequence record with terms from a controlled vocabulary. - - - - - - - - - - Genome annotation - - beta12orEarlier - Metagenome annotation - Annotate a genome sequence with terms from a controlled vocabulary. - - - - - - - - - - Nucleic acid sequence reverse and complement - - beta12orEarlier - Generate the reverse and / or complement of a nucleotide sequence. - - - - - - - - - - Random sequence generation - - Generate a random sequence, for example, with a specific character composition. - beta12orEarlier - - - - - - - - - - Nucleic acid restriction digest - - - - - - - - - beta12orEarlier - Generate digest fragments for a nucleotide sequence containing restriction sites. - - - - - - - - - - Protein sequence cleavage - - - - - - - - - - - - - - - beta12orEarlier - Cleave a protein sequence into peptide fragments (by enzymatic or chemical cleavage) and calculate the fragment masses. - - - - - - - - - - Sequence mutation and randomization - - beta12orEarlier - Mutate a molecular sequence a specified amount or shuffle it to produce a randomized sequence with the same overall composition. - - - - - - - - - - Sequence masking - - Mask characters in a molecular sequence (replacing those characters with a mask character). - For example, SNPs or repeats in a DNA sequence might be masked. - beta12orEarlier - - - - - - - - - - Sequence cutting - - Cut (remove) characters or a region from a molecular sequence. - beta12orEarlier - - - - - - - - - - Restriction site creation - - Create (or remove) restriction sites in sequences, for example using silent mutations. - beta12orEarlier - - - - - - - - - - DNA translation - - - - - - - - beta12orEarlier - Translate a DNA sequence into protein. - - - - - - - - - - DNA transcription - - - - - - - - beta12orEarlier - Transcribe a nucleotide sequence into mRNA sequence(s). - - - - - - - - - - Sequence composition calculation (nucleic acid) - - true - Calculate base frequency or word composition of a nucleotide sequence. - 1.8 - beta12orEarlier - - - - - - - - - - Sequence composition calculation (protein) - - 1.8 - Calculate amino acid frequency or word composition of a protein sequence. - beta12orEarlier - true - - - - - - - - - - Repeat sequence detection - - - beta12orEarlier - Find (and possibly render) short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Repeat sequence organisation analysis - - - beta12orEarlier - Analyse repeat sequence organization such as periodicity. - - - - - - - - - - Protein hydropathy calculation (from structure) - - true - Analyse the hydrophobic, hydrophilic or charge properties of a protein structure. - 1.12 - beta12orEarlier - - - - - - - - - - Accessible surface calculation - - - - - - - - beta12orEarlier - WHATIF:AtomAccessibilitySolventPlus - Protein solvent accessibility calculation - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate solvent accessible or buried surface areas in protein or other molecular structures. - WHATIF:AtomAccessibilitySolvent - - - - - - - - - - Protein hydropathy cluster calculation - - true - 1.12 - beta12orEarlier - Identify clusters of hydrophobic or charged residues in a protein structure. - - - - - - - - - - Protein dipole moment calculation - - - - - - - - beta12orEarlier - Calculate whether a protein structure has an unusually large net charge (dipole moment). - - - - - - - - - - Molecular surface calculation - - WHATIF:ResidueAccessibilityMolecular - Protein surface calculation - Protein surface and interior calculation - WHATIF:AtomAccessibilityMolecularPlus - WHATIF:TotAccessibilityMolecular - Protein atom surface calculation - Calculate the molecular surface area in proteins and other macromolecules. - Protein residue surface calculation - WHATIF:ResidueAccessibilityVacuum - beta12orEarlier - WHATIF:TotAccessibilitySolvent - WHATIF:ResidueAccessibilitySolvent - WHATIF:ResidueAccessibilityVacuumMolecular - WHATIF:AtomAccessibilityMolecular - - - - - - - - - - Protein binding site prediction (from structure) - - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein structures. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein-nucleic acid binding site analysis - - - - - - - - Analyse RNA or DNA-binding sites in protein structure. - beta12orEarlier - - - - - - - - - - Protein peeling - - beta12orEarlier - Decompose a structure into compact or globular fragments (protein peeling). - - - - - - - - - - Protein distance matrix calculation - - - - - - - - beta12orEarlier - Calculate a matrix of distance between residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map calculation - - - - - - - - beta12orEarlier - Calculate a residue contact map (typically all-versus-all inter-residue contacts) for a protein structure. - - - - - - - - - - Residue cluster calculation - - - - - - - - Calculate clusters of contacting residues in protein structures. - This includes for example clusters of hydrophobic or charged residues, or clusters of contacting residues which have a key structural or functional role. - beta12orEarlier - - - - - - - - - - Hydrogen bond calculation - - - - - - - - WHATIF:ShowHydrogenBonds - WHATIF:HasHydrogenBonds - The output might include the atoms involved in the bond, bond geometric parameters and bond enthalpy. - beta12orEarlier - WHATIF:ShowHydrogenBondsM - Identify potential hydrogen bonds between amino acids and other groups. - - - - - - - - - - Residue non-canonical interaction detection - - beta12orEarlier - 1.12 - Calculate non-canonical atomic interactions in protein structures. - true - - - - - - - - - - Ramachandran plot calculation - - - - - - - - Calculate a Ramachandran plot of a protein structure. - beta12orEarlier - - - - - - - - - - Ramachandran plot validation - - - - - - - - - - - - - - beta12orEarlier - Validate a Ramachandran plot of a protein structure. - - - - - - - - - - Protein molecular weight calculation - - - - - - - - - - - - - - Calculate the molecular weight of a protein sequence or fragments. - beta12orEarlier - - - - - - - - - - Protein extinction coefficient calculation - - - - - - - - beta12orEarlier - Predict extinction coefficients or optical density of a protein sequence. - - - - - - - - - - Protein pH-dependent property calculation - - - - - - - - - - - - - - Calculate pH-dependent properties from pKa calculations of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydropathy calculation (from sequence) - - 1.12 - Hydropathy calculation on a protein sequence. - beta12orEarlier - true - - - - - - - - - - Protein titration curve plotting - - - - - - - - - beta12orEarlier - Plot a protein titration curve. - - - - - - - - - - Protein isoelectric point calculation - - - - - - - - beta12orEarlier - Calculate isoelectric point of a protein sequence. - - - - - - - - - - Protein hydrogen exchange rate calculation - - - - - - - - Estimate hydrogen exchange rate of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydrophobic region calculation - - Calculate hydrophobic or hydrophilic / charged regions of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein aliphatic index calculation - - - - - - - - beta12orEarlier - Calculate aliphatic index (relative volume occupied by aliphatic side chains) of a protein. - - - - - - - - - - Protein hydrophobic moment plotting - - - - - - - - - beta12orEarlier - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - Calculate the hydrophobic moment of a peptide sequence and recognize amphiphilicity. - - - - - - - - - - Protein globularity prediction - - - - - - - - Predict the stability or globularity of a protein sequence, whether it is intrinsically unfolded etc. - beta12orEarlier - - - - - - - - - - Protein solubility prediction - - - - - - - - Predict the solubility or atomic solvation energy of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein crystallizability prediction - - - - - - - - beta12orEarlier - Predict crystallizability of a protein sequence. - - - - - - - - - - Protein signal peptide detection (eukaryotes) - - beta12orEarlier - Detect or predict signal peptides (and typically predict subcellular localization) of eukaryotic proteins. - - - - - - - - - - Protein signal peptide detection (bacteria) - - Detect or predict signal peptides (and typically predict subcellular localization) of bacterial proteins. - beta12orEarlier - - - - - - - - - - MHC peptide immunogenicity prediction - - true - - Predict MHC class I or class II binding peptides, promiscuous binding peptides, immunogenicity etc. - beta12orEarlier - 1.12 - - - - - - - - - - Protein feature prediction (from sequence) - - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - true - Sequence feature detection (protein) - 1.6 - Predict, recognise and identify positional features in protein sequences such as functional sites or regions and secondary structure. - - - - - - - - - - Nucleic acid feature detection - - - - - - - - - - - - - - - Sequence feature detection (nucleic acid) - Predict, recognise and identify features in nucleotide sequences such as functional sites or regions, typically by scanning for known motifs, patterns and regular expressions. - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - Nucleic acid feature recognition - Nucleic acid feature prediction - - - - - - - - - - Epitope mapping - - - - - - - - - beta12orEarlier - Predict antigenic determinant sites (epitopes) in protein sequences. - Epitope mapping is commonly done during vaccine design. - - - - - - - - - - Protein post-translation modification site prediction - - - - - - - - Predict post-translation modification sites in protein sequences. - beta12orEarlier - Methods might predict sites of methylation, N-terminal myristoylation, N-terminal acetylation, sumoylation, palmitoylation, phosphorylation, sulfation, glycosylation, glycosylphosphatidylinositol (GPI) modification sites (GPI lipid anchor signals) etc. - - - - - - - - - - Protein signal peptide detection - - - - - - - - - beta12orEarlier - Methods might use sequence motifs and features, amino acid composition, profiles, machine-learned classifiers, etc. - Detect or predict signal peptides and signal peptide cleavage sites in protein sequences. - - - - - - - - - - Protein binding site prediction (from sequence) - - 1.12 - Predict catalytic residues, active sites or other ligand-binding sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - Protein-nucleic acid binding prediction - - beta12orEarlier - Predict RNA and DNA-binding binding sites in protein sequences. - - - - - - - - - - Protein folding site prediction - - - Predict protein sites that are key to protein folding, such as possible sites of nucleation or stabilization. - beta12orEarlier - - - - - - - - - - Protein cleavage site prediction - - - - - - - - beta12orEarlier - Detect or predict cleavage sites (enzymatic or chemical) in protein sequences. - - - - - - - - - - Epitope mapping (MHC Class I) - - 1.8 - true - beta12orEarlier - Predict epitopes that bind to MHC class I molecules. - - - - - - - - - - Epitope mapping (MHC Class II) - - Predict epitopes that bind to MHC class II molecules. - 1.8 - true - beta12orEarlier - - - - - - - - - - - Whole gene prediction - - beta12orEarlier - 1.12 - true - Detect, predict and identify whole gene structure in DNA sequences. This includes protein coding regions, exon-intron structure, regulatory regions etc. - - - - - - - - - - Gene component prediction - - true - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - beta12orEarlier - Detect, predict and identify genetic elements such as promoters, coding regions, splice sites, etc in DNA sequences. - 1.12 - - - - - - - - - - Transposon prediction - - beta12orEarlier - Detect or predict transposons, retrotransposons / retrotransposition signatures etc. - - - - - - - - - - PolyA signal detection - - Detect polyA signals in nucleotide sequences. - beta12orEarlier - - - - - - - - - - Quadruplex formation site detection - - - - - - - - beta12orEarlier - Quadruplex structure prediction - Detect quadruplex-forming motifs in nucleotide sequences. - Quadruplex (4-stranded) structures are formed by guanine-rich regions and are implicated in various important biological processes and as therapeutic targets. - - - - - - - - - - CpG island and isochore detection - - - - - - - - An isochore is long region (> 3 KB) of DNA with very uniform GC content, in contrast to the rest of the genome. Isochores tend tends to have more genes, higher local melting or denaturation temperatures, and different flexibility. Methods might calculate fractional GC content or variation of GC content, predict methylation status of CpG islands etc. This includes methods that visualise CpG rich regions in a nucleotide sequence, for example plot isochores in a genome sequence. - beta12orEarlier - Find CpG rich regions in a nucleotide sequence or isochores in genome sequences. - CpG island and isochores rendering - CpG island and isochores detection - - - - - - - - - - Restriction site recognition - - - - - - - - beta12orEarlier - Find and identify restriction enzyme cleavage sites (restriction sites) in (typically) DNA sequences, for example to generate a restriction map. - - - - - - - - - - Nucleosome formation or exclusion sequence prediction - - beta12orEarlier - Identify or predict nucleosome exclusion sequences (nucleosome free regions) in DNA. - - - - - - - - - - Splice site prediction - - - - - - - - beta12orEarlier - Identify, predict or analyse splice sites in nucleotide sequences. - Methods might require a pre-mRNA or genomic DNA sequence. - - - - - - - - - - Integrated gene prediction - - Predict whole gene structure using a combination of multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Operon prediction - - Find operons (operators, promoters and genes) in bacteria genes. - beta12orEarlier - - - - - - - - - - Coding region prediction - - Predict protein-coding regions (CDS or exon) or open reading frames in nucleotide sequences. - ORF prediction - ORF finding - beta12orEarlier - - - - - - - - - - Selenocysteine insertion sequence (SECIS) prediction - - - - - - - - Predict selenocysteine insertion sequence (SECIS) in a DNA sequence. - SECIS elements are around 60 nucleotides in length with a stem-loop structure directs the cell to translate UGA codons as selenocysteines. - beta12orEarlier - - - - - - - - - - Regulatory element prediction - - - - - - - - Identify or predict transcription regulatory motifs, patterns, elements or regions in DNA sequences. - Translational regulatory element prediction - Transcription regulatory element prediction - This includes promoters, enhancers, silencers and boundary elements / insulators, regulatory protein or transcription factor binding sites etc. Methods might be specific to a particular genome and use motifs, word-based / grammatical methods, position-specific frequency matrices, discriminative pattern analysis etc. - beta12orEarlier - - - - - - - - - - Translation initiation site prediction - - - - - - - - Predict translation initiation sites, possibly by searching a database of sites. - beta12orEarlier - - - - - - - - - - Promoter prediction - - Identify or predict whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in DNA sequences. - Methods might recognize CG content, CpG islands, splice sites, polyA signals etc. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (DNA-cis) - - beta12orEarlier - Cis-regulatory elements (cis-elements) regulate the expression of genes located on the same strand. Cis-elements are found in the 5' promoter region of the gene, in an intron, or in the 3' untranslated region. Cis-elements are often binding sites of one or more trans-acting factors. - Identify, predict or analyse cis-regulatory elements (TATA box, Pribnow box, SOS box, CAAT box, CCAAT box, operator etc.) in DNA sequences. - - - - - - - - - - Transcription regulatory element prediction (RNA-cis) - - Cis-regulatory elements (cis-elements) regulate genes located on the same strand from which the element was transcribed. A riboswitch is a region of an mRNA molecule that bind a small target molecule that regulates the gene's activity. - Identify, predict or analyse cis-regulatory elements (for example riboswitches) in RNA sequences. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (trans) - - - - - - - - beta12orEarlier - Trans-regulatory elements regulate genes distant from the gene from which they were transcribed. - Identify or predict functional RNA sequences with a gene regulatory role (trans-regulatory elements) or targets. - Functional RNA identification - - - - - - - - - - Matrix/scaffold attachment site prediction - - MAR/SAR sites often flank a gene or gene cluster and are found nearby cis-regulatory sequences. They might contribute to transcription regulation. - Identify matrix/scaffold attachment regions (MARs/SARs) in DNA sequences. - beta12orEarlier - - - - - - - - - - Transcription factor binding site prediction - - beta12orEarlier - Identify or predict transcription factor binding sites in DNA sequences. - - - - - - - - - - Exonic splicing enhancer prediction - - - - - - - - An exonic splicing enhancer (ESE) is 6-base DNA sequence motif in an exon that enhances or directs splicing of pre-mRNA or hetero-nuclear RNA (hnRNA) into mRNA. - Identify or predict exonic splicing enhancers (ESE) in exons. - beta12orEarlier - - - - - - - - - - Sequence alignment validation - - - Evaluation might be purely sequence-based or use structural information. - Sequence alignment quality evaluation - Evaluate molecular sequence alignment accuracy. - beta12orEarlier - - - - - - - - - - Sequence alignment analysis (conservation) - - beta12orEarlier - Analyse character conservation in a molecular sequence alignment, for example to derive a consensus sequence. - Residue conservation analysis - Use this concept for methods that calculate substitution rates, estimate relative site variability, identify sites with biased properties, derive a consensus sequence, or identify highly conserved or very poorly conserved sites, regions, blocks etc. - - - - - - - - - - Sequence alignment analysis (site correlation) - - - Analyse correlations between sites in a molecular sequence alignment. - This is typically done to identify possible covarying positions and predict contacts or structural constraints in protein structures. - beta12orEarlier - - - - - - - - - - Chimeric sequence detection - - beta12orEarlier - A chimera includes regions from two or more phylogenetically distinct sequences. They are usually artifacts of PCR and are thought to occur when a prematurely terminated amplicon reanneals to another DNA strand and is subsequently copied to completion in later PCR cycles. - Detects chimeric sequences (chimeras) from a sequence alignment. - Sequence alignment analysis (chimeric sequence detection) - - - - - - - - - - Recombination detection - - Sequence alignment analysis (recombination detection) - beta12orEarlier - Detect recombination (hotspots and coldspots) and identify recombination breakpoints in a sequence alignment. - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - - - - - - - - - - Indel detection - - - beta12orEarlier - Sequence alignment analysis (indel detection) - Indel discovery - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - Identify insertion, deletion and duplication events from a sequence alignment. - - - - - - - - - - Nucleosome formation potential prediction - - true - beta12orEarlier - Predict nucleosome formation potential of DNA sequences. - beta12orEarlier - - - - - - - - - - Nucleic acid thermodynamic property calculation - - - - - - - - Calculate a thermodynamic property of DNA or DNA/RNA, such as melting temperature, enthalpy and entropy. - beta12orEarlier - - - - - - - - - - Nucleic acid melting profile plotting - - - - - - - - - Calculate and plot a DNA or DNA/RNA melting profile. - A melting profile is used to visualise and analyse partly melted DNA conformations. - beta12orEarlier - - - - - - - - - - Nucleic acid stitch profile plotting - - - - - - - - A stitch profile represents the alternative conformations that partly melted DNA can adopt in a temperature range. - beta12orEarlier - Calculate and plot a DNA or DNA/RNA stitch profile. - - - - - - - - - - Nucleic acid melting curve plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA melting curve. - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile plotting - - - - - - - - beta12orEarlier - Calculate and plot a DNA or DNA/RNA probability profile. - - - - - - - - - - Nucleic acid temperature profile plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA temperature profile. - beta12orEarlier - - - - - - - - - - Nucleic acid curvature calculation - - - - - - - - Calculate curvature and flexibility / stiffness of a nucleotide sequence. - beta12orEarlier - This includes properties such as. - - - - - - - - - - microRNA detection - - Identify or predict microRNA sequences (miRNA) and precursors or microRNA targets / binding sites in a DNA sequence. - beta12orEarlier - - - - - - - - - - tRNA gene prediction - - - - - - - - Identify or predict tRNA genes in genomic sequences (tRNA). - beta12orEarlier - - - - - - - - - - siRNA binding specificity prediction - - - - - - - - beta12orEarlier - Assess binding specificity of putative siRNA sequence(s), for example for a functional assay, typically with respect to designing specific siRNA sequences. - - - - - - - - - - Protein secondary structure prediction (integrated) - - Predict secondary structure of protein sequence(s) using multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (helices) - - beta12orEarlier - Predict helical secondary structure of protein sequences. - - - - - - - - - - Protein secondary structure prediction (turns) - - Predict turn structure (for example beta hairpin turns) of protein sequences. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (coils) - - beta12orEarlier - Predict open coils, non-regular secondary structure and intrinsically disordered / unstructured regions of protein sequences. - - - - - - - - - - Protein secondary structure prediction (disulfide bonds) - - beta12orEarlier - Predict cysteine bonding state and disulfide bond partners in protein sequences. - - - - - - - - - - GPCR prediction - - - beta12orEarlier - G protein-coupled receptor (GPCR) prediction - Predict G protein-coupled receptors (GPCR). - - - - - - - - - - GPCR analysis - - - - - - - - Analyse G-protein coupled receptor proteins (GPCRs). - beta12orEarlier - G protein-coupled receptor (GPCR) analysis - - - - - - - - - - Protein structure prediction - - - - - - - - - - - beta12orEarlier - Predict tertiary structure (backbone and side-chain conformation) of protein sequences. - - - - - - - - - - Nucleic acid structure prediction - - - - - - - - - - beta12orEarlier - Methods might identify thermodynamically stable or evolutionarily conserved structures. - Predict tertiary structure of DNA or RNA. - - - - - - - - - - Ab initio structure prediction - - Predict tertiary structure of protein sequence(s) without homologs of known structure. - de novo structure prediction - beta12orEarlier - - - - - - - - - - Protein modelling - - - - - - - - - - Comparative modelling - beta12orEarlier - Build a three-dimensional protein model based on known (for example homologs) structures. - The model might be of a whole, part or aspect of protein structure. Molecular modelling methods might use sequence-structure alignment, structural templates, molecular dynamics, energy minimization etc. - Homology modelling - Homology structure modelling - Protein structure comparative modelling - - - - - - - - - - Molecular docking - - - - - - - - - - - - - - - Model the structure of a protein in complex with a small molecule or another macromolecule. - beta12orEarlier - This includes protein-protein interactions, protein-nucleic acid, protein-ligand binding etc. Methods might predict whether the molecules are likely to bind in vivo, their conformation when bound, the strength of the interaction, possible mutations to achieve bonding and so on. - Docking simulation - Protein docking - - - - - - - - - - Protein modelling (backbone) - - Model protein backbone conformation. - Methods might require a preliminary C(alpha) trace. - beta12orEarlier - - - - - - - - - - Protein modelling (side chains) - - beta12orEarlier - Methods might use a residue rotamer library. - Model, analyse or edit amino acid side chain conformation in protein structure, optimize side-chain packing, hydrogen bonding etc. - - - - - - - - - - Protein modelling (loops) - - beta12orEarlier - Model loop conformation in protein structures. - - - - - - - - - - Protein-ligand docking - - - - - - - - - - - - - - beta12orEarlier - Methods aim to predict the position and orientation of a ligand bound to a protein receptor or enzyme. - Ligand-binding simulation - Model protein-ligand (for example protein-peptide) binding using comparative modelling or other techniques. - Virtual ligand screening - - - - - - - - - - Structured RNA prediction and optimisation - - - - - - - - Nucleic acid folding family identification - RNA inverse folding - beta12orEarlier - Predict or optimise RNA sequences (sequence pools) with likely secondary and tertiary structure for in vitro selection. - - - - - - - - - - SNP detection - - - - Find single nucleotide polymorphisms (SNPs) between sequences. - Single nucleotide polymorphism detection - beta12orEarlier - This includes functional SNPs for large-scale genotyping purposes, disease-associated non-synonymous SNPs etc. - SNP discovery - - - - - - - - - - Radiation Hybrid Mapping - - - - - - - - Generate a physical (radiation hybrid) map of genetic markers in a DNA sequence using provided radiation hybrid (RH) scores for one or more markers. - beta12orEarlier - - - - - - - - - - Functional mapping - - beta12orEarlier - true - This can involve characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Map the genetic architecture of dynamic complex traits. - beta12orEarlier - - - - - - - - - - Haplotype mapping - - - - - - - - - Haplotype map generation - Haplotype inference - Infer haplotypes, either alleles at multiple loci that are transmitted together on the same chromosome, or a set of single nucleotide polymorphisms (SNPs) on a single chromatid that are statistically associated. - beta12orEarlier - Haplotype inference can help in population genetic studies and the identification of complex disease genes, , and is typically based on aligned single nucleotide polymorphism (SNP) fragments. Haplotype comparison is a useful way to characterize the genetic variation between individuals. An individual's haplotype describes which nucleotide base occurs at each position for a set of common SNPs. Tools might use combinatorial functions (for example parsimony) or a likelihood function or model with optimization such as minimum error correction (MEC) model, expectation-maximization algorithm (EM), genetic algorithm or Markov chain Monte Carlo (MCMC). - Haplotype reconstruction - - - - - - - - - - Linkage disequilibrium calculation - - - - - - - - beta12orEarlier - Linkage disequilibrium is identified where a combination of alleles (or genetic markers) occurs more or less frequently in a population than expected by chance formation of haplotypes. - Calculate linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Genetic code prediction - - - - - - - - - beta12orEarlier - Predict genetic code from analysis of codon usage data. - - - - - - - - - - Dotplot plotting - - - - - - - - - - beta12orEarlier - Draw a dotplot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Pairwise sequence alignment - - - - - - - - Pairwise sequence alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Align exactly two molecular sequences. - Pairwise sequence alignment construction - beta12orEarlier - - - - - - - - - - Multiple sequence alignment - - Multiple sequence alignment construction - Align two or more molecular sequences. - This includes methods that use an existing alignment, for example to incorporate sequences into an alignment, or combine several multiple alignments into a single, improved alignment. - beta12orEarlier - Multiple sequence alignment generation - - - - - - - - - - Pairwise sequence alignment generation (local) - - beta12orEarlier - Local pairwise sequence alignment construction - Locally align exactly two molecular sequences. - Pairwise sequence alignment (local) - true - Local alignment methods identify regions of local similarity. - 1.6 - Pairwise sequence alignment construction (local) - - - - - - - - - - - Pairwise sequence alignment generation (global) - - Pairwise sequence alignment construction (global) - Global pairwise sequence alignment construction - 1.6 - true - Globally align exactly two molecular sequences. - beta12orEarlier - Global alignment methods identify similarity across the entire length of the sequences. - Pairwise sequence alignment (global) - - - - - - - - - - - Local sequence alignment - - Multiple sequence alignment (local) - Local multiple sequence alignment construction - beta12orEarlier - Local alignment methods identify regions of local similarity. - Multiple sequence alignment construction (local) - Sequence alignment generation (local) - Sequence alignment (local) - Locally align two or more molecular sequences. - Smith-Waterman - - - - - - - - - - Global sequence alignment - - Global multiple sequence alignment construction - Multiple sequence alignment (global) - beta12orEarlier - Sequence alignment (global) - Multiple sequence alignment construction (global) - Globally align two or more molecular sequences. - Sequence alignment generation (global) - Global alignment methods identify similarity across the entire length of the sequences. - - - - - - - - - - Constrained sequence alignment - - beta12orEarlier - Align two or more molecular sequences with user-defined constraints. - Multiple sequence alignment construction (constrained) - Sequence alignment generation (constrained) - Multiple sequence alignment (constrained) - Sequence alignment (constrained) - Constrained multiple sequence alignment construction - - - - - - - - - - Consensus-based sequence alignment - - Consensus multiple sequence alignment construction - Sequence alignment (consensus) - beta12orEarlier - Align two or more molecular sequences using multiple methods to achieve higher quality. - Sequence alignment generation (consensus) - Multiple sequence alignment construction (consensus) - Multiple sequence alignment (consensus) - - - - - - - - - - Tree-based sequence alignment - - - - - - - - Sequence alignment generation (phylogenetic tree-based) - This is supposed to give a more biologically meaningful alignment than standard alignments. - beta12orEarlier - Phylogenetic tree-based multiple sequence alignment construction - Align multiple sequences using relative gap costs calculated from neighbors in a supplied phylogenetic tree. - Sequence alignment (phylogenetic tree-based) - Multiple sequence alignment construction (phylogenetic tree-based) - Multiple sequence alignment (phylogenetic tree-based) - - - - - - - - - - Secondary structure alignment generation - - beta12orEarlier - 1.6 - Secondary structure alignment construction - Secondary structure alignment - true - Align molecular secondary structure (represented as a 1D string). - - - - - - - - - - Protein secondary structure alignment generation - - - - - - - - - Protein secondary structure alignment construction - Align protein secondary structures. - beta12orEarlier - Secondary structure alignment (protein) - Protein secondary structure alignment - - - - - - - - - - RNA secondary structure alignment - - - - - - - - - - - - - - - RNA secondary structure alignment generation - Align RNA secondary structures. - RNA secondary structure alignment construction - Secondary structure alignment (RNA) - beta12orEarlier - - - - - - - - - - Pairwise structure alignment - - beta12orEarlier - Pairwise structure alignment generation - Pairwise structure alignment construction - Align (superimpose) exactly two molecular tertiary structures. - - - - - - - - - - Multiple structure alignment construction - - Align (superimpose) two or more molecular tertiary structures. - This includes methods that use an existing alignment. - 1.6 - true - Multiple structure alignment - beta12orEarlier - - - - - - - - - - Structure alignment (protein) - - beta13 - true - beta12orEarlier - Align protein tertiary structures. - - - - - - - - - - Structure alignment (RNA) - - beta13 - true - Align RNA tertiary structures. - beta12orEarlier - - - - - - - - - - Pairwise structure alignment generation (local) - - Locally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (local) - Local alignment methods identify regions of local similarity, common substructures etc. - Pairwise structure alignment construction (local) - 1.6 - true - Local pairwise structure alignment construction - beta12orEarlier - - - - - - - - - - - Pairwise structure alignment generation (global) - - Global pairwise structure alignment construction - Global alignment methods identify similarity across the entire structures. - true - beta12orEarlier - 1.6 - Pairwise structure alignment construction (global) - Globally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (global) - - - - - - - - - - - Local structure alignment - - Local multiple structure alignment construction - Local alignment methods identify regions of local similarity, common substructures etc. - Structure alignment construction (local) - beta12orEarlier - Locally align (superimpose) two or more molecular tertiary structures. - Multiple structure alignment construction (local) - Multiple structure alignment (local) - Structure alignment generation (local) - - - - - - - - - - Global structure alignment - - Structure alignment construction (global) - Multiple structure alignment (global) - Structure alignment generation (global) - Multiple structure alignment construction (global) - beta12orEarlier - Global alignment methods identify similarity across the entire structures. - Global multiple structure alignment construction - Globally align (superimpose) two or more molecular tertiary structures. - - - - - - - - - - Profile-to-profile alignment (pairwise) - - Sequence alignment generation (pairwise profile) - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise sequence profile alignment construction - Sequence profile alignment construction (pairwise) - Sequence profile alignment (pairwise) - beta12orEarlier - Align exactly two molecular profiles. - Sequence profile alignment generation (pairwise) - - - - - - - - - - Sequence alignment generation (multiple profile) - - Align two or more molecular profiles. - 1.6 - true - Sequence profile alignment generation (multiple) - beta12orEarlier - Sequence profile alignment (multiple) - Sequence profile alignment construction (multiple) - Multiple sequence profile alignment construction - - - - - - - - - - 3D profile-to-3D profile alignment (pairwise) - - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise structural (3D) profile alignment construction - Structural (3D) profile alignment (pairwise) - Structural profile alignment construction (pairwise) - Align exactly two molecular Structural (3D) profiles. - beta12orEarlier - Structural profile alignment generation (pairwise) - - - - - - - - - - Structural profile alignment generation (multiple) - - true - Structural profile alignment construction (multiple) - Align two or more molecular 3D profiles. - Multiple structural (3D) profile alignment construction - beta12orEarlier - Structural (3D) profile alignment (multiple) - 1.6 - - - - - - - - - - Data retrieval (tool metadata) - - Data retrieval (tool annotation) - 1.6 - Search and retrieve names of or documentation on bioinformatics tools, for example by keyword or which perform a particular function. - beta12orEarlier - true - Tool information retrieval - - - - - - - - - - Data retrieval (database metadata) - - beta12orEarlier - true - Data retrieval (database annotation) - Search and retrieve names of or documentation on bioinformatics databases or query terms, for example by keyword. - Database information retrieval - 1.6 - - - - - - - - - - PCR primer design (for large scale sequencing) - - 1.13 - Predict primers for large scale sequencing. - beta12orEarlier - true - - - - - - - - - - PCR primer design (for genotyping polymorphisms) - - true - beta12orEarlier - Predict primers for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs). - 1.13 - - - - - - - - - - PCR primer design (for gene transcription profiling) - - Predict primers for gene transcription profiling. - beta12orEarlier - true - 1.13 - - - - - - - - - - PCR primer design (for conserved primers) - - 1.13 - Predict primers that are conserved across multiple genomes or species. - beta12orEarlier - true - - - - - - - - - - PCR primer design (based on gene structure) - - 1.13 - true - beta12orEarlier - - - - - - - - - - PCR primer design (for methylation PCRs) - - true - beta12orEarlier - Predict primers for methylation PCRs. - 1.13 - - - - - - - - - - Mapping assembly - - Sequence assembly by combining fragments using an existing backbone sequence, typically a reference genome. - beta12orEarlier - Sequence assembly (mapping assembly) - The final sequence will resemble the backbone sequence. Mapping assemblers are usually much faster and less memory intensive than de-novo assemblers. - - - - - - - - - - De-novo assembly - - De Bruijn graph - Sequence assembly by combining fragments without the aid of a reference sequence or genome. - Sequence assembly (de-novo assembly) - De-novo assemblers are much slower and more memory intensive than mapping assemblers. - beta12orEarlier - - - - - - - - - - Genome assembly - - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - beta12orEarlier - Sequence assembly (genome assembly) - - - - - - - - - - EST assembly - - beta12orEarlier - Sequence assembly (EST assembly) - Sequence assembly for EST sequences (transcribed mRNA). - Assemblers must handle (or be complicated by) alternative splicing, trans-splicing, single-nucleotide polymorphism (SNP), recoding, and post-transcriptional modification. - - - - - - - - - - Tag mapping - - - Tag mapping might assign experimentally obtained tags to known transcripts or annotate potential virtual tags in a genome. - Tag to gene assignment - Make gene to tag assignments (tag mapping) of SAGE, MPSS and SBS data, by annotating tags with ontology concepts. - beta12orEarlier - - - - - - - - - - SAGE data processing - - beta12orEarlier - Serial analysis of gene expression data processing - beta12orEarlier - Process (read and / or write) serial analysis of gene expression (SAGE) data. - true - - - - - - - - - - MPSS data processing - - beta12orEarlier - Process (read and / or write) massively parallel signature sequencing (MPSS) data. - true - Massively parallel signature sequencing data processing - beta12orEarlier - - - - - - - - - - SBS data processing - - beta12orEarlier - Sequencing by synthesis data processing - beta12orEarlier - Process (read and / or write) sequencing by synthesis (SBS) data. - true - - - - - - - - - - Heat map generation - - - - - - - - - beta12orEarlier - The heat map usually uses a coloring scheme to represent clusters. They can show how expression of mRNA by a set of genes was influenced by experimental conditions. - Heat map construction - Generate a heat map of gene expression from microarray data. - - - - - - - - - - Gene expression profile analysis - - true - Functional profiling - beta12orEarlier - Analyse one or more gene expression profiles, typically to interpret them in functional terms. - 1.6 - - - - - - - - - - Gene expression profile pathway mapping - - - - - - - - - - beta12orEarlier - Map a gene expression profile to known biological pathways, for example, to identify or reconstruct a pathway. - - - - - - - - - - Protein secondary structure assignment (from coordinate data) - - - beta12orEarlier - Assign secondary structure from protein coordinate data. - - - - - - - - - - Protein secondary structure assignment (from CD data) - - - - - - - - Assign secondary structure from circular dichroism (CD) spectroscopic data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from X-ray crystallographic data) - - true - 1.7 - Assign a protein tertiary structure (3D coordinates) from raw X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from NMR data) - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw NMR spectroscopy data. - true - 1.7 - - - - - - - - - - Phylogenetic tree generation (data centric) - - Phylogenetic tree construction (data centric) - beta12orEarlier - Construct a phylogenetic tree from a specific type of data. - - - - - - - - - - Phylogenetic tree generation (method centric) - - Phylogenetic tree construction (method centric) - Construct a phylogenetic tree using a specific method. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (from molecular sequences) - - - Phylogenetic tree construction from molecular sequences. - beta12orEarlier - Phylogenetic tree construction (from molecular sequences) - Methods typically compare multiple molecular sequence and estimate evolutionary distances and relationships to infer gene families or make functional predictions. - - - - - - - - - - Phylogenetic tree generation (from continuous quantitative characters) - - - - - - - - Phylogenetic tree construction (from continuous quantitative characters) - beta12orEarlier - Phylogenetic tree construction from continuous quantitative character data. - - - - - - - - - - Phylogenetic tree generation (from gene frequencies) - - - - - - - - - - - - - - Phylogenetic tree construction (from gene frequencies) - Phylogenetic tree construction from gene frequency data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree construction (from polymorphism data) - - - - - - - - Phylogenetic tree construction from polymorphism data including microsatellites, RFLP (restriction fragment length polymorphisms), RAPD (random-amplified polymorphic DNA) and AFLP (amplified fragment length polymorphisms) data. - Phylogenetic tree generation (from polymorphism data) - beta12orEarlier - - - - - - - - - - Phylogenetic species tree construction - - Construct a phylogenetic species tree, for example, from a genome-wide sequence comparison. - Phylogenetic species tree generation - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (parsimony methods) - - Phylogenetic tree construction (parsimony methods) - Construct a phylogenetic tree by computing a sequence alignment and searching for the tree with the fewest number of character-state changes from the alignment. - This includes evolutionary parsimony (invariants) methods. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (minimum distance methods) - - This includes neighbor joining (NJ) clustering method. - beta12orEarlier - Phylogenetic tree construction (minimum distance methods) - Construct a phylogenetic tree by computing (or using precomputed) distances between sequences and searching for the tree with minimal discrepancies between pairwise distances. - - - - - - - - - - Phylogenetic tree generation (maximum likelihood and Bayesian methods) - - Phylogenetic tree construction (maximum likelihood and Bayesian methods) - Construct a phylogenetic tree by relating sequence data to a hypothetical tree topology using a model of sequence evolution. - Maximum likelihood methods search for a tree that maximizes a likelihood function, i.e. that is most likely given the data and model. Bayesian analysis estimate the probability of tree for branch lengths and topology, typically using a Monte Carlo algorithm. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (quartet methods) - - beta12orEarlier - Phylogenetic tree construction (quartet methods) - Construct a phylogenetic tree by computing four-taxon trees (4-trees) and searching for the phylogeny that matches most closely. - - - - - - - - - - Phylogenetic tree generation (AI methods) - - Construct a phylogenetic tree by using artificial-intelligence methods, for example genetic algorithms. - Phylogenetic tree construction (AI methods) - beta12orEarlier - - - - - - - - - - DNA substitution modelling - - - - - - - - - - - - - - - Identify a plausible model of DNA substitution that explains a molecular (DNA or protein) sequence alignment. - Sequence alignment analysis (phylogenetic modelling) - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (shape) - - Phylogenetic tree topology analysis - Analyse the shape (topology) of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree bootstrapping - - - Apply bootstrapping or other measures to estimate confidence of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (gene family prediction) - - - - - - - - - - - - - - Predict families of genes and gene function based on their position in a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (natural selection) - - beta12orEarlier - Stabilizing/purifying (directional) selection favors a single phenotype and tends to decrease genetic diversity as a population stabilizes on a particular trait, selecting out trait extremes or deleterious mutations. In contrast, balancing selection maintain genetic polymorphisms (or multiple alleles), whereas disruptive (or diversifying) selection favors individuals at both extremes of a trait. - Analyse a phylogenetic tree to identify allele frequency distribution and change that is subject to evolutionary pressures (natural selection, genetic drift, mutation and gene flow). Identify type of natural selection (such as stabilizing, balancing or disruptive). - - - - - - - - - - Phylogenetic tree generation (consensus) - - - Compare two or more phylogenetic trees to produce a consensus tree. - Methods typically test for topological similarity between trees using for example a congruence index. - beta12orEarlier - Phylogenetic tree construction (consensus) - - - - - - - - - - Phylogenetic sub/super tree detection - - beta12orEarlier - Compare two or more phylogenetic trees to detect subtrees or supertrees. - - - - - - - - - - Phylogenetic tree distances calculation - - - - - - - - beta12orEarlier - Compare two or more phylogenetic trees to calculate distances between trees. - - - - - - - - - - Phylogenetic tree annotation - - beta12orEarlier - http://www.evolutionaryontology.org/cdao.owl#CDAOAnnotation - Annotate a phylogenetic tree with terms from a controlled vocabulary. - - - - - - - - - - Immunogenicity prediction - - true - 1.12 - beta12orEarlier - Peptide immunogen prediction - Predict and optimise peptide ligands that elicit an immunological response. - - - - - - - - - - DNA vaccine design - - - - - - - - beta12orEarlier - Predict or optimise DNA to elicit (via DNA vaccination) an immunological response. - - - - - - - - - - Sequence formatting - - 1.12 - beta12orEarlier - Reformat (a file or other report of) molecular sequence(s). - true - - - - - - - - - - Sequence alignment formatting - - Reformat (a file or other report of) molecular sequence alignment(s). - beta12orEarlier - true - 1.12 - - - - - - - - - - Codon usage table formatting - - Reformat a codon usage table. - true - beta12orEarlier - 1.12 - - - - - - - - - - Sequence visualisation - - - - - - - - - - - - - - - beta12orEarlier - Visualise, format or render a molecular sequence, possibly with sequence features or properties shown. - Sequence rendering - - - - - - - - - - Sequence alignment visualisation - - - - - - - - - - - - - - - Sequence alignment rendering - Visualise, format or print a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence cluster visualisation - - - - - - - - Sequence cluster rendering - beta12orEarlier - Visualise, format or render sequence clusters. - - - - - - - - - - Phylogenetic tree visualisation - - - - - - - - - Render or visualise a phylogenetic tree. - Phylogenetic tree rendering - beta12orEarlier - - - - - - - - - - RNA secondary structure visualisation - - - - - - - - - RNA secondary structure rendering - Visualise RNA secondary structure, knots, pseudoknots etc. - beta12orEarlier - - - - - - - - - - Protein secondary structure rendering - Protein secondary structure visualisation - - - - - - - - Render and visualise protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure visualisation - - - - - - - - - - - - - - - Structure rendering - Visualise or render a molecular tertiary structure, for example a high-quality static picture or animation. - beta12orEarlier - - - - - - - - - - Microarray data rendering - - - - - - - - - - Visualise microarray data. - beta12orEarlier - - - - - - - - - - Protein interaction network rendering - Protein interaction network visualisation - - - - - - - - - beta12orEarlier - Identify and analyse networks of protein interactions. - - - - - - - - - - Map drawing - - - - - - - - beta12orEarlier - DNA map drawing - Map rendering - Draw or visualise a DNA map. - - - - - - - - - - Sequence motif rendering - - Render a sequence with motifs. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Restriction map drawing - - - - - - - - - Draw or visualise restriction maps in DNA sequences. - beta12orEarlier - - - - - - - - - - DNA linear map rendering - - beta12orEarlier - beta12orEarlier - true - Draw a linear maps of DNA. - - - - - - - - - - Plasmid map drawing - - beta12orEarlier - DNA circular map rendering - Draw a circular maps of DNA, for example a plasmid map. - - - - - - - - - - Operon drawing - - - - - - - - Visualise operon structure etc. - beta12orEarlier - Operon rendering - - - - - - - - - - Nucleic acid folding family identification - - true - beta12orEarlier - Identify folding families of related RNAs. - beta12orEarlier - - - - - - - - - - Nucleic acid folding energy calculation - - beta12orEarlier - Compute energies of nucleic acid folding, e.g. minimum folding energies for DNA or RNA sequences or energy landscape of RNA mutants. - - - - - - - - - - Annotation retrieval - - beta12orEarlier - Use this concepts for tools which retrieve pre-existing annotations, not for example prediction methods that might make annotations. - Retrieve existing annotation (or documentation), typically annotation on a database entity. - beta12orEarlier - true - - - - - - - - - - Protein function prediction - - - - - - - - - beta12orEarlier - Predict general functional properties of a protein. - For functional properties that can be mapped to a sequence, use 'Sequence feature detection (protein)' instead. - - - - - - - - - - Protein function comparison - - - - - - - - - Compare the functional properties of two or more proteins. - beta12orEarlier - - - - - - - - - - Sequence submission - - Submit a molecular sequence to a database. - beta12orEarlier - 1.6 - true - - - - - - - - - - Gene regulatory network analysis - - - - - - - - beta12orEarlier - Analyse a known network of gene regulation. - - - - - - - - - - - Loading - - - - - - - - Data loading - WHATIF:UploadPDB - Prepare or load a user-specified data file so that it is available for use. - beta12orEarlier - - - - - - - - - - Sequence retrieval - - This includes direct retrieval methods (e.g. the dbfetch program) but not those that perform calculations on the sequence. - Data retrieval (sequences) - 1.6 - Query a sequence data resource (typically a database) and retrieve sequences and / or annotation. - beta12orEarlier - true - - - - - - - - - - Structure retrieval - - true - WHATIF:EchoPDB - beta12orEarlier - WHATIF:DownloadPDB - This includes direct retrieval methods but not those that perform calculations on the sequence or structure. - Query a tertiary structure data resource (typically a database) and retrieve structures, structure-related data and annotation. - 1.6 - - - - - - - - - - Surface rendering - - - beta12orEarlier - WHATIF:GetSurfaceDots - Calculate the positions of dots that are homogeneously distributed over the surface of a molecule. - A dot has three coordinates (x,y,z) and (typically) a color. - - - - - - - - - - Protein atom surface calculation (accessible) - - beta12orEarlier - 1.12 - true - Calculate the solvent accessibility ('accessible surface') for each atom in a structure. - Waters are not considered. - - - - - - - - - - Protein atom surface calculation (accessible molecular) - - beta12orEarlier - 1.12 - Calculate the solvent accessibility ('accessible molecular surface') for each atom in a structure. - Waters are not considered. - true - - - - - - - - - - Protein residue surface calculation (accessible) - - true - 1.12 - beta12orEarlier - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('accessible surface') for each residue in a structure. - - - - - - - - - - Protein residue surface calculation (vacuum accessible) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('vacuum accessible surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (accessible molecular) - - Calculate the solvent accessibility ('accessible molecular surface') for each residue in a structure. - true - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - 1.12 - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (vacuum molecular) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - true - beta12orEarlier - Calculate the solvent accessibility ('vacuum molecular surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - - - - - - - - - - Protein surface calculation (accessible molecular) - - true - 1.12 - beta12orEarlier - Calculate the solvent accessibility ('accessible molecular surface') for a structure as a whole. - - - - - - - - - - Protein surface calculation (accessible) - - Calculate the solvent accessibility ('accessible surface') for a structure as a whole. - beta12orEarlier - 1.12 - true - - - - - - - - - - Backbone torsion angle calculation - - 1.12 - beta12orEarlier - true - Calculate for each residue in a protein structure all its backbone torsion angles. - - - - - - - - - - Full torsion angle calculation - - 1.12 - beta12orEarlier - Calculate for each residue in a protein structure all its torsion angles. - true - - - - - - - - - - Cysteine torsion angle calculation - - beta12orEarlier - Calculate for each cysteine (bridge) all its torsion angles. - 1.12 - true - - - - - - - - - - Tau angle calculation - - beta12orEarlier - Tau is the backbone angle N-Calpha-C (angle over the C-alpha). - 1.12 - For each amino acid in a protein structure calculate the backbone angle tau. - true - - - - - - - - - - Cysteine bridge detection - - WHATIF:ShowCysteineBridge - Detect cysteine bridges (from coordinate data) in a protein structure. - beta12orEarlier - - - - - - - - - - Free cysteine detection - - beta12orEarlier - A free cysteine is neither involved in a cysteine bridge, nor functions as a ligand to a metal. - Detect free cysteines in a protein structure. - WHATIF:ShowCysteineFree - - - - - - - - - - Metal-bound cysteine detection - - - beta12orEarlier - WHATIF:ShowCysteineMetal - Detect cysteines that are bound to metal in a protein structure. - - - - - - - - - - Residue contact calculation (residue-nucleic acid) - - beta12orEarlier - 1.12 - true - Calculate protein residue contacts with nucleic acids in a structure. - - - - - - - - - - Protein-metal contact calculation - - beta12orEarlier - Calculate protein residue contacts with metal in a structure. - Residue-metal contact calculation - - - - - - - - - - Residue contact calculation (residue-negative ion) - - Calculate ion contacts in a structure (all ions for all side chain atoms). - beta12orEarlier - true - 1.12 - - - - - - - - - - Residue bump detection - - WHATIF:ShowBumps - beta12orEarlier - Detect 'bumps' between residues in a structure, i.e. those with pairs of atoms whose Van der Waals' radii interpenetrate more than a defined distance. - - - - - - - - - - Residue symmetry contact calculation - - Calculate the number of symmetry contacts made by residues in a protein structure. - true - 1.12 - WHATIF:SymmetryContact - A symmetry contact is a contact between two atoms in different asymmetric unit. - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-ligand) - - true - beta12orEarlier - 1.12 - Calculate contacts between residues and ligands in a protein structure. - - - - - - - - - - Salt bridge calculation - - Salt bridges are interactions between oppositely charged atoms in different residues. The output might include the inter-atomic distance. - WHATIF:HasSaltBridgePlus - WHATIF:ShowSaltBridges - beta12orEarlier - WHATIF:HasSaltBridge - WHATIF:ShowSaltBridgesH - Calculate (and possibly score) salt bridges in a protein structure. - - - - - - - - - - Rotamer likelihood prediction - - WHATIF:ShowLikelyRotamers - WHATIF:ShowLikelyRotamers500 - 1.12 - Predict rotamer likelihoods for all 20 amino acid types at each position in a protein structure. - WHATIF:ShowLikelyRotamers600 - WHATIF:ShowLikelyRotamers800 - WHATIF:ShowLikelyRotamers900 - true - Output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. - WHATIF:ShowLikelyRotamers700 - WHATIF:ShowLikelyRotamers400 - WHATIF:ShowLikelyRotamers300 - WHATIF:ShowLikelyRotamers200 - WHATIF:ShowLikelyRotamers100 - beta12orEarlier - - - - - - - - - - Proline mutation value calculation - - true - 1.12 - Calculate for each position in a protein structure the chance that a proline, when introduced at this position, would increase the stability of the whole protein. - WHATIF:ProlineMutationValue - beta12orEarlier - - - - - - - - - - Residue packing validation - - beta12orEarlier - Identify poorly packed residues in protein structures. - WHATIF: PackingQuality - - - - - - - - - - Protein geometry validation - - WHATIF: ImproperQualitySum - beta12orEarlier - Validate protein geometry, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - WHATIF: ImproperQualityMax - - - - - - - - - - PDB file sequence retrieval - - Extract a molecular sequence from a PDB file. - beta12orEarlier - WHATIF: PDB_sequence - true - beta12orEarlier - - - - - - - - - - HET group detection - - true - Identify HET groups in PDB files. - beta12orEarlier - 1.12 - A HET group usually corresponds to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. - - - - - - - - - - DSSP secondary structure assignment - - Determine for residue the DSSP determined secondary structure in three-state (HSC). - beta12orEarlier - WHATIF: ResidueDSSP - beta12orEarlier - true - - - - - - - - - - Structure formatting - - 1.12 - true - Reformat (a file or other report of) tertiary structure data. - beta12orEarlier - WHATIF: PDBasXML - - - - - - - - - - Protein cysteine and disulfide bond assignment - - - - - - - - Assign cysteine bonding state and disulfide bond partners in protein structures. - beta12orEarlier - - - - - - - - - - Residue validation - - 1.12 - Identify poor quality amino acid positions in protein structures. - beta12orEarlier - true - - - - - - - - - - Structure retrieval (water) - - beta12orEarlier - 1.6 - WHATIF:MovedWaterPDB - true - Query a tertiary structure database and retrieve water molecules. - - - - - - - - - - siRNA duplex prediction - - - - - - - - beta12orEarlier - Identify or predict siRNA duplexes in RNA. - - - - - - - - - - Sequence alignment refinement - - - Refine an existing sequence alignment. - beta12orEarlier - - - - - - - - - - Listfile processing - - 1.6 - Process an EMBOSS listfile (list of EMBOSS Uniform Sequence Addresses). - true - beta12orEarlier - - - - - - - - - - Sequence file editing - - - beta12orEarlier - Perform basic (non-analytical) operations on a report or file of sequences (which might include features), such as file concatenation, removal or ordering of sequences, creation of subset or a new file of sequences. - - - - - - - - - - Sequence alignment file processing - - beta12orEarlier - Perform basic (non-analytical) operations on a sequence alignment file, such as copying or removal and ordering of sequences. - 1.6 - true - - - - - - - - - - Small molecule data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) physicochemical property data for small molecules. - - - - - - - - - - Data retrieval (ontology annotation) - - beta13 - Ontology information retrieval - true - Search and retrieve documentation on a bioinformatics ontology. - beta12orEarlier - - - - - - - - - - Data retrieval (ontology concept) - - Query an ontology and retrieve concepts or relations. - true - beta13 - beta12orEarlier - Ontology retrieval - - - - - - - - - - Representative sequence identification - - Identify a representative sequence from a set of sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - beta12orEarlier - - - - - - - - - - Structure file processing - - Perform basic (non-analytical) operations on a file of molecular tertiary structural data. - 1.6 - beta12orEarlier - true - - - - - - - - - - Data retrieval (sequence profile) - - Query a profile data resource and retrieve one or more profile(s) and / or associated annotation. - true - This includes direct retrieval methods that retrieve a profile by, e.g. the profile name. - beta13 - beta12orEarlier - - - - - - - - - - Statistical calculation - - Statistics - Statistical testing - Statistical analysis - Perform a statistical data operation of some type, e.g. calibration or validation. - Gibbs sampling - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix generation - - - - - - - - - - - - - - - - beta12orEarlier - 3D-1D scoring matrix construction - A 3D-1D scoring matrix scores the probability of amino acids occurring in different structural environments. - Calculate a 3D-1D scoring matrix from analysis of protein sequence and structural data. - - - - - - - - - - Transmembrane protein visualisation - - - - - - - - - Visualise transmembrane proteins, typically the transmembrane regions within a sequence. - beta12orEarlier - Transmembrane protein rendering - - - - - - - - - - Demonstration - - beta12orEarlier - true - An operation performing purely illustrative (pedagogical) purposes. - beta13 - - - - - - - - - - Data retrieval (pathway or network) - - beta12orEarlier - true - Query a biological pathways database and retrieve annotation on one or more pathways. - beta13 - - - - - - - - - - Data retrieval (identifier) - - beta12orEarlier - Query a database and retrieve one or more data identifiers. - beta13 - true - - - - - - - - - - Nucleic acid density plotting - - - beta12orEarlier - Calculate a density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence analysis - - - - - - - - Analyse one or more known molecular sequences. - beta12orEarlier - Sequence analysis (general) - - - - - - - - - - Sequence motif analysis - - Analyse molecular sequence motifs. - beta12orEarlier - Sequence motif processing - - - - - - - - - - Protein interaction data processing - - 1.6 - Process (read and / or write) protein interaction data. - true - beta12orEarlier - - - - - - - - - - Protein structure analysis - - - - - - - - - - - - - - - Structure analysis (protein) - beta12orEarlier - Analyse protein tertiary structural data. - - - - - - - - - - Annotation processing - - true - beta12orEarlier - beta12orEarlier - Process (read and / or write) annotation of some type, typically annotation on an entry from a biological or biomedical database entity. - - - - - - - - - - Sequence feature analysis - - beta12orEarlier - true - Analyse features in molecular sequences. - beta12orEarlier - - - - - - - - - - Data handling - - - - - - - - beta12orEarlier - File processing - Report handling - File handling - Utility operation - Processing - Basic (non-analytical) operations of some data, either a file or equivalent entity in memory, such that the same basic type of data is consumed as input and generated as output. - - - - - - - - - - Gene expression analysis - - Analyse gene expression and regulation data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structural profile processing - - beta12orEarlier - 1.6 - Process (read and / or write) one or more structural (3D) profile(s) or template(s) of some type. - 3D profile processing - true - - - - - - - - - - Data index processing - - Database index processing - true - Process (read and / or write) an index of (typically a file of) biological data. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence profile processing - - true - beta12orEarlier - Process (read and / or write) some type of sequence profile. - 1.6 - - - - - - - - - - Protein function analysis - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - beta12orEarlier - Analyse protein function, typically by processing protein sequence and/or structural data, and generate an informative report. - - - - - - - - - - Protein folding analysis - - - - - - - - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - Analyse protein folding, typically by processing sequence and / or structural data, and write an informative report. - Protein folding modelling - beta12orEarlier - - - - - - - - - - Protein secondary structure analysis - - - - - - - - - - - - - - Analyse known protein secondary structure data. - beta12orEarlier - Secondary structure analysis (protein) - - - - - - - - - - Physicochemical property data processing - - beta13 - true - Process (read and / or write) data on the physicochemical property of a molecule. - beta12orEarlier - - - - - - - - - - Primer and probe design - - - - - - - - - Primer and probe prediction - beta12orEarlier - Predict oligonucleotide primers or probes. - - - - - - - - - - Operation (typed) - - true - Process (read and / or write) data of a specific type, for example applying analytical methods. - beta12orEarlier - 1.12 - - - - - - - - - - Database search - - - - - - - - beta12orEarlier - Typically the query is compared to each entry and high scoring matches (hits) are returned. For example, a BLAST search of a sequence database. - Search a database (or other data resource) with a supplied query and retrieve entries (or parts of entries) that are similar to the query. - Search - - - - - - - - - - Data retrieval - - - - - - - - Information retrieval - beta12orEarlier - Retrieve an entry (or part of an entry) from a data resource that matches a supplied query. This might include some primary data and annotation. The query is a data identifier or other indexed term. For example, retrieve a sequence record with the specified accession number, or matching supplied keywords. - Retrieval - - - - - - - - - - Prediction and recognition - - beta12orEarlier - Recognition - Prediction - Predict, recognise, detect or identify some properties of a biomolecule. - Detection - - - - - - - - - - Comparison - - beta12orEarlier - Compare two or more things to identify similarities. - - - - - - - - - - Optimisation and refinement - - beta12orEarlier - Refine or optimise some data model. - - - - - - - - - - Modelling and simulation - - - - - - - - beta12orEarlier - Model or simulate some biological entity or system, typically using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - Mathematical modelling - - - - - - - - - - Data handling - - true - beta12orEarlier - Perform basic operations on some data or a database. - beta12orEarlier - - - - - - - - - - Validation - - beta12orEarlier - Validation and standardisation - Quality control - Validate some data. - - - - - - - - - - Mapping - - This is a broad concept and is used a placeholder for other, more specific concepts. - Map properties to positions on an biological entity (typically a molecular sequence or structure), or assemble such an entity from constituent parts. - beta12orEarlier - - - - - - - - - - Design - - beta12orEarlier - Design a biological entity (typically a molecular sequence or structure) with specific properties. - - - - - - - - - - Microarray data processing - - beta12orEarlier - Process (read and / or write) microarray data. - beta12orEarlier - true - - - - - - - - - - Codon usage table processing - - Process (read and / or write) a codon usage table. - beta12orEarlier - - - - - - - - - - Data retrieval (codon usage table) - - Retrieve a codon usage table and / or associated annotation. - beta12orEarlier - true - beta13 - - - - - - - - - - Gene expression profile processing - - 1.6 - Process (read and / or write) a gene expression profile. - true - beta12orEarlier - - - - - - - - - - Functional enrichment - - - - - - - - - Analyse a set of genes (genes corresponding to an expression profile, or any other set) to find functional annotations (such as cellular processes or metaobolic pathways) that the sets are significantly associated with, providing biological insight into the a set of genes. - beta12orEarlier - The Gene Ontology (GO) is invariably used, the input is a set of Gene IDs and the output of the analysis is typically a ranked list of GO terms, each associated with a p-value. - GO term enrichment - - - - - - - - - - Gene regulatory network prediction - - - - - - - - - - - - - - - Predict a network of gene regulation. - beta12orEarlier - - - - - - - - - - Pathway or network processing - - Generate, analyse or handle a biological pathway or network. - beta12orEarlier - true - 1.12 - - - - - - - - - - RNA secondary structure analysis - - - - - - - - beta12orEarlier - Process (read and / or write) RNA secondary structure data. - - - - - - - - - - Structure processing (RNA) - - Process (read and / or write) RNA tertiary structure data. - beta12orEarlier - beta13 - true - - - - - - - - - - RNA structure prediction - - - - - - - - beta12orEarlier - Predict RNA tertiary structure. - - - - - - - - - - DNA structure prediction - - - - - - - - Predict DNA tertiary structure. - beta12orEarlier - - - - - - - - - - Phylogenetic tree processing - - beta12orEarlier - 1.12 - true - Generate, process or analyse phylogenetic tree or trees. - - - - - - - - - - Protein secondary structure processing - - Process (read and / or write) protein secondary structure data. - 1.6 - true - beta12orEarlier - - - - - - - - - - Protein interaction network processing - - true - beta12orEarlier - Process (read and / or write) a network of protein interactions. - 1.6 - - - - - - - - - - Sequence processing - - Sequence processing (general) - Process (read and / or write) one or more molecular sequences and associated annotation. - true - beta12orEarlier - 1.6 - - - - - - - - - - Sequence processing (protein) - - Process (read and / or write) a protein sequence and associated annotation. - beta12orEarlier - true - 1.6 - - - - - - - - - - Sequence processing (nucleic acid) - - 1.6 - true - beta12orEarlier - Process (read and / or write) a nucleotide sequence and associated annotation. - - - - - - - - - - Sequence comparison - - - - - - - - - - - - - - - Compare two or more molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster processing - - Process (read and / or write) a sequence cluster. - true - beta12orEarlier - 1.6 - - - - - - - - - - Feature table processing - - Process (read and / or write) a sequence feature table. - 1.6 - true - beta12orEarlier - - - - - - - - - - Gene prediction - - - - - - - - - - - - - - Gene component prediction - Detect, predict and identify genes or components of genes in DNA sequences, including promoters, coding regions, splice sites, etc. - Whole gene prediction - Gene and gene component prediction - beta12orEarlier - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene finding - - - - - - - - - - GPCR classification - - - - - - - - - beta12orEarlier - G protein-coupled receptor (GPCR) classification - Classify G-protein coupled receptors (GPCRs) into families and subfamilies. - - - - - - - - - - GPCR coupling selectivity prediction - - - - - - - - - - Predict G-protein coupled receptor (GPCR) coupling selectivity. - beta12orEarlier - - - - - - - - - - Structure processing (protein) - - true - 1.6 - beta12orEarlier - Process (read and / or write) a protein tertiary structure. - - - - - - - - - - Protein atom surface calculation - - Waters are not considered. - Calculate the solvent accessibility for each atom in a structure. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein residue surface calculation - - beta12orEarlier - true - Calculate the solvent accessibility for each residue in a structure. - 1.12 - - - - - - - - - - Protein surface calculation - - beta12orEarlier - Calculate the solvent accessibility of a structure as a whole. - 1.12 - true - - - - - - - - - - Sequence alignment processing - - beta12orEarlier - true - Process (read and / or write) a molecular sequence alignment. - 1.6 - - - - - - - - - - Protein-protein interaction prediction - - - - - - - - - - - - - - - Identify or predict protein-protein interactions, interfaces, binding sites etc. - beta12orEarlier - - - - - - - - - - Structure processing - - true - 1.6 - Process (read and / or write) a molecular tertiary structure. - beta12orEarlier - - - - - - - - - - Map annotation - - Annotate a DNA map of some type with terms from a controlled vocabulary. - true - beta12orEarlier - 1.6 - - - - - - - - - - Data retrieval (protein annotation) - - Retrieve information on a protein. - beta13 - true - Protein information retrieval - beta12orEarlier - - - - - - - - - - Data retrieval (phylogenetic tree) - - beta12orEarlier - beta13 - Retrieve a phylogenetic tree from a data resource. - true - - - - - - - - - - Data retrieval (protein interaction annotation) - - Retrieve information on a protein interaction. - true - beta13 - beta12orEarlier - - - - - - - - - - Data retrieval (protein family annotation) - - beta12orEarlier - Protein family information retrieval - beta13 - Retrieve information on a protein family. - true - - - - - - - - - - Data retrieval (RNA family annotation) - - true - Retrieve information on an RNA family. - RNA family information retrieval - beta12orEarlier - beta13 - - - - - - - - - - Data retrieval (gene annotation) - - beta12orEarlier - Gene information retrieval - Retrieve information on a specific gene. - true - beta13 - - - - - - - - - - Data retrieval (genotype and phenotype annotation) - - Retrieve information on a specific genotype or phenotype. - Genotype and phenotype information retrieval - beta12orEarlier - beta13 - true - - - - - - - - - - Protein architecture comparison - - - Compare the architecture of two or more protein structures. - beta12orEarlier - - - - - - - - - - Protein architecture recognition - - - - beta12orEarlier - Includes methods that try to suggest the most likely biological unit for a given protein X-ray crystal structure based on crystal symmetry and scoring of putative protein-protein interfaces. - Identify the architecture of a protein structure. - - - - - - - - - - Molecular dynamics simulation - - - - - - - - - - - - - - - - - - - - - - Simulate molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence analysis - - - - - - - - - - - - - - - Analyse a nucleic acid sequence (using methods that are only applicable to nucleic acid sequences). - beta12orEarlier - Sequence analysis (nucleic acid) - - - - - - - - - - Protein sequence analysis - - - - - - - - - Analyse a protein sequence (using methods that are only applicable to protein sequences). - Sequence analysis (protein) - beta12orEarlier - - - - - - - - - - Structure analysis - - - - - - - - beta12orEarlier - Analyse known molecular tertiary structures. - - - - - - - - - - Nucleic acid structure analysis - - - - - - - - - - - - - - - Analyse nucleic acid tertiary structural data. - beta12orEarlier - - - - - - - - - - Secondary structure processing - - 1.6 - Process (read and / or write) a molecular secondary structure. - true - beta12orEarlier - - - - - - - - - - Structure comparison - - - - - - - - - beta12orEarlier - Compare two or more molecular tertiary structures. - - - - - - - - - - Helical wheel drawing - - - - - - - - Helical wheel rendering - beta12orEarlier - Render a helical wheel representation of protein secondary structure. - - - - - - - - - - Topology diagram drawing - - - - - - - - Topology diagram rendering - beta12orEarlier - Render a topology diagram of protein secondary structure. - - - - - - - - - - Protein structure comparison - - - - - - - - - - beta12orEarlier - Structure comparison (protein) - Methods might identify structural neighbors, find structural similarities or define a structural core. - Compare protein tertiary structures. - - - - - - - - - - Protein secondary structure comparison - - - - Compare protein secondary structures. - beta12orEarlier - Secondary structure comparison (protein) - Protein secondary structure - - - - - - - - - - Protein subcellular localization prediction - - - - - - - - - The prediction might include subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or export (extracellular proteins) of a protein. - Predict the subcellular localization of a protein sequence. - Protein targeting prediction - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-residue) - - true - beta12orEarlier - Calculate contacts between residues in a protein structure. - 1.12 - - - - - - - - - - Hydrogen bond calculation (inter-residue) - - Identify potential hydrogen bonds between amino acid residues. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein interaction prediction - - - - - - - - - - - - - - - Predict the interactions of proteins with other molecules. - beta12orEarlier - - - - - - - - - - Codon usage data processing - - beta12orEarlier - beta13 - Process (read and / or write) codon usage data. - true - - - - - - - - - - Gene expression data analysis - - - - - - - - Gene expression (microarray) data processing - Gene expression profile analysis - beta12orEarlier - Microarray data processing - Gene expression data processing - Gene expression analysis - Process (read and / or write) gene expression (typically microarray) data, including analysis of one or more gene expression profiles, typically to interpret them in functional terms. - - - - - - - - - - Gene regulatory network processing - - 1.6 - beta12orEarlier - Process (read and / or write) a network of gene regulation. - true - - - - - - - - - - Pathway or network analysis - - - - - - - - Pathway analysis - Generate, process or analyse a biological pathway or network. - Network analysis - beta12orEarlier - - - - - - - - - - Sequencing-based expression profile data analysis - - Analyse SAGE, MPSS or SBS experimental data, typically to identify or quantify mRNA transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Splicing model analysis - - - - - - - - - - Analyse, characterize and model alternative splicing events from comparing multiple nucleic acid sequences. - Splicing analysis - beta12orEarlier - - - - - - - - - - Microarray raw data analysis - - beta12orEarlier - beta12orEarlier - true - Analyse raw microarray data. - - - - - - - - - - Nucleic acid analysis - - - - - - - - Process (read and / or write) nucleic acid sequence or structural data. - Nucleic acid data processing - beta12orEarlier - - - - - - - - - - Protein analysis - - - - - - - - beta12orEarlier - Protein data processing - Process (read and / or write) protein sequence or structural data. - - - - - - - - - - Sequence data processing - - beta12orEarlier - Process (read and / or write) molecular sequence data. - beta13 - true - - - - - - - - - - Structural data processing - - Process (read and / or write) molecular structural data. - beta13 - true - beta12orEarlier - - - - - - - - - - Text processing - - true - beta12orEarlier - Process (read and / or write) text. - 1.6 - - - - - - - - - - Protein sequence alignment analysis - - - - - - - - - - Analyse a protein sequence alignment, typically to detect features or make predictions. - beta12orEarlier - Sequence alignment analysis (protein) - - - - - - - - - - Nucleic acid sequence alignment analysis - - - - - - - - - - beta12orEarlier - Sequence alignment analysis (nucleic acid) - Analyse a protein sequence alignment, typically to detect features or make predictions. - - - - - - - - - - Nucleic acid sequence comparison - - - - Sequence comparison (nucleic acid) - Compare two or more nucleic acid sequences. - beta12orEarlier - - - - - - - - - - Protein sequence comparison - - - - beta12orEarlier - Sequence comparison (protein) - Compare two or more protein sequences. - - - - - - - - - - DNA back-translation - - - - - - - - beta12orEarlier - Back-translate a protein sequence into DNA. - - - - - - - - - - Sequence editing (nucleic acid) - - 1.8 - true - Edit or change a nucleic acid sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence editing (protein) - - Edit or change a protein sequence, either randomly or specifically. - beta12orEarlier - true - 1.8 - - - - - - - - - - Sequence generation (nucleic acid) - - Generate a nucleic acid sequence by some means. - beta12orEarlier - - - - - - - - - - Sequence generation (protein) - - - Generate a protein sequence by some means. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence visualisation - - Visualise, format or render a nucleic acid sequence. - true - Various nucleic acid sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - 1.8 - beta12orEarlier - - - - - - - - - - Protein sequence visualisation - - true - beta12orEarlier - Visualise, format or render a protein sequence. - 1.8 - Various protein sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - - - - - - - - - - Nucleic acid structure comparison - - - - Compare nucleic acid tertiary structures. - beta12orEarlier - Structure comparison (nucleic acid) - - - - - - - - - - Structure processing (nucleic acid) - - 1.6 - beta12orEarlier - true - Process (read and / or write) nucleic acid tertiary structure data. - - - - - - - - - - - DNA mapping - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Generate a map of a DNA sequence annotated with positional or non-positional features of some type. - - - - - - - - - - Map data processing - - DNA map data processing - Process (read and / or write) a DNA map of some type. - beta12orEarlier - true - 1.6 - - - - - - - - - - Protein hydropathy calculation - - - - - - - - - - - - - - beta12orEarlier - Analyse the hydrophobic, hydrophilic or charge properties of a protein (from analysis of sequence or structural information). - - - - - - - - - - Protein binding site prediction - - - - - - - - - beta12orEarlier - Active site prediction - Binding site prediction - Protein binding site detection - Ligand-binding site prediction - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein sequences or structures. - - - - - - - - - - Sequence tagged site (STS) mapping - - - - - - - - beta12orEarlier - Sequence mapping - An STS is a short subsequence of known sequence and location that occurs only once in the chromosome or genome that is being mapped. Sources of STSs include 1. expressed sequence tags (ESTs), simple sequence length polymorphisms (SSLPs), and random genomic sequences from cloned genomic DNA or database sequences. - Generate a physical DNA map (sequence map) from analysis of sequence tagged sites (STS). - - - - - - - - - - Alignment - - - - - - - - - Compare two or more entities, typically the sequence or structure (or derivatives) of macromolecules, to identify equivalent subunits. - Alignment generation - beta12orEarlier - Alignment construction - - - - - - - - - - Protein fragment weight comparison - - - Calculate the molecular weight of a protein (or fragments) and compare it to another protein or reference data. Generally used for protein identification. - Peptide mass fingerprinting - Protein fingerprinting - beta12orEarlier - PMF - - - - - - - - - - Protein property comparison - - - - - - - - Compare the physicochemical properties of two or more proteins (or reference data). - beta12orEarlier - - - - - - - - - - Secondary structure comparison - - - - - - - - Compare two or more molecular secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plotting - - beta12orEarlier - 1.12 - Generate a Hopp and Woods plot of antigenicity of a protein. - true - - - - - - - - - - Microarray cluster textual view generation - - beta12orEarlier - Visualise gene clusters with gene names. - - - - - - - - - - Microarray wave graph plotting - - Microarray wave graph rendering - Microarray cluster temporal graph rendering - beta12orEarlier - This view can be rendered as a pie graph. The distance matrix is sorted by cluster number and typically represented as a diagonal matrix with distance values displayed in different color shades. - Visualise clustered gene expression data as a set of waves, where each wave corresponds to a gene across samples on the X-axis. - - - - - - - - - - Microarray dendrograph plotting - - Microarray dendrograph rendering - Generate a dendrograph of raw, preprocessed or clustered microarray data. - beta12orEarlier - Microarray checks view rendering - Microarray view rendering - - - - - - - - - - Microarray proximity map plotting - - beta12orEarlier - Microarray distance map rendering - Generate a plot of distances (distance matrix) between genes. - Microarray proximity map rendering - - - - - - - - - - Microarray tree or dendrogram rendering - - Microarray 2-way dendrogram rendering - beta12orEarlier - Visualise clustered gene expression data using a gene tree, array tree and color coded band of gene expression. - Microarray matrix tree plot rendering - - - - - - - - - - Microarray principal component plotting - - beta12orEarlier - Microarray principal component rendering - Generate a line graph drawn as sum of principal components (Eigen value) and individual expression values. - - - - - - - - - - Microarray scatter plot plotting - - Generate a scatter plot of microarray data, typically after principal component analysis. - beta12orEarlier - Microarray scatter plot rendering - - - - - - - - - - Whole microarray graph plotting - - Visualise gene expression data where each band (or line graph) corresponds to a sample. - beta12orEarlier - Whole microarray graph rendering - - - - - - - - - - Microarray tree-map rendering - - beta12orEarlier - Visualise gene expression data after hierarchical clustering for representing hierarchical relationships. - - - - - - - - - - Microarray Box-Whisker plot plotting - - beta12orEarlier - Visualise raw and pre-processed gene expression data, via a plot showing over- and under-expression along with mean, upper and lower quartiles. - - - - - - - - - - Physical mapping - - - - - - - - - - - - - - beta12orEarlier - Generate a physical (sequence) map of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. - - - - - - - - - - Analysis - - Apply analytical methods to existing data of a specific type. - This excludes non-analytical methods that read and write the same basic type of data (for that, see 'Data handling'). - beta12orEarlier - - - - - - - - - - Alignment analysis - - Process or analyse an alignment of molecular sequences or structures. - true - beta12orEarlier - 1.8 - - - - - - - - - - Article analysis - - - - - - - - - - - - - - - - - - - - Analyse a body of scientific text (typically a full text article from a scientific journal.) - beta12orEarlier - - - - - - - - - - Molecular interaction analysis - - Analyse the interactions of two or more molecules (or parts of molecules) that are known to interact. - beta12orEarlier - beta13 - true - - - - - - - - - - Protein interaction analysis - - - - - - - - - - - - - - beta12orEarlier - Analyse known protein-protein, protein-DNA/RNA or protein-ligand interactions. - - - - - - - - - - Residue distance calculation - - WHATIF:HasNegativeIonContacts - Residue contact calculation (residue-ligand) - Residue contact calculation (residue-metal) - WHATIF:SymmetryContact - Residue contact calculation (residue-negative ion) - This includes identifying HET groups, which usually correspond to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. It can also include calculation of symmetry contacts, i.e. a contact between two atoms in different asymmetric unit. - WHATIF:HasMetalContactsPlus - Calculate contacts between residues, or between residues and other groups, in a protein structure, on the basis of distance calculations. - Residue contact calculation (residue-nucleic acid) - WHATIF: HETGroupNames - HET group detection - WHATIF:ShowDrugContacts - WHATIF:ShowLigandContacts - WHATIF:HasNucleicContacts - WHATIF:ShowDrugContactsShort - WHATIF:ShowProteiNucleicContacts - beta12orEarlier - WHATIF:HasMetalContacts - WHATIF:HasNegativeIonContactsPlus - - - - - - - - - - Alignment processing - - true - Process (read and / or write) an alignment of two or more molecular sequences, structures or derived data. - 1.6 - beta12orEarlier - - - - - - - - - - - Structure alignment processing - - Process (read and / or write) a molecular tertiary (3D) structure alignment. - 1.6 - beta12orEarlier - true - - - - - - - - - - Codon usage bias calculation - - - - - - - - Calculate codon usage bias. - beta12orEarlier - - - - - - - - - - Codon usage bias plotting - - - - - - - - - beta12orEarlier - Generate a codon usage bias plot. - - - - - - - - - - Codon usage fraction calculation - - - - - - - - Calculate the differences in codon usage fractions between two sequences, sets of sequences, codon usage tables etc. - beta12orEarlier - - - - - - - - - - Classification - - beta12orEarlier - Assign molecular sequences, structures or other biological data to a specific group or category according to qualities it shares with that group or category. - - - - - - - - - - Molecular interaction data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) molecular interaction data. - - - - - - - - - - Sequence classification - - - beta12orEarlier - Assign molecular sequence(s) to a group or category. - - - - - - - - - - Structure classification - - - Assign molecular structure(s) to a group or category. - beta12orEarlier - - - - - - - - - - Protein comparison - - Compare two or more proteins (or some aspect) to identify similarities. - beta12orEarlier - - - - - - - - - - Nucleic acid comparison - - beta12orEarlier - Compare two or more nucleic acids to identify similarities. - - - - - - - - - - Prediction and recognition (protein) - - beta12orEarlier - Predict, recognise, detect or identify some properties of proteins. - - - - - - - - - - Prediction and recognition (nucleic acid) - - beta12orEarlier - Predict, recognise, detect or identify some properties of nucleic acids. - - - - - - - - - - Structure editing - - - - - - - - beta13 - Edit, convert or otherwise change a molecular tertiary structure, either randomly or specifically. - - - - - - - - - - Sequence alignment editing - - Edit, convert or otherwise change a molecular sequence alignment, either randomly or specifically. - beta13 - - - - - - - - - - Pathway or network visualisation - - - - - - - - - Render (visualise) a biological pathway or network. - Pathway or network rendering - beta13 - - - - - - - - - - Protein function prediction (from sequence) - - beta13 - true - Predict general (non-positional) functional properties of a protein from analysing its sequence. - For functional properties that are positional, use 'Protein site detection' instead. - 1.6 - - - - - - - - - - Protein sequence feature detection - - - - Protein site recognition - Predict, recognise and identify functional or other key sites within protein sequences, typically by scanning for known motifs, patterns and regular expressions. - Protein site prediction - Sequence profile database search - Protein site detection - Protein secondary database search - Sequence feature detection (protein) - beta13 - - - - - - - - - - Protein property calculation (from sequence) - - - beta13 - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - - - - - - - - - - Protein feature prediction (from structure) - - beta13 - 1.6 - true - Predict, recognise and identify positional features in proteins from analysing protein structure. - - - - - - - - - - Protein feature detection - - - - - - - - - - - - - - - Features includes functional sites or regions, secondary structure, structural domains and so on. Methods might use fingerprints, motifs, profiles, hidden Markov models, sequence alignment etc to provide a mapping of a query protein sequence to a discriminatory element. This includes methods that search a secondary protein database (Prosite, Blocks, ProDom, Prints, Pfam etc.) to assign a protein sequence(s) to a known protein family or group. - - Predict, recognise and identify positional features in proteins from analysing protein sequences or structures. - beta13 - Protein feature recognition - Protein feature prediction - - - - - - - - - - Database search (by sequence) - - Sequence screening - true - 1.6 - Screen a molecular sequence(s) against a database (of some type) to identify similarities between the sequence and database entries. - beta13 - - - - - - - - - - Protein interaction network prediction - - - - - - - - - - - - - - beta13 - Predict a network of protein interactions. - - - - - - - - - - Nucleic acid design - - - beta13 - Design (or predict) nucleic acid sequences with specific chemical or physical properties. - - - - - - - - - - Editing - - beta13 - Edit a data entity, either randomly or specifically. - - - - - - - - - - Sequence assembly validation - - - - - - - - - - - - - - - - - - - - - Assembly quality evaluation - Assembly QC - Sequence assembly quality evaluation - Sequence assembly QC - Evaluate a DNA sequence assembly, typically for purposes of quality control. - 1.1 - - - - - - - - - - Genome alignment - - Align two or more (tpyically huge) molecular sequences that represent genomes. - Genome alignment construction - 1.1 - - - - - - - - - - Localized reassembly - - Reconstruction of a sequence assembly in a localised area. - 1.1 - - - - - - - - - - Sequence assembly visualisation - - Assembly rendering - Sequence assembly rendering - Render and visualise a DNA sequence assembly. - 1.1 - Assembly visualisation - - - - - - - - - - Base-calling - - - - - - - - Phred base calling - 1.1 - Identify base (nucleobase) sequence from a fluorescence 'trace' data generated by an automated DNA sequencer. - Base calling - Phred base-calling - - - - - - - - - - Bisulfite mapping - - 1.1 - Bisulfite mapping follows high-throughput sequencing of DNA which has undergone bisulfite treatment followed by PCR amplification; unmethylated cytosines are specifically converted to thymine, allowing the methylation status of cytosine in the DNA to be detected. - The mapping of methylation sites in a DNA (genome) sequence. - Bisulfite sequence alignment - Bisulfite sequence mapping - - - - - - - - - - Sequence contamination filtering - - - - - - - - beta12orEarlier - Identify and filter a (typically large) sequence data set to remove sequences from contaminants in the sample that was sequenced. - - - - - - - - - - Trim ends - - 1.1 - Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. - 1.12 - For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - true - - - - - - - - - - Trim vector - - true - Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - 1.12 - 1.1 - - - - - - - - - - Trim to reference - - true - 1.1 - 1.12 - Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - - - - - - - - - - Sequence trimming - - 1.1 - Cut (remove) the end from a molecular sequence. - Barcode sequence removal - Trim vector - Trimming - Trim ends - Trim to reference - This includes - -ennd trimming -Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. -For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - -trimming to a reference sequence, -Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - -vector trimming -Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - - - - - - - - - - - Genome feature comparison - - Genomic elements that might be compared include genes, indels, single nucleotide polymorphisms (SNPs), retrotransposons, tandem repeats and so on. - Compare the features of two genome sequences. - 1.1 - - - - - - - - - - Sequencing error detection - - - - - - - - Short read error correction - Short-read error correction - beta12orEarlier - Detect errors in DNA sequences generated from sequencing projects). - - - - - - - - - - Genotyping - - 1.1 - Methods might consider cytogenetic analyses, copy number polymorphism (and calculate copy number calls for copy-number variation(CNV) regions), single nucleotide polymorphism (SNP), , rare copy number variation (CNV) identification, loss of heterozygosity data and so on. - Analyse DNA sequence data to identify differences between the genetic composition (genotype) of an individual compared to other individual's or a reference sequence. - - - - - - - - - - Genetic variation analysis - - - 1.1 - Sequence variation analysis - Genetic variation annotation provides contextual interpretation of coding SNP consequences in transcripts. It allows comparisons to be made between variation data in different populations or strains for the same transcript. - Genetic variation annotation - Analyse a genetic variation, for example to annotate its location, alleles, classification, and effects on individual transcripts predicted for a gene model. - - - - - - - - - - Read mapping - - - Short oligonucleotide alignment - Oligonucleotide mapping - Oligonucleotide alignment generation - Short read mapping - Oligonucleotide alignment construction - The purpose of read mapping is to identify the location of sequenced fragments within a reference genome and assumes that there is, in fact, at least local similarity between the fragment and reference sequences. - Oligonucleotide alignment - Read alignment - 1.1 - Short read alignment - Align short oligonucleotide sequences (reads) to a larger (genomic) sequence. - Short sequence read mapping - - - - - - - - - - Split read mapping - - A varient of oligonucleotide mapping where a read is mapped to two separate locations because of possible structural variation. - 1.1 - - - - - - - - - - Community profiling - - - Analyse DNA sequences in order to identify a DNA 'barcode'; marker genes or any short fragment(s) of DNA that are useful to diagnose the taxa of biological organisms. - 1.1 - DNA barcoding - Sample barcoding - - - - - - - - - - SNP calling - - Identify single nucleotide change in base positions in sequencing data that differ from a reference genome and which might, especially by reference to population frequency or functional data, indicate a polymorphism. - Operations usually score confidence in the prediction or some other statistical measure of evidence. - 1.1 - - - - - - - - - - Polymorphism detection - - Polymorphism detection - Detect mutations in multiple DNA sequences, for example, from the alignment and comparison of the fluorescent traces produced by DNA sequencing hardware. - 1.1 - Mutation detection - - - - - - - - - - Chromatogram visualisation - - Visualise, format or render an image of a Chromatogram. - Chromatogram viewing - 1.1 - - - - - - - - - - Methylation analysis - - 1.1 - Determine cytosine methylation states in nucleic acid sequences. - - - - - - - - - - Methylation calling - - - 1.1 - Determine cytosine methylation status of specific positions in a nucleic acid sequences. - - - - - - - - - - Methylation level analysis (global) - - 1.1 - Global methylation analysis - Measure the overall level of methyl cytosines in a genome from analysis of experimental data, typically from chromatographic methods and methyl accepting capacity assay. - - - - - - - - - - Methylation level analysis (gene-specific) - - Gene-specific methylation analysis - Many different techniques are available for this. - Measure the level of methyl cytosines in specific genes. - 1.1 - - - - - - - - - - Genome visualisation - - 1.1 - Genome visualization - Visualise, format or render a nucleic acid sequence that is part of (and in context of) a complete genome sequence. - Genome rendering - Genome browser - Genome viewing - Genome browsing - - - - - - - - - - Genome comparison - - Compare the sequence or features of two or more genomes, for example, to find matching regions. - 1.1 - Genomic region matching - - - - - - - - - - Genome indexing - - - - - - - - Genome indexing (Burrows-Wheeler) - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. A suffix array consists of the lexicographically sorted list of suffixes of a genome. - Genome indexing (suffix arrays) - Generate an index of a genome sequence. - Suffix arrays - Burrows-Wheeler - 1.1 - - - - - - - - - - Genome indexing (Burrows-Wheeler) - - The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. - 1.12 - true - Generate an index of a genome sequence using the Burrows-Wheeler algorithm. - 1.1 - - - - - - - - - - Genome indexing (suffix arrays) - - 1.1 - Generate an index of a genome sequence using a suffix arrays algorithm. - A suffix array consists of the lexicographically sorted list of suffixes of a genome. - true - 1.12 - Suffix arrays - - - - - - - - - - Spectral analysis - - - - - - - - 1.1 - Analyse one or more spectra from mass spectrometry (or other) experiments. - Spectrum analysis - Mass spectrum analysis - - - - - - - - - - Peak detection - - - - - - - - 1.1 - Peak finding - Peak assignment - Identify peaks in a spectrum from a mass spectrometry, NMR, or some other spectrum-generating experiment. - - - - - - - - - - Scaffolding - - - - - - - - - Scaffold construction - Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically typically contigs; contiguous sequences corresponding to read overlaps. - 1.1 - Scaffold may be positioned along a chromosome physical map to create a "golden path". - Scaffold generation - - - - - - - - - - Scaffold gap completion - - Fill the gaps in a sequence assembly (scaffold) by merging in additional sequences. - Different techniques are used to generate gap sequences to connect contigs, depending on the size of the gap. For small (5-20kb) gaps, PCR amplification and sequencing is used. For large (>20kb) gaps, fragments are cloned (e.g. in BAC (Bacterial artificial chromosomes) vectors) and then sequenced. - 1.1 - - - - - - - - - - Sequencing quality control - - - Raw sequence data quality control. - Analyse raw sequence data from a sequencing pipeline and identify (and possiby fix) problems. - Sequencing QC - 1.1 - - - - - - - - - - Read pre-processing - - - Sequence read pre-processing - Pre-process sequence reads to ensure (or improve) quality and reliability. - For example process paired end reads to trim low quality ends remove short sequences, identify sequence inserts, detect chimeric reads, or remove low quality sequnces including vector, adaptor, low complexity and contaminant sequences. Sequences might come from genomic DNA library, EST libraries, SSH library and so on. - 1.1 - - - - - - - - - - Species frequency estimation - - - - - - - - Estimate the frequencies of different species from analysis of the molecular sequences, typically of DNA recovered from environmental samples. - 1.1 - - - - - - - - - - Peak calling - - Peak-pair calling - Chip-sequencing combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to generate a set of reads, which are aligned to a genome sequence. The enriched areas contain the binding sites of DNA-associated proteins. For example, a transcription factor binding site. ChIP-on-chip in contrast combines chromatin immunoprecipitation ('ChIP') with microarray ('chip'). "Peak-pair calling" is similar to "Peak calling" in the context of ChIP-exo. - Identify putative protein-binding regions in a genome sequence from analysis of Chip-sequencing data or ChIP-on-chip data. - Protein binding peak detection - 1.1 - - - - - - - - - - Differential expression analysis - - Identify (typically from analysis of microarray or RNA-seq data) genes whose expression levels are significantly different between two sample groups. - Differentially expressed gene identification - Differential expression analysis is used, for example, to identify which genes are up-regulated (increased expression) or down-regulated (decreased expression) between a group treated with a drug and a control groups. - 1.1 - - - - - - - - - - Gene set testing - - 1.1 - Gene sets can be defined beforehand by biological function, chromosome locations and so on. - Analyse gene expression patterns (typically from DNA microarray datasets) to identify sets of genes that are associated with a specific trait, condition, clinical outcome etc. - - - - - - - - - - Variant classification - - - Classify variants based on their potential effect on genes, especially functional effects on the expressed proteins. - 1.1 - Variants are typically classified by their position (intronic, exonic, etc.) in a gene transcript and (for variants in coding exons) by their effect on the protein sequence (synonymous, non-synonymous, frameshifting, etc.) - - - - - - - - - - Variant prioritization - - Variant prioritization can be used for example to produce a list of variants responsible for 'knocking out' genes in specific genomes. Methods amino acid substitution, aggregative approaches, probabilistic approach, inheritance and unified likelihood-frameworks. - Identify biologically interesting variants by prioritizing individual variants, for example, homozygous variants absent in control genomes. - 1.1 - - - - - - - - - - Variant calling - - Allele calling - Somatic variant calling - Germ line variant calling - Somatic variant calling is the detection of variations established in somatic cells and hence not inherited as a germ line variant. - Methods often utilise a database of aligned reads. - Variant mapping - 1.1 - Variant detection - Identify and map genomic alterations, including single nucleotide polymorphisms, short indels and structural variants, in a genome sequence. - - - - - - - - - - Structural variation discovery - - Detect large regions in a genome subject to copy-number variation, or other structural variations in genome(s). - 1.1 - Methods might involve analysis of whole-genome array comparative genome hybridization or single-nucleotide polymorphism arrays, paired-end mapping of sequencing data, or from analysis of short reads from new sequencing technologies. - - - - - - - - - - Exome assembly - Exome analysis - - 1.1 - Exome sequence analysis - Anaylse sequencing data from experiments aiming to selectively sequence the coding regions of the genome. - - - - - - - - - - Read depth analysis - - 1.1 - Analyse mapping density (read depth) of (typically) short reads from sequencing platforms, for example, to detect deletions and duplications. - - - - - - - - - - Gene expression QTL analysis - - - - - - - - expression quantitative trait loci profiling - 1.1 - eQTL profiling - Combine classical quantitative trait loci (QTL) analysis with gene expression profiling, for example, to describe describe cis- and trans-controlling elements for the expression of phenotype associated genes. - expression QTL profiling - - - - - - - - - - Copy number estimation - - Methods typically implement some statistical model for hypothesis testing, and methods estimate total copy number, i.e. do not distinguish the two inherited chromosomes quantities (specific copy number). - Transcript copy number estimation - 1.1 - Estimate the number of copies of loci of particular gene(s) in DNA sequences typically from gene-expression profiling technology based on microarray hybridization-based experiments. For example, estimate copy number (or marker dosage) of a dominant marker in samples from polyploid plant cells or tissues, or chromosomal gains and losses in tumors. - - - - - - - - - - Primer removal - - 1.2 - Remove forward and/or reverse primers from nucleic acid sequences (typically PCR products). - Adapter removal - - - - - - - - - - Transcriptome assembly - - - - - - - - - - - - - - Infer a transcriptome sequence by analysis of short sequence reads. - 1.2 - - - - - - - - - - Transcriptome assembly (de novo) - - de novo transcriptome assembly - true - 1.6 - 1.2 - Infer a transcriptome sequence without the aid of a reference genome, i.e. by comparing short sequences (reads) to each other. - - - - - - - - - - Transcriptome assembly (mapping) - - Infer a transcriptome sequence by mapping short reads to a reference genome. - 1.6 - 1.2 - true - - - - - - - - - - Sequence coordinate conversion - - - - - - - - - - - - - - 1.3 - Convert one set of sequence coordinates to another, e.g. convert coordinates of one assembly to another, cDNA to genomic, CDS to genomic, protein translation to genomic etc. - - - - - - - - - - Document similarity calculation - - Calculate similarity between 2 or more documents. - 1.3 - - - - - - - - - - Document clustering - - - Cluster (group) documents on the basis of their calculated similarity. - 1.3 - - - - - - - - - - Named entity recognition - - - Entity identification - Entity chunking - Entity extraction - Recognise named entities (text tokens) within documents. - 1.3 - - - - - - - - - - ID mapping - - - Identifier mapping - The mapping can be achieved by comparing identifier values or some other means, e.g. exact matches to a provided sequence. - 1.3 - Accession mapping - Map data identifiers to one another for example to establish a link between two biological databases for the purposes of data integration. - - - - - - - - - - Anonymisation - - Process data in such a way that makes it hard to trace to the person which the data concerns. - 1.3 - Data anonymisation - - - - - - - - - - ID retrieval - - - - - - - - id retrieval - Data retrieval (accession) - Data retrieval (ID) - Identifier retrieval - Data retrieval (id) - Accession retrieval - Search for and retrieve a data identifier of some kind, e.g. a database entry accession. - 1.3 - - - - - - - - - - Sequence checksum generation - - - - - - - - - - - - - - Generate a checksum of a molecular sequence. - 1.4 - - - - - - - - - - Bibliography generation - - - - - - - - Bibliography construction - Construct a bibliography from the scientific literature. - 1.4 - - - - - - - - - - Protein quaternary structure prediction - - 1.4 - Predict the structure of a multi-subunit protein and particularly how the subunits fit together. - - - - - - - - - - Molecular surface analysis - - - - - - - - - - - - - - 1.4 - Analyse the surface properties of proteins or other macromolecules, including surface accessible pockets, interior inaccessible cavities etc. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - 1.9 - - - - - - - - - - Format detection - - - - - - - - - - - - - - Recognition of which format the given data is in. - 1.4 - Format identification - Format recognition - 'Format recognition' is not a bioinformatics-specific operation, but of great relevance in bioinformatics. Should be removed from EDAM if/when captured satisfactorily in a suitable domain-generic ontology. - Format inference - - - - - - The has_input "Data" (data_0006) may cause visualisation or other problems although ontologically correct. But on the other hand it may be useful to distinguish from nullary operations without inputs. - - - - - - - - - - - Splitting - - File splitting - Split a file containing multiple data items into many files, each containing one item - 1.4 - - - - - - - - - - Generation - - Construction - beta12orEarlier - For non-analytical operations, see the 'Processing' branch. - Construct some data entity. - - - - - - - - - - Nucleic acid sequence feature detection - - - Nucleic acid site prediction - Predict, recognise and identify functional or other key sites within nucleic acid sequences, typically by scanning for known motifs, patterns and regular expressions. - Nucleic acid site recognition - 1.6 - Nucleic acid site detection - - - - - - - - - - Deposition - - Deposit some data in a database or some other type of repository or software system. - 1.6 - Database submission - Submission - Data submission - Data deposition - Database deposition - For non-analytical operations, see the 'Processing' branch. - - - - - - - - - - Clustering - - 1.6 - Group together some data entities on the basis of similarities such that entities in the same group (cluster) are more similar to each other than to those in other groups (clusters). - - - - - - - - - - Assembly - - 1.6 - Construct some entity (typically a molecule sequence) from component pieces. - - - - - - - - - - Conversion - - Convert a data set from one form to another. - 1.6 - - - - - - - - - - Standardization and normalization - - Normalization - 1.6 - Standardization - Standardize or normalize data. - - - - - - - - - - Aggregation - - Combine multiple files or data items into a single file or object. - 1.6 - - - - - - - - - - Article comparison - - Compare two or more scientific articles. - 1.6 - - - - - - - - - - Calculation - - Mathemetical determination of the value of something, typically a properly of a molecule. - 1.6 - - - - - - - - - - Pathway or network prediction - - - 1.6 - Predict a molecular pathway or network. - - - - - - - - - - Genome assembly - - 1.12 - 1.6 - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - true - - - - - - - - - - - Plotting - - Generate a graph, or other visual representation, of data, showing the relationship between two or more variables. - 1.6 - - - - - - - - - - Image analysis - - - - - - - - 1.7 - The analysis of a image (typically a digital image) of some type in order to extract information from it. - Image processing - - - - - - - - - - - Diffraction data analysis - - 1.7 - Analysis of data from a diffraction experiment. - - - - - - - - - - Cell migration analysis - - - - - - - - 1.7 - Analysis of cell migration images in order to study cell migration, typically in order to study the processes that play a role in the disease progression. - - - - - - - - - - Diffraction data reduction - - 1.7 - Processing of diffraction data into a corrected, ordered, and simplified form. - - - - - - - - - - Neurite measurement - - - - - - - - Measurement of neurites; projections (axons or dendrites) from the cell body of a neuron, from analysis of neuron images. - 1.7 - - - - - - - - - - Diffraction data integration - - 1.7 - Diffraction summation integration - Diffraction profile fitting - The evaluation of diffraction intensities and integration of diffraction maxima from a diffraction experiment. - - - - - - - - - - Phasing - - Phase a macromolecular crystal structure, for example by using molecular replacement or experimental phasing methods. - 1.7 - - - - - - - - - - Molecular replacement - - 1.7 - A technique used to construct an atomic model of an unknown structure from diffraction data, based upon an atomic model of a known structure, either a related protein or the same protein from a different crystal form. - The technique solves the phase problem, i.e. retrieve information concern phases of the structure. - - - - - - - - - - Rigid body refinement - - 1.7 - Rigid body refinement usually follows molecular replacement in the assignment of a structure from diffraction data. - A method used to refine a structure by moving the whole molecule or parts of it as a rigid unit, rather than moving individual atoms. - - - - - - - - - - Single particle analysis - - - - - - - - - An image processing technique that combines and analyze multiple images of a particulate sample, in order to produce an image with clearer features that are more easily interpreted. - 1.7 - Single particle analysis is used to improve the information that can be obtained by relatively low resolution techniques, , e.g. an image of a protein or virus from transmission electron microscopy (TEM). - - - - - - - - - - Single particle alignment and classification - - - Compare (align and classify) multiple particle images from a micrograph in order to produce a representative image of the particle. - 1.7 - A micrograph can include particles in multiple different orientations and/or conformations. Particles are compared and organised into sets based on their similarity. Typically iterations of classification and alignment and are performed to optimise the final image; average images produced by classification are used as a reference image for subsequent alignment of the whole image set. - - - - - - - - - - Functional clustering - - - - - - - - 1.7 - Clustering of molecular sequences on the basis of their function, typically using information from an ontology of gene function, or some other measure of functional phenotype. - Functional sequence clustering - - - - - - - - - - Taxonomic classification - - Taxonomy assignment - Classifiication (typically of molecular sequences) by assignment to some taxonomic hierarchy. - 1.7 - - - - - - - - - - Virulence prediction - - - - - - - - - Pathogenicity prediction - The prediction of the degree of pathogenicity of a microorganism from analysis of molecular sequences. - 1.7 - - - - - - - - - - Gene expression correlation analysis - - - 1.7 - Gene co-expression network analysis - Analyse the correlation patterns among genes across across a variety of experiments, microarray samples etc. - - - - - - - - - - - Correlation - - - - - - - - 1.7 - Identify a correlation, i.e. a statistical relationship between two random variables or two sets of data. - - - - - - - - - - RNA structure covariance model generation - - - - - - - - - Compute the covariance model for (a family of) RNA secondary structures. - 1.7 - - - - - - - - - - RNA secondary structure prediction (shape-based) - - RNA shape prediction - Predict RNA secondary structure by analysis, e.g. probabilistic analysis, of the shape of RNA folds. - 1.7 - - - - - - - - - - Nucleic acid folding prediction (alignment-based) - - 1.7 - Prediction of nucleic-acid folding using sequence alignments as a source of data. - - - - - - - - - - k-mer counting - - Count k-mers (substrings of length k) in DNA sequence data. - 1.7 - k-mer counting is used in genome and transcriptome assembly, metagenomic sequencing, and for error correction of sequence reads. - - - - - - - - - - Phylogenetic tree reconstruction - - - - - - - - Reconstructing the inner node labels of a phylogenetic tree from its leafes. - Note that this is somewhat different from simply analysing an existing tree or constructing a completely new one. - 1.7 - - - - - - - - - - Probabilistic data generation - - Generate some data from a choosen probibalistic model, possibly to evaluate algorithms. - 1.7 - - - - - - - - - - Probabilistic sequence generation - - - 1.7 - Generate sequences from some probabilistic model, e.g. a model that simulates evolution. - - - - - - - - - - Antimicrobial resistance prediction - - - - - - - - - 1.7 - Identify or predict causes for antibiotic resistance from molecular sequence analysis. - - - - - - - - - - Enrichment - - - - - - - - - A relevant ontology will be used. The input is typically a set of identifiers or other data, and the output of the analysis is typically a ranked list of ontology terms, each associated with a p-value. - Term enrichment - 1.8 - Analyse a dataset with respect to concepts from an ontology. - - - - - - - - - - Chemical class enrichment - - - - - - - - - 1.8 - Analyse a dataset with respect to concepts from an ontology of chemical structure. - - - - - - - - - - Incident curve plotting - - 1.8 - Plot an incident curve such as a survival curve, death curve, mortality curve. - - - - - - - - - - Variant pattern analysis - - Methods often utilise a database of aligned reads. - Identify and map patterns of genomic variations. - 1.8 - - - - - - - - - - Mathematical modelling - - 1.12 - Model some biological system using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - true - beta12orEarlier - - - - - - - - - - Microscope image visualisation - - - - - - - - Visualise images resulting from various types of microscopy. - 1.9 - Microscopy image visualisation - - - - - - - - - - Image annotation - - 1.9 - Annotate an image of some sort, typically with terms from a controlled vocabulary. - - - - - - - - - - Imputation - - Data imputation - Replace missing data with substituted values, usually by using some statistical or other mathematical approach. - 1.9 - - - - - - - - - - Ontology visualisation - - 1.9 - Visualise, format or render data from an ontology, typically a tree of terms. - Ontology browsing - - - - - - - - - - Maximum occurence analysis - - A method for making numerical assessments about the maximum percent of time that a conformer of a flexible macromolecule can exist and still be compatible with the experimental data. - beta12orEarlier - - - - - - - - - - Database comparison - - - 1.9 - Data model comparison - Compare the models or schemas used by two or more databases, or any other general comparison of databases rather than a detailed comparison of the entries themselves. - Schema comparison - - - - - - - - - - Network simulation - - - - - - - - Simulate the bevaviour of a biological pathway or network. - Pathway simulation - Network topology simulation - 1.9 - - - - - - - - - - RNA-seq read count analysis - - Analyze read counts from RNA-seq experiments. - 1.9 - - - - - - - - - - Chemical redundancy removal - - 1.9 - Identify and remove redudancy from a set of small molecule structures. - - - - - - - - - - RNA-seq time series data analysis - - 1.9 - Analyze time series data from an RNA-seq experiment. - - - - - - - - - - Simulated gene expression data generation - - 1.9 - Simulate gene expression data, e.g. for purposes of benchmarking. - - - - - - - - - - Relationship inference - - - - - - - - - - - - - - - - - - - - 1.12 - Identify semantic relationships within a text or between two or more texts using text mining techniques. - - - - - - - - - - Mass spectra calibration - - - - - - - - Re-adjust the output of mass spectrometry experiments with shifted ppm values. - 1.12 - - - - - - - - - - Chromatographic alignment - - - - - - - - Align multiple data sets using information from chromatography and/or peptide identification, from mass spectrometry experiments. - 1.12 - - - - - - - - - - Deisotoping - - - - - - - - The removal of isotope peaks in a spectrum, to represent the fragment ion as one data point. - Deconvolution - 1.12 - Deisotoping is commonly done to reduce complexity, and done in conjunction with the charge state deconvolution. - - - - - - - - - - Quantification - - - - - - - - Technique for determining the amount of proteins in a sample. - 1.12 - Quantitation - - - - - - - - - - Peptide identification - - - - - - - - Peptide-spectrum-matching - Determination of peptide sequence from mass spectrum. - 1.12 - - - - - - - - - - Isotopic distributions calculation - - - - - - - - - - - - - - 1.12 - Calculate the isotope distribution of a given chemical species. - - - - - - - - - - Retention times prediction - - Retention times calculation - Prediction of retention times in a mass spectrometry experiment based on compositional and structural properties of the separated species. - 1.12 - - - - - - - - - - Label-free quantification - - 1.12 - Quantification without the use of chemical tags. - - - - - - - - - - Labeled quantification - - 1.12 - Quantification based on the use of chemical tags. - - - - - - - - - - MRM/SRM - - 1.12 - Quantification by Selected/multiple Reaction Monitoring workflow (XIC quantitation of precursor / fragment mass pair). - - - - - - - - - - Spectral counting - - 1.12 - Calculate number of identified MS2 spectra as approximation of peptide / protein quantity. - - - - - - - - - - SILAC - - Quantification analysis using stable isotope labeling by amino acids in cell culture. - 1.12 - - - - - - - - - - iTRAQ - - 1.12 - Quantification analysis using the AB SCIEX iTRAQ isobaric labelling workflow, wherein 2-8 reporter ions are measured in MS2 spectra near 114 m/z. - - - - - - - - - - 18O labeling - - 1.12 - Quantification analysis using labeling based on 18O-enriched H2O. - - - - - - - - - - TMT-tag - - 1.12 - Quantification analysis using the Thermo Fisher tandem mass tag labelling workflow. - - - - - - - - - - Dimethyl - - 1.12 - Quantification analysis using chemical labeling by stable isotope dimethylation - - - - - - - - - - Tag-based peptide identification - - Peptide sequence tags are used as piece of information about a peptide obtained by tandem mass spectrometry. - 1.12 - - - - - - - - - - de Novo sequencing - - - Analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum (MS/MS) without the assistance of a sequence database. - 1.12 - - - - - - - - - - PTM identification - - Identification of post-translational modifications (PTMs) of peptides/proteins in mass spectrum. - 1.12 - - - - - - - - - - Peptide database search - - - 1.12 - Determination of best matches between MS/MS spectrum and a database of protein or nucleic acid sequences. - - - - - - - - - - Blind peptide database search - - Modification-tolerant peptide database search - Unrestricted peptide database search - 1.12 - Peptide database search for identification of known and unknown PTMs looking for mass difference mismatches. - - - - - - - - - - Validation of peptide-spectrum matches - - - Statistical estimation of false discovery rate from score distribution for peptide-spectrum-matches, following a peptide database search. - 1.12 - - - - - - - - - - Target-Decoy - - Estimation of false discovery rate by comparison to search results with a database containing incorrect information. - 1.12 - - - - - - - - - - Statistical inference - - 1.12 - Empirical Bayes - Analyse data in order to deduce properties of an underlying distribution or population. - - - - - - - - - - Regression analysis - - A statistical calculation to estimate the relationships among variables. - Regression - 1.12 - - - - - - - - - - Metabolic network modelling - - - - - - - - Model a metabolic network, for example, to reconstruct pathways or to simulate metabolism. - Metabolic reconstruction - Metabolic network reconstruction - Metabolic network simulation - 1.12 - - - - - - - - - - SNP annotation - - Predict the effect or function of an individual single nucleotide polymorphism (SNP). - 1.12 - - - - - - - - - - Ab-initio gene prediction - - Prediction of genes or gene components from first principles, i.e. without reference to existing genes. - 1.12 - Gene prediction (ab-initio) - - - - - - - - - - Homology-based gene prediction - - Gene prediction (homology-based) - Prediction of genes or gene components by reference to homologous genes. - 1.12 - - - - - - - - - - Statistical modelling - - 1.12 - Construction of a statistical model, or a set of assumptions around some observed data, usually by describing a set of probability distributions which approximate the distribution of data. - - - - - - - - - - Molecular surface comparison - - - 1.12 - Compare two or more molecular surfaces. - - - - - - - - - - Gene functional annotation - - 1.12 - Annotate one or more sequences with functional information, such as cellular processes or metaobolic pathways, by reference to a controlled vocabulary - invariably the Gene Ontology (GO). - - - - - - - - - - Variant filtering - - - 1.12 - Variant filtering is used to eliminate false positive variants based for example on base calling quality, strand and position information, and mapping info. - - - - - - - - - - Differential binding analysis - - 1.12 - Differential binding analysis identifies binding sites in nucleic acid sequences that are statistically significantly differentially bound between sample groups. - - - - - - - - - - RNA-Seq analysis - - Analyze data from RNA-seq experiments. - 1.13 - - - - - - - - - - Mass spectrum visualisation - - 1.1 - Visualise, format or render a mass spectrum. - - - - - - - - - - Filtering - - Filter a set of files or data items according to some property. - 1.13 - Sequence filtering - - - - - - - - - - Reference identification - - Identification of the best reference for mapping for a specific dataset from a list of potential references, when performing genetic variation analysis. - 1.1 - - - - - - - - - - Ion counting - - Ion current integration - Label-free quantification by integration of ion current (ion counting). - 1.14 - - - - - - - - - - Isotope-coded protein label - - Chemical tagging free amino groups of intact proteins with stable isotopes. - ICPL - 1.14 - - - - - - - - - - Metabolic labeling - - Labeling all proteins and (possibly) all amino acids using C-13 or N-15 enriched grown medium or feed. - 1.14 - This includes N-15 metabolic labeling (labeling all proteins and (possibly) all amino acids using N-15 enriched grown medium or feed) and C-13 metabolic labeling (labeling all proteins and (possibly) all amino acids using C-13 enriched grown medium or feed). - N-15 metabolic labeling - C-13 metabolic labeling - - - - - - - - - - Topic - - http://purl.org/biotop/biotop.owl#Quality - http://bioontology.org/ontologies/ResearchArea.owl#Area_of_Research - http://www.onto-med.de/ontologies/gfo.owl#Category - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - sumo:FieldOfStudy - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - - - - - - - - - - Nucleic acid analysis - - The processing and analysis of nucleic acid sequence, structural and other data. - Nucleic acid bioinformatics - Nucleic acids - Nucleic acid informatics - http://purl.bioontology.org/ontology/MSH/D017423 - Nucleic acid properties - Nucleic acid physicochemistry - http://purl.bioontology.org/ontology/MSH/D017422 - true - beta12orEarlier - - - - - - - - - - Protein analysis - - Protein informatics - Proteins - http://purl.bioontology.org/ontology/MSH/D020539 - Protein bioinformatics - Protein databases - true - beta12orEarlier - Archival, processing and analysis of protein data, typically molecular sequence and structural data. - - - - - - - - - - Metabolites - - 1.13 - true - The structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. - beta12orEarlier - - - - - - - - - - Sequence analysis - - true - beta12orEarlier - Sequence databases - Sequences - http://purl.bioontology.org/ontology/MSH/D017421 - The archival, processing and analysis of molecular sequences (monomer composition of polymers) including molecular sequence data resources, sequence sites, alignments, motifs and profiles. - - - - - - - - - - - Structure analysis - - Computational structural biology - true - The curation, processing and analysis of the structure of biological molecules, typically proteins and nucleic acids and other macromolecules. - http://purl.bioontology.org/ontology/MSH/D015394 - Structural bioinformatics - Structure databases - This includes related concepts such as structural properties, alignments and structural motifs. - Structure data resources - beta12orEarlier - - - - - - - - - - - Structure prediction - - Protein fold recognition - The prediction of molecular structure, including the prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features, and the folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - - - Nucleic acid structure prediction - beta12orEarlier - Protein structure prediction - true - DNA structure prediction - Nucleic acid design - Nucleic acid folding - RNA structure prediction - This includes the recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s), for example by threading, or the alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - - - - - - - - - - Alignment - - beta12orEarlier - true - The alignment (equivalence between sites) of molecular sequences, structures or profiles (representing a sequence or structure alignment). - beta12orEarlier - - - - - - - - - - - Phylogeny - - - Phylogeny reconstruction - Phylogenetic stratigraphy - beta12orEarlier - Phylogenetic dating - Phylogenetic clocks - true - http://purl.bioontology.org/ontology/MSH/D010802 - The study of evolutionary relationships amongst organisms. - Phylogenetic simulation - This includes diverse phylogenetic methods, including phylogenetic tree construction, typically from molecular sequence or morphological data, methods that simulate DNA sequence evolution, a phylogenetic tree or the underlying data, or which estimate or use molecular clock and stratigraphic (age) data, methods for studying gene evolution etc. - - - - - - - - - - - Functional genomics - - - beta12orEarlier - true - The study of gene or protein functions and their interactions in totality in a given organism, tissue, cell etc. - - - - - - - - - - - Ontology and terminology - - true - Terminology - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D002965 - Applied ontology - Ontology - The conceptualisation, categorisation and nomenclature (naming) of entities or phenomena within biology or bioinformatics. This includes formal ontologies, controlled vocabularies, structured glossary, symbols and terminology or other related resource. - Ontologies - - - - - - - - - - - Information retrieval - - beta12orEarlier - 1.13 - true - The search and query of data sources (typically databases or ontologies) in order to retrieve entries or other information. - VT 1.3.3 Information retrieval - - - - - - - - - - Bioinformatics - - This includes data processing in general, including basic handling of files and databases, datatypes, workflows and annotation. - VT 1.5.6 Bioinformatics - The archival, curation, processing and analysis of complex biological data. - http://purl.bioontology.org/ontology/MSH/D016247 - beta12orEarlier - true - - - - - - - - - - - Data visualisation - - Data rendering - Rendering (drawing on a computer screen) or visualisation of molecular sequences, structures or other biomolecular data. - true - VT 1.2.5 Computer graphics - beta12orEarlier - Computer graphics - - - - - - - - - - Nucleic acid thermodynamics - - true - The study of the thermodynamic properties of a nucleic acid. - 1.3 - - - - - - - - - - Nucleic acid structure analysis - - - Includes secondary and tertiary nucleic acid structural data, nucleic acid thermodynamic, thermal and conformational properties including DNA or DNA/RNA denaturation (melting) etc. - DNA melting - Nucleic acid denaturation - RNA alignment - The archival, curation, processing and analysis of nucleic acid structural information, such as whole structures, structural features and alignments, and associated annotation. - beta12orEarlier - RNA structure alignment - Nucleic acid structure - Nucleic acid thermodynamics - RNA structure - - - - - - - - - - RNA - - beta12orEarlier - Small RNA - RNA sequences and structures. - - - - - - - - - - Nucleic acid restriction - - 1.3 - beta12orEarlier - Topic for the study of restriction enzymes, their cleavage sites and the restriction of nucleic acids. - true - - - - - - - - - - Mapping - - The mapping of complete (typically nucleotide) sequences. Mapping (in the sense of short read alignment, or more generally, just alignment) has application in RNA-Seq analysis (mapping of transcriptomics reads), variant discovery (e.g. mapping of exome capture), and re-sequencing (mapping of WGS reads). - Genetic linkage - Linkage - Linkage mapping - true - Synteny - This includes resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. It also includes resources for physical (sequence) maps of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. It also covers for example the alignment of sequences of (typically millions) of short reads to a reference genome. - DNA mapping - beta12orEarlier - - - - - - - - - Genetic codes and codon usage - - beta12orEarlier - true - 1.3 - Codon usage analysis - The study of codon usage in nucleotide sequence(s), genetic codes and so on. - - - - - - - - - - Protein expression - - Translation - The translation of mRNA into protein and subsequent protein processing in the cell. - beta12orEarlier - - - - - - - - - - - Gene finding - - 1.3 - This includes the study of promoters, coding regions, splice sites, etc. Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene discovery - Methods that aims to identify, predict, model or analyse genes or gene structure in DNA sequences. - beta12orEarlier - Gene prediction - true - - - - - - - - - - Transcription - - 1.3 - The transcription of DNA into mRNA. - beta12orEarlier - true - - - - - - - - - - Promoters - - true - beta12orEarlier - Promoters in DNA sequences (region of DNA that facilitates the transcription of a particular gene by binding RNA polymerase and transcription factor proteins). - beta13 - - - - - - - - - - Nucleic acid folding - - beta12orEarlier - The folding (in 3D space) of nucleic acid molecules. - true - beta12orEarlier - - - - - - - - - - Gene structure - - This includes the study of promoters, coding regions etc. - beta12orEarlier - Fusion genes - Gene features - true - Gene structure, regions which make an RNA product and features such as promoters, coding regions, gene fusion, splice sites etc. - - This incudes operons (operators, promoters and genes) from a bacterial genome. For example the operon leader and trailer gene, gene composition of the operon and associated information. - - - - - - - - - - Proteomics - - beta12orEarlier - Protein and peptide identification, especially in the study of whole proteomes of organisms. - Protein and peptide identification - Peptide identification - Proteomics includes any methods (especially high-throughput) that separate, characterize and identify expressed proteins such as mass spectrometry, two-dimensional gel electrophoresis and protein microarrays, as well as in-silico methods that perform proteolytic or mass calculations on a protein sequence and other analyses of protein expression data, for example in different cells or tissues. - true - http://purl.bioontology.org/ontology/MSH/D040901 - Protein expression - - - - - - - - - - - Structural genomics - - - true - beta12orEarlier - The elucidation of the three dimensional structure for all (available) proteins in a given organism. - - - - - - - - - - - Protein properties - - The study of the physical and biochemical properties of peptides and proteins, for example the hydrophobic, hydrophilic and charge properties of a protein. - Protein hydropathy - true - Protein physicochemistry - beta12orEarlier - - - - - - - - - - Protein interactions - - - Protein-protein, protein-DNA/RNA and protein-ligand interactions, including analysis of known interactions and prediction of putative interactions. - Protein-nucleic acid interactions - Protein-RNA interaction - Protein interaction networks - This includes experimental (e.g. yeast two-hybrid) and computational analysis techniques. - Protein-protein interactions - Protein-ligand interactions - beta12orEarlier - Protein-DNA interaction - true - - - - - - - - - - Protein folding, stability and design - - beta12orEarlier - Protein residue interactions - Protein design - true - Protein folding - Protein stability - Protein stability, folding (in 3D space) and protein sequence-structure-function relationships. This includes for example study of inter-atomic or inter-residue interactions in protein (3D) structures, the effect of mutation, and the design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - Rational protein design - - - - - - - - - - Two-dimensional gel electrophoresis - - Two-dimensional gel electrophoresis image and related data. - beta13 - beta12orEarlier - true - - - - - - - - - - Mass spectrometry - - beta12orEarlier - true - 1.13 - An analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. - - - - - - - - - - Protein microarrays - - Protein microarray data. - true - beta12orEarlier - beta13 - - - - - - - - - - Protein hydropathy - - beta12orEarlier - true - The study of the hydrophobic, hydrophilic and charge properties of a protein. - 1.3 - - - - - - - - - - Protein targeting and localization - - Protein targeting - Protein sorting - The study of how proteins are transported within and without the cell, including signal peptides, protein subcellular localization and export. - Protein localization - beta12orEarlier - - - - - - - - - - Protein cleavage sites and proteolysis - - true - beta12orEarlier - 1.3 - Enzyme or chemical cleavage sites and proteolytic or mass calculations on a protein sequence. - - - - - - - - - - Protein structure comparison - - The comparison of two or more protein structures. - beta12orEarlier - true - Use this concept for methods that are exclusively for protein structure. - beta12orEarlier - - - - - - - - - - Protein residue interactions - - The processing and analysis of inter-atomic or inter-residue interactions in protein (3D) structures. - true - 1.3 - beta12orEarlier - - - - - - - - - - Protein-protein interactions - - Protein interaction networks - true - Protein-protein interactions, individual interactions and networks, protein complexes, protein functional coupling etc. - beta12orEarlier - 1.3 - - - - - - - - - - Protein-ligand interactions - - beta12orEarlier - true - 1.3 - Protein-ligand (small molecule) interactions. - - - - - - - - - - Protein-nucleic acid interactions - - beta12orEarlier - 1.3 - Protein-DNA/RNA interactions. - true - - - - - - - - - - Protein design - - 1.3 - beta12orEarlier - The design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - true - - - - - - - - - - G protein-coupled receptors (GPCR) - - G-protein coupled receptors (GPCRs). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Carbohydrates - - beta12orEarlier - Carbohydrates, typically including structural information. - true - - - - - - - - - - Lipids - - beta12orEarlier - true - Lipidomics - Lipids and their structures. - - - - - - - - - - Small molecules - - Drugs and target structures - Amino acids - Targets - Drug structures - Metabolite structures - Target structures - Small molecules of biological significance, typically archival, curation, processing and analysis of structural information. - Small molecules include organic molecules, metal-organic compounds, small polypeptides, small polysaccharides and oligonucleotides. Structural data is usually included. - true - This concept excludes macromolecules such as proteins and nucleic acids. - Toxins and targets - CHEBI:23367 - Toxins - Metabolites - Drug targets - Peptides and amino acids - beta12orEarlier - Chemical structures - This includes the structures of drugs, drug target, their interactions and binding affinities. Also the structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. Also the physicochemical, biochemical or structural properties of amino acids or peptides. Also structural and associated data for toxic chemical substances. - Peptides - - - - - - - - - - Sequence editing - - beta12orEarlier - true - beta12orEarlier - Edit, convert or otherwise change a molecular sequence, either randomly or specifically. - - - - - - - - - - - Sequence composition, complexity and repeats - - Repeat sequences - This includes short repetitive subsequences (repeat sequences) in a protein sequence. - true - The archival, processing and analysis of the basic character composition of molecular sequences, for example character or word frequency, ambiguity, complexity, particularly regions of low complexity, and repeats or the repetitive nature of molecular sequences. - beta12orEarlier - Protein sequence repeats - Nucleic acid repeats - This includes repetitive elements within a nucleic acid sequence, e.g. -long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a sequence and other types of repeating unit. - Sequence complexity - Low complexity sequences - Sequence repeats - Sequence composition - Protein repeats - - - - - - - - - - Sequence motifs - - beta12orEarlier - Motifs - true - 1.3 - Conserved patterns (motifs) in molecular sequences, that (typically) describe functional or other key sites. - - - - - - - - - - Sequence comparison - - true - The comparison might be on the basis of sequence, physico-chemical or some other properties of the sequences. - beta12orEarlier - 1.12 - The comparison of two or more molecular sequences, for example sequence alignment and clustering. - - - - - - - - - - Sequence sites, features and motifs - - Sequence features - true - Functional sites - The archival, detection, prediction and analysis of positional features such as functional and other key sites, in molecular sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence motifs - Sequence profiles - Sequence sites - HMMs - beta12orEarlier - - - - - - - - - - Sequence database search - - beta12orEarlier - Search and retrieve molecular sequences that are similar to a sequence-based query (typically a simple sequence). - beta12orEarlier - true - The query is a sequence-based entity such as another sequence, a motif or profile. - - - - - - - - - - Sequence clustering - - This includes systems that generate, process and analyse sequence clusters. - beta12orEarlier - true - 1.7 - The comparison and grouping together of molecular sequences on the basis of their similarities. - Sequence clusters - - - - - - - - - - Protein structural motifs and surfaces - - This includes conformation of conserved substructures, conserved geometry (spatial arrangement) of secondary structure or protein backbone, solvent-exposed surfaces, internal cavities, the analysis of shape, hydropathy, electrostatic patches, role and functions etc. - Protein structural features - Structural motifs - Protein 3D motifs - true - beta12orEarlier - Protein structural motifs - Structural features or common 3D motifs within protein structures, including the surface of a protein structure, such as biological interfaces with other molecules. - Protein surfaces - - - - - - - - - - Structural (3D) profiles - - The processing, analysis or use of some type of structural (3D) profile or template; a computational entity (typically a numerical matrix) that is derived from and represents a structure or structure alignment. - true - beta12orEarlier - 1.3 - Structural profiles - - - - - - - - - - Protein structure prediction - - true - beta12orEarlier - The prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features. - 1.12 - - - - - - - - - - Nucleic acid structure prediction - - The folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - 1.12 - true - beta12orEarlier - - - - - - - - - - Ab initio structure prediction - - 1.7 - The prediction of three-dimensional structure of a (typically protein) sequence from first principles, using a physics-based or empirical scoring function and without using explicit structural templates. - true - beta12orEarlier - - - - - - - - - - Homology modelling - - 1.4 - The modelling of the three-dimensional structure of a protein using known sequence and structural data. - true - beta12orEarlier - - - - - - - - - - Molecular dynamics - - This includes resources concerning flexibility and motion in protein and other molecular structures. - Protein dynamics - true - Molecular flexibility - Molecular motions - beta12orEarlier - The study and simulation of molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - - - - - - - - - - Molecular docking - - beta12orEarlier - true - The modelling the structure of proteins in complex with small molecules or other macromolecules. - true - 1.12 - - - - - - - - - - Protein secondary structure prediction - - beta12orEarlier - 1.3 - The prediction of secondary or supersecondary structure of protein sequences. - true - - - - - - - - - - Protein tertiary structure prediction - - 1.3 - true - The prediction of tertiary structure of protein sequences. - beta12orEarlier - - - - - - - - - - Protein fold recognition - - 1.12 - The recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Sequence alignment - - This includes the generation of alignments (the identification of equivalent sites), the analysis of alignments, editing, visualisation, alignment databases, the alignment (equivalence between sites) of sequence profiles (representing sequence alignments) and so on. - beta12orEarlier - 1.7 - The alignment of molecular sequences or sequence profiles (representing sequence alignments). - true - - - - - - - - - - Structure alignment - - The superimposition of molecular tertiary structures or structural (3D) profiles (representing a structure or structure alignment). - This includes the generation, storage, analysis, rendering etc. of structure alignments. - true - 1.7 - beta12orEarlier - - - - - - - - - - Threading - - Sequence-structure alignment - 1.3 - beta12orEarlier - The alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - true - - - - - - - - - - Sequence profiles and HMMs - - true - Sequence profiles; typically a positional, numerical matrix representing a sequence alignment. - beta12orEarlier - 1.3 - Sequence profiles include position-specific scoring matrix (position weight matrix), hidden Markov models etc. - - - - - - - - - - Phylogeny reconstruction - - The reconstruction of a phylogeny (evolutionary relatedness amongst organisms), for example, by building a phylogenetic tree. - 1.3 - true - Currently too specific for the topic sub-ontology (but might be unobsoleted). - beta12orEarlier - - - - - - - - - - Phylogenomics - - - beta12orEarlier - The integrated study of evolutionary relationships and whole genome data, for example, in the analysis of species trees, horizontal gene transfer and evolutionary reconstruction. - true - - - - - - - - - - - Virtual PCR - - beta13 - Polymerase chain reaction - beta12orEarlier - Simulated polymerase chain reaction (PCR). - PCR - true - - - - - - - - - - Sequence assembly - - true - Assembly - The assembly of fragments of a DNA sequence to reconstruct the original sequence. - beta12orEarlier - Assembly has two broad types, de-novo and re-sequencing. Re-sequencing is a specialized case of assembly, where an assembled (typically de-novo assembled) reference genome is available and is about 95% identical to the re-sequenced genome. All other cases of assembly are 'de-novo'. - - - - - - - - - - Genetic variation - - Mutation - beta12orEarlier - Polymorphism - Somatic mutations - Stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - http://purl.bioontology.org/ontology/MSH/D014644 - DNA variation - true - - - - - - - - - - Microarrays - - true - http://purl.bioontology.org/ontology/MSH/D046228 - Microarrays, for example, to process microarray data or design probes and experiments. - 1.3 - DNA microarrays - beta12orEarlier - - - - - - - - - - Pharmacology - - Computational pharmacology - beta12orEarlier - Pharmacoinformatics - The study of drugs and their effects or responses in living systems. - VT 3.1.7 Pharmacology and pharmacy - true - - - - - - - - - - - Gene expression - - This includes the study of codon usage in nucleotide sequence(s), genetic codes and so on. - Transcription - Gene expression profiling - Expression profiling - beta12orEarlier - http://edamontology.org/topic_0197 - Gene expression levels are analysed by identifying, quantifying or comparing mRNA transcripts, for example using microarrays, RNA-seq, northern blots, gene-indexed expression profiles etc. - http://purl.bioontology.org/ontology/MSH/D015870 - Gene expression analysis - DNA microarrays - The analysis of levels and patterns of synthesis of gene products (proteins and functional RNA) including interpretation in functional terms of gene expression data. - Codon usage - true - - - - - - - - - - - Gene regulation - - true - Regulatory genomics - beta12orEarlier - The regulation of gene expression. - - - - - - - - - - Pharmacogenomics - - - true - beta12orEarlier - Pharmacogenetics - The influence of genotype on drug response, for example by correlating gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - - - - - - - - - - - Medicinal chemistry - - - VT 3.1.4 Medicinal chemistry - The design and chemical synthesis of bioactive molecules, for example drugs or potential drug compounds, for medicinal purposes. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - true - Drug design - beta12orEarlier - - - - - - - - - - - Fish - - beta12orEarlier - true - 1.3 - Information on a specific fish genome including molecular sequences, genes and annotation. - - - - - - - - - - Flies - - 1.3 - true - beta12orEarlier - Information on a specific fly genome including molecular sequences, genes and annotation. - - - - - - - - - - Mice or rats - - Information on a specific mouse or rat genome including molecular sequences, genes and annotation. - The resource may be specific to a group of mice / rats or all mice / rats. - beta12orEarlier - - - - - - - - - - Worms - - true - 1.3 - beta12orEarlier - Information on a specific worm genome including molecular sequences, genes and annotation. - - - - - - - - - - Literature analysis - - beta12orEarlier - 1.3 - The processing and analysis of the bioinformatics literature and bibliographic data, such as literature search and query. - true - - - - - - - - - - Text mining - - beta12orEarlier - The analysis of the biomedical and informatics literature. - Literature analysis - Literature mining - Text data mining - - - - - - - - - - - Data submission, annotation and curation - - Database curation - Deposition and curation of database accessions, including annotation, typically with terms from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Document, record and content management - - true - The management and manipulation of digital documents, including database records, files and reports. - VT 1.3.6 Multimedia, hypermedia - 1.13 - beta12orEarlier - - - - - - - - - - Sequence annotation - - beta12orEarlier - beta12orEarlier - true - Annotation of a molecular sequence. - - - - - - - - - - Genome annotation - - Annotation of a genome. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - - NMR - - - ROESY - NOESY - Nuclear Overhauser Effect Spectroscopy - An analytical technique that exploits the magenetic properties of certain atomic nuclei to provide information on the structure, dynamics, reaction state and chemical environment of molecules. - HOESY - beta12orEarlier - Heteronuclear Overhauser Effect Spectroscopy - Nuclear magnetic resonance spectroscopy - Spectroscopy - NMR spectroscopy - Rotational Frame Nuclear Overhauser Effect Spectroscopy - - - - - - - - - - - Sequence classification - - 1.12 - true - beta12orEarlier - The classification of molecular sequences based on some measure of their similarity. - Methods including sequence motifs, profile and other diagnostic elements which (typically) represent conserved patterns (of residues or properties) in molecular sequences. - - - - - - - - - - Protein classification - - 1.3 - true - beta12orEarlier - primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc. - - - - - - - - - - Sequence motif or profile - - beta12orEarlier - true - Sequence motifs, or sequence profiles derived from an alignment of molecular sequences of a particular type. - This includes comparison, discovery, recognition etc. of sequence motifs. - beta12orEarlier - - - - - - - - - - Protein modifications - - GO:0006464 - Protein chemical modifications - Protein post-translational modification - Protein chemical modifications, e.g. post-translational modifications. - true - EDAM does not describe all possible protein modifications. For fine-grained annotation of protein modification use the Gene Ontology (children of concept GO:0006464) and/or the Protein Modifications ontology (children of concept MOD:00000) - Protein post-translational modifications - Post-translation modifications - MOD:00000 - beta12orEarlier - - - - - - - - - - Molecular interactions, pathways and networks - - Networks - Environmental information processing pathways - Pathways - Biological networks - Disease pathways - true - Signal transduction pathways - beta13 - Biological models - Cellular process pathways - Molecular interactions - Gene regulatory networks - Molecular interactions, biological pathways, networks and other models. - Biological pathways - Interactions - Genetic information processing pathways - Signaling pathways - http://edamontology.org/topic_3076 - - - - - - - - - - - Informatics - - true - The study and practice of information processing and use of computer information systems. - VT 1.3.99 Other - Knowledge management - VT 1.3.4 Information management - beta12orEarlier - Information management - VT 1.3.5 Knowledge management - VT 1.3.3 Information retrieval - VT 1.3 Information sciences - Information science - - - - - - - - - Literature data resources - - Data resources for the biological or biomedical literature, either a primary source of literature or some derivative. - true - 1.3 - beta12orEarlier - - - - - - - - - - Laboratory information management - - true - Laboratory management and resources, for example, catalogues of biological resources for use in the lab including cell lines, viruses, plasmids, phages, DNA probes and primers and so on. - beta12orEarlier - Laboratory resources - - - - - - - - - - - - Cell and tissue culture - - Tissue culture - 1.3 - true - General cell culture or data on a specific cell lines. - Cell culture - beta12orEarlier - - - - - - - - - - Ecology - - true - The ecological and environmental sciences and especially the application of information technology (ecoinformatics). - http://purl.bioontology.org/ontology/MSH/D004777 - Ecological informatics - VT 1.5.15 Ecology - Computational ecology - beta12orEarlier - Ecoinformatics - Environmental science - - - - - - - - - - - Electron microscopy - - - SEM - Scanning electron microscopy - TEM - The study of matter by studying the interference pattern from firing electrons at a sample, to analyse structures at resolutions higher than can be achieved using light. - - Transmission electron microscopy - beta12orEarlier - Electron crystallography - Electron diffraction experiment - Single particle electron microscopy - - - - - - - - - - - Cell cycle - - beta13 - beta12orEarlier - true - The cell cycle including key genes and proteins. - - - - - - - - - - Peptides and amino acids - - beta12orEarlier - The physicochemical, biochemical or structural properties of amino acids or peptides. - 1.13 - true - - - - - - - - - - Organelles - - Cell membrane - Cytoplasm - Organelle genes and proteins - Smooth endoplasmic reticulum - beta12orEarlier - Lysosome - Centriole - Ribosome - Nucleus - true - A specific organelle, or organelles in general, typically the genes and proteins (or genome and proteome). - Mitochondria - Golgi apparatus - Rough endoplasmic reticulum - 1.3 - - - - - - - - - - Ribosomes - - beta12orEarlier - Ribosomes, typically of ribosome-related genes and proteins. - Ribosome genes and proteins - 1.3 - true - - - - - - - - - - Scents - - A database about scents. - beta12orEarlier - beta13 - true - - - - - - - - - - Drugs and target structures - - beta12orEarlier - The structures of drugs, drug target, their interactions and binding affinities. - true - 1.13 - - - - - - - - - - Model organisms - - This may include information on the genome (including molecular sequences and map, genes and annotation), proteome, as well as more general information about an organism. - beta12orEarlier - A specific organism, or group of organisms, used to study a particular aspect of biology. - true - Organisms - - - - - - - - - - - Genomics - - http://purl.bioontology.org/ontology/MSH/D023281 - Personal genomics - beta12orEarlier - Whole genomes of one or more organisms, or genomes in general, such as meta-information on genomes, genome projects, gene names etc. - true - - - - - - - - - - - Gene and protein families - - - beta12orEarlier - Gene family - A protein families database might include the classifier (e.g. a sequence profile) used to build the classification. - Protein families - Genes, gene family or system - Gene system - Protein sequence classification - Particular gene(s), gene family or other gene group or system and their encoded proteins.Primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc., curation of a particular protein or protein family, or any other proteins that have been classified as members of a common group. - true - Gene families - - - - - - - - - - - Chromosomes - - beta12orEarlier - Study of chromosomes. - 1.13 - true - - - - - - - - - - Genotype and phenotype - - Genotype and phenotype resources - The study of genetic constitution of a living entity, such as an individual, and organism, a cell and so on, typically with respect to a particular observable phenotypic traits, or resources concerning such traits, which might be an aspect of biochemistry, physiology, morphology, anatomy, development and so on. - Genotyping - Phenotyping - true - beta12orEarlier - - - - - - - - - - - Gene expression and microarray - - true - beta12orEarlier - beta12orEarlier - Gene expression e.g. microarray data, northern blots, gene-indexed expression profiles etc. - - - - - - - - - - Probes and primers - - Probes - This includes the design of primers for PCR and DNA amplification or the design of molecular probes. - http://purl.bioontology.org/ontology/MSH/D015335 - Primers - true - beta12orEarlier - Molecular probes (e.g. a peptide probe or DNA microarray probe) or PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - - Pathology - - Disease - Diseases, including diseases in general and the genes, gene variations and proteins involved in one or more specific diseases. - true - beta12orEarlier - VT 3.1.6 Pathology - - - - - - - - - - - Specific protein resources - - 1.3 - A particular protein, protein family or other group of proteins. - true - Specific protein - beta12orEarlier - - - - - - - - - - Taxonomy - - true - beta12orEarlier - VT 1.5.25 Taxonomy - Organism classification, identification and naming. - - - - - - - - - - Protein sequence analysis - - beta12orEarlier - Archival, processing and analysis of protein sequences and sequence-based entities such as alignments, motifs and profiles. - 1.8 - true - - - - - - - - - - Nucleic acid sequence analysis - - beta12orEarlier - 1.8 - true - The archival, processing and analysis of nucleotide sequences and and sequence-based entities such as alignments, motifs and profiles. - - - - - - - - - - - Repeat sequences - - true - The repetitive nature of molecular sequences. - beta12orEarlier - 1.3 - - - - - - - - - - Low complexity sequences - - true - The (character) complexity of molecular sequences, particularly regions of low complexity. - 1.3 - beta12orEarlier - - - - - - - - - - Proteome - - A specific proteome including protein sequences and annotation. - beta12orEarlier - beta13 - true - - - - - - - - - - DNA - - DNA analysis - beta12orEarlier - Ancient DNA - Chromosomes - DNA sequences and structure, including processes such as methylation and replication. - The DNA sequences might be coding or non-coding sequences. - - - - - - - - - - Coding RNA - - Protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames - 1.13 - beta12orEarlier - true - - - - - - - - - - Functional, regulatory and non-coding RNA - - - true - small interfering RNA - small nucleolar RNA - ncRNA - Non-coding RNA - Functional RNA - snRNA - Non-coding or functional RNA sequences, including regulatory RNA sequences, ribosomal RNA (rRNA) and transfer RNA (tRNA). - Non-coding RNA includes piwi-interacting RNA (piRNA), small nuclear RNA (snRNA) and small nucleolar RNA (snoRNA). Regulatory RNA includes microRNA (miRNA) - short single stranded RNA molecules that regulate gene expression, and small interfering RNA (siRNA). - Regulatory RNA - siRNA - piRNA - snoRNA - small nuclear RNA - beta12orEarlier - miRNA - microRNA - piwi-interacting RNA - - - - - - - - - - rRNA - - 1.3 - One or more ribosomal RNA (rRNA) sequences. - true - - - - - - - - - - tRNA - - 1.3 - true - One or more transfer RNA (tRNA) sequences. - - - - - - - - - - Protein secondary structure - - true - beta12orEarlier - 1.8 - Protein secondary structure or secondary structure alignments. - This includes assignment, analysis, comparison, prediction, rendering etc. of secondary structure data. - - - - - - - - - - RNA structure - - 1.3 - RNA secondary or tertiary structure and alignments. - beta12orEarlier - true - - - - - - - - - - Protein tertiary structure - - 1.8 - true - Protein tertiary structures. - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - Classification of nucleic acid sequences and structures. - 1.3 - true - beta12orEarlier - - - - - - - - - - Protein families - - beta12orEarlier - true - Primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc., curation of a particular protein or protein family, or any other proteins that have been classified as members of a common group. - 1.14 - - - - - - - - - - Protein folds and structural domains - - Protein tertiary structural domains and folds in a protein or polypeptide chain. - This includes topological domains such as cytoplasmic regions in a protein. - Protein transmembrane regions - Protein domains - Protein membrane regions - Intramembrane regions - beta12orEarlier - Protein topological domains - true - This includes trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. For example, the location and size of the membrane spanning segments and intervening loop regions, transmembrane region IN/OUT orientation relative to the membrane, plus the following data for each amino acid: A Z-coordinate (the distance to the membrane center), the free energy of membrane insertion (calculated in a sliding window over the sequence) and a reliability score. The z-coordinate implies information about re-entrant helices, interfacial helices, the tilt of a transmembrane helix and loop lengths. - Protein folds - Transmembrane regions - Protein structural domains - - - - - - - - - - Nucleic acid sequence alignment - - beta12orEarlier - true - 1.3 - Nucleotide sequence alignments. - - - - - - - - - - Protein sequence alignment - - 1.3 - Protein sequence alignments. - beta12orEarlier - true - A sequence profile typically represents a sequence alignment. - - - - - - - - - - Nucleic acid sites and features - - beta12orEarlier - 1.3 - true - The archival, detection, prediction and analysis of -positional features such as functional sites in nucleotide sequences. - - - - - - - - - - - Protein sites and features - - beta12orEarlier - The detection, identification and analysis of positional features in proteins, such as functional sites. - 1.3 - true - - - - - - - - - - - Transcription factors and regulatory sites - - - - CpG islands - Proteins that bind to DNA and control transcription of DNA to mRNA (transcription factors) and also transcriptional regulatory sites, elements and regions (such as promoters, enhancers, silencers and boundary elements / insulators) in nucleotide sequences. - Enhancers - Attenuators - CAAT signals - Transcriptional regulatory sites - TFBS - CAT box - CCAAT box - This includes CpG rich regions (isochores) in a nucleotide sequence. - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - -10 signals - Transcription factor proteins either promote (as an activator) or block (as a repressor) the binding to DNA of RNA polymerase. Regulatory sites including transcription factor binding site as well as promoters, enhancers, silencers and boundary elements / insulators. - Terminators - TATA signals - GC signals - Promoters - -35 signals - Transcription factors - Isochores - beta12orEarlier - Transcription factor binding sites - - - - - - - - - - Phosphorylation sites - - 1.0 - Protein phosphorylation and phosphorylation sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - - Metabolic pathways - - beta12orEarlier - 1.13 - true - Metabolic pathways. - - - - - - - - - - Signaling pathways - - true - Signaling pathways. - 1.13 - beta12orEarlier - - - - - - - - - - Protein and peptide identification - - 1.3 - beta12orEarlier - true - - - - - - - - - - Workflows - - Pipelines - Biological or biomedical analytical workflows or pipelines. - beta12orEarlier - - - - - - - - - Data types and objects - - Structuring data into basic types and (computational) objects. - beta12orEarlier - 1.0 - true - - - - - - - - - - Theoretical biology - - 1.3 - true - - - - - - - - - - Mitochondria - - beta12orEarlier - true - Mitochondria, typically of mitochondrial genes and proteins. - 1.3 - - - - - - - - - - Plants - - The resource may be specific to a plant, a group of plants or all plants. - Plant science - Plants, e.g. information on a specific plant genome including molecular sequences, genes and annotation. - Plant biology - Botany - VT 1.5.22 Plant science - Plant - VT 1.5.10 Botany - beta12orEarlier - - - - - - - - - - Viruses - - Virology - VT 1.5.28 Virology - beta12orEarlier - Viruses, e.g. sequence and structural data, interactions of viral proteins, or a viral genome including molecular sequences, genes and annotation. - The resource may be specific to a virus, a group of viruses or all viruses. - - - - - - - - - - Fungi - - Mycology - beta12orEarlier - The resource may be specific to a fungus, a group of fungi or all fungi. - Yeast - VT 1.5.21 Mycology - Fungi and molds, e.g. information on a specific fungal genome including molecular sequences, genes and annotation. - - - - - - - - - - Pathogens - - Pathogens, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a pathogen, a group of pathogens or all pathogens. - - - - - - - - - - Arabidopsis - - beta12orEarlier - Arabidopsis-specific data. - 1.3 - true - - - - - - - - - - Rice - - Rice-specific data. - true - 1.3 - beta12orEarlier - - - - - - - - - - Genetic mapping and linkage - - Linkage mapping - beta12orEarlier - 1.3 - true - Genetic linkage - Informatics resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. - - - - - - - - - - Comparative genomics - - The study (typically comparison) of the sequence, structure or function of multiple genomes. - true - beta12orEarlier - - - - - - - - - - - Mobile genetic elements - - Transposons - beta12orEarlier - Mobile genetic elements, such as transposons, Plasmids, Bacteriophage elements and Group II introns. - - - - - - - - - - Human disease - - Human diseases, typically describing the genes, mutations and proteins implicated in disease. - beta13 - true - beta12orEarlier - - - - - - - - - - Immunology - - VT 3.1.3 Immunology - Immunoinformatics - http://purl.bioontology.org/ontology/MSH/D007120 - http://purl.bioontology.org/ontology/MSH/D007125 - beta12orEarlier - true - Computational immunology - The application of information technology to immunology such as immunological processes, immunological genes, proteins and peptide ligands, antigens and so on. - - - - - - - - - - - Membrane and lipoproteins - - Lipoproteins (protein-lipid assemblies), and proteins or region of a protein that spans or are associated with a membrane. - true - beta12orEarlier - Membrane proteins - Lipoproteins - Transmembrane proteins - - - - - - - - - - Enzymes - - Proteins that catalyze chemical reaction, the kinetics of enzyme-catalysed reactions, enzyme nomenclature etc. - beta12orEarlier - Enzymology - true - - - - - - - - - - Primers - - true - 1.13 - PCR primers and hybridization oligos in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - PolyA signal or sites - - beta12orEarlier - 1.13 - true - Regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - - - - - - - - - - CpG island and isochores - - beta12orEarlier - 1.13 - true - CpG rich regions (isochores) in a nucleotide sequence. - - - - - - - - - - Restriction sites - - Restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Splice sites - - beta12orEarlier - Splice sites in a nucleotide sequence or alternative RNA splicing events. - 1.13 - true - - - - - - - - - - - Matrix/scaffold attachment sites - - 1.13 - true - beta12orEarlier - Matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - - - - - - - - - - Operon - - beta12orEarlier - 1.13 - true - Operons (operators, promoters and genes) from a bacterial genome. - - - - - - - - - - Promoters - - true - 1.13 - Whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - beta12orEarlier - - - - - - - - - - Structural biology - - Structural assignment - Structure determination - This includes experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - 1.3 - This includes Informatics concerning data generated from the use of microscopes, including optical, electron and scanning probe microscopy. Includes methods for digitizing microscope images and viewing the produced virtual slides and associated data on a computer screen. - The molecular structure of biological molecules, particularly macromolecules such as proteins and nucleic acids. - true - VT 1.5.24 Structural biology - Structural determination - - - - - - - - - - - Protein membrane regions - - 1.8 - 1.13 - true - Trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Structure comparison - - This might involve comparison of secondary or tertiary (3D) structural information. - true - The comparison of two or more molecular structures, for example structure alignment and clustering. - 1.13 - beta12orEarlier - - - - - - - - - - Function analysis - - true - Protein function prediction - The study of gene and protein function including the prediction of functional properties of a protein. - Protein function analysis - beta12orEarlier - - - - - - - - - - - Prokaryotes and archae - - The resource may be specific to a prokaryote, a group of prokaryotes or all prokaryotes. - VT 1.5.2 Bacteriology - Bacteriology - beta12orEarlier - Specific bacteria or archaea, e.g. information on a specific prokaryote genome including molecular sequences, genes and annotation. - - - - - - - - - - Protein databases - - true - 1.3 - Protein data resources. - beta12orEarlier - Protein data resources - - - - - - - - - - Structure determination - - Experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - beta12orEarlier - true - 1.3 - - - - - - - - - - Cell biology - - beta12orEarlier - true - VT 1.5.11 Cell biology - Cellular processes - Cells, such as key genes and proteins involved in the cell cycle. - - - - - - - - - - Classification - - beta13 - beta12orEarlier - Topic focused on identifying, grouping, or naming things in a structured way according to some schema based on observable relationships. - true - - - - - - - - - - Lipoproteins - - true - 1.3 - beta12orEarlier - Lipoproteins (protein-lipid assemblies). - - - - - - - - - - Phylogeny visualisation - - true - Visualise a phylogeny, for example, render a phylogenetic tree. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cheminformatics - - The application of information technology to chemistry in biological research environment. - Chemical informatics - beta12orEarlier - Chemoinformatics - true - - - - - - - - - - - Systems biology - - http://en.wikipedia.org/wiki/Systems_biology - This includes databases of models and methods to construct or analyse a model. - Biological models - http://purl.bioontology.org/ontology/MSH/D049490 - true - beta12orEarlier - Biological modelling - Biological system modelling - The holistic modelling and analysis of complex biological systems and the interactions therein. - - - - - - - - - - - Statistics and probability - - Biostatistics - Probability - http://en.wikipedia.org/wiki/Biostatistics - beta12orEarlier - The application of statistical methods to biological problems. - Statistics - http://purl.bioontology.org/ontology/MSH/D056808 - - - - - - - - - - - Structure database search - - The query is a structure-based entity such as another structure, a 3D (structural) motif, 3D profile or template. - beta12orEarlier - Search for and retrieve molecular structures that are similar to a structure-based query (typically another structure or part of a structure). - beta12orEarlier - true - - - - - - - - - - Molecular modelling - - Molecular docking - Homology modeling - beta12orEarlier - Comparative modelling - Homology modelling - Molecular modeling - Comparative modeling - true - The construction, analysis, evaluation, refinement etc. of models of a molecules properties or behaviour, including the modelling the structure of proteins in complex with small molecules or other macromolecules (docking). - - - - - - - - - - Protein function prediction - - 1.2 - beta12orEarlier - true - The prediction of functional properties of a protein. - - - - - - - - - - SNP - - true - Single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. - beta12orEarlier - 1.13 - - - - - - - - - - Transmembrane protein prediction - - Predict transmembrane domains and topology in protein sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - - Nucleic acid structure comparison - - The comparison two or more nucleic acid (typically RNA) secondary or tertiary structures. - beta12orEarlier - true - beta12orEarlier - Use this concept for methods that are exclusively for nucleic acid structures. - - - - - - - - - - - Exons - - beta12orEarlier - true - Exons in a nucleotide sequences. - 1.13 - - - - - - - - - - Gene transcription - - Transcription of DNA into RNA including the regulation of transcription. - true - 1.13 - beta12orEarlier - - - - - - - - - - DNA mutation - - - beta12orEarlier - DNA mutation. - - - - - - - - - - Oncology - - beta12orEarlier - VT 3.2.16 Oncology - Cancer - true - The study of cancer, for example, genes and proteins implicated in cancer. - Cancer biology - - - - - - - - - - - Toxins and targets - - 1.13 - beta12orEarlier - true - Structural and associated data for toxic chemical substances. - - - - - - - - - - Introns - - 1.13 - Introns in a nucleotide sequences. - beta12orEarlier - true - - - - - - - - - - Tool topic - - beta12orEarlier - A topic concerning primarily bioinformatics software tools, typically the broad function or purpose of a tool. - true - beta12orEarlier - - - - - - - - - - Study topic - - A general area of bioinformatics study, typically the broad scope or category of content of a bioinformatics journal or conference proceeding. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Nomenclature - - true - 1.3 - beta12orEarlier - Biological nomenclature (naming), symbols and terminology. - - - - - - - - - - Disease genes and proteins - - 1.3 - true - beta12orEarlier - The genes, gene variations and proteins involved in one or more specific diseases. - - - - - - - - - - Protein structure analysis - - - Protein structure - true - Protein secondary or tertiary structural data and/or associated annotation. - http://edamontology.org/topic_3040 - beta12orEarlier - - - - - - - - - - - Humans - - beta12orEarlier - The human genome, including molecular sequences, genes, annotation, maps and viewers, the human proteome or human beings in general. - - - - - - - - - - Gene resources - - Gene resource - beta12orEarlier - 1.3 - Informatics resource (typically a database) primarily focussed on genes. - Gene database - true - - - - - - - - - - Yeast - - beta12orEarlier - Yeast, e.g. information on a specific yeast genome including molecular sequences, genes and annotation. - true - 1.3 - - - - - - - - - - Eukaryotes - - Eukaryote - Eukaryotes or data concerning eukaryotes, e.g. information on a specific eukaryote genome including molecular sequences, genes and annotation. - The resource may be specific to a eukaryote, a group of eukaryotes or all eukaryotes. - beta12orEarlier - - - - - - - - - - Invertebrates - - The resource may be specific to an invertebrate, a group of invertebrates or all invertebrates. - beta12orEarlier - Invertebrates, e.g. information on a specific invertebrate genome including molecular sequences, genes and annotation. - - - - - - - - - - Vertebrates - - The resource may be specific to a vertebrate, a group of vertebrates or all vertebrates. - Vertebrates, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - - - - - - - - - - Unicellular eukaryotes - - Unicellular eukaryotes, e.g. information on a unicellular eukaryote genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a unicellular eukaryote, a group of unicellular eukaryotes or all unicellular eukaryotes. - - - - - - - - - - Protein structure alignment - - Protein secondary or tertiary structure alignments. - beta12orEarlier - true - 1.3 - - - - - - - - - - X-ray diffraction - - - The study of matter and their structure by means of the diffraction of X-rays, typically the diffraction pattern caused by the regularly spaced atoms of a crystalline sample. - beta12orEarlier - X-ray microscopy - Crystallography - X-ray crystallography - - - - - - - - - - - Ontologies, nomenclature and classification - - true - Conceptualisation, categorisation and naming of entities or phenomena within biology or bioinformatics. - 1.3 - http://purl.bioontology.org/ontology/MSH/D002965 - beta12orEarlier - - - - - - - - - - Immunoproteins, genes and antigens - - - Immunopeptides - Immunity-related genes, proteins and their ligands. - Antigens - This includes T cell receptors (TR), major histocompatibility complex (MHC), immunoglobulin superfamily (IgSF) / antibodies, major histocompatibility complex superfamily (MhcSF), etc." - beta12orEarlier - Immunoproteins - Immunogenes - - - - - - - - - - - Molecules - - CHEBI:23367 - beta12orEarlier - beta12orEarlier - Specific molecules, including large molecules built from repeating subunits (macromolecules) and small molecules of biological significance. - true - - - - - - - - - - Toxicology - - - Toxins and the adverse effects of these chemical substances on living organisms. - VT 3.1.9 Toxicology - Toxicoinformatics - true - beta12orEarlier - Computational toxicology - - - - - - - - - - - High-throughput sequencing - - Next-generation sequencing - beta13 - true - beta12orEarlier - Parallelized sequencing processes that are capable of sequencing many thousands of sequences simultaneously. - - - - - - - - - - Structural clustering - - The comparison and grouping together of molecular structures on the basis of similarity; generate, process or analyse structural clusters. - 1.7 - Structure classification - true - beta12orEarlier - - - - - - - - - - Gene regulatory networks - - Gene regulatory networks. - true - 1.13 - beta12orEarlier - - - - - - - - - - Disease (specific) - - Informatics resources dedicated to one or more specific diseases (not diseases in general). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - VNTR - - Variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Microsatellites - - true - 1.13 - beta12orEarlier - Microsatellite polymorphism in a DNA sequence. - - - - - - - - - - - RFLP - - Restriction fragment length polymorphisms (RFLP) in a DNA sequence. - true - 1.13 - beta12orEarlier - - - - - - - - - - - DNA polymorphism - - - Includes restriction fragment length polymorphisms (RFLP) in a DNA sequence. An RFLP is defined by the presence or absence of a specific restriction site of a bacterial restriction enzyme. - true - RFLP - Single nucleotide polymorphism - Microsatellites - VNTR - SNP - Includes microsatellite polymorphism in a DNA sequence. A microsatellite polymorphism is a very short subsequence that is repeated a variable number of times between individuals. These repeats consist of the nucleotides cytosine and adenosine. - DNA polymorphism. - Variable number of tandem repeat polymorphism - Includes single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. A SNP is a DNA sequence variation where a single nucleotide differs between members of a species or paired chromosomes in an individual. - beta12orEarlier - Includes variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. VNTRs occur in non-coding regions of DNA and consists sub-sequence that is repeated a multiple (and varied) number of times. - - - - - - - - - - Nucleic acid design - - Topic for the design of nucleic acid sequences with specific conformations. - 1.3 - beta12orEarlier - true - - - - - - - - - - Primer or probe design - - 1.3 - true - beta13 - The design of primers for PCR and DNA amplification or the design of molecular probes. - - - - - - - - - - Structure databases - - beta13 - true - 1.2 - Structure data resources - Molecular secondary or tertiary (3D) structural data resources, typically of proteins and nucleic acids. - - - - - - - - - - Nucleic acid structure - - true - beta13 - Nucleic acid (secondary or tertiary) structure, such as whole structures, structural features and associated annotation. - 1.2 - - - - - - - - - - Sequence databases - - Molecular sequence data resources, including sequence sites, alignments, motifs and profiles. - true - beta13 - Sequence data resources - Sequence data - Sequence data resource - 1.3 - - - - - - - - - - Nucleic acid sequences - - Nucleotide sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - Nucleotide sequences - - - - - - - - - - Protein sequences - - Protein sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - - - - - - - - - - Protein interaction networks - - 1.3 - true - - - - - - - - - - Molecular biology - - true - VT 1.5.4 Biochemistry and molecular biology - beta13 - The molecular basis of biological activity, particularly the macromolecules (e.g. proteins and nucleic acids) that are essential to life. - - - - - - - - - - - Mammals - - true - beta13 - 1.3 - Mammals, e.g. information on a specific mammal genome including molecular sequences, genes and annotation. - - - - - - - - - - Biodiversity - - The degree of variation of life forms within a given ecosystem, biome or an entire planet. - beta13 - VT 1.5.5 Biodiversity conservation - true - http://purl.bioontology.org/ontology/MSH/D044822 - - - - - - - - - - - Sequence clusters and classification - - This includes the results of sequence clustering, ortholog identification, assignment to families, annotation etc. - The comparison, grouping together and classification of macromolecules on the basis of sequence similarity. - Sequence families - 1.3 - true - Sequence clusters - beta13 - - - - - - - - - - Genetics - - http://purl.bioontology.org/ontology/MSH/D005823 - true - The study of genes, genetic variation and heredity in living organisms. - beta13 - Heredity - - - - - - - - - - - Quantitative genetics - - beta13 - The genes and genetic mechanisms such as Mendelian inheritance that underly continuous phenotypic traits (such as height or weight). - true - - - - - - - - - - Population genetics - - The distribution of allele frequencies in a population of organisms and its change subject to evolutionary processes including natural selection, genetic drift, mutation and gene flow. - true - beta13 - - - - - - - - - - - Regulatory RNA - - 1.3 - Regulatory RNA sequences including microRNA (miRNA) and small interfering RNA (siRNA). - true - beta13 - - - - - - - - - - Documentation and help - - The documentation of resources such as tools, services and databases and how to get help. - true - beta13 - 1.13 - - - - - - - - - - Genetic organisation - - The structural and functional organisation of genes and other genetic elements. - 1.3 - beta13 - true - - - - - - - - - - Medical informatics - - true - Health informatics - Clinical informatics - Biomedical informatics - Translational medicine - The application of information technology to health, disease and biomedicine. - Healthcare informatics - beta13 - Health and disease - Molecular medicine - - - - - - - - - - - Developmental biology - - VT 1.5.14 Developmental biology - true - beta13 - How organisms grow and develop. - - - - - - - - - - - Embryology - - true - beta13 - The development of organisms between the one-cell stage (typically the zygote) and the end of the embryonic stage. - - - - - - - - - - - Anatomy - - VT 3.1.1 Anatomy and morphology - beta13 - The form and function of the structures of living organisms. - true - - - - - - - - - - - Literature and reference - - beta13 - true - http://purl.bioontology.org/ontology/MSH/D011642 - The scientific literature, reference information and documentation. - Literature sources - Bibliography - This includes the documentation of resources such as tools, services and databases, user support, how to get help etc. - Documentation - - - - - - - - - - - Biology - - VT 1.5.8 Biology - beta13 - VT 1.5 Biological sciences - VT 1.5.23 Reproductive biology - Cryobiology - Biological rhythms - A particular biological science, especially observable traits such as aspects of biochemistry, physiology, morphology, anatomy, development and so on. - VT 1.5.7 Biological rhythm - Biological science - Aerobiology - VT 1.5.99 Other - Chronobiology - true - VT 1.5.13 Cryobiology - - VT 1.5.1 Aerobiology - VT 1.5.3 Behavioural biology - Reproductive biology - Behavioural biology - - - - - - - - - - - Data management - - The development and use of architectures, policies, practices and procedures for management of data. - true - beta13 - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - VT 1.3.1 Data management - - - - - - - - - - - Sequence feature detection - - 1.3 - true - beta13 - The detection of the positional features, such as functional and other key sites, in molecular sequences. - http://purl.bioontology.org/ontology/MSH/D058977 - - - - - - - - - - Nucleic acid feature detection - - The detection of positional features such as functional sites in nucleotide sequences. - true - beta13 - 1.3 - - - - - - - - - - Protein feature detection - - The detection, identification and analysis of positional protein sequence features, such as functional sites. - beta13 - 1.3 - true - - - - - - - - - - Biological system modelling - - 1.2 - true - beta13 - Topic for modelling biological systems in mathematical terms. - - - - - - - - - - Data acquisition - - The acquisition of data, typically measurements of physical systems using any type of sampling system, or by another other means. - beta13 - - - - - - - - - - Genes and proteins resources - - 1.3 - Gene family - beta13 - Gene and protein families - Specific genes and/or their encoded proteins or a family or other grouping of related genes and proteins. - true - - - - - - - - - - Protein topological domains - - 1.13 - Topological domains such as cytoplasmic regions in a protein. - true - 1.8 - - - - - - - - - - Protein variants - - beta13 - true - Protein sequence variants produced e.g. from alternative splicing, alternative promoter usage, alternative initiation and ribosomal frameshifting. - - - - - - - - - - - Expression signals - - beta13 - true - 1.12 - Regions within a nucleic acid sequence containing a signal that alters a biological function. - - - - - - - - - - DNA binding sites - - - Matrix-attachment region - beta13 - Nucleosome exclusion sequences - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes), restriction enzyme recognition sites (restriction sites) etc. - Restriction sites - Ribosome binding sites - Scaffold-attachment region - This includes sites involved with DNA replication and recombination. This includes binding sites for initiation of replication (origin of replication), regions where transfer is initiated during the conjugation or mobilization (origin of transfer), starting sites for DNA duplication (origin of replication) and regions which are eliminated through any of kind of recombination. Also nucleosome exclusion regions, i.e. specific patterns or regions which exclude nucleosomes (the basic structural units of eukaryotic chromatin which play a significant role in regulating gene expression). - Nucleic acids binding to some other molecule. - Matrix/scaffold attachment region - - - - - - - - - - - Nucleic acid repeats - - true - beta13 - This includes long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a defined sequence and other types of repeating unit. - Repetitive elements within a nucleic acid sequence. - 1,13 - - - - - - - - - - DNA replication and recombination - - DNA replication or recombination. - beta13 - true - - - - - - - - - - Signal or transit peptide - - beta13 - 1.13 - true - Coding sequences for a signal or transit peptide. - - - - - - - - - - Sequence tagged sites - - beta13 - 1.13 - Sequence tagged sites (STS) in nucleic acid sequences. - true - - - - - - - - - - Sequencing - - Resequencing - true - http://purl.bioontology.org/ontology/MSH/D059014 - Chromosome walking - NGS - Next gen sequencing - DNA-Seq - High throughput sequencing - 1.1 - Primer walking - Next generation sequencing - The determination of complete (typically nucleotide) sequences, including those of genomes (full genome sequencing, de novo sequencing and resequencing), amplicons and transcriptomes. - - - - - - - - - - - ChIP-seq - - - Chip sequencing - 1.1 - The analysis of protein-DNA interactions where chromatin immunoprecipitation (ChIP) is used in combination with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. - Chip Seq - Chip-sequencing - - - - - - - - - RNA-Seq - - Small RNA-seq - Whole transcriptome shotgun sequencing - RNA-seq - miRNA-seq - 1.1 - A topic concerning high-throughput sequencing of cDNA to measure the RNA content (transcriptome) of a sample, for example, to investigate how different alleles of a gene are expressed, detect post-transcriptional mutations or identify gene fusions. - Small RNA-Seq - WTSS - This includes small RNA profiling (small RNA-Seq), for example to find novel small RNAs, characterize mutations and analyze expression of small RNAs. - - - - - - - - - DNA methylation - - true - DNA methylation including bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - 1.3 - http://purl.bioontology.org/ontology/MSH/D019175 - 1.1 - - - - - - - - - - Metabolomics - - The systematic study of metabolites, the chemical processes they are involved, and the chemical fingerprints of specific cellular processes in a whole cell, tissue, organ or organism. - true - http://purl.bioontology.org/ontology/MSH/D055432 - 1.1 - - - - - - - - - - - Epigenomics - - - Epigenetics concerns the heritable changes in gene expression owing to mechanisms other than DNA sequence variation. - 1.1 - http://purl.bioontology.org/ontology/MSH/D057890 - The study of the epigenetic modifications of a whole cell, tissue, organism etc. - true - - - - - - - - - - - Metagenomics - - - http://purl.bioontology.org/ontology/MSH/D056186 - Ecogenomics - Community genomics - Environmental genomics - true - 1.1 - The study of genetic material recovered from environmental samples, and associated environmental data. - - - - - - - - - - - DNA structural variation - - - 1.1 - Variation in chromosome structure including microscopic and submicroscopic types of variation such as deletions, duplications, copy-number variants, insertions, inversions and translocations. - Structural variation - Genomic structural variation - - - - - - - - - - DNA packaging - - Nucleosome positioning - beta12orEarlier - DNA-histone complexes (chromatin), organisation of chromatin into nucleosomes and packaging into higher-order structures. - http://purl.bioontology.org/ontology/MSH/D042003 - - - - - - - - - - DNA-Seq - - 1.1 - A topic concerning high-throughput sequencing of randomly fragmented genomic DNA, for example, to investigate whole-genome sequencing and resequencing, SNP discovery, identification of copy number variations and chromosomal rearrangements. - 1.3 - DNA-seq - true - - - - - - - - - - RNA-Seq alignment - - true - 1.3 - RNA-seq alignment - The alignment of sequences of (typically millions) of short reads to a reference genome. This is a specialised topic within sequence alignment, especially because of complications arising from RNA splicing. - beta12orEarlier - - - - - - - - - - ChIP-on-chip - - ChiP - ChIP-Chip - 1.1 - Experimental techniques that combine chromatin immunoprecipitation ('ChIP') with microarray ('chip'). ChIP-on-chip is used for high-throughput study protein-DNA interactions. - ChIP-chip - - - - - - - - - Data security - - 1.3 - Data privacy - The protection of data, such as patient health data, from damage or unwanted access from unauthorized users. - - - - - - - - - - Sample collections - - samples - biobanking - 1.3 - biosamples - Biological samples and specimens. - Specimen collections - - - - - - - - - - - Biochemistry - - - VT 1.5.4 Biochemistry and molecular biology - Chemical biology - 1.3 - Biological chemistry - true - Chemical substances and physico-chemical processes and that occur within living organisms. - - - - - - - - - - - Phylogenetics - - - The study of evolutionary relationships amongst organisms from analysis of genetic information (typically gene or protein sequences). - 1.3 - http://purl.bioontology.org/ontology/MSH/D010802 - true - - - - - - - - - - Epigenetics - - Topic concerning the study of heritable changes, for example in gene expression or phenotype, caused by mechanisms other than changes in the DNA sequence. - This includes sub-topics such as histone modification and DNA methylation. DNA methylation includes bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - http://purl.bioontology.org/ontology/MSH/D019175 - DNA methylation - Bisulfite sequencing - Histone modification - true - 1.3 - - - - - - - - - - - Biotechnology - - true - 1.3 - The exploitation of biological process, structure and function for industrial purposes, for example the genetic manipulation of microorganisms for the antibody production. - - - - - - - - - - - Phenomics - - - - Phenomes, or the study of the change in phenotype (the physical and biochemical traits of organisms) in response to genetic and environmental factors. - 1.3 - true - - - - - - - - - - - Evolutionary biology - - VT 1.5.16 Evolutionary biology - true - 1.3 - The evolutionary processes, from the genetic to environmental scale, that produced life in all its diversity. - - - - - - - - - - - Physiology - - The functions of living organisms and their constituent parts. - 1.3 - VT 3.1.8 Physiology - true - - - - - - - - - - - Microbiology - - true - The biology of microorganisms. - 1.3 - VT 1.5.20 Microbiology - - - - - - - - - - - Parasitology - - true - 1.3 - The biology of parasites. - - - - - - - - - - - Medicine - - General medicine - Research in support of healing by diagnosis, treatment, and prevention of disease. - true - 1.3 - VT 3.1 Basic medicine - VT 3.2.9 General and internal medicine - Experimental medicine - Biomedical research - Clinical medicine - VT 3.2 Clinical medicine - Internal medicine - - - - - - - - - - - Neurobiology - - Neuroscience - 1.3 - true - The study of the nervous system and brain; its anatomy, physiology and function. - VT 3.1.5 Neuroscience - - - - - - - - - - - Public health and epidemiology - - VT 3.3.1 Epidemiology - Topic concerning the the patterns, cause, and effect of disease within populations. - true - 1.3 - Public health - Epidemiology - - - - - - - - - - - Biophysics - - - 1.3 - true - VT 1.5.9 Biophysics - The use of physics to study biological system. - - - - - - - - - - - Computational biology - - VT 1.5.19 Mathematical biology - VT 1.5.12 Computational biology - This includes the modeling and treatment of biological processes and systems in mathematical terms (theoretical biology). - Mathematical biology - VT 1.5.26 Theoretical biology - Theoretical biology - 1.3 - The development and application of theory, analytical methods, mathematical models and computational simulation of biological systems. - true - Biomathematics - - - - - - - - - - - Transcriptomics - - - Comparative transcriptomics - Metatranscriptomics - The analysis of transcriptomes, or a set of all the RNA molecules in a specific cell, tissue etc. - Transcriptome - 1.3 - true - - - - - - - - - - - Chemistry - - VT 1.7.10 Polymer science - VT 1.7.7 Mathematical chemistry - VT 1.7.3 Colloid chemistry - 1.3 - Mathematical chemistry - Physical chemistry - VT 1.7.9 Physical chemistry - Polymer science - Chemical science - Organic chemistry - VT 1.7.6 Inorganic and nuclear chemistry - VT 1.7 Chemical sciences - VT 1.7.5 Electrochemistry - Inorganic chemistry - VT 1.7.2 Chemistry - Nuclear chemistry - VT 1.7.8 Organic chemistry - The composition and properties of matter, reactions, and the use of reactions to create new substances. - - - - - - - - - - - Mathematics - - The study of numbers (quantity) and other topics including structure, space, and change. - VT:1.1 Mathematics - Maths - VT 1.1.99 Other - 1.3 - - - - - - - - - - - Computer science - - 1.3 - VT 1.2 Computer sciences - VT 1.2.99 Other - The theory and practical use of computer systems. - - - - - - - - - - - Physics - - The study of matter, space and time, and related concepts such as energy and force. - 1.3 - - - - - - - - - - - RNA splicing - - - This includes the study of splice sites, splicing patterns, alternative splicing events and variants, isoforms, etc.. - Splice sites - RNA splicing; post-transcription RNA modification involving the removal of introns and joining of exons. - 1.3 - Alternative splicing - true - - - - - - - - - - Molecular genetics - - - 1.3 - The structure and function of genes at a molecular level. - true - - - - - - - - - - - Respiratory medicine - - true - VT 3.2.25 Respiratory systems - Pulmonology - The study of respiratory system. - Pulmonary medicine - Respiratory disease - 1.3 - Pulmonary disorders - - - - - - - - - - - Metabolic disease - - The study of metabolic diseases. - 1.4 - 1.3 - true - - - - - - - - - - Infectious disease - - Transmissable disease - VT 3.3.4 Infectious diseases - Communicable disease - The branch of medicine that deals with the prevention, diagnosis and management of transmissable disease with clinically evident illness resulting from infection with pathogenic biological agents (viruses, bacteria, fungi, protozoa, parasites and prions). - 1.3 - - - - - - - - - - - Rare diseases - - 1.3 - The study of rare diseases. - - - - - - - - - - - Computational chemistry - - - 1.3 - VT 1.7.4 Computational chemistry - true - Topic concerning the development and application of theory, analytical methods, mathematical models and computational simulation of chemical systems. - - - - - - - - - - - Neurology - - Neurological disorders - true - 1.3 - The branch of medicine that deals with the anatomy, functions and disorders of the nervous system. - - - - - - - - - - - Cardiology - - true - Cardiovascular disease - VT 3.2.4 Cardiac and Cardiovascular systems - 1.3 - Cardiovascular medicine - Heart disease - VT 3.2.22 Peripheral vascular disease - The diseases and abnormalities of the heart and circulatory system. - - - - - - - - - - - Drug discovery - - - The discovery and design of drugs or potential drug compounds. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - 1.3 - true - - - - - - - - - - - Biobank - - true - biobanking - 1.3 - Repositories of biological samples, typically human, for basic biological and clinical research. - Tissue collection - - - - - - - - - - - Mouse clinic - - 1.3 - Laboratory study of mice, for example, phenotyping, and mutagenesis of mouse cell lines. - - - - - - - - - - - Microbial collection - - Collections of microbial cells including bacteria, yeasts and moulds. - 1.3 - - - - - - - - - - - Cell culture collection - - 1.3 - Collections of cells grown under laboratory conditions, specifically, cells from multi-cellular eukaryotes and especially animal cells. - - - - - - - - - - - Clone library - - 1.3 - Collections of DNA, including both collections of cloned molecules, and populations of micro-organisms that store and propagate cloned DNA. - - - - - - - - - - - Translational medicine - - 'translating' the output of basic and biomedical research into better diagnostic tools, medicines, medical procedures, policies and advice. - true - 1.3 - - - - - - - - - - - Compound libraries and screening - - Translational medicine - Chemical library - Collections of chemicals, typically for use in high-throughput screening experiments. - Compound library - Chemical screening - 1.3 - - - - - - - - - - - Biomedical science - - Topic concerning biological science that is (typically) performed in the context of medicine. - true - VT 3.3 Health sciences - Health science - 1.3 - - - - - - - - - - - Data identity and mapping - - Topic concerning the identity of biological entities, or reports on such entities, and the mapping of entities and records in different databases. - 1.3 - - - - - - - - - - - Sequence search - - 1.3 - Sequence database search - true - 1.12 - The search and retrieval from a database on the basis of molecular sequence similarity. - - - - - - - - - - Biomarkers - - Diagnostic markers - 1.4 - Objective indicators of biological state often used to assess health, and determinate treatment. - true - - - - - - - - - - Laboratory techniques - - The procedures used to conduct an experiment. - Lab techniques - 1.4 - - - - - - - - - - - Data architecture, analysis and design - - The development of policies, models and standards that cover data acquisitioin, storage and integration, such that it can be put to use, typically through a process of systematically applying statistical and / or logical techniques to describe, illustrate, summarise or evaluate data. - Data analysis - Data design - 1.4 - Data architecture - - - - - - - - - - - Data integration and warehousing - - The combination and integration of data from different sources, for example into a central repository or warehouse, to provide users with a unified view of these data. - - - Data integration - 1.4 - Data warehousing - - - - - - - - - - - Biomaterials - - Any matter, surface or construct that interacts with a biological system. - Diagnostic markers - 1.4 - - - - - - - - - - - Chemical biology - - - true - 1.4 - The use of synthetic chemistry to study and manipulate biological systems. - - - - - - - - - - - Analytical chemistry - - 1.4 - The study of the separation, identification, and quantification of the chemical components of natural and artificial materials. - VT 1.7.1 Analytical chemistry - - - - - - - - - - - Synthetic chemistry - - Synthetic organic chemistry - The use of chemistry to create new compounds. - 1.4 - - - - - - - - - - - Software engineering - - VT 1.2.1 Algorithms - Programming languages - VT 1.2.7 Data structures - Software development - Software engineering - Computer programming - 1.4 - 1.2.12 Programming languages - The process that leads from an original formulation of a computing problem to executable programs. - Data structures - Algorithms - VT 1.2.14 Software engineering - - - - - - - - - - - Drug development - - 1.4 - Medicine development - The process of bringing a new drug to market once a lead compounds has been identified through drug discovery. - Drug development science - Medicines development - true - - - - - - - - - - - Drug formulation and delivery - - The process of formulating abd administering a pharmaceutical compound to achieve a therapeutic effect. - Drug delivery - Drug formulation - 1.4 - - - - - - - - - - - Pharmacokinetics and pharmacodynamics - - Pharmacodynamics - Pharmacokinetics - Drug distribution - true - 1.4 - Drug excretion - The study of how a drug interacts with the body. - Drug absorption - ADME - Drug metabolism - Drug metabolism - - - - - - - - - - - Medicines research and development - Medicine research and development - - The discovery, development and approval of medicines. - Health care research - Drug discovery and development - 1.4 - Health care science - - - - - - - - - - - Safety sciences - - 1.4 - Drug safety - The safety (or lack) of drugs and other medical interventions. - - - - - - - - - - - Pharmacovigilence - - 1.4 - Pharmacovigilence concerns safety once a drug has gone to market. - The detection, assesment, understanding and prevention of adverse effects of medicines. - - - - - - - - - - - Preclinical and clinical studies - - - The testing of new medicines, vaccines or procedures on animals (preclinical) and humans (clinical) prior to their approval by regulatory authorities. - Preclinical studies - 1.4 - Clinical study - Preclinical study - Clinical studies - - - - - - - - - - - Imaging - - true - Microscopy imaging - Microscopy - Diffraction experiment - The visual representation of an object. - This includes diffraction experiments that are based upon the interference of waves, typically electromagnetic waves such as X-rays or visible light, by some object being studied, typical in order to produce an image of the object or determine its structure. - 1.4 - - - - - - - - - - - Biological imaging - - The use of imaging techniques to understand biology. - 1.4 - - - - - - - - - - - Medical imaging - - VT 3.2.24 Radiology - The use of imaging techniques for clinical purposes for medical research. - 1.4 - Radiology - VT 3.2.14 Nuclear medicine - Nuclear medicine - VT 3.2.13 Medical imaging - - - - - - - - - - - Light microscopy - - The use of optical instruments to magnify the image of an object. - 1.4 - - - - - - - - - - - Laboratory animal science - - 1.4 - The use of animals and alternatives in experimental research. - - - - - - - - - - - Marine biology - - 1.4 - VT 1.5.18 Marine and Freshwater biology - true - The study of organisms in the ocean or brackish waters. - - - - - - - - - - - Molecular medicine - - The identification of molecular and genetic causes of disease and the development of interventions to correct them. - 1.4 - true - - - - - - - - - - - Nutritional science - - 1.4 - VT 3.3.7 Nutrition and Dietetics - Dietetics - The study of the effects of food components on the metabolism, health, performance and disease resistance of humans and animals. It also includes the study of human behaviours related to food choices. - Nutrition science - - - - - - - - - - - Omics - - true - The collective characterisation and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms. - 1.4 - - - - - - - - - - - Quality affairs - - The processes that need to be in place to ensure the quality of products for human or animal use. - Good clinical practice - Good manufacturing practice - Quality assurance - Good laboratory practice - 1.4 - - - - - - - - - - - Regulatory affairs - - The protection of public health by controlling the safety and efficacy of products in areas including pharmaceuticals, veterinary medicine, medical devices, pesticides, agrochemicals, cosmetics, and complementary medicines. - 1.4 - - - - - - - - - - - Regnerative medicine - - Stem cell research - Biomedical approaches to clinical interventions that involve the use of stem cells. - true - 1.4 - - - - - - - - - - - Systems medicine - - true - 1.4 - An interdisciplinary field of study that looks at the dynamic systems of the human body as part of an integrted whole, incoporating biochemical, physiological, and environmental interactions that sustain life. - - - - - - - - - - - Veterinary medicine - - Topic concerning the branch of medicine that deals with the prevention, diagnosis, and treatment of disease, disorder and injury in animals. - 1.4 - - - - - - - - - - - Bioengineering - - 1.4 - The application of biological concepts and methods to the analytical and synthetic methodologies of engineering. - Diagnostic markers - - - - - - - - - - - Geriatric medicine - - The branch of medicine dealing with the diagnosis, treatment and prevention of disease in older people, and the problems specific to aging. - VT 3.2.10 Geriatrics and gerontology - true - Ageing - Gerontology - Aging - 1.4 - Geriatrics - - - - - - - - - - - Allergy, clinical immunology and immunotherapeutics. - - VT 3.2.1 Allergy - Health issues related to the immune system and their prevention, diagnosis and mangement. - 1.4 - true - Immune disorders - Clinical immunology - Immunomodulators - Allergy - Immunotherapeutics - - - - - - - - - - - Pain medicine - - 1.4 - Algiatry - true - The prevention of pain and the evaluation, treatment and rehabilitation of persons in pain. - - - - - - - - - - - Anaesthesiology - - Anaesthetics - Anaesthesia and anaesthetics. - 1.4 - VT 3.2.2 Anaesthesiology - - - - - - - - - - - Critical care medicine - - Acute medicine - VT 3.2.5 Critical care/Emergency medicine - Emergency medicine - 1.4 - The multidisciplinary that cares for patients with acute, life-threatening illness or injury. - - - - - - - - - - - Dermatology - - The branch of medicine that deals with prevention, diagnosis and treatment of disorders of the skin, scalp, hair and nails. - Dermatological disorders - 1.4 - VT 3.2.7 Dermatology and venereal diseases - - - - - - - - - - - Dentistry - - 1.4 - The study, diagnosis, prevention and treatments of disorders of the oral cavity, maxillofacial area and adjacent structures. - - - - - - - - - - - Ear, nose and throat medicine - - Otolaryngology - 1.4 - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the ear, nose and throat. - Otorhinolaryngology - Head and neck disorders - VT 3.2.20 Otorhinolaryngology - Audiovestibular medicine - - - - - - - - - - - Endocrinology and metabolism - - 1.4 - Metabolic disorders - true - The branch of medicine dealing with diseases of endocrine organs, hormone systems, their target organs, and disorders of the pathways of glucose and lipid metabolism. - Metabolism - Endocrinology - Endocrine disorders - - - - - - - - - - - Haematology - - VT 3.2.11 Hematology - true - The branch of medicine that deals with the blood, blood-forming organs and blood diseases. - Haematological disorders - 1.4 - Blood disorders - - - - - - - - - - - Gastroenterology - - true - The branch of medicine that deals with disorders of the oesophagus, stomach, duodenum, jejenum, ileum, large intestine, sigmoid colon and rectum. - Gastrointestinal disorders - VT 3.2.8 Gastroenterology and hepatology - 1.4 - - - - - - - - - - - Gender medicine - - The study of the biological and physiological differences between males and females and how they effect differences in disease presentation and management. - 1.4 - - - - - - - - - - - Gynaecology and obstetrics - - The branch of medicine that deals with the health of the female reproductive system, pregnancy and birth. - true - 1.4 - VT 3.2.15 Obstetrics and gynaecology - Gynaecology - Gynaecological disorders - Obstetrics - - - - - - - - - - - Hepatic and biliary medicine - - Hepatobiliary medicine - Liver disorders - 1.4 - true - The branch of medicine that deals with the liver, gallbladder, bile ducts and bile. - - - - - - - - - - - Infectious tropical disease - - The branch of medicine that deals with the infectious diseases of the tropics. - 1.13 - true - 1.4 - - - - - - - - - - Trauma medicine - - 1.4 - The branch of medicine that treats body wounds or shock produced by sudden physical injury, as from violence or accident. - - - - - - - - - - - Medical toxicology - - true - The branch of medicine that deals with the diagnosis, management and prevention of poisoning and other adverse health effects caused by medications, occupational and environmental toxins, and biological agents. - 1.4 - - - - - - - - - - - Musculoskeletal medicine - - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the muscle, bone and connective tissue. It incorporates aspects of orthopaedics, rheumatology, rehabilitation medicine and pain medicine. - VT 3.2.26 Rheumatology - VT 3.2.19 Orthopaedics - Musculoskeletal disorders - Orthopaedics - Rheumatology - 1.4 - - - - - - - - - - - Opthalmology - - Eye disoders - VT 3.2.18 Optometry - 1.4 - Optometry - VT 3.2.17 Ophthalmology - Audiovestibular medicine - The branch of medicine that deals with disorders of the eye, including eyelid, optic nerve/visual pathways and occular muscles. - - - - - - - - - - - Paediatrics - - 1.4 - The branch of medicine that deals with the medical care of infants, children and adolescents. - VT 3.2.21 Paediatrics - Child health - - - - - - - - - - - Psychiatry - - The branch of medicine that deals with the mangement of mental illness, emotional disturbance and abnormal behaviour. - 1.4 - Psychiatric disorders - VT 3.2.23 Psychiatry - Mental health - - - - - - - - - - - Reproductive health - - Reproductive disorders - Audiovestibular medicine - VT 3.2.3 Andrology - Andrology - 1.4 - Family planning - The health of the reproductive processes, functions and systems at all stages of life. - Fertility medicine - - - - - - - - - - - Surgery - - Transplantation - VT 3.2.28 Transplantation - The use of operative, manual and instrumental techniques on a patient to investigate and/or treat a pathological condition or help improve bodily function or appearance. - 1.4 - - - - - - - - - - - Urology and nephrology - - The branches of medicine and physiology focussing on the function and disorders of the urinary system in males and females, the reproductive system in males, and the kidney. - VT 3.2.29 Urology and nephrology - 1.4 - Urology - Kidney disease - Urological disorders - Nephrology - - - - - - - - - - - Complementary medicine - - Medical therapies that fall beyond the scope of conventional medicine but may be used alongside it in the treatment of disease and ill health. - VT 3.2.12 Integrative and Complementary medicine - Holistic medicine - 1.4 - Alternative medicine - Integrative medicine - - - - - - - - - - - MRI - - Nuclear magnetic resonance imaging - 1.7 - MRT - Magnetic resonance tomography - Techniques that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - NMRI - Magnetic resonance imaging - - - - - - - - - - - Neutron diffraction - - - The study of matter by studying the diffraction pattern from firing neutrons at a sample, typically to determine atomic and/or magnetic structure. - Neutron microscopy - Elastic neutron scattering - 1.7 - Neutron diffraction experiment - - - - - - - - - - Tomography - - X-ray tomography - Imaging in sections (sectioning), through the use of a wave-generating device (tomograph) that generates an image (a tomogram). - Electron tomography - 1.7 - - - - - - - - - - Data mining - - 1.7 - VT 1.3.2 Data mining - The discovery of patterns in large data sets and the extraction and trasnsformation of those patterns into a useful format. - true - KDD - Knowledge discovery in databases - - - - - - - - - - Machine learning - - A topic concerning the application of artificial intelligence methods to algorithms, in order to create methods that can learn from data in order to generate an ouput, rather than relying on explicitly encoded information only. - Artificial Intelligence - 1.7 - VT 1.2.2 Artificial Intelligence (expert systems, machine learning, robotics) - - - - - - - - - - Database management - - File management - Document, record and content management - Database administration - This includes databases for the results of scientific experiments, the application of high-throughput technology, computational analysis and the scientific literature. It covers the management and manipulation of digital documents, including database records, files and reports. - Document management - Content management - 1.8 - Databases - Data maintenance - The general handling of data stored in digital archives such as databanks, databases proper, web portals and other data resources. - - Record management - Biological databases - - - - - - - - - - Animals - - 1.8 - Animal biology - Animals, e.g. information on a specific animal genome including molecular sequences, genes and annotation. - Zoology - Animal - VT 1.5.29 Zoology - The resource may be specific to a plant, a group of plants or all plants. - Metazoa - - - - - - - - - - Protein sites, features and motifs - - - A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Protein sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in protein sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Signal peptide cleavage sites - - - - - - - - - - Nucleic acid sites, features and motifs - - - Primer binding sites - Nucleic acid functional sites - Sequence tagged sites - Nucleic acid sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in nucleic acid sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence tagged sites are short DNA sequences that are unique within a genome and serve as a mapping landmark, detectable by PCR they allow a genome to be mapped via an ordering of STSs. - - - - - - - - - - Gene transcripts - - - EST - This includes Introns, and protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. Also expressed sequence tag (EST) or complementary DNA (cDNA) sequences. - Transcription - mRNA features - This includes regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. A polyA signal is required for endonuclease cleavage of an RNA transcript that is followed by polyadenylation. A polyA site is a site on an RNA transcript to which adenine residues will be added during post-transcriptional polyadenylation. - cDNA - Introns - PolyA site - Fusion transcripts - Exons - Signal peptide coding sequence - This includes coding sequences for a signal or transit peptide. A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Transcription of DNA into RNA and features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. - 1.8 - PolyA signal - mRNA - Transit peptide coding sequence - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Coding RNA - Gene transcript features - - - - - - - - - - Protein-ligand interactions - - true - 1.8 - Protein-ligand (small molecule) interaction(s). - 1.13 - Protein-drug interactions - - - - - - - - - - Protein-drug interactions - - 1.13 - 1.8 - true - Protein-drug interaction(s). - - - - - - - - - - Genotyping experiment - - 1.8 - Genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - - - - - - - - - - GWAS study - - 1.8 - Genome-wide association study experiments. - Genome-wide association study - - - - - - - - - - Microarray experiment - - ChIP-chip - Microarray experiments including conditions, protocol, sample:data relationships etc. - Microarrays - Tissue microarray - Reverse phase protein array - Methylation array - mRNA microarray - Multichannel microarray - Proprietary platform micoarray - MicroRNA array - 1.8 - Two channel microarray - miRNA array - This might specify which raw data file relates to which sample and information on hybridisations, e.g. which are technical and which are biological replicates. - One channel microarray - ChIP-on-chip - Genotyping array - - - - - - - - - - PCR experiment - - 1.8 - PCR experiments, e.g. quantitative real-time PCR. - - - - - - - - - - Proteomics experiment - - Proteomics experiments. - Northern blot experiment - 2D PAGE experiment - 1.8 - This includes two-dimensional gel electrophoresis (2D PAGE) experiments, gels or spots in a gel. Also mass spectrometry - an analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. Also Northern blot experiments. - Mass spectrometry - - - - - - - - - - 2D PAGE experiment - - true - Two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - 1.13 - - - - - - - - - - Northern blot experiment - - Northern Blot experiments. - true - 1.13 - 1.8 - - - - - - - - - - RNAi experiment - - 1.8 - RNAi experiments. - - - - - - - - - - Simulation experiment - - 1.8 - Biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - - - - - - - - - - Protein-nucleic acid interactions - - true - 1.8 - Protein-DNA/RNA interaction(s). - 1.13 - - - - - - - - - - Protein-protein interactions - - 1.13 - Protein-protein interaction(s), including interactions between protein domains. - 1.8 - true - - - - - - - - - - Cellular process pathways - - 1.8 - Cellular process pathways. - true - 1.13 - - - - - - - - - - Disease pathways - - 1.13 - Disease pathways, typically of human disease. - true - 1.8 - - - - - - - - - - Environmental information processing pathways - - true - Environmental information processing pathways. - 1.8 - 1.13 - - - - - - - - - - Genetic information processing pathways - - true - 1.8 - Genetic information processing pathways. - 1.13 - - - - - - - - - - Protein super-secondary structure - - Super-secondary structure of protein sequence(s). - true - 1.8 - 1.13 - - - - - - - - - - Protein active sites - - 1.8 - 1.13 - true - Catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein binding sites - - Protein functional sites - Enzyme active site - Binding sites in proteins, including cleavage sites (for a proteolytic enzyme or agent), key residues involved in protein folding, catalytic residues (active site) of an enzyme, ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids, RNA and DNA-binding proteins and binding sites etc. - Protein-nucleic acid binding sites - 1.8 - Protein cleavage sites - Protein key folding sites - - - - - - - - - - Protein-nucleic acid binding sites - - RNA and DNA-binding proteins and binding sites in protein sequences. - 1.13 - 1.8 - true - - - - - - - - - - Protein cleavage sites - - Cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - true - 1.8 - 1.13 - - - - - - - - - - Protein chemical modifications - - true - Chemical modification of a protein. - 1.13 - 1.8 - - - - - - - - - - Protein disordered structure - - Disordered structure in a protein. - 1.8 - Protein features (disordered structure) - - - - - - - - - - Protein domains - - true - 1.13 - Structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - - - - - - - - - - Protein key folding sites - - 1.8 - 1.13 - true - Key residues involved in protein folding. - - - - - - - - - - Protein post-translational modifications - - true - 1.13 - Post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein secondary structure - - The location and size of the secondary structure elements and intervening loop regions is typically given. The report can include disulphide bonds and post-translationally formed peptide bonds (crosslinks). - Secondary structure (predicted or real) of a protein, including super-secondary structure. - Protein super-secondary structure - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - Protein features (secondary structure) - 1.8 - - - - - - - - - - Protein sequence repeats - - true - 1.8 - Short repetitive subsequences (repeat sequences) in a protein sequence. - 1.13 - - - - - - - - - - Protein signal peptides - - 1.13 - Signal peptides or signal peptide cleavage sites in protein sequences. - true - 1.8 - - - - - - - - - - Protein interaction experiment - - 1.12 - Yeast one-hybrid - Co-immunoprecipitation - An experiment for studying protein-protein interactions. - Yeast two-hybrid - Phage display - - - - - - - - - - Applied mathematics - - VT 1.1.1 Applied mathematics - The application of mathematics to specific problems in science, typically by the formulation and analysis of mathematical models. - 1.10 - - - - - - - - - - Pure mathematics - - VT 1.1.1 Pure mathematics - The study of abstract mathematical concepts. - 1.10 - - - - - - - - - - Data governance - - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - The control of data entry and maintenance to ensure the data meets defined standards, qualities or constraints. - 1.10 - Data stewardship - - - - - - - - - - Data quality management - - http://purl.bioontology.org/ontology/MSH/D030541 - 1.10 - Data quality - Data integrity - Data clean-up - Data enrichment - The quality, integrity, cleaning up and enrichment of data. - - - - - - - - - - Freshwater biology - - 1.10 - VT 1.5.18 Marine and Freshwater biology - The study of organisms in freshwater ecosystems. - - - - - - - - - - - Human genetics - - true - The study of inheritatnce in human beings. - VT 3.1.2 Human genetics - 1.10 - - - - - - - - - - - Tropical medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - VT 3.3.14 Tropical medicine - - - - - - - - - - - Medical biotechnology - - VT 3.4.1 Biomedical devices - 1.10 - true - VT 3.4.2 Health-related biotechnology - VT 3.4 Medical biotechnology - VT 3.3.14 Tropical medicine - Pharmaceutical biotechnology - Biotechnology applied to the medical sciences and the development of medicines. - - - - - - - - - - - Personalized medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - Molecular diagnostics - true - VT 3.4.5 Molecular diagnostics - - - - - - - - - - - Immunoprecipitation experiment - - - - Chromatin immunoprecipitation - Experimental techniques to purify a protein-DNA crosslinked complex. Usually sequencing follows e.g. in the techniques ChIP-chip, ChIP-seq and MeDIP-seq. - 1.12 - - - - - - - - - - Whole genome sequencing - - 1.12 - Laboratory technique to sequence the complete DNA sequence of an organism's genome at a single time. - WGS - Whole genome resequencing - - - - - - - - - - Methylated DNA immunoprecipitation - - 1.12 - MeDIP-seq - Methylated DNA immunoprecipitation (MeDIP) - Methylation sequencing - Laboratory technique to sequence the methylated regions in DNA. - MeDIP-chip - Bisulfite sequencing - MeDIP - mDIP - - - - - - - - - - Exome sequencing - - 1.1 - Exome capture - Exome sequencing is considered a cheap alternative to whole genome sequencing. - Targeted exome capture - Exome sequence analysis - Laboratory technique to sequence all the protein-coding regions in a genome, i.e., the exome. - Exome analysis - - - - - - - - - - - Experimental design and studies - - Design of experiments - 1.12 - Experimental design - Studies - The design of an experiment intended to test a hypothesis, and describe or explain empirical data obtained under various experimental conditions. - true - - - - - - - - - - - Animal study - - - Challenge study - 1.12 - The design of an experiment involving non-human animals. - - - - - - - - - - Microbial ecology - - - 1.13 - The ecology of microorganisms including their relationship with one another and their environment. - Microbiome - true - Environmental microbiology - - - - - - - - - - Obsolete concept (EDAM) - - 1.2 - Needed for conversion to the OBO format. - An obsolete concept (redefined in EDAM). - true - - - - - - - - - - - - - - diff --git a/HOW_TO_CONTRIBUTE.md b/HOW_TO_CONTRIBUTE.md deleted file mode 100644 index 9b3be14..0000000 --- a/HOW_TO_CONTRIBUTE.md +++ /dev/null @@ -1,35 +0,0 @@ -EDAM is a community project, and suggestions for additions, corrections, and other improvements are always welcome! - - -# Mailing lists -To find the most efficient way to contribute to EDAM, request changes and additions, and for general discussions, including the use of EDAM for resource annotation and in software implementations, mail: - -edam@elixir-dk.org - -We'll make every effort to be responsive, given our limited resources, and will work with you to find the most efficient way to proceed, depending on your requirements, expertise and bandwidth. - -To receive mail from the list above, subscribe here: - -http://elixirmail.cbs.dtu.dk/mailman/listinfo/edam - -To receive low-traffic announcements, subscribe here: - -http://elixirmail.cbs.dtu.dk/mailman/listinfo/edam-announce - - -# Suggestions form -Simple requests for one or a few changes can be made using this form: - -http://tinyurl.com/EDAMChangeRequest - -If you require many changes and additions, there's probably a more efficient way to proceed: please contact edam@elixir-dk.org. - - -# GitHub issue tracker -If you have a GitHub account, you can make requests by opening a GitHub issue: -- Go to https://github.com/edamontology/edamontology/issues and click on "New issue". -- If you are not logged in, you will be asked first to log in or create an account. -- Provide a title, and a report that is concise but sufficiently detailed to be actionable. - -# Joining the team -We very much welcome you to join the EDAM team. Please first read about the [Governance of EDAM](https://github.com/edamontology/edamontology#governance-of-edam) to find the level that is right for you. Then mail edam-core@elixir-dk.org to get started. diff --git a/HOW_TO_EDIT.md b/HOW_TO_EDIT.md deleted file mode 100644 index fb9967b..0000000 --- a/HOW_TO_EDIT.md +++ /dev/null @@ -1,182 +0,0 @@ -If you’re not sure how to do something please first ask by mailing: -edam@elixir-dk.org - -# Modifications in GitHub main repository (Core Developers only) -The workflow is: - -1. Get the “editing token” - - Contact edam-core@elixir-dk.org and claim the “editing token” after first checking that it is not currently taken :) - - Say what you are doing, why, and about how long it will take -2. Update your local repo with the latest files from the GitHub master: - - `git pull` - - If you’ve not already done so, you will first need to clone the master repo: - - `git clone https://github.com/edamontology/edamontology.git` -3. Make and commit your local changes. You **must** be working with the latest “dev” version, _e.g._ EDAM_1.5_dev.owl. You should leave the version number unchanged, i.e. should not need to add any new files to the repo. - - Check your changes and that the OWL file looks good in Protégé - - Ensure the `next_id` attribute is updated - - Ensure that `oboOther:date` is updated to the current GMT/BST before the commit - - Add the editted file to the commit - - `git add ` - - Commit your local changes, including a concise but complete summary of the major changes: - - `git commit -m ”commit message here”` -4. Push your changes to the GitHub master: - - `git push origin` - -** Please provide a meaningful commit message so that we can easily generate the ChangeLog upon next release ** - -5. Release the editing token for the other developers: - - Contact edam-core@elixir-dk.org and release the “editing token” . - - Summarise what you actually did and why. - -# Workflow for the creation of a new official release of EDAM (Core developers only) -From January 2016, EDAM follows a monthly release cycle to this schedule: - -1. First Wed of every month: EDAM team skype to discuss plans for this month. Announcement (to edam-announcence) including short summary of plans, invitation for suggestions. -2. Last Mon of every month: Announcement (to edam-announcence) saying that release is immiment, invitation for last-minute suggestions. -3. Last Wed of every month: Complete the work for the release. Make the release. Ensure it works in BioPortal, OLS, and in bio.tools. -4. Last Fri of every month: Announcee the release, incuding summary of changes. - -**Before to create a new release, please make sure you have the approval of leader of EDAM core-dev, and that the [changelog.md](https://github.com/edamontology/edamontology/blob/master/changelog.md) and [changelog-detailed.md](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) files are up-to-date with the changes of the new release**. See section below on creating the ChangeLog files. Once you're clear to go, do the following: - -1. Update your local version of the repository: - - `git pull` -2. Assuming you are releasing version n+1, n being the current version: - - you initially have EDAM\_dev.owl in the repository - - make sure to update 'oboOther:date' in this file - - copy the file EDAM\_dev.owl to releases/EDAM\_n+1.owl - - `cp EDAM\_dev.owl releases/EDAM\_n+1.owl` - `git add releases/EDAM\_n+1.owl` - - - modify the doap:version property to **n+1** in `releases/EDAM\_n+1.owl` and to **n+2\_dev** in `EDAM\_dev.owl` - - - commit and push your changes - - `git commit -a` - - `git push origin` - -3. Update the file names of web/page_x.html and relations-and-properties_x.html: update the version number to n+1 (in file name, and multiple places in the contents), and also update the last update date in web/page_x.html. -4. Update the [detailed changelog](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) by running [Bubastis](http://www.ebi.ac.uk/efo/bubastis/) to compare the release against the previous version. -5. Update the [changelog](https://github.com/edamontology/edamontology/blob/master/changelog.md) with a summary of the major changes. -6. Create the release on github (Use the [_draft a new release_](https://github.com/edamontology/edamontology/releases/new) button of the _[releases](https://github.com/edamontology/edamontology/releases)_ tab). -7. Update the website, http://edamontology.org. -8. Submit this new release to BioPortal and OLS. -9. Close GitHub issues labelled "done - staged for release". -10. Announce the new release on Twitter and mailing lists (edam-announce@elixir-dk.org, edam@elixir-dk.org) including thanks and a summary of changes. -11. Help apps that implement EDAM to update to the new version. In particular [bio.tools](http://bio.tools). - -# Modifications in a GitHub fork (non-core developers) -GitHub makes it possible for any developer (even if you are not a “core developer”) to make modifications in a copy of EDAM and suggest these modifications are included in the original. -Please note that we discourage using this mechanism for large modifications made using Protégé, because merging OWL files which have been reformatted by Protégé is notoriously unreliable (see “Best practices for edition” below). If you get an agreement from the core developers to make large modifications in Protégé, we can provide you a core developer status on a temporary basis. This access will be removed once the task is accomplished. -The workflow is: -- Fork the edamontology repository in your own account. -- Make the modifications you want to suggest for inclusion in EDAM in this forked repository. -- Open pull requests for each modification you make. -Please make sure to: -- Keep your forked repository synchronized with the core repository, to avoid inconsistencies. -- Make sure to follow the “Best practices for edition” below. - -# Editing the ChangeLog -The ChangeLog includes: -1. [changelog](https://github.com/edamontology/edamontology/blob/master/changelog.md) - a summary of the major changes and what motivated them -2. [detailed changelog](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) - fine-grained details obtained using [Bubastis](http://www.ebi.ac.uk/efo/bubastis/) - -The changelog should include: -1. (as 1st paragraph) an "executive summary" suitable for consumption by technical managers, describing the motivation for major changes, including e.g. requests at recent hackathons, requests via GitHub, strategic directions etc. -2. summary of changes distilled from the output of [Bubastis](http://www.ebi.ac.uk/efo/bubastis/) (see below). -3. summary of GitHub commit messages. ** PLEASE ensure meaningful commit messages are provided on every commit** - -Some hacking of bubastis output is needed to identify (at least): - - number of new concepts - - number of deprecations - - summary of activity, i.e. in which branches was most work focucssed ? - - -# Best practices for edition - -## General guidelines - -1. As much as you can, try to make atomic changes and commit them independently. this improves greatly traceability in the long term -2. Make trivial modifications using a text editor if possible, rather than Protégé, because the actual modification is not hidden in haystack of Protégé reformattings -3. Check and double-check your changes: errors are hard to track and fix later - -## Adding concepts - -When adding new terms, you _**MUST**_ specify the following (attributes are in parenthesis): - -1. Correct concept URI, i.e. in the right namespace and with the latest ID -2. Preferred term (`rdfs:label`) -3. Definition (`oboInOwl:hasDefinition`) -4. Parent concept (`rdfs:subClassOf`) -5. Current dev version into `created_in` : type a value e.g. “1.5” -6. The ‘edam’ subset (`oboInOwl:inSubset`): in Protege, pick (don’t type!) the value of `'edam'` -7. The branch subset (`oboInOwl:inSubset`): pick one of ‘topic’, ‘data’, ‘format’ or ‘operation’ -8. Any specialised subset (pick as above, only if required) -9. The next ID ontology attribute (`next_id`) - -Note that : -- The **preferred label** should be a short name or phrase in common use. -- Consider providing common **synonyms** of the term: - - Exact synonym (`oboInOwl:hasExactSynonym`) - bog-standard synyonsm - - Narrow synonym (`oboInOwl:hasNarrowSynonym`) - specialisms of the term - - Broad synonym (`oboInOwl:hasBroadSynonym`) - generalisations of the term -- The **definition** should be a concise and lucid description of the concept, without acronyms, and avoiding jargon. -- Peripheral but important information can go in the **comment** (`rdfs:comment`). - -In addition, for **Format** concepts, please specify: - -1. The Data concept which the format applies to : define this relation in Protege using the pattern 'Format is_format_of some Data' -2. The URL of the format documentation, if availablle ('Documentation' attribute) : in Protege, type a URL using the Protege IRI editor. - -In addition, for **Identifier** concepts, specify: - -1. The Data concept which the identifier applies to : define this relation in Protege using the pattern 'Identifier is_identifier_of some Data' -2. The regular expression defining valid values of that identifier ('Regular expression') : type the regex into the Protege 'Constant" editor - -In addition, for **Topic** concepts, specify: - -1. The corresponding Wikipedia page that exact matches the term ('Documentation' attribute) : in Protege, type a URL using the IRI editor. This method will change when we eventually link via Wikidata. - - - - -## Deprecating concepts - -When deprecating concepts, you _**MUST**_ specify the following: - -1. Current dev version into `obsolete_since`. -2. The ‘obsolete’ subset (`oboInOwl:inSubset`): pick ‘obsolete’. -3. The ‘deprecated’ attribute (`owl:deprecated`): type the value of ‘true’. -4. The alternative ‘replacement’ term to firmly use (`oboInOwl:replacedBy`), or to consider when less certain (`oboInOwl:consider`): pick a concept. -5. Set the parent concept (`rdfs:subClassOf`) to the `ObsoleteClass`. -6. Remove all other class annotations (subsets, comments, synonyms etc.) and axioms (including parent concepts). - - -## Ensuring logical consistency - -Before commiting changes, to ensure logical consistency of EDAM, please do the following within Protege: - -1. Click "Reasoner->Hermit" -2. Click "Reasoner->Start reasoner" (it may take a few seconds) -3. In the "Entities" tab, select the "Class hierarchy (inferred) tab" -3. Select the "nothing" branch. - -If nothing (no classes) are shown under the "nothing" branch, then all is well. If one or more classes are shown, then there is a logical inconsistency which must be fixed. You might see lots of classes, but usually the problem is in one or a few classes. - -Common problems include: -- classes assigned as a subClass of some deprecated term -- end-point of relations are in the wrong branch, e.g. 'class has_topic some operation'. These can easily occur if you use the "Class expression editor" in Protege to define such axioms: this is NOT EDAM namespace aware, and in cases where a concept with the same preferred label exists in both classes, can easily pick the wrong one. - -The problems are easily fixed within Protege: ask on the mailing list if you're not sure how. Finally, do not be tempted to click "Reasoner->Synchronise reasoner" between changes: it tends to hang Protege. Instead, use "Reasoner->Stop reasoner" than "Reasoner->Start reasoner". - -## Continuous Integration - -Every modification on the ontology pushed to github triggers an automated test in Travis CI. For now, it only checks a few rules using the edamxpathvalidator tool (https://github.com/edamontology/edamxpathvalidator). The Travis-CI website shows you the current status here https://travis-ci.org/edamontology/edamontology. The fact that the continuous integration task succeeds does not guarantee that it there are no remaining bugs, but a failure means that you must take action to correct the problem, either fix it, fix the edamxpathvalidator program, or ask the mailing list if you're unsure. diff --git a/README.md b/README.md deleted file mode 100644 index f014366..0000000 --- a/README.md +++ /dev/null @@ -1,199 +0,0 @@ -# What is EDAM? -EDAM is a simple ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM provides a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use. - -You can browse EDAM at the new [OLS beta](http://www.ebi.ac.uk/ols/beta/ontologies/edam) and the [NCBO BioPortal](http://bioportal.bioontology.org/ontologies/EDAM/). - -Twitter: [@edamontology](http://twitter.com/edamontology) ([follow](https://twitter.com/intent/follow?original_referer=https%3A%2F%2Fgithub.com%2Fedamontology%2Fedamontology®ion=follow_link&screen_name=edamontology&tw_p=followbutton)). - -# Motivation -Bioinformaticians handle an increasingly large and diverse set of tools and data. Meanwhile, researchers demand ever more powerful and convenient means to organise, find, understand, compare, select, use and connect the available resources. These tasks often rely on consistent, machine-understandable descriptions of the underlying components, but these have been generally lacking in _ad hoc_ resource descriptions. The urgent need - filled by EDAM - is for an ontology that unifies semantically the bioinformatics concepts in common use, provides the curator with a comprehensive controlled vocabulary that is broadly applicable, and supports new and powerful search, browse and query functions. - -# Applications -EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources, including: - -- Web services including REST and SOAP APIs -- Application software -- Tool collections and packages -- Workflows / pipelines -- Databases -- XML Schemata and data objects -- Data syntax and file formats -- Web portals and pages -- Resource catalogues -- Training materials -- Courses, tutorials, and other events -- Areas of scientific interest -- Documents, such as scientific publications - -EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries. - -# Scope - -EDAM includes 4 main sub-ontologies or 'branches' of concepts: - -- _**Data**_ - “Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output.” -- _**Format**_ - “A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere.” -- _**Operation**_ - “A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs).” -- _**Topic**_ - “A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other.” - -Noteworthy within the the Data sub-ontology is: -- _**Identifier**_ - “A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things).” - -![EDAM concepts figure](https://raw.githubusercontent.com/edamontology/edamontology/master/web/EDAMconcepts.png) - -As a general rule, the _**Data**_, _**Format**_, and _**Operation**_ branches include concepts strictly in domain of bioinformatics and computational biology: concepts purely concerning biology, computer science, _etc._ are not included. The _**Topic**_ branch, however, includes broader inter-disciplinary concepts from the biological and medical domains. - -EDAM provides different semantic 'axes' for annotation. For example, annotation of a software tool might include: - -- _Topic_ - general scientific domain the software serves, _e.g._ “Structural biology” -- _Operation_ - the precise function of the tool, _e.g._ “Homology modelling” -- _Data_ - the primary input and output, _e.g._ “Protein structure” -- _Format_ - the supported format(s) of the input and output, _e.g._ “PDB format” - -# Principles - -EDAM strives to uphold a few founding principles including: - -- **Quality** - a controlled vocabulary that is moderated -- **Openness** - development in collaboration with the community -- **Relevance** - prioritising use-case-driven development towards comprehensive but practical coverage -- **Practicality** - practical utility is valued over ontological “strictness” or any metaphysical doctrine -- **Clear scope** - respecting the scope of other complementary, well-developed ontologies -- **Familiarity** - including only concepts that are well established; familiar are prevalent and jargon is discouraged -- **Usability** - conceptual hierarchy with sufficient richness but only necessary complexity -- **Maintainability** - development must be efficient and sustainably up to date in the long term - -EDAM is working towards implementing these principles fully and is open to suggestions. - -# Architecture -EDAM has 3 components: - -- _**Concepts**_ - All concepts have a name (the term or label) and definition. Further, a concept may have simple relations (see below) to other EDAM concepts, as well other intrinsic properties, _e.g._ an identifier may have a regular expression defining its syntax. -- _**Hierarchy**_ - Every concept (excluding top-level concepts) is related to one or more other concepts within the same branch by an _**is a**_ (specialisation) relation. Hence EDAM has 4 primary hierarchies (for _Data_, _Format_, _Operation_, and _Topic_). -- _**Relations**_ - Concepts are related by defined relation types (see figure below), which reflect well established or self-evident principles, and are used primarily to define internal consistency of EDAM. These have external applications too, e.g. annotations on the Semantic Web. - -![EDAM relations figure](https://raw.githubusercontent.com/edamontology/edamontology/master/web/EDAMrelations.png) - -# Priorities - -Our core priority is to be responsive to users of EDAM. Furthermore, to establish a more sustainable footing for essential EDAM maintenance and developments, including: -- Content review and refactoring to ensure structural and semantic simplicity ensuring high usability -- Community build-up and development including more formal, but agile, governance and maintenance models and mechanisms -- Agile and responsive development of content in close collaboration with end-users and serving concrete use-cases -- Technical refactoring to minimise the cost of routine housekeeping and content development -- Implementation of tooling for routine maintenance to serve the needs of end-users, _e.g._ harvesting change requests and mappings between concepts - -# Governance of EDAM - -EDAM follows a model with five tiers of governance: - -1. **EDAM Advisory Group** advises the EDAM Core Developers on how best to uphold the EDAM principles and achieve its current aims. It represents the broad life science community, especially scientist end-users, and adopters of EDAM. Advisory Group members have no formal responsibilities, but are expected to advocate EDAM and actively offer constructive advice based on their practical experience, requirements and expertise. The EDAM Core Developers will respect this advice and give quarterly progress reports by email. The Core Developers aim to assemble with the Advisory Group virtually 2 or 3 times a year or as circumstances dictate, in meetings with open agenda and followed up with actions and notes on key recommendations. The Advisory Group will be reconstituted each year and the Steering Group (below) reserves the right to replace inactive members. - -2. **EDAM Steering Group** includes representatives of institutes that are committing significant resources to EDAM. Members of the Steering Group have four primary responsibilities: - - * Agree strategy and set priorities in consultation with the Core Developers - * Verify whether stated aims are coherent and wise - * Monitor progress and provide feedback - * Help arrange funding for EDAM - -3. **EDAM Core Developers** are funded to develop EDAM and have GitHub commit rights. Responsible for agreeing aims and general good practice, overseeing and approving developments and routine maintenance. The model is quasi-democratic with a leader (currently Jon Ison) having the final say where necessary. The leader ensures the Advisory Group, and all editors and contributors, are listened to and informed. The leader may be temporarily appointed from the core developers as necessary, e.g. during holidays. Core Developers must have the intent and some bandwidth to develop EDAM in the long-term. They have 3 primary responsibilities: - * Understand and uphold the EDAM principles - * Advocate EDAM - * Develop EDAM as bandwidth permits - -4. **EDAM Editors** would not normally have GitHub commit rights long-term. They include anyone who makes significant contributions to EDAM scientific content, by whatever means, but have none of the commitments or responsibilities of the core developers. - -5. **Other contributors** do not have GitHub commit rights, but can still make comments, contribute suggestions for new terms and other changes. - -We very much welcome new editors and contributors. Representatives of projects who plan to adopt EDAM are welcome to join the EDAM Advisory Group. For futher information please [read about how to contribute](https://github.com/edamontology/edamontology/blob/master/HOW_TO_CONTRIBUTE.md) or mail [edam-core@elixir-dk.org](edam-core@elixir-dk.org). - -# People - -## EDAM Core Developers -* Jon Ison (DTU, DK) *- lead developer* -* Matúš Kalaš (University of Bergen, NO) -* Hervé Ménager (Institut Pasteur, FR) -* Veit Schwämmle (SDU, DK) - -## EDAM Editors -* David Sehnal (MU, CZ) - General bioinformatics -* Ivan Mičetić (University of Padova, IT) - Protein structure -* Kristian Davidsen (DTU, DK) - Sequencing -* Laura Emery (EMBL-EBI, UK) - EBI tools and training -* Lukáš Pravda (MU, CZ) - Structural bioinformatics -* Stanislav Geidl (MU, CZ) - Chemoinformatics -* Wouter Touw (CMBI, NL) - Protein structure - -## EDAM Steering Group -* Alfonso Valencia (ELIXIR ES) -* Cath Brooksbank (ELIXIR EMBL-EBI) -* Christophe Blanchet (ELIXIR FR) -* Heinz Stockinger (ELIXIR CH) -* Inge Jonassen (ELIXIR NO) -* Karel Berka (ELIXIR CZ) -* Søren Brunak (ELIXIR DK) -* Steven Newhouse (ELIXIR EMBL-EBI) - -## EDAM Advisory Group -* Anna-Lena Lamprecht (University of Potsdam, DE) -* Dan Bolser (EMBL-EBI, UK) -* Frederik Coppens (ELIXIR BE) -* Hedi Peterson (ELIXIR EE) -* Jane Lomax (Sanger Institute, UK) -* Melissa Haendel (Oregon Health & Science University, USA) -* Michael Crusoe (University of California) -* Niclas Jareborg (ELIXIR SE) -* Radka Svobodová (MU, CZ) -* Rafael Jimenez (ELIXIR HUB) - - - - -## Contributors -Thanks to the many people who have contributed - if you're not listed below, please let us know! - -* Marie Grosjean (IFB, FR) -* Nathalie Conte (EMBL-EBI, UK) -* Victor de la Torre (ELIXIR-ES) -* Ray Fergerson (Stanford University, USA) -* Carole Goble (ELIXIR-UK) -* Simon Jupp (EMBL-EBI, UK) -* Peter Løngreen (CBS-DTU, DK) -* Allyson Lister (Newcastle University, UK) -* Rodrigo Lopez (EMBL-EBI, UK) -* James Malone (EMBL-EBI, UK) -* Julie McMurry (EMBL-EBI, UK) -* Hamish McWilliam (formerly EMBL-EBI, UK) -* Helen Parkinson (EMBL-EBI, UK) -* Steve Pettifer (University of Manchester, UK) -* Kristoffer Rapacki (CBS-DTU, DK) -* Peter Rice (Imperial College, UK) -* Mahmut Uludag (EMBL-EBI, UK) -* Jiří Vondrášek (IOCB AS, CZ) -* Gert Vriend (CMBI, NL) -* Trish Whetzel (University of California, USA) - - - -# Recent workshops (2014 - ) -Thank you to all of the participants of various meetings and workshops organised by ELIXIR, BioMedBridges and others. See the complete list of past and forthcoming [workshops](https://bio.tools/events). - -# Publication - -If you use EDAM or its part, please reference: - -Ison, J., Kalaš, M., Jonassen, I., Bolser, D., Uludag, M., McWilliam, H., Malone, J., Lopez, R., Pettifer, S. and Rice, P. (2013). [EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats](http://bioinformatics.oxfordjournals.org/content/29/10/1325.full). _Bioinformatics_, **29**(10): 1325-1332. - -doi: [10.1093/bioinformatics/btt113](http://doi.org/10.1093/bioinformatics/btt113) PMID: [23479348](http://www.ncbi.nlm.nih.gov/pubmed/23479348) - -This article is freely available (Open Access). - -# Documentation and website - -Full user documentation of the EDAM ontology is available at http://edamontology.org. - -The _edamontology.org_ site provides content negotiation with respect to the desired media type (_i.e._ format, _e.g._ HTML, OWL, _etc._). This applies also to the URIs of EDAM concepts that are in this way dereferencable, concise, and stable. Alternatively to requesting the format in the HTTP header, users can retrieve the desired content from a web browser by inserting _?format=\_ query into the URL. - -# Current development status - -[![Build Status](https://travis-ci.org/edamontology/edamontology.svg?branch=master)](https://travis-ci.org/edamontology/edamontology) diff --git a/changelog-detailed.md b/changelog-detailed.md deleted file mode 100644 index 4638c75..0000000 --- a/changelog-detailed.md +++ /dev/null @@ -1,6910 +0,0 @@ -# Detailed Changelog for EDAM -Data were generated using the [Bubastis](http://www.ebi.ac.uk/efo/bubastis/) ontology-diff tool. - -# EDAM\_1.14.owl - -## Classes modified: - -Class: http://edamontology.org/topic_0821 -Label: Enzymes -- 'Enzymes' SubClassOf 'Protein families' -+ 'Enzymes' SubClassOf 'Protein analysis' - -Class: http://edamontology.org/topic_0820 -Label: Membrane and lipoproteins -- 'Membrane and lipoproteins' SubClassOf 'Protein families' -+ 'Membrane and lipoproteins' SubClassOf 'Protein analysis' - -Class: http://edamontology.org/operation_0527 -Label: Tag mapping -- 'Tag mapping' SubClassOf 'has output' some 'Sequence tag profile (with gene assignment)' - -Class: http://edamontology.org/data_2966 -Label: Oligonucleotide probe sets annotation -- 'Oligonucleotide probe sets annotation' SubClassOf 'Oligonucleotide probe annotation' -+ 'Oligonucleotide probe sets annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1235 -Label: Sequence cluster -- 'Sequence cluster' SubClassOf 'has topic' some 'Protein families' -+ 'Sequence cluster' SubClassOf 'has topic' some 'Gene families' - -Class: http://edamontology.org/data_0907 -Label: Protein family report -- 'Protein family report' SubClassOf 'has topic' some 'Protein families' -+ 'Protein family report' SubClassOf 'has topic' some 'Gene families' - -Class: http://edamontology.org/data_0936 -Label: Sequence tag profile (with gene assignment) -- 'Sequence tag profile (with gene assignment)' SubClassOf 'Sequence tag profile' -+ 'Sequence tag profile (with gene assignment)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3096 -Label: Editing -- 'Editing' SubClassOf 'Operation' -+ 'Editing' SubClassOf 'File handling' - -Class: http://edamontology.org/topic_2830 -Label: Immunoproteins, genes and antigens -- 'Immunoproteins, genes and antigens' SubClassOf 'Protein families' -+ 'Immunoproteins, genes and antigens' SubClassOf 'Gene families' - -Class: http://edamontology.org/topic_0623 -Label: Gene families -+ 'Gene families' SubClassOf 'Protein analysis' - -Class: http://edamontology.org/topic_0749 -Label: Transcription factors and regulatory sites -- 'Transcription factors and regulatory sites' SubClassOf 'Protein families' -+ 'Transcription factors and regulatory sites' SubClassOf 'Protein analysis' - -Class: http://edamontology.org/format_2055 -Label: Sequence assembly format -- 'Sequence assembly format' SubClassOf 'is format of' some 'Data' -+ 'Sequence assembly format' SubClassOf 'is format of' some 'Sequence assembly' - -Class: http://edamontology.org/topic_0724 -Label: Protein families -- 'Protein families' SubClassOf 'Protein analysis' -+ 'Protein families' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0248 -Label: Residue interaction calculation -- 'Residue interaction calculation' SubClassOf 'Residue interaction calculation' - -## New classes: - -Class: http://edamontology.org/format_3702 -Label: MSF -+ 'MSF' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3709 -Label: GCT/Res format -+ 'GCT/Res format' SubClassOf 'Textual format' -+ 'GCT/Res format' SubClassOf 'Gene expression report format' - -Class: http://edamontology.org/format_3708 -Label: ABCD format -+ 'ABCD format' SubClassOf 'is format of' some 'Biodiversity report' -+ 'ABCD format' SubClassOf 'Biodiversity data format' - -Class: http://edamontology.org/format_3706 -Label: Biodiversity data format -+ 'Biodiversity data format' SubClassOf 'is format of' some 'Biodiversity report' -+ 'Biodiversity data format' SubClassOf 'Format (typed)' - -Class: http://edamontology.org/format_3710 -Label: WIFF format -+ 'WIFF format' SubClassOf 'Mass spectrometry data format' -+ 'WIFF format' SubClassOf 'Binary format' - -Class: http://edamontology.org/format_3712 -Label: Thermo RAW -+ 'Thermo RAW' SubClassOf 'Binary format' -+ 'Thermo RAW' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3711 -Label: X!Tandem XML -+ 'X!Tandem XML' SubClassOf 'Mass spectrometry data format' -+ 'X!Tandem XML' SubClassOf 'Binary format' -+ 'X!Tandem XML' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3714 -Label: MaxQuant APL peaklist format -+ 'MaxQuant APL peaklist format' SubClassOf 'Mass spectrometry data format' -+ 'MaxQuant APL peaklist format' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3713 -Label: Mascot .dat file -+ 'Mascot .dat file' SubClassOf 'Textual format' -+ 'Mascot .dat file' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3727 -Label: OME-TIFF -+ 'OME-TIFF' SubClassOf 'Binary format' -+ 'OME-TIFF' SubClassOf 'Image format' - -Class: http://edamontology.org/format_3726 -Label: PMML -+ 'PMML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3725 -Label: SBOL -+ 'SBOL' SubClassOf 'XML' - -Class: http://edamontology.org/format_3728 -Label: LocARNA PP -+ 'LocARNA PP' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3729 -Label: dbGaP format -+ 'dbGaP format' SubClassOf 'Textual format' - -Class: http://edamontology.org/data_3724 -Label: Cultivation parameter -+ 'Cultivation parameter' SubClassOf 'Experimental measurement' - -Class: http://edamontology.org/data_3723 -Label: Morphology parameter -+ 'Morphology parameter' SubClassOf 'Experimental measurement' - -Class: http://edamontology.org/data_3722 -Label: Physiology parameter -+ 'Physiology parameter' SubClassOf 'Experimental measurement' - -Class: http://edamontology.org/data_3721 -Label: Isolation source -+ 'Isolation source' SubClassOf 'Isolation report' - -Class: http://edamontology.org/data_3720 -Label: Geographic location -+ 'Geographic location' SubClassOf 'Isolation report' - -Class: http://edamontology.org/data_3707 -Label: Biodiversity report -+ 'Biodiversity report' SubClassOf 'Report' - -Class: http://edamontology.org/data_3717 -Label: Isolation report -+ 'Isolation report' SubClassOf 'Report' - -Class: http://edamontology.org/data_3716 -Label: Biosafety report -+ 'Biosafety report' SubClassOf 'Report' - -Class: http://edamontology.org/data_3719 -Label: Biosafety classification -+ 'Biosafety classification' SubClassOf 'Biosafety report' - -Class: http://edamontology.org/data_3718 -Label: Pathogenicity report -+ 'Pathogenicity report' SubClassOf 'Biosafety report' - -Class: http://edamontology.org/operation_3703 -Label: Reference identification -+ 'Reference identification' SubClassOf 'Genetic variation analysis' - -Class: http://edamontology.org/operation_3705 -Label: Isotope-coded protein label -+ 'Isotope-coded protein label' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3704 -Label: Ion counting -+ 'Ion counting' SubClassOf 'Label-free quantification' - -Class: http://edamontology.org/operation_3715 -Label: Metabolic labeling -+ 'Metabolic labeling' SubClassOf 'Labeled quantification' - -# EDAM\_1.13.owl - -## Classes modified: - -Class: http://edamontology.org/data_0899 -Label: Protein structural motifs and surfaces -- 'Protein structural motifs and surfaces' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structural motifs and surfaces' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0894 -Label: Amino acid annotation -- 'Amino acid annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Amino acid annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0895 -Label: Peptide annotation -- 'Peptide annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Peptide annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0891 -Label: Sequence-3D profile alignment -- 'Sequence-3D profile alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence-3D profile alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0876 -Label: Protein features report (secondary structure) -- 'Protein features report (secondary structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (secondary structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0877 -Label: Protein features report (super-secondary) -- 'Protein features report (super-secondary)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (super-secondary)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0875 -Label: Protein topology -- 'Protein topology' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein topology' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0879 -Label: Secondary structure alignment metadata (protein) -- 'Secondary structure alignment metadata (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure alignment metadata (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0885 -Label: Structure database search results -- 'Structure database search results' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure database search results' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0889 -Label: Structural profile -- 'Structural profile' SubClassOf 'has topic' some 'Structure comparison' -+ 'Structural profile' SubClassOf 'has topic' some 'Structure analysis' - -Class: http://edamontology.org/data_0882 -Label: Secondary structure alignment metadata (RNA) -- 'Secondary structure alignment metadata (RNA)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure alignment metadata (RNA)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0884 -Label: Tertiary structure record -- 'Tertiary structure record' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tertiary structure record' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1703 -Label: ChEBI entry format -- 'ChEBI entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ChEBI entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1702 -Label: ChemSpider entry format -- 'ChemSpider entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ChemSpider entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1701 -Label: PubChem entry format -- 'PubChem entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PubChem entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1700 -Label: KEGG GLYCAN entry format -- 'KEGG GLYCAN entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG GLYCAN entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1565 -Label: Protein-protein interaction report -- 'Protein-protein interaction report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-protein interaction report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1564 -Label: Protein fold recognition report -- 'Protein fold recognition report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein fold recognition report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1561 -Label: CATH functional category -- 'CATH functional category' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH functional category' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1560 -Label: CATH structurally similar group -- 'CATH structurally similar group' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH structurally similar group' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1706 -Label: KEGG DRUG entry format -- 'KEGG DRUG entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG DRUG entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1567 -Label: Protein-nucleic acid interactions report -- 'Protein-nucleic acid interactions report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-nucleic acid interactions report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1704 -Label: MSDchem ligand dictionary entry format -- 'MSDchem ligand dictionary entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'MSDchem ligand dictionary entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1591 -Label: Vienna RNA parameters -- 'Vienna RNA parameters' SubClassOf 'Obsolete concept (EDAM)' -+ 'Vienna RNA parameters' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1592 -Label: Vienna RNA structure constraints -- 'Vienna RNA structure constraints' SubClassOf 'Obsolete concept (EDAM)' -+ 'Vienna RNA structure constraints' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1593 -Label: Vienna RNA concentration data -- 'Vienna RNA concentration data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Vienna RNA concentration data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1594 -Label: Vienna RNA calculated energy -- 'Vienna RNA calculated energy' SubClassOf 'Obsolete concept (EDAM)' -+ 'Vienna RNA calculated energy' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1599 -Label: Codon adaptation index -- 'Codon adaptation index' SubClassOf 'Obsolete concept (EDAM)' -+ 'Codon adaptation index' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1586 -Label: Nucleic acid melting temperature -- 'Nucleic acid melting temperature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid melting temperature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0005 -Label: Resource type -- 'Resource type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Resource type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0007 -Label: Tool -- 'Tool' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tool' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2197 -Label: OWL format -- 'OWL format' SubClassOf 'Ontology format' - -Class: http://edamontology.org/format_2188 -Label: UniProt format -- 'UniProt format' SubClassOf 'Obsolete concept (EDAM)' -+ 'UniProt format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2189 -Label: ipi -- 'ipi' SubClassOf 'Obsolete concept (EDAM)' -+ 'ipi' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1747 -Label: PDB atom record format -- 'PDB atom record format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PDB atom record format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1760 -Label: CATH chain report format -- 'CATH chain report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH chain report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1761 -Label: CATH PDB report format -- 'CATH PDB report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH PDB report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0851 -Label: Sequence mask character -- 'Sequence mask character' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence mask character' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0859 -Label: Sequence signature model -- 'Sequence signature model' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence signature model' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0852 -Label: Sequence mask type -- 'Sequence mask type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence mask type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0855 -Label: Sequence metadata -- 'Sequence metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0854 -Label: Sequence length specification -- 'Sequence length specification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence length specification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2159 -Label: Gene features (coding region) format -- 'Gene features (coding region) format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features (coding region) format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0861 -Label: Sequence alignment (words) -- 'Sequence alignment (words)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment (words)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0866 -Label: Sequence alignment metadata -- 'Sequence alignment metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0864 -Label: Sequence alignment parameter -- 'Sequence alignment parameter' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment parameter' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1782 -Label: NCBI gene report format -- 'NCBI gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'NCBI gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0831 -Label: MeSH vocabulary -- 'MeSH vocabulary' SubClassOf 'Obsolete concept (EDAM)' -+ 'MeSH vocabulary' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0832 -Label: HGNC vocabulary -- 'HGNC vocabulary' SubClassOf 'Obsolete concept (EDAM)' -+ 'HGNC vocabulary' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0835 -Label: UMLS vocabulary -- 'UMLS vocabulary' SubClassOf 'Obsolete concept (EDAM)' -+ 'UMLS vocabulary' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2175 -Label: Gene cluster format -- 'Gene cluster format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene cluster format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0843 -Label: Database entry -- 'Database entry' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database entry' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2993 -Label: Molecular interaction data processing -- 'Molecular interaction data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular interaction data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1431 -Label: Phylogenetic property values format -- 'Phylogenetic property values format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic property values format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0922 -Label: Primers -- 'Primers' SubClassOf 'Gene transcription features' -+ 'Primers' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2948 -Label: Molecular interaction analysis -- 'Molecular interaction analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular interaction analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2364 -Label: 2D PAGE report -- '2D PAGE report' SubClassOf 'Obsolete concept (EDAM)' -+ '2D PAGE report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2363 -Label: 2D PAGE data -- '2D PAGE data' SubClassOf 'Obsolete concept (EDAM)' -+ '2D PAGE data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2360 -Label: Domain-domain interaction (indirect) -- 'Domain-domain interaction (indirect)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Domain-domain interaction (indirect)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2952 -Label: Structure alignment processing -- 'Structure alignment processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2951 -Label: Alignment processing -- 'Alignment processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Alignment processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2359 -Label: Domain-domain interactions -- 'Domain-domain interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Domain-domain interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2358 -Label: Domain-nucleic acid interaction report -- 'Domain-nucleic acid interaction report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Domain-nucleic acid interaction report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2357 -Label: Protein signature type -- 'Protein signature type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein signature type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3563 -Label: RNA-seq read count analysis -- 'RNA-seq read count analysis' SubClassOf 'Nucleic acid sequence analysis' -+ 'RNA-seq read count analysis' SubClassOf http://edamontology.org/operation_3680 - -Class: http://edamontology.org/operation_3565 -Label: RNA-seq time series data analysis -- 'RNA-seq time series data analysis' SubClassOf 'Nucleic acid sequence analysis' -+ 'RNA-seq time series data analysis' SubClassOf http://edamontology.org/operation_3680 - -Class: http://edamontology.org/operation_2946 -Label: Alignment analysis -- 'Alignment analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Alignment analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2300 -Label: Gene name (NCBI) -- 'Gene name (NCBI)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (NCBI)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2307 -Label: Virus annotation -- 'Virus annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Virus annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2308 -Label: Virus annotation (taxonomy) -- 'Virus annotation (taxonomy)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Virus annotation (taxonomy)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2932 -Label: Hopp and Woods plotting -- 'Hopp and Woods plotting' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hopp and Woods plotting' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2931 -Label: Secondary structure comparison -- 'Secondary structure comparison' SubClassOf 'has topic' some 'Structure comparison' -+ 'Secondary structure comparison' SubClassOf 'has topic' some 'Structure analysis' - -Class: http://edamontology.org/data_0901 -Label: Protein features report (domains) -- 'Protein features report (domains)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (domains)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0900 -Label: Protein domain classification -- 'Protein domain classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein domain classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0903 -Label: Protein folding report -- 'Protein folding report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein folding report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0902 -Label: Protein architecture report -- 'Protein architecture report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein architecture report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1536 -Label: MHC peptide immunogenicity report -- 'MHC peptide immunogenicity report' SubClassOf 'Obsolete concept (EDAM)' -+ 'MHC peptide immunogenicity report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1533 -Label: Protein subcellular localization -- 'Protein subcellular localization' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein subcellular localization' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1541 -Label: Protein flexibility or motion report -- 'Protein flexibility or motion report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein flexibility or motion report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1543 -Label: Protein surface report -- 'Protein surface report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein surface report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1540 -Label: Protein non-covalent interactions report -- 'Protein non-covalent interactions report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein non-covalent interactions report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1559 -Label: CATH homologous superfamily -- 'CATH homologous superfamily' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH homologous superfamily' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1558 -Label: CATH topology -- 'CATH topology' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH topology' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1557 -Label: CATH architecture -- 'CATH architecture' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH architecture' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1556 -Label: CATH class -- 'CATH class' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH class' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1555 -Label: EMBASSY domain classification -- 'EMBASSY domain classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBASSY domain classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1554 -Label: SCOP node -- 'SCOP node' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP node' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1553 -Label: CATH node -- 'CATH node' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH node' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1550 -Label: Protein non-canonical interactions -- 'Protein non-canonical interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein non-canonical interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2381 -Label: Experiment report (genotyping) -- 'Experiment report (genotyping)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Experiment report (genotyping)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3545 -Label: Mathematical modelling -- 'Mathematical modelling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mathematical modelling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2372 -Label: 2D PAGE spot report -- '2D PAGE spot report' SubClassOf 'Obsolete concept (EDAM)' -+ '2D PAGE spot report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2378 -Label: Protein-motif interaction -- 'Protein-motif interaction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-motif interaction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3427 -Label: RNAi report -- 'RNAi report' SubClassOf 'Obsolete concept (EDAM)' -+ 'RNAi report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3426 -Label: Proteomics experiment report -- 'Proteomics experiment report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Proteomics experiment report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3428 -Label: Simulation experiment report -- 'Simulation experiment report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Simulation experiment report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1517 -Label: Restriction enzyme report -- 'Restriction enzyme report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Restriction enzyme report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2399 -Label: Gene transcriptional features report -- 'Gene transcriptional features report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene transcriptional features report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2397 -Label: Gene features report (exon) -- 'Gene features report (exon)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features report (exon)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2395 -Label: Fungi annotation -- 'Fungi annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Fungi annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2396 -Label: Fungi annotation (anamorph) -- 'Fungi annotation (anamorph)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Fungi annotation (anamorph)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1509 -Label: Enzyme report -- 'Enzyme report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Enzyme report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2230 -Label: Classification -- 'Classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2232 -Label: Lipoproteins -- 'Lipoproteins' SubClassOf 'Obsolete concept (EDAM)' -+ 'Lipoproteins' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2243 -Label: phylip property values -- 'phylip property values' SubClassOf 'Obsolete concept (EDAM)' -+ 'phylip property values' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2225 -Label: Protein databases -- 'Protein databases' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein databases' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2226 -Label: Structure determination -- 'Structure determination' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure determination' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1490 -Label: Multiple protein tertiary structure alignment (C-alpha atoms) -- 'Multiple protein tertiary structure alignment (C-alpha atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple protein tertiary structure alignment (C-alpha atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1491 -Label: Structure alignment (nucleic acid pair) -- 'Structure alignment (nucleic acid pair)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (nucleic acid pair)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1492 -Label: Multiple nucleic acid tertiary structure alignment -- 'Multiple nucleic acid tertiary structure alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple nucleic acid tertiary structure alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1496 -Label: Molecular similarity score -- 'Molecular similarity score' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular similarity score' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1495 -Label: DaliLite hit table -- 'DaliLite hit table' SubClassOf 'Obsolete concept (EDAM)' -+ 'DaliLite hit table' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1480 -Label: Structure alignment (multiple) -- 'Structure alignment (multiple)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (multiple)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1489 -Label: Multiple protein tertiary structure alignment (all atoms) -- 'Multiple protein tertiary structure alignment (all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple protein tertiary structure alignment (all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1488 -Label: Pairwise protein tertiary structure alignment (C-alpha atoms) -- 'Pairwise protein tertiary structure alignment (C-alpha atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise protein tertiary structure alignment (C-alpha atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1487 -Label: Pairwise protein tertiary structure alignment (all atoms) -- 'Pairwise protein tertiary structure alignment (all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise protein tertiary structure alignment (all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1486 -Label: Structure alignment (protein C-alpha atoms) -- 'Structure alignment (protein C-alpha atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (protein C-alpha atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1485 -Label: Structure alignment (protein all atoms) -- 'Structure alignment (protein all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (protein all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1484 -Label: Multiple protein tertiary structure alignment -- 'Multiple protein tertiary structure alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple protein tertiary structure alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1483 -Label: Structure alignment (protein pair) -- 'Structure alignment (protein pair)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (protein pair)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2210 -Label: Strain data format -- 'Strain data format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Strain data format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2211 -Label: CIP strain data format -- 'CIP strain data format' SubClassOf 'Obsolete concept (EDAM)' -+ 'CIP strain data format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2271 -Label: Structure database search -- 'Structure database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2278 -Label: Transmembrane protein prediction -- 'Transmembrane protein prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transmembrane protein prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2277 -Label: SNP -- 'SNP' SubClassOf 'DNA polymorphism' -+ 'SNP' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2276 -Label: Protein function prediction -- 'Protein function prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein function prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1471 -Label: Protein chain (all atoms) -- 'Protein chain (all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein chain (all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1472 -Label: Protein chain (C-alpha atoms) -- 'Protein chain (C-alpha atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein chain (C-alpha atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1473 -Label: Protein domain (all atoms) -- 'Protein domain (all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein domain (all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1474 -Label: Protein domain (C-alpha atoms) -- 'Protein domain (C-alpha atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein domain (C-alpha atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2202 -Label: Sequence record full format -- 'Sequence record full format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence record full format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2203 -Label: Sequence record lite format -- 'Sequence record lite format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence record lite format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1604 -Label: DictyBase gene report format -- 'DictyBase gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'DictyBase gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1603 -Label: Ensembl gene report format -- 'Ensembl gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1607 -Label: EcoCyc gene report format -- 'EcoCyc gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'EcoCyc gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1608 -Label: FlyBase gene report format -- 'FlyBase gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'FlyBase gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1605 -Label: CGD gene report format -- 'CGD gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'CGD gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1469 -Label: Protein structure (all atoms) -- 'Protein structure (all atoms)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure (all atoms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1606 -Label: DragonDB gene report format -- 'DragonDB gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'DragonDB gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1609 -Label: Gramene gene report format -- 'Gramene gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gramene gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2257 -Label: Phylogeny visualisation -- 'Phylogeny visualisation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogeny visualisation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1453 -Label: Amino acid comparison matrix (floats) -- 'Amino acid comparison matrix (floats)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Amino acid comparison matrix (floats)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1456 -Label: Protein features report (membrane regions) -- 'Protein features report (membrane regions)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (membrane regions)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1450 -Label: Nucleotide comparison matrix (integers) -- 'Nucleotide comparison matrix (integers)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleotide comparison matrix (integers)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1451 -Label: Nucleotide comparison matrix (floats) -- 'Nucleotide comparison matrix (floats)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleotide comparison matrix (floats)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1452 -Label: Amino acid comparison matrix (integers) -- 'Amino acid comparison matrix (integers)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Amino acid comparison matrix (integers)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1446 -Label: Comparison matrix (integers) -- 'Comparison matrix (integers)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Comparison matrix (integers)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1447 -Label: Comparison matrix (floats) -- 'Comparison matrix (floats)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Comparison matrix (floats)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1443 -Label: Phylogenetic tree report (tree stratigraphic) -- 'Phylogenetic tree report (tree stratigraphic)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic tree report (tree stratigraphic)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1440 -Label: Phylogenetic tree report (tree shape) -- 'Phylogenetic tree report (tree shape)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic tree report (tree shape)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1441 -Label: Phylogenetic tree report (tree evaluation) -- 'Phylogenetic tree report (tree evaluation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic tree report (tree evaluation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1649 -Label: HumanCyc entry format -- 'HumanCyc entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'HumanCyc entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1513 -Label: KEGG REACTION enzyme report format -- 'KEGG REACTION enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG REACTION enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1514 -Label: KEGG ENZYME enzyme report format -- 'KEGG ENZYME enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG ENZYME enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1515 -Label: REBASE proto enzyme report format -- 'REBASE proto enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'REBASE proto enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1516 -Label: REBASE withrefm enzyme report format -- 'REBASE withrefm enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'REBASE withrefm enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2400 -Label: Toxin annotation -- 'Toxin annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Toxin annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1645 -Label: EMDB entry format -- 'EMDB entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMDB entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2402 -Label: Protein-drug interaction report -- 'Protein-drug interaction report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-drug interaction report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1511 -Label: IntEnz enzyme report format -- 'IntEnz enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'IntEnz enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1647 -Label: KEGG PATHWAY entry format -- 'KEGG PATHWAY entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG PATHWAY entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2401 -Label: Protein report (membrane protein) -- 'Protein report (membrane protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein report (membrane protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1512 -Label: BRENDA enzyme report format -- 'BRENDA enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'BRENDA enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1648 -Label: MetaCyc entry format -- 'MetaCyc entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'MetaCyc entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1640 -Label: ArrayExpress entry format -- 'ArrayExpress entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ArrayExpress entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1619 -Label: TIGR gene report format -- 'TIGR gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'TIGR gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1618 -Label: ZFIN gene report format -- 'ZFIN gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ZFIN gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1617 -Label: WormBase gene report format -- 'WormBase gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'WormBase gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1616 -Label: TAIR gene report format -- 'TAIR gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'TAIR gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1500 -Label: Domainatrix 3D-1D scoring matrix format -- 'Domainatrix 3D-1D scoring matrix format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Domainatrix 3D-1D scoring matrix format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1614 -Label: SGD gene report format -- 'SGD gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'SGD gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1615 -Label: GeneDB gene report format -- 'GeneDB gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'GeneDB gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1612 -Label: MGD gene report format -- 'MGD gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'MGD gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1613 -Label: RGD gene report format -- 'RGD gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'RGD gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1610 -Label: KEGG GENES gene report format -- 'KEGG GENES gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG GENES gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1611 -Label: MaizeGDB gene report format -- 'MaizeGDB gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'MaizeGDB gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1623 -Label: OMIM entry format -- 'OMIM entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'OMIM entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1624 -Label: HGVbase entry format -- 'HGVbase entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'HGVbase entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1625 -Label: HIVDB entry format -- 'HIVDB entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'HIVDB entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1626 -Label: KEGG DISEASE entry format -- 'KEGG DISEASE entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG DISEASE entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1620 -Label: dbSNP polymorphism report format -- 'dbSNP polymorphism report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'dbSNP polymorphism report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2280 -Label: Nucleic acid structure comparison -- 'Nucleic acid structure comparison' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid structure comparison' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1563 -Label: SMART domain assignment report format -- 'SMART domain assignment report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'SMART domain assignment report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1569 -Label: IntAct entry format -- 'IntAct entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'IntAct entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1568 -Label: BIND entry format -- 'BIND entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'BIND entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0079 -Label: Metabolites -- 'Metabolites' SubClassOf 'Small molecules' -+ 'Metabolites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0078 -Label: Proteins -- 'Proteins' SubClassOf 'Biochemistry' -+ 'Proteins' SubClassOf 'Computational biology' - -Class: http://edamontology.org/topic_0077 -Label: Nucleic acids -- 'Nucleic acids' SubClassOf 'Biochemistry' -+ 'Nucleic acids' SubClassOf 'Computational biology' - -Class: http://edamontology.org/format_1651 -Label: PATIKA entry format -- 'PATIKA entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PATIKA entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1650 -Label: INOH entry format -- 'INOH entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'INOH entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0084 -Label: Phylogeny -+ 'Phylogeny' SubClassOf 'Computational biology' - -Class: http://edamontology.org/topic_0083 -Label: Alignment -- 'Alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1655 -Label: Panther Pathways entry format -- 'Panther Pathways entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Panther Pathways entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1654 -Label: CPDB entry format -- 'CPDB entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'CPDB entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1653 -Label: aMAZE entry format -- 'aMAZE entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'aMAZE entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1652 -Label: Reactome entry format -- 'Reactome entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Reactome entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0090 -Label: Information retrieval -- 'Information retrieval' SubClassOf 'Data management' -+ 'Information retrieval' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0094 -Label: Nucleic acid thermodynamics -- 'Nucleic acid thermodynamics' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid thermodynamics' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0097 -Label: Nucleic acid structure analysis -+ 'Nucleic acid structure analysis' SubClassOf 'Nucleic acids' - -Class: http://edamontology.org/format_1666 -Label: BioModel mathematical model format -- 'BioModel mathematical model format' SubClassOf 'Obsolete concept (EDAM)' -+ 'BioModel mathematical model format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3440 -Label: Genome assembly -- 'Genome assembly' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genome assembly' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1579 -Label: TIGRFam entry format -- 'TIGRFam entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'TIGRFam entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1577 -Label: SMART entry format -- 'SMART entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'SMART entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1578 -Label: Superfamily entry format -- 'Superfamily entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Superfamily entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1575 -Label: Panther Families and HMMs entry format -- 'Panther Families and HMMs entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Panther Families and HMMs entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1576 -Label: Pfam entry format -- 'Pfam entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pfam entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1573 -Label: PIRSF entry format -- 'PIRSF entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PIRSF entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1574 -Label: PRINTS entry format -- 'PRINTS entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PRINTS entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1571 -Label: InterPro entry abstract format -- 'InterPro entry abstract format' SubClassOf 'Obsolete concept (EDAM)' -+ 'InterPro entry abstract format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1572 -Label: Gene3D entry format -- 'Gene3D entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene3D entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1570 -Label: InterPro entry format -- 'InterPro entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'InterPro entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0803 -Label: Human disease -- 'Human disease' SubClassOf 'Obsolete concept (EDAM)' -+ 'Human disease' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1697 -Label: KEGG LIGAND entry format -- 'KEGG LIGAND entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG LIGAND entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1698 -Label: KEGG COMPOUND entry format -- 'KEGG COMPOUND entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG COMPOUND entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1699 -Label: KEGG PLANT entry format -- 'KEGG PLANT entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'KEGG PLANT entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1580 -Label: ProDom entry format -- 'ProDom entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ProDom entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1581 -Label: FSSP entry format -- 'FSSP entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'FSSP entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0111 -Label: Promoters -- 'Promoters' SubClassOf 'Obsolete concept (EDAM)' -+ 'Promoters' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0110 -Label: Transcription -- 'Transcription' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcription' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0114 -Label: Gene structure -- 'Gene structure' SubClassOf 'Nucleic acid sites, features and motifs' - -Class: http://edamontology.org/topic_0112 -Label: Nucleic acid folding -- 'Nucleic acid folding' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid folding' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0100 -Label: Nucleic acid restriction -- 'Nucleic acid restriction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid restriction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0107 -Label: Genetic codes and codon usage -- 'Genetic codes and codon usage' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genetic codes and codon usage' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0109 -Label: Gene finding -- 'Gene finding' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene finding' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0133 -Label: Two-dimensional gel electrophoresis -- 'Two-dimensional gel electrophoresis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Two-dimensional gel electrophoresis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0137 -Label: Protein hydropathy -- 'Protein hydropathy' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein hydropathy' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0134 -Label: Mass spectrometry -- 'Mass spectrometry' SubClassOf 'Proteomics experiment' -+ 'Mass spectrometry' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0135 -Label: Protein microarrays -- 'Protein microarrays' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein microarrays' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1420 -Label: Sequence-profile alignment (fingerprint) -- 'Sequence-profile alignment (fingerprint)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence-profile alignment (fingerprint)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1438 -Label: Phylogenetic report -- 'Phylogenetic report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1400 -Label: Terminal gap penalty -- 'Terminal gap penalty' SubClassOf 'Obsolete concept (EDAM)' -+ 'Terminal gap penalty' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1405 -Label: Gap opening penalty (float) -- 'Gap opening penalty (float)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap opening penalty (float)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1404 -Label: Gap opening penalty (integer) -- 'Gap opening penalty (integer)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap opening penalty (integer)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1407 -Label: Gap extension penalty (float) -- 'Gap extension penalty (float)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap extension penalty (float)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1406 -Label: Gap extension penalty (integer) -- 'Gap extension penalty (integer)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap extension penalty (integer)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1409 -Label: Gap separation penalty (float) -- 'Gap separation penalty (float)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap separation penalty (float)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1408 -Label: Gap separation penalty (integer) -- 'Gap separation penalty (integer)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gap separation penalty (integer)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1416 -Label: Sequence alignment report (site correlation) -- 'Sequence alignment report (site correlation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment report (site correlation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1415 -Label: Sequence alignment report (site conservation) -- 'Sequence alignment report (site conservation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment report (site conservation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1414 -Label: Sequence alignment metadata (quality report) -- 'Sequence alignment metadata (quality report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment metadata (quality report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1418 -Label: Sequence-profile alignment (HMM) -- 'Sequence-profile alignment (HMM)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence-profile alignment (HMM)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1417 -Label: Sequence-profile alignment (Domainatrix signature) -- 'Sequence-profile alignment (Domainatrix signature)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence-profile alignment (Domainatrix signature)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1330 -Label: MHC Class II epitopes report -- 'MHC Class II epitopes report' SubClassOf 'Obsolete concept (EDAM)' -+ 'MHC Class II epitopes report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1331 -Label: Protein features (PEST sites) -- 'Protein features (PEST sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (PEST sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2090 -Label: Database entry version information -- 'Database entry version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database entry version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1338 -Label: Sequence database hits scores list -- 'Sequence database hits scores list' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database hits scores list' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2092 -Label: SNP -- 'SNP' SubClassOf 'Obsolete concept (EDAM)' -+ 'SNP' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1339 -Label: Sequence database hits alignments list -- 'Sequence database hits alignments list' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database hits alignments list' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1323 -Label: Protein features report (cleavage sites) -- 'Protein features report (cleavage sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (cleavage sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1324 -Label: Protein features (post-translation modifications) -- 'Protein features (post-translation modifications)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (post-translation modifications)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1321 -Label: Protein features (sites) -- 'Protein features (sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1322 -Label: Protein features report (signal peptides) -- 'Protein features report (signal peptides)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (signal peptides)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1329 -Label: MHC Class I epitopes report -- 'MHC Class I epitopes report' SubClassOf 'Obsolete concept (EDAM)' -+ 'MHC Class I epitopes report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1327 -Label: Protein features (epitopes) -- 'Protein features (epitopes)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (epitopes)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1328 -Label: Protein features report (nucleic acid binding sites) -- 'Protein features report (nucleic acid binding sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (nucleic acid binding sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1325 -Label: Protein features report (active sites) -- 'Protein features report (active sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (active sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1326 -Label: Protein features report (binding sites) -- 'Protein features report (binding sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (binding sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2303 -Label: STRING entry format (HTML) -- 'STRING entry format (HTML)' SubClassOf 'Obsolete concept (EDAM)' -+ 'STRING entry format (HTML)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1358 -Label: Prosite nucleotide pattern -- 'Prosite nucleotide pattern' SubClassOf 'Obsolete concept (EDAM)' -+ 'Prosite nucleotide pattern' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1359 -Label: Prosite protein pattern -- 'Prosite protein pattern' SubClassOf 'Obsolete concept (EDAM)' -+ 'Prosite protein pattern' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1959 -Label: selex sequence format -- 'selex sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'selex sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2334 -Label: URI format -- 'URI format' SubClassOf 'Obsolete concept (EDAM)' -+ 'URI format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2399 -Label: Gene transcription features -- 'Gene transcription features' SubClassOf 'Gene structure' -+ 'Gene transcription features' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1956 -Label: phylipnon sequence format -- 'phylipnon sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'phylipnon sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2397 -Label: Exons -- 'Exons' SubClassOf 'Coding RNA' -+ 'Exons' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1955 -Label: phylip sequence format -- 'phylip sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'phylip sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1340 -Label: Sequence database hits evaluation data -- 'Sequence database hits evaluation data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database hits evaluation data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1345 -Label: MEME background frequencies file -- 'MEME background frequencies file' SubClassOf 'Obsolete concept (EDAM)' -+ 'MEME background frequencies file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1346 -Label: MEME motifs directive file -- 'MEME motifs directive file' SubClassOf 'Obsolete concept (EDAM)' -+ 'MEME motifs directive file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1344 -Label: MEME motif alphabet -- 'MEME motif alphabet' SubClassOf 'Obsolete concept (EDAM)' -+ 'MEME motif alphabet' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1348 -Label: HMM emission and transition counts -- 'HMM emission and transition counts' SubClassOf 'Obsolete concept (EDAM)' -+ 'HMM emission and transition counts' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2323 -Label: ENZYME enzyme report format -- 'ENZYME enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ENZYME enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2329 -Label: GeneCards gene report format -- 'GeneCards gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'GeneCards gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1965 -Label: treecon sequence format -- 'treecon sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'treecon sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2328 -Label: PseudoCAP gene report format -- 'PseudoCAP gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'PseudoCAP gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2322 -Label: BioCyc enzyme report format -- 'BioCyc enzyme report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'BioCyc enzyme report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1377 -Label: Protein conserved site signature -- 'Protein conserved site signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein conserved site signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1376 -Label: Protein site signature -- 'Protein site signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein site signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1379 -Label: Protein binding site signature -- 'Protein binding site signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein binding site signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1378 -Label: Protein active site signature -- 'Protein active site signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein active site signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1373 -Label: Protein domain signature -- 'Protein domain signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein domain signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1372 -Label: Protein family signature -- 'Protein family signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein family signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1375 -Label: Protein repeat signature -- 'Protein repeat signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein repeat signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1374 -Label: Protein region signature -- 'Protein region signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein region signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1371 -Label: HMMER NULL hidden Markov model -- 'HMMER NULL hidden Markov model' SubClassOf 'Obsolete concept (EDAM)' -+ 'HMMER NULL hidden Markov model' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1368 -Label: Domainatrix signature -- 'Domainatrix signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Domainatrix signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_3476 -Label: Gene expression data format -- 'Gene expression data format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene expression data format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2341 -Label: NCI-Nature pathway entry format -- 'NCI-Nature pathway entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'NCI-Nature pathway entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1918 -Label: Atomic data format -- 'Atomic data format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Atomic data format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1395 -Label: Score end gaps control -- 'Score end gaps control' SubClassOf 'Obsolete concept (EDAM)' -+ 'Score end gaps control' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1396 -Label: Aligned sequence order -- 'Aligned sequence order' SubClassOf 'Obsolete concept (EDAM)' -+ 'Aligned sequence order' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1390 -Label: Multiple protein sequence alignment -- 'Multiple protein sequence alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple protein sequence alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1924 -Label: clustal sequence format -- 'clustal sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'clustal sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1386 -Label: Sequence alignment (nucleic acid pair) -- 'Sequence alignment (nucleic acid pair)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment (nucleic acid pair)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1389 -Label: Multiple nucleotide sequence alignment -- 'Multiple nucleotide sequence alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple nucleotide sequence alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1388 -Label: Hybrid sequence alignment (pair) -- 'Hybrid sequence alignment (pair)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hybrid sequence alignment (pair)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1387 -Label: Sequence alignment (protein pair) -- 'Sequence alignment (protein pair)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment (protein pair)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1382 -Label: Sequence alignment (multiple) -- 'Sequence alignment (multiple)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment (multiple)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1380 -Label: Protein post-translational modification signature -- 'Protein post-translational modification signature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein post-translational modification signature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0504 -Label: Multiple structure alignment construction -- 'Multiple structure alignment construction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Multiple structure alignment construction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0500 -Label: Secondary structure alignment generation -- 'Secondary structure alignment generation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure alignment generation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0507 -Label: Pairwise structure alignment generation (local) -- 'Pairwise structure alignment generation (local)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise structure alignment generation (local)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0508 -Label: Pairwise structure alignment generation (global) -- 'Pairwise structure alignment generation (global)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise structure alignment generation (global)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0505 -Label: Structure alignment (protein) -- 'Structure alignment (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0506 -Label: Structure alignment (RNA) -- 'Structure alignment (RNA)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment (RNA)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0515 -Label: Data retrieval (tool metadata) -- 'Data retrieval (tool metadata)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (tool metadata)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0514 -Label: Structural profile alignment generation (multiple) -- 'Structural profile alignment generation (multiple)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural profile alignment generation (multiple)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0512 -Label: Sequence alignment generation (multiple profile) -- 'Sequence alignment generation (multiple profile)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment generation (multiple profile)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3413 -Label: Infectious tropical disease -- 'Infectious tropical disease' SubClassOf 'Infectious disease' -+ 'Infectious tropical disease' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0188 -Label: Sequence profiles and HMMs -- 'Sequence profiles and HMMs' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profiles and HMMs' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0184 -Label: Threading -- 'Threading' SubClassOf 'Obsolete concept (EDAM)' -+ 'Threading' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0183 -Label: Structure alignment -- 'Structure alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0182 -Label: Sequence alignment -- 'Sequence alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0180 -Label: Protein fold recognition -- 'Protein fold recognition' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein fold recognition' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_3466 -Label: EPS -- 'EPS' SubClassOf 'Textual format' -+ 'EPS' SubClassOf http://edamontology.org/format_3696 - -Class: http://edamontology.org/topic_0199 -Label: Genetic variation -- 'Genetic variation' SubClassOf 'Genetics' -- 'Genetic variation' SubClassOf 'Nucleic acid sites, features and motifs' -+ 'Genetic variation' SubClassOf 'Molecular genetics' - -Class: http://edamontology.org/topic_0195 -Label: Virtual PCR -- 'Virtual PCR' SubClassOf 'Obsolete concept (EDAM)' -+ 'Virtual PCR' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0191 -Label: Phylogeny reconstruction -- 'Phylogeny reconstruction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogeny reconstruction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1977 -Label: swiss feature -- 'swiss feature' SubClassOf 'Obsolete concept (EDAM)' -+ 'swiss feature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1976 -Label: pir -- 'pir' SubClassOf 'Obsolete concept (EDAM)' -+ 'pir' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0166 -Label: Protein structural motifs and surfaces -- 'Protein structural motifs and surfaces' SubClassOf 'Protein sites, features and motifs' - -Class: http://edamontology.org/format_1971 -Label: meganon sequence format -- 'meganon sequence format' SubClassOf 'Obsolete concept (EDAM)' -+ 'meganon sequence format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0163 -Label: Sequence database search -- 'Sequence database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0164 -Label: Sequence clustering -- 'Sequence clustering' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence clustering' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0167 -Label: Structural (3D) profiles -- 'Structural (3D) profiles' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural (3D) profiles' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1980 -Label: EMBL feature -- 'EMBL feature' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBL feature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1981 -Label: GenBank feature -- 'GenBank feature' SubClassOf 'Obsolete concept (EDAM)' -+ 'GenBank feature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0172 -Label: Protein structure prediction -- 'Protein structure prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0173 -Label: Nucleic acid structure prediction -- 'Nucleic acid structure prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid structure prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0174 -Label: Ab initio structure prediction -- 'Ab initio structure prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ab initio structure prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0175 -Label: Homology modelling -- 'Homology modelling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Homology modelling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0177 -Label: Molecular docking -- 'Molecular docking' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular docking' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0179 -Label: Protein tertiary structure prediction -- 'Protein tertiary structure prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein tertiary structure prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0178 -Label: Protein secondary structure prediction -- 'Protein secondary structure prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein secondary structure prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0559 -Label: Immunogenicity prediction -- 'Immunogenicity prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Immunogenicity prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0517 -Label: PCR primer design (for large scale sequencing) -- 'PCR primer design (for large scale sequencing)' SubClassOf 'has topic' some 'Sequencing' -- 'PCR primer design (for large scale sequencing)' SubClassOf 'PCR primer design' -+ 'PCR primer design (for large scale sequencing)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0516 -Label: Data retrieval (database metadata) -- 'Data retrieval (database metadata)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (database metadata)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0519 -Label: PCR primer design (for gene transcription profiling) -- 'PCR primer design (for gene transcription profiling)' SubClassOf 'has topic' some 'Gene expression' -- 'PCR primer design (for gene transcription profiling)' SubClassOf 'PCR primer design' -+ 'PCR primer design (for gene transcription profiling)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0518 -Label: PCR primer design (for genotyping polymorphisms) -- 'PCR primer design (for genotyping polymorphisms)' SubClassOf 'PCR primer design' -+ 'PCR primer design (for genotyping polymorphisms)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1995 -Label: nexusnon alignment format -- 'nexusnon alignment format' SubClassOf 'Obsolete concept (EDAM)' -+ 'nexusnon alignment format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0143 -Label: Protein structure comparison -- 'Protein structure comparison' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure comparison' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0144 -Label: Protein residue interactions -- 'Protein residue interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1993 -Label: msf alignment format -- 'msf alignment format' SubClassOf 'Obsolete concept (EDAM)' -+ 'msf alignment format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0141 -Label: Protein cleavage sites and proteolysis -- 'Protein cleavage sites and proteolysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein cleavage sites and proteolysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1994 -Label: nexus alignment format -- 'nexus alignment format' SubClassOf 'Obsolete concept (EDAM)' -+ 'nexus alignment format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0149 -Label: Protein-nucleic acid interactions -- 'Protein-nucleic acid interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-nucleic acid interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0148 -Label: Protein-ligand interactions -- 'Protein-ligand interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-ligand interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0147 -Label: Protein-protein interactions -- 'Protein-protein interactions' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-protein interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0520 -Label: PCR primer design (for conserved primers) -- 'PCR primer design (for conserved primers)' SubClassOf 'PCR primer design' -+ 'PCR primer design (for conserved primers)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0521 -Label: PCR primer design (based on gene structure) -- 'PCR primer design (based on gene structure)' SubClassOf 'has topic' some 'Gene structure' -- 'PCR primer design (based on gene structure)' SubClassOf 'PCR primer design' -+ 'PCR primer design (based on gene structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0522 -Label: PCR primer design (for methylation PCRs) -- 'PCR primer design (for methylation PCRs)' SubClassOf 'PCR primer design' -+ 'PCR primer design (for methylation PCRs)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0529 -Label: MPSS data processing -- 'MPSS data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'MPSS data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0528 -Label: SAGE data processing -- 'SAGE data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'SAGE data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0152 -Label: Carbohydrates -- 'Carbohydrates' SubClassOf 'Biochemistry' -+ 'Carbohydrates' SubClassOf 'Structure analysis' - -Class: http://edamontology.org/topic_0153 -Label: Lipids -- 'Lipids' SubClassOf 'Biochemistry' -+ 'Lipids' SubClassOf 'Structure analysis' - -Class: http://edamontology.org/topic_0154 -Label: Small molecules -- 'Small molecules' SubClassOf 'Biochemistry' -+ 'Small molecules' SubClassOf 'Structure analysis' - -Class: http://edamontology.org/topic_0150 -Label: Protein design -- 'Protein design' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein design' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0151 -Label: G protein-coupled receptors (GPCR) -- 'G protein-coupled receptors (GPCR)' SubClassOf 'Obsolete concept (EDAM)' -+ 'G protein-coupled receptors (GPCR)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0156 -Label: Sequence editing -- 'Sequence editing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence editing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0159 -Label: Sequence comparison -- 'Sequence comparison' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence comparison' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0158 -Label: Sequence motifs -- 'Sequence motifs' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motifs' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0532 -Label: Gene expression profile analysis -- 'Gene expression profile analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene expression profile analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0530 -Label: SBS data processing -- 'SBS data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'SBS data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0536 -Label: Protein structure assignment (from X-ray crystallographic data) -- 'Protein structure assignment (from X-ray crystallographic data)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure assignment (from X-ray crystallographic data)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0537 -Label: Protein structure assignment (from NMR data) -- 'Protein structure assignment (from NMR data)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure assignment (from NMR data)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0574 -Label: Sequence motif rendering -- 'Sequence motif rendering' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif rendering' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0577 -Label: DNA linear map rendering -- 'DNA linear map rendering' SubClassOf 'Obsolete concept (EDAM)' -+ 'DNA linear map rendering' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3353 -Label: Ontology comparison -- 'Ontology comparison' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ontology comparison' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0563 -Label: Codon usage table formatting -- 'Codon usage table formatting' SubClassOf 'Obsolete concept (EDAM)' -+ 'Codon usage table formatting' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0562 -Label: Sequence alignment formatting -- 'Sequence alignment formatting' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment formatting' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0222 -Label: Genome annotation -- 'Genome annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genome annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0561 -Label: Sequence formatting -- 'Sequence formatting' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence formatting' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0221 -Label: Sequence annotation -- 'Sequence annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0220 -Label: Document, record and content management -- 'Document, record and content management' SubClassOf 'Data management' -+ 'Document, record and content management' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0217 -Label: Literature analysis -- 'Literature analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Literature analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0215 -Label: Worms -- 'Worms' SubClassOf 'Obsolete concept (EDAM)' -+ 'Worms' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0211 -Label: Flies -- 'Flies' SubClassOf 'Obsolete concept (EDAM)' -+ 'Flies' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2100 -Label: Type -- 'Type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0210 -Label: Fish -- 'Fish' SubClassOf 'Obsolete concept (EDAM)' -+ 'Fish' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2103 -Label: Gene name (KEGG GENES) -- 'Gene name (KEGG GENES)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (KEGG GENES)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0200 -Label: Microarrays -- 'Microarrays' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarrays' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0203 -Label: Gene expression -- 'Gene expression' SubClassOf 'Genetics' -+ 'Gene expression' SubClassOf 'Molecular genetics' - -Class: http://edamontology.org/data_2925 -Label: Sequence data -- 'Sequence data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2927 -Label: Codon usage -- 'Codon usage' SubClassOf 'Obsolete concept (EDAM)' -+ 'Codon usage' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1228 -Label: UniGene entry format -- 'UniGene entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'UniGene entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3515 -Label: Protein-drug interactions -- 'Protein-drug interactions' SubClassOf 'Protein-ligand interactions' -+ 'Protein-drug interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3514 -Label: Protein-ligand interactions -- 'Protein-ligand interactions' SubClassOf 'Protein interactions' -+ 'Protein-ligand interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3511 -Label: Nucleic acid sites, features and motifs -+ 'Nucleic acid sites, features and motifs' SubClassOf 'Nucleic acids' - -Class: http://edamontology.org/topic_3510 -Label: Protein sites, features and motifs -+ 'Protein sites, features and motifs' SubClassOf 'Proteins' - -Class: http://edamontology.org/topic_3526 -Label: Protein-protein interactions -- 'Protein-protein interactions' SubClassOf 'Protein interactions' -+ 'Protein-protein interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3527 -Label: Cellular process pathways -- 'Cellular process pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Cellular process pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3528 -Label: Disease pathways -- 'Disease pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Disease pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3529 -Label: Environmental information processing pathways -- 'Environmental information processing pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Environmental information processing pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3522 -Label: Northern blot experiment -- 'Northern blot experiment' SubClassOf 'Proteomics experiment' -+ 'Northern blot experiment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3525 -Label: Protein-nucleic acid interactions -- 'Protein-nucleic acid interactions' SubClassOf 'Protein interactions' -+ 'Protein-nucleic acid interactions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3521 -Label: 2D PAGE experiment -- '2D PAGE experiment' SubClassOf 'Proteomics experiment' -+ '2D PAGE experiment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1247 -Label: COG sequence cluster format -- 'COG sequence cluster format' SubClassOf 'Obsolete concept (EDAM)' -+ 'COG sequence cluster format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2116 -Label: Nucleic acid features (codon) -- 'Nucleic acid features (codon)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (codon)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2142 -Label: EMBOSS graph -- 'EMBOSS graph' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS graph' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2143 -Label: EMBOSS report -- 'EMBOSS report' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2145 -Label: Sequence offset -- 'Sequence offset' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence offset' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2146 -Label: Threshold -- 'Threshold' SubClassOf 'Obsolete concept (EDAM)' -+ 'Threshold' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2147 -Label: Protein report (transcription factor) -- 'Protein report (transcription factor)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein report (transcription factor)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2149 -Label: Database category name -- 'Database category name' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database category name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2954 -Label: Article report -- 'Article report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Article report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2246 -Label: Demonstration -- 'Demonstration' SubClassOf 'Obsolete concept (EDAM)' -+ 'Demonstration' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1308 -Label: Matrix/scaffold attachment sites -- 'Matrix/scaffold attachment sites' SubClassOf 'Gene transcription features' -+ 'Matrix/scaffold attachment sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1307 -Label: Splice sites -- 'Splice sites' SubClassOf 'Gene transcript features' -- 'Splice sites' SubClassOf 'RNA splicing' -+ 'Splice sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1305 -Label: Restriction sites -- 'Restriction sites' SubClassOf 'DNA binding sites' -+ 'Restriction sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1304 -Label: CpG island and isochores -- 'CpG island and isochores' SubClassOf 'Gene transcription features' -+ 'CpG island and isochores' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1302 -Label: PolyA signal or sites -- 'PolyA signal or sites' SubClassOf 'Gene transcript features' -+ 'PolyA signal or sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2141 -Label: Window step size -- 'Window step size' SubClassOf 'Obsolete concept (EDAM)' -+ 'Window step size' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2134 -Label: Results sort order -- 'Results sort order' SubClassOf 'Obsolete concept (EDAM)' -+ 'Results sort order' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2132 -Label: Mutation type -- 'Mutation type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mutation type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2135 -Label: Toggle -- 'Toggle' SubClassOf 'Obsolete concept (EDAM)' -+ 'Toggle' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2961 -Label: Gene regulatory network report -- 'Gene regulatory network report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene regulatory network report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2136 -Label: Sequence width -- 'Sequence width' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence width' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2967 -Label: Microarray image -- 'Microarray image' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarray image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2965 -Label: 2D PAGE gel report -- '2D PAGE gel report' SubClassOf 'Obsolete concept (EDAM)' -+ '2D PAGE gel report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2130 -Label: Sequence profile type -- 'Sequence profile type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2169 -Label: Nucleic acid features (siRNA) -- 'Nucleic acid features (siRNA)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (siRNA)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2164 -Label: Protein sequence properties plot -- 'Protein sequence properties plot' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence properties plot' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2224 -Label: Data retrieval (ontology concept) -- 'Data retrieval (ontology concept)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (ontology concept)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2222 -Label: Data retrieval (ontology annotation) -- 'Data retrieval (ontology annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (ontology annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2157 -Label: Word composition -- 'Word composition' SubClassOf 'Obsolete concept (EDAM)' -+ 'Word composition' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2156 -Label: Date -- 'Date' SubClassOf 'Obsolete concept (EDAM)' -+ 'Date' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2237 -Label: Data retrieval (sequence profile) -- 'Data retrieval (sequence profile)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (sequence profile)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2239 -Label: 3D-1D scoring matrix generation -- '3D-1D scoring matrix generation' SubClassOf 'has topic' some 'Structure comparison' -+ '3D-1D scoring matrix generation' SubClassOf 'has topic' some 'Structure analysis' - -Class: http://edamontology.org/data_2152 -Label: Rendering parameter -- 'Rendering parameter' SubClassOf 'Obsolete concept (EDAM)' -+ 'Rendering parameter' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2151 -Label: Color -- 'Color' SubClassOf 'Obsolete concept (EDAM)' -+ 'Color' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2150 -Label: Sequence profile name -- 'Sequence profile name' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2234 -Label: Structure file processing -- 'Structure file processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure file processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2180 -Label: 2 or more -- '2 or more' SubClassOf 'Obsolete concept (EDAM)' -+ '2 or more' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2178 -Label: 1 or more -- '1 or more' SubClassOf 'Obsolete concept (EDAM)' -+ '1 or more' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2177 -Label: Exactly 1 -- 'Exactly 1' SubClassOf 'Obsolete concept (EDAM)' -+ 'Exactly 1' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2176 -Label: Cardinality -- 'Cardinality' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cardinality' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2179 -Label: Exactly 2 -- 'Exactly 2' SubClassOf 'Obsolete concept (EDAM)' -+ 'Exactly 2' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2173 -Label: Sequence set (stream) -- 'Sequence set (stream)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence set (stream)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2972 -Label: Workflow -- 'Workflow' SubClassOf 'Obsolete concept (EDAM)' -+ 'Workflow' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2971 -Label: Workflow data -- 'Workflow data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Workflow data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2973 -Label: Secondary structure data -- 'Secondary structure data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1308 -Label: Nucleic acid features report (matrix/scaffold attachment sites) -- 'Nucleic acid features report (matrix/scaffold attachment sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (matrix/scaffold attachment sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1307 -Label: Nucleic acid features report (splice sites) -- 'Nucleic acid features report (splice sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (splice sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1309 -Label: Gene features (exonic splicing enhancer) -- 'Gene features (exonic splicing enhancer)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features (exonic splicing enhancer)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1304 -Label: Nucleic acid features report (CpG island and isochore) -- 'Nucleic acid features report (CpG island and isochore)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (CpG island and isochore)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1303 -Label: Nucleic acid features (quadruplexes) -- 'Nucleic acid features (quadruplexes)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (quadruplexes)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1306 -Label: Nucleosome exclusion sequences -- 'Nucleosome exclusion sequences' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleosome exclusion sequences' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2264 -Label: Data retrieval (pathway or network) -- 'Data retrieval (pathway or network)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (pathway or network)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1305 -Label: Nucleic acid features report (restriction sites) -- 'Nucleic acid features report (restriction sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (restriction sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2265 -Label: Data retrieval (identifier) -- 'Data retrieval (identifier)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (identifier)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1300 -Label: Gene and transcript structure (report) -- 'Gene and transcript structure (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene and transcript structure (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1302 -Label: Nucleic acid features report (PolyA signal or site) -- 'Nucleic acid features report (PolyA signal or site)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (PolyA signal or site)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1301 -Label: Mobile genetic elements -- 'Mobile genetic elements' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mobile genetic elements' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2989 -Label: Protein features report (key folding sites) -- 'Protein features report (key folding sites)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (key folding sites)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2987 -Label: Classification report -- 'Classification report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Classification report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2986 -Label: Nucleic acid classification -- 'Nucleic acid classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2983 -Label: Pathway or network data -- 'Pathway or network data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pathway or network data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2982 -Label: Sequence profile data -- 'Sequence profile data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2981 -Label: Sequence motif data -- 'Sequence motif data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2980 -Label: Protein classification -- 'Protein classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2198 -Label: Gene cluster -- 'Gene cluster' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene cluster' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1311 -Label: Operon -- 'Operon' SubClassOf 'Gene structure' -+ 'Operon' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1312 -Label: Promoters -- 'Promoters' SubClassOf 'Transcription factors and regulatory sites' -+ 'Promoters' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2191 -Label: Protein features report (chemical modifications) -- 'Protein features report (chemical modifications)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (chemical modifications)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2192 -Label: Error -- 'Error' SubClassOf 'Obsolete concept (EDAM)' -+ 'Error' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1315 -Label: Transcription factor binding sites -- 'Transcription factor binding sites' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcription factor binding sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1314 -Label: Gene features (SECIS element) -- 'Gene features (SECIS element)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features (SECIS element)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1313 -Label: Coding region -- 'Coding region' SubClassOf 'Obsolete concept (EDAM)' -+ 'Coding region' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1312 -Label: Nucleic acid features report (promoters) -- 'Nucleic acid features report (promoters)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (promoters)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1311 -Label: Gene features report (operon) -- 'Gene features report (operon)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features report (operon)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1310 -Label: Nucleic acid features (microRNA) -- 'Nucleic acid features (microRNA)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (microRNA)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1237 -Label: HMMER synthetic sequences set -- 'HMMER synthetic sequences set' SubClassOf 'Obsolete concept (EDAM)' -+ 'HMMER synthetic sequences set' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1236 -Label: Psiblast checkpoint file -- 'Psiblast checkpoint file' SubClassOf 'Obsolete concept (EDAM)' -+ 'Psiblast checkpoint file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1810 -Label: ColiCard report format -- 'ColiCard report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'ColiCard report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1774 -Label: Annotation retrieval -- 'Annotation retrieval' SubClassOf 'Obsolete concept (EDAM)' -+ 'Annotation retrieval' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0992 -Label: Ligand identifier -- 'Ligand identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ligand identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1264 -Label: Sequence composition table -- 'Sequence composition table' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence composition table' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1269 -Label: DAS sequence feature annotation -- 'DAS sequence feature annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'DAS sequence feature annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1780 -Label: Sequence submission -- 'Sequence submission' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence submission' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1809 -Label: BacMap gene card format -- 'BacMap gene card format' SubClassOf 'Obsolete concept (EDAM)' -+ 'BacMap gene card format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_1808 -Label: GeneIlluminator gene report format -- 'GeneIlluminator gene report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'GeneIlluminator gene report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1256 -Label: Sequence features (comparative) -- 'Sequence features (comparative)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence features (comparative)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1258 -Label: Sequence property (nucleic acid) -- 'Sequence property (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence property (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1257 -Label: Sequence property (protein) -- 'Sequence property (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence property (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1252 -Label: Sequence length range -- 'Sequence length range' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence length range' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1251 -Label: Window size -- 'Window size' SubClassOf 'Obsolete concept (EDAM)' -+ 'Window size' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1253 -Label: Sequence information report -- 'Sequence information report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence information report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1250 -Label: Word size -- 'Word size' SubClassOf 'Obsolete concept (EDAM)' -+ 'Word size' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1244 -Label: primersearch primer pairs sequence record -- 'primersearch primer pairs sequence record' SubClassOf 'Obsolete concept (EDAM)' -+ 'primersearch primer pairs sequence record' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1243 -Label: Primer3 mispriming library file -- 'Primer3 mispriming library file' SubClassOf 'Obsolete concept (EDAM)' -+ 'Primer3 mispriming library file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1242 -Label: Primer3 internal oligo mishybridizing library -- 'Primer3 internal oligo mishybridizing library' SubClassOf 'Obsolete concept (EDAM)' -+ 'Primer3 internal oligo mishybridizing library' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1241 -Label: vectorstrip cloning vector definition file -- 'vectorstrip cloning vector definition file' SubClassOf 'Obsolete concept (EDAM)' -+ 'vectorstrip cloning vector definition file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_3582 -Label: afg -- 'afg' SubClassOf 'Sequence assembly format' -+ 'afg' SubClassOf 'Sequence assembly format (text)' - -Class: http://edamontology.org/data_0948 -Label: Data resource definition -- 'Data resource definition' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data resource definition' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0952 -Label: EMBOSS database resource definition -- 'EMBOSS database resource definition' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS database resource definition' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0953 -Label: Version information -- 'Version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'Version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3191 -Label: Trim to reference -- 'Trim to reference' SubClassOf 'Obsolete concept (EDAM)' -+ 'Trim to reference' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0959 -Label: Job metadata -- 'Job metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Job metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3190 -Label: Trim vector -- 'Trim vector' SubClassOf 'Obsolete concept (EDAM)' -+ 'Trim vector' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0964 -Label: Scent annotation -- 'Scent annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Scent annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1294 -Label: GlobPlot domain image -- 'GlobPlot domain image' SubClassOf 'Obsolete concept (EDAM)' -+ 'GlobPlot domain image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1293 -Label: SMART protein schematic -- 'SMART protein schematic' SubClassOf 'Obsolete concept (EDAM)' -+ 'SMART protein schematic' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1292 -Label: InterPro architecture image -- 'InterPro architecture image' SubClassOf 'Obsolete concept (EDAM)' -+ 'InterPro architecture image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1291 -Label: InterPro detailed match image -- 'InterPro detailed match image' SubClassOf 'Obsolete concept (EDAM)' -+ 'InterPro detailed match image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1290 -Label: InterPro compact match image -- 'InterPro compact match image' SubClassOf 'Obsolete concept (EDAM)' -+ 'InterPro compact match image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3542 -Label: Protein secondary structure -- 'Protein secondary structure' SubClassOf 'Protein sites, features and motifs' - -Class: http://edamontology.org/topic_3543 -Label: Protein sequence repeats -- 'Protein sequence repeats' SubClassOf 'Protein sites, features and motifs' -+ 'Protein sequence repeats' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3540 -Label: Protein key folding sites -- 'Protein key folding sites' SubClassOf 'Protein sites, features and motifs' -- 'Protein key folding sites' SubClassOf 'Protein folding, stability and design' -+ 'Protein key folding sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3541 -Label: Protein post-translational modifications -- 'Protein post-translational modifications' SubClassOf 'Protein sites, features and motifs' -+ 'Protein post-translational modifications' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0978 -Label: Discrete entity identifier -- 'Discrete entity identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Discrete entity identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0979 -Label: Entity feature identifier -- 'Entity feature identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Entity feature identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0974 -Label: Entity identifier -- 'Entity identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Entity identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3544 -Label: Protein signal peptides -- 'Protein signal peptides' SubClassOf 'Protein sites, features and motifs' -+ 'Protein signal peptides' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0975 -Label: Data resource identifier -- 'Data resource identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data resource identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1299 -Label: Sequence features (repeats) -- 'Sequence features (repeats)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence features (repeats)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1298 -Label: Sequence motif matches -- 'Sequence motif matches' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif matches' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1281 -Label: Sequence signature map -- 'Sequence signature map' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence signature map' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3530 -Label: Genetic information processing pathways -- 'Genetic information processing pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Genetic information processing pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3531 -Label: Protein super-secondary structure -- 'Protein super-secondary structure' SubClassOf 'Protein structural motifs and surfaces' -+ 'Protein super-secondary structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3538 -Label: Protein disordered structure -- 'Protein disordered structure' SubClassOf 'Protein sites, features and motifs' -+ 'Protein disordered structure' SubClassOf 'Protein structure analysis' - -Class: http://edamontology.org/topic_3537 -Label: Protein chemical modifications -- 'Protein chemical modifications' SubClassOf 'Protein sites, features and motifs' -+ 'Protein chemical modifications' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3539 -Label: Protein domains -- 'Protein domains' SubClassOf 'Protein sites, features and motifs' -- 'Protein domains' SubClassOf 'Protein domains and folds' -+ 'Protein domains' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3534 -Label: Protein binding sites -- 'Protein binding sites' SubClassOf 'Protein structural motifs and surfaces' -+ 'Protein binding sites' SubClassOf 'Protein sites, features and motifs' - -Class: http://edamontology.org/topic_3533 -Label: Protein active sites -- 'Protein active sites' SubClassOf 'Protein structural motifs and surfaces' -+ 'Protein active sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0986 -Label: Chemical identifier -- 'Chemical identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Chemical identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3536 -Label: Protein cleavage sites -- 'Protein cleavage sites' SubClassOf 'Protein sites, features and motifs' -+ 'Protein cleavage sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0985 -Label: Molecule type -- 'Molecule type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecule type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3535 -Label: Protein-nucleic acid binding sites -- 'Protein-nucleic acid binding sites' SubClassOf 'Protein structural motifs and surfaces' -+ 'Protein-nucleic acid binding sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0981 -Label: Phenomenon identifier -- 'Phenomenon identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phenomenon identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0980 -Label: Entity collection identifier -- 'Entity collection identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Entity collection identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0904 -Label: Protein features (mutation) -- 'Protein features (mutation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (mutation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0911 -Label: Nucleotide base annotation -- 'Nucleotide base annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleotide base annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0918 -Label: DNA variation -- 'DNA variation' SubClassOf 'Obsolete concept (EDAM)' -+ 'DNA variation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0917 -Label: Gene classification -- 'Gene classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0919 -Label: Chromosome report -- 'Chromosome report' SubClassOf 'has topic' some 'Chromosomes' - -Class: http://edamontology.org/data_0922 -Label: Nucleic acid features report (primers) -- 'Nucleic acid features report (primers)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (primers)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0923 -Label: PCR experiment report -- 'PCR experiment report' SubClassOf 'Obsolete concept (EDAM)' -+ 'PCR experiment report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0933 -Label: SAGE experimental data -- 'SAGE experimental data' SubClassOf 'Obsolete concept (EDAM)' -+ 'SAGE experimental data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0934 -Label: MPSS experimental data -- 'MPSS experimental data' SubClassOf 'Obsolete concept (EDAM)' -+ 'MPSS experimental data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0935 -Label: SBS experimental data -- 'SBS experimental data' SubClassOf 'Obsolete concept (EDAM)' -+ 'SBS experimental data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0931 -Label: Microarray experiment report -- 'Microarray experiment report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarray experiment report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3189 -Label: Trim ends -- 'Trim ends' SubClassOf 'Obsolete concept (EDAM)' -+ 'Trim ends' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0932 -Label: Oligonucleotide probe data -- 'Oligonucleotide probe data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Oligonucleotide probe data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0946 -Label: Pathway or network annotation -- 'Pathway or network annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pathway or network annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0947 -Label: Biological pathway map -- 'Biological pathway map' SubClassOf 'Obsolete concept (EDAM)' -+ 'Biological pathway map' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2201 -Label: Sequence record full -- 'Sequence record full' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence record full' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3259 -Label: Transcriptome assembly (de novo) -- 'Transcriptome assembly (de novo)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcriptome assembly (de novo)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3260 -Label: Transcriptome assembly (mapping) -- 'Transcriptome assembly (mapping)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcriptome assembly (mapping)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2240 -Label: Heterogen annotation -- 'Heterogen annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Heterogen annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2248 -Label: Schema -- 'Schema' SubClassOf 'Obsolete concept (EDAM)' -+ 'Schema' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2247 -Label: Phylogenetic consensus tree -- 'Phylogenetic consensus tree' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic consensus tree' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2245 -Label: Sequence set (bootstrapped) -- 'Sequence set (bootstrapped)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence set (bootstrapped)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2242 -Label: Phylogenetic property values -- 'Phylogenetic property values' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic property values' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2249 -Label: DTD -- 'DTD' SubClassOf 'Obsolete concept (EDAM)' -+ 'DTD' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3200 -Label: DNA barcoding -+ 'DNA barcoding' SubClassOf 'Taxonomic classification' - -Class: http://edamontology.org/data_2235 -Label: Raw SCOP domain classification -- 'Raw SCOP domain classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Raw SCOP domain classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2236 -Label: Raw CATH domain classification -- 'Raw CATH domain classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Raw CATH domain classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3213 -Label: Genome indexing (suffix arrays) -- 'Genome indexing (suffix arrays)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genome indexing (suffix arrays)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3212 -Label: Genome indexing (Burrows-Wheeler) -- 'Genome indexing (Burrows-Wheeler)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genome indexing (Burrows-Wheeler)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://www.geneontology.org/formats/oboInOwl#ObsoleteClass -Label: Obsolete concept (EDAM) -- 'Obsolete concept (EDAM)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_2213 -Label: Mutation annotation (prevalence) -- 'Mutation annotation (prevalence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mutation annotation (prevalence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2212 -Label: Mutation annotation (basic) -- 'Mutation annotation (basic)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mutation annotation (basic)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2215 -Label: Mutation annotation (functional) -- 'Mutation annotation (functional)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mutation annotation (functional)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2214 -Label: Mutation annotation (prognostic) -- 'Mutation annotation (prognostic)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mutation annotation (prognostic)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2217 -Label: Tumor annotation -- 'Tumor annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tumor annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2218 -Label: Server metadata -- 'Server metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Server metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2289 -Label: Sequence identifier (nucleic acid) -- 'Sequence identifier (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence identifier (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2288 -Label: Sequence identifier (protein) -- 'Sequence identifier (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence identifier (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2123 -Label: Small molecule data processing -- 'Small molecule data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Small molecule data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2122 -Label: Sequence alignment file processing -- 'Sequence alignment file processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment file processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2120 -Label: Listfile processing -- 'Listfile processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Listfile processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1768 -Label: Nucleic acid folding family identification -- 'Nucleic acid folding family identification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid folding family identification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2252 -Label: XSLT stylesheet -- 'XSLT stylesheet' SubClassOf 'Obsolete concept (EDAM)' -+ 'XSLT stylesheet' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2251 -Label: Relax-NG schema -- 'Relax-NG schema' SubClassOf 'Obsolete concept (EDAM)' -+ 'Relax-NG schema' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2250 -Label: XML Schema -- 'XML Schema' SubClassOf 'Obsolete concept (EDAM)' -+ 'XML Schema' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2296 -Label: Gene name (AceView) -- 'Gene name (AceView)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (AceView)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1456 -Label: Protein membrane regions -- 'Protein membrane regions' SubClassOf 'Protein domains and folds' -- 'Protein membrane regions' SubClassOf 'Protein domains' -+ 'Protein membrane regions' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0317 -Label: EST and cDNA sequence analysis -- 'EST and cDNA sequence analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'EST and cDNA sequence analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0316 -Label: Functional profiling -- 'Functional profiling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Functional profiling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0312 -Label: Sequencing-based expression profile data processing -- 'Sequencing-based expression profile data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequencing-based expression profile data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2748 -Label: Database name (Osteogenesis) -- 'Database name (Osteogenesis)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database name (Osteogenesis)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2018 -Label: Annotation -- 'Annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2747 -Label: Database name (CMD) -- 'Database name (CMD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database name (CMD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2740 -Label: Gene name (Genolist) -- 'Gene name (Genolist)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (Genolist)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2743 -Label: Gene name (HUGO) -- 'Gene name (HUGO)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (HUGO)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2735 -Label: Database name (SwissRegulon) -- 'Database name (SwissRegulon)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database name (SwissRegulon)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2733 -Label: Genus name (virus) -- 'Genus name (virus)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genus name (virus)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2734 -Label: Family name (virus) -- 'Family name (virus)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Family name (virus)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2028 -Label: Experimental data -- 'Experimental data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Experimental data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2023 -Label: Sequence mask parameter -- 'Sequence mask parameter' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence mask parameter' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2022 -Label: Vienna RNA structural data -- 'Vienna RNA structural data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Vienna RNA structural data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2767 -Label: Identifier with metadata -- 'Identifier with metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Identifier with metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2768 -Label: Gene symbol annotation -- 'Gene symbol annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene symbol annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2763 -Label: Locus annotation -- 'Locus annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Locus annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2765 -Label: Term ID list -- 'Term ID list' SubClassOf 'Obsolete concept (EDAM)' -+ 'Term ID list' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2041 -Label: Genome version information -- 'Genome version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genome version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2046 -Label: Nucleic acid sequence record (lite) -- 'Nucleic acid sequence record (lite)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence record (lite)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2043 -Label: Sequence record lite -- 'Sequence record lite' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence record lite' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2754 -Label: Gene features report (intron) -- 'Gene features report (intron)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene features report (intron)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2047 -Label: Protein sequence record (lite) -- 'Protein sequence record (lite)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence record (lite)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2053 -Label: Structural data -- 'Structural data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2661 -Label: Toxins and targets -- 'Toxins and targets' SubClassOf 'Toxicology' -- 'Toxins and targets' SubClassOf 'Small molecules' -+ 'Toxins and targets' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2079 -Label: Search parameter -- 'Search parameter' SubClassOf 'Obsolete concept (EDAM)' -+ 'Search parameter' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2722 -Label: Protein features report (disordered structure) -- 'Protein features report (disordered structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (disordered structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2724 -Label: Embryo report -- 'Embryo report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Embryo report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2726 -Label: Inhibitor annotation -- 'Inhibitor annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Inhibitor annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2081 -Label: Secondary structure -- 'Secondary structure' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2083 -Label: Alignment data -- 'Alignment data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Alignment data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3147 -Label: Mass spectrometry experiment -- 'Mass spectrometry experiment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mass spectrometry experiment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3143 -Label: SCOP superfamily -- 'SCOP superfamily' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP superfamily' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3144 -Label: SCOP family -- 'SCOP family' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP family' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3145 -Label: SCOP protein -- 'SCOP protein' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP protein' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3146 -Label: SCOP species -- 'SCOP species' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP species' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3295 -Label: Epigenetics -- 'Epigenetics' SubClassOf 'Molecular genetics' -+ 'Epigenetics' SubClassOf 'Genetics' - -Class: http://edamontology.org/data_3140 -Label: Nucleic acid features (immunoglobulin gene structure) -- 'Nucleic acid features (immunoglobulin gene structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (immunoglobulin gene structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3142 -Label: SCOP fold -- 'SCOP fold' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP fold' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3141 -Label: SCOP class -- 'SCOP class' SubClassOf 'Obsolete concept (EDAM)' -+ 'SCOP class' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1125 -Label: Comparison matrix type -- 'Comparison matrix type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Comparison matrix type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1121 -Label: BLAST sequence alignment type -- 'BLAST sequence alignment type' SubClassOf 'Obsolete concept (EDAM)' -+ 'BLAST sequence alignment type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1122 -Label: Phylogenetic tree type -- 'Phylogenetic tree type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic tree type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1120 -Label: Sequence alignment type -- 'Sequence alignment type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3165 -Label: NGS experiment -- 'NGS experiment' SubClassOf 'Obsolete concept (EDAM)' -+ 'NGS experiment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1152 -Label: HIVDB identifier -- 'HIVDB identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'HIVDB identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1156 -Label: Pathway ID (aMAZE) -- 'Pathway ID (aMAZE)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pathway ID (aMAZE)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3105 -Label: Geotemporal metadata -- 'Geotemporal metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Geotemporal metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3101 -Label: Protein domain classification node -- 'Protein domain classification node' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein domain classification node' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1906 -Label: Quantitative trait locus -- 'Quantitative trait locus' SubClassOf 'Obsolete concept (EDAM)' -+ 'Quantitative trait locus' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2542 -Label: Protein features (domains) format -- 'Protein features (domains) format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features (domains) format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3116 -Label: Microarray protocol annotation -- 'Microarray protocol annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarray protocol annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3119 -Label: Sequence features (compositionally-biased regions) -- 'Sequence features (compositionally-biased regions)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence features (compositionally-biased regions)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3118 -Label: Protein features report (topological domains) -- 'Protein features report (topological domains)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (topological domains)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3122 -Label: Nucleic acid features (difference and change) -- 'Nucleic acid features (difference and change)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (difference and change)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3123 -Label: Nucleic acid features report (expression signal) -- 'Nucleic acid features report (expression signal)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (expression signal)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3126 -Label: Nucleic acid repeats (report) -- 'Nucleic acid repeats (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid repeats (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3125 -Label: Nucleic acid features report (binding) -- 'Nucleic acid features report (binding)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (binding)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3127 -Label: Nucleic acid features report (replication and recombination) -- 'Nucleic acid features report (replication and recombination)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (replication and recombination)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2560 -Label: STRING entry format -- 'STRING entry format' SubClassOf 'Obsolete concept (EDAM)' -+ 'STRING entry format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3129 -Label: Protein features report (repeats) -- 'Protein features report (repeats)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein features report (repeats)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2562 -Label: Amino acid identifier format -- 'Amino acid identifier format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Amino acid identifier format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1101 -Label: TREMBL accession -- 'TREMBL accession' SubClassOf 'Obsolete concept (EDAM)' -+ 'TREMBL accession' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3135 -Label: Nucleic acid features report (signal or transit peptide) -- 'Nucleic acid features report (signal or transit peptide)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (signal or transit peptide)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3133 -Label: Nucleic acid features (stem loop) -- 'Nucleic acid features (stem loop)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (stem loop)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3132 -Label: Nucleic acid features (d-loop) -- 'Nucleic acid features (d-loop)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features (d-loop)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3139 -Label: Nucleic acid features report (STS) -- 'Nucleic acid features report (STS)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (STS)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3138 -Label: Transcriptional features (report) -- 'Transcriptional features (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcriptional features (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3137 -Label: Non-coding RNA -- 'Non-coding RNA' SubClassOf 'Obsolete concept (EDAM)' -+ 'Non-coding RNA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2009 -Label: Ordered locus name -- 'Ordered locus name' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ordered locus name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1111 -Label: EMBOSS listfile -- 'EMBOSS listfile' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS listfile' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1110 -Label: EMBOSS sequence type -- 'EMBOSS sequence type' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS sequence type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3130 -Label: Sequence motif matches (protein) -- 'Sequence motif matches (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif matches (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3131 -Label: Sequence motif matches (nucleic acid) -- 'Sequence motif matches (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif matches (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1832 -Label: Residue contact calculation (residue-nucleic acid) -- 'Residue contact calculation (residue-nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue contact calculation (residue-nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1835 -Label: Residue contact calculation (residue-negative ion) -- 'Residue contact calculation (residue-negative ion)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue contact calculation (residue-negative ion)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1826 -Label: Full torsion angle calculation -- 'Full torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Full torsion angle calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1825 -Label: Backbone torsion angle calculation -- 'Backbone torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Backbone torsion angle calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1828 -Label: Tau angle calculation -- 'Tau angle calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tau angle calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1827 -Label: Cysteine torsion angle calculation -- 'Cysteine torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cysteine torsion angle calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1845 -Label: PDB file sequence retrieval -- 'PDB file sequence retrieval' SubClassOf 'Obsolete concept (EDAM)' -+ 'PDB file sequence retrieval' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1846 -Label: HET group detection -- 'HET group detection' SubClassOf 'Obsolete concept (EDAM)' -+ 'HET group detection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1841 -Label: Rotamer likelihood prediction -- 'Rotamer likelihood prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Rotamer likelihood prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1842 -Label: Proline mutation value calculation -- 'Proline mutation value calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Proline mutation value calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1838 -Label: Residue contact calculation (residue-ligand) -- 'Residue contact calculation (residue-ligand)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue contact calculation (residue-ligand)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1837 -Label: Residue symmetry contact calculation -- 'Residue symmetry contact calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue symmetry contact calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1813 -Label: Sequence retrieval -- 'Sequence retrieval' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence retrieval' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1820 -Label: Protein residue surface calculation (vacuum accessible) -- 'Protein residue surface calculation (vacuum accessible)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue surface calculation (vacuum accessible)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1823 -Label: Protein surface calculation (accessible molecular) -- 'Protein surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein surface calculation (accessible molecular)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1824 -Label: Protein surface calculation (accessible) -- 'Protein surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein surface calculation (accessible)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1821 -Label: Protein residue surface calculation (accessible molecular) -- 'Protein residue surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue surface calculation (accessible molecular)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1822 -Label: Protein residue surface calculation (vacuum molecular) -- 'Protein residue surface calculation (vacuum molecular)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue surface calculation (vacuum molecular)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1817 -Label: Protein atom surface calculation (accessible) -- 'Protein atom surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein atom surface calculation (accessible)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1814 -Label: Structure retrieval -- 'Structure retrieval' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure retrieval' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1819 -Label: Protein residue surface calculation (accessible) -- 'Protein residue surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue surface calculation (accessible)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1818 -Label: Protein atom surface calculation (accessible molecular) -- 'Protein atom surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein atom surface calculation (accessible molecular)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2496 -Label: Gene regulatory network processing -- 'Gene regulatory network processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene regulatory network processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2498 -Label: Sequencing-based expression profile data analysis -- 'Sequencing-based expression profile data analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequencing-based expression profile data analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2491 -Label: Hydrogen bond calculation (inter-residue) -- 'Hydrogen bond calculation (inter-residue)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hydrogen bond calculation (inter-residue)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2493 -Label: Codon usage data processing -- 'Codon usage data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Codon usage data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2490 -Label: Residue contact calculation (residue-residue) -- 'Residue contact calculation (residue-residue)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue contact calculation (residue-residue)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2482 -Label: Secondary structure processing -- 'Secondary structure processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2483 -Label: Structure comparison -- 'Structure comparison' SubClassOf 'has topic' some 'Structure comparison' -+ 'Structure comparison' SubClassOf 'has topic' some 'Structure analysis' - -Class: http://edamontology.org/operation_3084 -Label: Protein function prediction (from sequence) -- 'Protein function prediction (from sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein function prediction (from sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2470 -Label: Data retrieval (protein family annotation) -- 'Data retrieval (protein family annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (protein family annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2471 -Label: Data retrieval (RNA family annotation) -- 'Data retrieval (RNA family annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (RNA family annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2472 -Label: Data retrieval (gene annotation) -- 'Data retrieval (gene annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (gene annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2473 -Label: Data retrieval (genotype and phenotype annotation) -- 'Data retrieval (genotype and phenotype annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (genotype and phenotype annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3090 -Label: Protein feature prediction (from structure) -- 'Protein feature prediction (from structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein feature prediction (from structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_3093 -Label: Database search (by sequence) -- 'Database search (by sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database search (by sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1847 -Label: DSSP secondary structure assignment -- 'DSSP secondary structure assignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'DSSP secondary structure assignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1848 -Label: Structure formatting -- 'Structure formatting' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure formatting' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2468 -Label: Data retrieval (phylogenetic tree) -- 'Data retrieval (phylogenetic tree)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (phylogenetic tree)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2469 -Label: Data retrieval (protein interaction annotation) -- 'Data retrieval (protein interaction annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (protein interaction annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2466 -Label: Map annotation -- 'Map annotation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Map annotation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2467 -Label: Data retrieval (protein annotation) -- 'Data retrieval (protein annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (protein annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3086 -Label: Nucleic acid sequence composition (report) -- 'Nucleic acid sequence composition (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence composition (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2460 -Label: Protein atom surface calculation -- 'Protein atom surface calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein atom surface calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2461 -Label: Protein residue surface calculation -- 'Protein residue surface calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein residue surface calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3085 -Label: Protein sequence composition -- 'Protein sequence composition' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence composition' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2465 -Label: Structure processing -- 'Structure processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2462 -Label: Protein surface calculation -- 'Protein surface calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein surface calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2463 -Label: Sequence alignment processing -- 'Sequence alignment processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence alignment processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2453 -Label: Feature table processing -- 'Feature table processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Feature table processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2452 -Label: Sequence cluster processing -- 'Sequence cluster processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence cluster processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0385 -Label: Protein hydropathy cluster calculation -- 'Protein hydropathy cluster calculation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein hydropathy cluster calculation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0383 -Label: Protein hydropathy calculation (from structure) -- 'Protein hydropathy calculation (from structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein hydropathy calculation (from structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0388 -Label: Protein binding site prediction (from structure) -- 'Protein binding site prediction (from structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein binding site prediction (from structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2459 -Label: Structure processing (protein) -- 'Structure processing (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure processing (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2440 -Label: Structure processing (RNA) -- 'Structure processing (RNA)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure processing (RNA)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2443 -Label: Phylogenetic tree processing -- 'Phylogenetic tree processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic tree processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0395 -Label: Residue non-canonical interaction detection -- 'Residue non-canonical interaction detection' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue non-canonical interaction detection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2448 -Label: Sequence processing (nucleic acid) -- 'Sequence processing (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence processing (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2445 -Label: Protein interaction network processing -- 'Protein interaction network processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein interaction network processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2444 -Label: Protein secondary structure processing -- 'Protein secondary structure processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein secondary structure processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2447 -Label: Sequence processing (protein) -- 'Sequence processing (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence processing (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2446 -Label: Sequence processing -- 'Sequence processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0360 -Label: Structural similarity search -- 'Structural similarity search' SubClassOf 'has topic' some 'Structure comparison' - -Class: http://edamontology.org/operation_2432 -Label: Microarray data processing -- 'Microarray data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarray data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2435 -Label: Gene expression profile processing -- 'Gene expression profile processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene expression profile processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2434 -Label: Data retrieval (codon usage table) -- 'Data retrieval (codon usage table)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (codon usage table)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2438 -Label: Pathway or network processing -- 'Pathway or network processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pathway or network processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2420 -Label: Operation (typed) -- 'Operation (typed)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Operation (typed)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0377 -Label: Sequence composition calculation (nucleic acid) -- 'Sequence composition calculation (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence composition calculation (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2427 -Label: Data handling -- 'Data handling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data handling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0378 -Label: Sequence composition calculation (protein) -- 'Sequence composition calculation (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence composition calculation (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2409 -Label: File handling -- 'File handling' SubClassOf 'Analysis' -+ 'File handling' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2408 -Label: Sequence feature analysis -- 'Sequence feature analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence feature analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0345 -Label: Sequence retrieval (by keyword) -- 'Sequence retrieval (by keyword)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence retrieval (by keyword)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2417 -Label: Physicochemical property data processing -- 'Physicochemical property data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Physicochemical property data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0343 -Label: Transmembrane protein database search -- 'Transmembrane protein database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transmembrane protein database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0344 -Label: Sequence retrieval (by code) -- 'Sequence retrieval (by code)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence retrieval (by code)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2413 -Label: Sequence profile processing -- 'Sequence profile processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0347 -Label: Sequence database search (by motif or pattern) -- 'Sequence database search (by motif or pattern)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by motif or pattern)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2411 -Label: Structural profile processing -- 'Structural profile processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural profile processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0348 -Label: Sequence database search (by amino acid composition) -- 'Sequence database search (by amino acid composition)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by amino acid composition)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2412 -Label: Data index processing -- 'Data index processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data index processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2410 -Label: Gene expression analysis -- 'Gene expression analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene expression analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0341 -Label: Motif database search -- 'Motif database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Motif database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0342 -Label: Sequence profile database search -- 'Sequence profile database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0340 -Label: Protein secondary database search -- 'Protein secondary database search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein secondary database search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2788 -Label: Sequence profile data -- 'Sequence profile data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence profile data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0354 -Label: Sequence database search (by sequence for primer sequences) -- 'Sequence database search (by sequence for primer sequences)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by sequence for primer sequences)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2404 -Label: Sequence motif processing -- 'Sequence motif processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif processing' SubClassOf 'Sequence analysis' - -Class: http://edamontology.org/operation_0355 -Label: Sequence database search (by molecular weight) -- 'Sequence database search (by molecular weight)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by molecular weight)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2405 -Label: Protein interaction data processing -- 'Protein interaction data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein interaction data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0356 -Label: Sequence database search (by isoelectric point) -- 'Sequence database search (by isoelectric point)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by isoelectric point)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0357 -Label: Structure retrieval (by code) -- 'Structure retrieval (by code)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure retrieval (by code)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2407 -Label: Annotation processing -- 'Annotation processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Annotation processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0358 -Label: Structure retrieval (by keyword) -- 'Structure retrieval (by keyword)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure retrieval (by keyword)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0359 -Label: Structure database search (by sequence) -- 'Structure database search (by sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure database search (by sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0350 -Label: Sequence database search (by sequence using word-based methods) -- 'Sequence database search (by sequence using word-based methods)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by sequence using word-based methods)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0351 -Label: Sequence database search (by sequence using profile-based methods) -- 'Sequence database search (by sequence using profile-based methods)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by sequence using profile-based methods)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0352 -Label: Sequence database search (by sequence using local alignment-based methods) -- 'Sequence database search (by sequence using local alignment-based methods)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by sequence using local alignment-based methods)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0353 -Label: Sequence database search (by sequence using global alignment-based methods) -- 'Sequence database search (by sequence using global alignment-based methods)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database search (by sequence using global alignment-based methods)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0318 -Label: Structural genomics target selection -- 'Structural genomics target selection' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural genomics target selection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0332 -Label: Immunogen design -- 'Immunogen design' SubClassOf 'Obsolete concept (EDAM)' -+ 'Immunogen design' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0330 -Label: Protein SNP mapping -- 'Protein SNP mapping' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein SNP mapping' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1890 -Label: Gene name (Arabidopsis) -- 'Gene name (Arabidopsis)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (Arabidopsis)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1892 -Label: Gene name (GeneFarm) -- 'Gene name (GeneFarm)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (GeneFarm)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0414 -Label: Protein feature prediction (from sequence) -- 'Protein feature prediction (from sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein feature prediction (from sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0413 -Label: MHC peptide immunogenicity prediction -- 'MHC peptide immunogenicity prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'MHC peptide immunogenicity prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2880 -Label: Secondary structure image -- 'Secondary structure image' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure image' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2881 -Label: Secondary structure report -- 'Secondary structure report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2882 -Label: DNA features -- 'DNA features' SubClassOf 'Obsolete concept (EDAM)' -+ 'DNA features' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2883 -Label: RNA features report -- 'RNA features report' SubClassOf 'Obsolete concept (EDAM)' -+ 'RNA features report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2885 -Label: Nucleic acid features report (polymorphism) -- 'Nucleic acid features report (polymorphism)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (polymorphism)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2888 -Label: Protein sequence record (full) -- 'Protein sequence record (full)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence record (full)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2889 -Label: Nucleic acid sequence record (full) -- 'Nucleic acid sequence record (full) ' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence record (full) ' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1880 -Label: Misnomer -- 'Misnomer' SubClassOf 'Obsolete concept (EDAM)' -+ 'Misnomer' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0401 -Label: Protein hydropathy calculation (from sequence) -- 'Protein hydropathy calculation (from sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein hydropathy calculation (from sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1884 -Label: UniProt keywords -- 'UniProt keywords' SubClassOf 'Obsolete concept (EDAM)' -+ 'UniProt keywords' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1888 -Label: Gene ID (MIPS Medicago) -- 'Gene ID (MIPS Medicago)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (MIPS Medicago)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1889 -Label: Gene name (DragonDB) -- 'Gene name (DragonDB)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (DragonDB)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1887 -Label: Gene ID (MIPS Maize) -- 'Gene ID (MIPS Maize)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (MIPS Maize)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2874 -Label: Sequence set (polymorphic) -- 'Sequence set (polymorphic)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence set (polymorphic)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2875 -Label: DRCAT resource -- 'DRCAT resource' SubClassOf 'Obsolete concept (EDAM)' -+ 'DRCAT resource' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2754 -Label: Introns -- 'Introns' SubClassOf 'Gene transcript features' -+ 'Introns' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2869 -Label: Nucleic acid features report (RFLP) -- 'Nucleic acid features report (RFLP)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (RFLP)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2866 -Label: Northern blot report -- 'Northern blot report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Northern blot report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2867 -Label: Nucleic acid features report (VNTR) -- 'Nucleic acid features report (VNTR)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (VNTR)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2868 -Label: Nucleic acid features report (microsatellite) -- 'Nucleic acid features report (microsatellite)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (microsatellite)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0425 -Label: Whole gene prediction -- 'Whole gene prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Whole gene prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0424 -Label: Epitope mapping (MHC Class II) -- 'Epitope mapping (MHC Class II)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Epitope mapping (MHC Class II)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0426 -Label: Gene component prediction -- 'Gene component prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene component prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1099 -Label: UniProt accession (extended) -- 'UniProt accession (extended)' SubClassOf 'Obsolete concept (EDAM)' -+ 'UniProt accession (extended)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0423 -Label: Epitope mapping (MHC Class I) -- 'Epitope mapping (MHC Class I)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Epitope mapping (MHC Class I)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3346 -Label: Sequence search -- 'Sequence search' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence search' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1094 -Label: Sequence type -- 'Sequence type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2857 -Label: Article metadata -- 'Article metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Article metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0419 -Label: Protein binding site prediction (from sequence) -- 'Protein binding site prediction (from sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein binding site prediction (from sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1853 -Label: Chromosome annotation (aberration) -- 'Chromosome annotation (aberration)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Chromosome annotation (aberration)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1065 -Label: Sequence signature identifier -- 'Sequence signature identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence signature identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1067 -Label: Phylogenetic distance matrix identifier -- 'Phylogenetic distance matrix identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phylogenetic distance matrix identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1852 -Label: Sequence assembly component -- 'Sequence assembly component' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence assembly component' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1062 -Label: Database entry identifier -- 'Database entry identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database entry identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2832 -Label: Web portal -- 'Web portal' SubClassOf 'Obsolete concept (EDAM)' -+ 'Web portal' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2838 -Label: Experimental data (proteomics) -- 'Experimental data (proteomics)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Experimental data (proteomics)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2831 -Label: Databank -- 'Databank' SubClassOf 'Obsolete concept (EDAM)' -+ 'Databank' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3323 -Label: Metabolic disease -- 'Metabolic disease' SubClassOf 'Obsolete concept (EDAM)' -+ 'Metabolic disease' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3321 -Label: Molecular genetics -+ 'Molecular genetics' SubClassOf 'Computational biology' - -Class: http://edamontology.org/topic_3320 -Label: RNA splicing -- 'RNA splicing' SubClassOf 'Molecular genetics' -+ 'RNA splicing' SubClassOf 'Gene expression' - -Class: http://edamontology.org/data_1879 -Label: Acronym -- 'Acronym' SubClassOf 'Obsolete concept (EDAM)' -+ 'Acronym' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1878 -Label: Misspelling -- 'Misspelling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Misspelling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1877 -Label: Synonym -- 'Synonym' SubClassOf 'Obsolete concept (EDAM)' -+ 'Synonym' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1866 -Label: Map type -- 'Map type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Map type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1865 -Label: Map feature -- 'Map feature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Map feature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1057 -Label: Sequence database name -- 'Sequence database name' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1806 -Label: Gene synonym -- 'Gene synonym' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene synonym' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1024 -Label: Codon name -- 'Codon name' SubClassOf 'Obsolete concept (EDAM)' -+ 'Codon name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1028 -Label: Gene identifier (NCBI RefSeq) -- 'Gene identifier (NCBI RefSeq)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene identifier (NCBI RefSeq)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1029 -Label: Gene identifier (NCBI UniGene) -- 'Gene identifier (NCBI UniGene)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene identifier (NCBI UniGene)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1030 -Label: Gene identifier (Entrez) -- 'Gene identifier (Entrez)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene identifier (Entrez)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1800 -Label: Gene ID (GeneDB Schizosaccharomyces pombe) -- 'Gene ID (GeneDB Schizosaccharomyces pombe)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (GeneDB Schizosaccharomyces pombe)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1801 -Label: Gene ID (GeneDB Trypanosoma brucei) -- 'Gene ID (GeneDB Trypanosoma brucei)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (GeneDB Trypanosoma brucei)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3026 -Label: GO concept name (biological process) -- 'GO concept name (biological process)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO concept name (biological process)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3027 -Label: GO concept name (molecular function) -- 'GO concept name (molecular function)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO concept name (molecular function)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1014 -Label: Sequence position specification -- 'Sequence position specification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence position specification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3031 -Label: Core data -- 'Core data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Core data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1018 -Label: Nucleic acid feature identifier -- 'Nucleic acid feature identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid feature identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1019 -Label: Protein feature identifier -- 'Protein feature identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein feature identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0486 -Label: Functional mapping -- 'Functional mapping' SubClassOf 'Obsolete concept (EDAM)' -+ 'Functional mapping' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0494 -Label: Pairwise sequence alignment generation (global) -- 'Pairwise sequence alignment generation (global)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise sequence alignment generation (global)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0493 -Label: Pairwise sequence alignment generation (local) -- 'Pairwise sequence alignment generation (local)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Pairwise sequence alignment generation (local)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0594 -Label: Sequence classification -- 'Sequence classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0595 -Label: Protein classification -- 'Protein classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0598 -Label: Sequence motif or profile -- 'Sequence motif or profile' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence motif or profile' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0452 -Label: Indel detection -+ 'Indel detection' SubClassOf 'Mutation detection' - -Class: http://edamontology.org/operation_0453 -Label: Nucleosome formation potential prediction -- 'Nucleosome formation potential prediction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleosome formation potential prediction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1913 -Label: Residue validation -- 'Residue validation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Residue validation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_1914 -Label: Structure retrieval (water) -- 'Structure retrieval (water)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure retrieval (water)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1729 -Label: GO (cellular component) -- 'GO (cellular component)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO (cellular component)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1730 -Label: Ontology relation type -- 'Ontology relation type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ontology relation type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1733 -Label: Ontology concept reference -- 'Ontology concept reference' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ontology concept reference' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1732 -Label: Ontology concept comment -- 'Ontology concept comment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ontology concept comment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1738 -Label: doc2loc document information -- 'doc2loc document information' SubClassOf 'Obsolete concept (EDAM)' -+ 'doc2loc document information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1718 -Label: HGNC -- 'HGNC' SubClassOf 'Obsolete concept (EDAM)' -+ 'HGNC' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1719 -Label: NCBI taxonomy vocabulary -- 'NCBI taxonomy vocabulary' SubClassOf 'Obsolete concept (EDAM)' -+ 'NCBI taxonomy vocabulary' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1720 -Label: Plant ontology term -- 'Plant ontology term' SubClassOf 'Obsolete concept (EDAM)' -+ 'Plant ontology term' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1724 -Label: ChEBI -- 'ChEBI' SubClassOf 'Obsolete concept (EDAM)' -+ 'ChEBI' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1723 -Label: EMAP -- 'EMAP' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMAP' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1722 -Label: FMA -- 'FMA' SubClassOf 'Obsolete concept (EDAM)' -+ 'FMA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1721 -Label: UMLS -- 'UMLS' SubClassOf 'Obsolete concept (EDAM)' -+ 'UMLS' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1728 -Label: GO (molecular function) -- 'GO (molecular function)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO (molecular function)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1727 -Label: GO (biological process) -- 'GO (biological process)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO (biological process)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1726 -Label: myGrid -- 'myGrid' SubClassOf 'Obsolete concept (EDAM)' -+ 'myGrid' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1725 -Label: MGED -- 'MGED' SubClassOf 'Obsolete concept (EDAM)' -+ 'MGED' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1798 -Label: Gene ID (GeneDB Leishmania major) -- 'Gene ID (GeneDB Leishmania major)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (GeneDB Leishmania major)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1799 -Label: Gene ID (GeneDB Plasmodium falciparum) -- 'Gene ID (GeneDB Plasmodium falciparum)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (GeneDB Plasmodium falciparum)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1797 -Label: Gene ID (GeneDB Glossina morsitans) -- 'Gene ID (GeneDB Glossina morsitans)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene ID (GeneDB Glossina morsitans)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1790 -Label: Gene name (CGSC) -- 'Gene name (CGSC)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (CGSC)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1791 -Label: Gene name (HGNC) -- 'Gene name (HGNC)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (HGNC)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1792 -Label: Gene name (MGD) -- 'Gene name (MGD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (MGD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1793 -Label: Gene name (Bacillus subtilis) -- 'Gene name (Bacillus subtilis)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (Bacillus subtilis)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2867 -Label: VNTR -- 'VNTR' SubClassOf 'DNA polymorphism' -+ 'VNTR' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2868 -Label: Microsatellites -- 'Microsatellites' SubClassOf 'DNA polymorphism' -+ 'Microsatellites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2869 -Label: RFLP -- 'RFLP' SubClassOf 'DNA polymorphism' -+ 'RFLP' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1789 -Label: Gene name (TGD) -- 'Gene name (TGD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (TGD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1787 -Label: Gene name (MaizeGDB) -- 'Gene name (MaizeGDB)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (MaizeGDB)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1788 -Label: Gene name (SGD) -- 'Gene name (SGD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (SGD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1785 -Label: Gene name (dictyBase) -- 'Gene name (dictyBase)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (dictyBase)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1786 -Label: Gene name (EcoGene primary) -- 'Gene name (EcoGene primary)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (EcoGene primary)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1783 -Label: Gene name (ASPGD) -- 'Gene name (ASPGD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (ASPGD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1784 -Label: Gene name (CGD) -- 'Gene name (CGD)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene name (CGD)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2524 -Label: Protein data -- 'Protein data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2525 -Label: Nucleic acid data -- 'Nucleic acid data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2528 -Label: Molecular data -- 'Molecular data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2529 -Label: Molecule report -- 'Molecule report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecule report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2522 -Label: Map data -- 'Map data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Map data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1776 -Label: Protein report (function) -- 'Protein report (function)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein report (function)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1767 -Label: CATH domain sequences (COMBS) -- 'CATH domain sequences (COMBS)' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH domain sequences (COMBS)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1765 -Label: CATH representative domain sequences (COMBS) -- 'CATH representative domain sequences (COMBS)' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH representative domain sequences (COMBS)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2842 -Label: High-throughput sequencing -- 'High-throughput sequencing' SubClassOf 'Obsolete concept (EDAM)' -+ 'High-throughput sequencing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1766 -Label: CATH domain sequences (ATOM) -- 'CATH domain sequences (ATOM)' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH domain sequences (ATOM)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2844 -Label: Structural clustering -- 'Structural clustering' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural clustering' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2846 -Label: Gene regulatory networks -- 'Gene regulatory networks' SubClassOf 'Molecular interactions, pathways and networks' -- 'Gene regulatory networks' SubClassOf 'Gene regulation' -+ 'Gene regulatory networks' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2847 -Label: Disease (specific) -- 'Disease (specific)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Disease (specific)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1764 -Label: CATH representative domain sequences (ATOM) -- 'CATH representative domain sequences (ATOM)' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH representative domain sequences (ATOM)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1762 -Label: CATH domain report -- 'CATH domain report' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH domain report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2839 -Label: Molecules -- 'Molecules' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecules' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2829 -Label: Ontologies, nomenclature and classification -- 'Ontologies, nomenclature and classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ontologies, nomenclature and classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2826 -Label: Protein structure alignment -- 'Protein structure alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein structure alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2816 -Label: Gene resources -- 'Gene resources' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene resources' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2817 -Label: Yeast -- 'Yeast' SubClassOf 'Obsolete concept (EDAM)' -+ 'Yeast' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3038 -Label: Structure databases -- 'Structure databases' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure databases' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3039 -Label: Nucleic acid structure -- 'Nucleic acid structure' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2811 -Label: Nomenclature -- 'Nomenclature' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nomenclature' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2814 -Label: Protein structure analysis -+ 'Protein structure analysis' SubClassOf 'Proteins' - -Class: http://edamontology.org/topic_2813 -Label: Disease genes and proteins -- 'Disease genes and proteins' SubClassOf 'Obsolete concept (EDAM)' -+ 'Disease genes and proteins' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3032 -Label: Primer or probe design -- 'Primer or probe design' SubClassOf 'Obsolete concept (EDAM)' -+ 'Primer or probe design' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2807 -Label: Tool topic -- 'Tool topic' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tool topic' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2809 -Label: Study topic -- 'Study topic' SubClassOf 'Obsolete concept (EDAM)' -+ 'Study topic' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3042 -Label: Nucleic acid sequences -- 'Nucleic acid sequences' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequences' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3041 -Label: Sequence databases -- 'Sequence databases' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence databases' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3044 -Label: Protein interaction networks -- 'Protein interaction networks' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein interaction networks' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3043 -Label: Protein sequences -- 'Protein sequences' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequences' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3048 -Label: Mammals -- 'Mammals' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mammals' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1716 -Label: GO -- 'GO' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1717 -Label: MeSH -- 'MeSH' SubClassOf 'Obsolete concept (EDAM)' -+ 'MeSH' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1715 -Label: BioPax term -- 'BioPax term' SubClassOf 'Obsolete concept (EDAM)' -+ 'BioPax term' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3052 -Label: Sequence clusters and classification -- 'Sequence clusters and classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence clusters and classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3060 -Label: Regulatory RNA -- 'Regulatory RNA' SubClassOf 'Obsolete concept (EDAM)' -+ 'Regulatory RNA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3061 -Label: Documentation and help -- 'Documentation and help' SubClassOf 'Literature and reference' -+ 'Documentation and help' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3062 -Label: Genetic organisation -- 'Genetic organisation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genetic organisation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3072 -Label: Sequence feature detection -- 'Sequence feature detection' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence feature detection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3073 -Label: Nucleic acid feature detection -- 'Nucleic acid feature detection' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid feature detection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3074 -Label: Protein feature detection -- 'Protein feature detection' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein feature detection' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3075 -Label: Biological system modelling -- 'Biological system modelling' SubClassOf 'Obsolete concept (EDAM)' -+ 'Biological system modelling' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3078 -Label: Genes and proteins resources -- 'Genes and proteins resources' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genes and proteins resources' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_1770 -Label: Structure comparison -- 'Structure comparison' SubClassOf 'Structure analysis' -+ 'Structure comparison' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0644 -Label: Proteome -- 'Proteome' SubClassOf 'Obsolete concept (EDAM)' -+ 'Proteome' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0642 -Label: Low complexity sequences -- 'Low complexity sequences' SubClassOf 'Obsolete concept (EDAM)' -+ 'Low complexity sequences' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0641 -Label: Repeat sequences -- 'Repeat sequences' SubClassOf 'Obsolete concept (EDAM)' -+ 'Repeat sequences' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0640 -Label: Nucleic acid sequence analysis -- 'Nucleic acid sequence analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0655 -Label: Coding RNA -- 'Coding RNA' SubClassOf 'Gene transcript features' -+ 'Coding RNA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0629 -Label: Gene expression and microarray -- 'Gene expression and microarray' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene expression and microarray' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0620 -Label: Drugs and target structures -- 'Drugs and target structures' SubClassOf 'Drug discovery' -- 'Drugs and target structures' SubClassOf 'Small molecules' -+ 'Drugs and target structures' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0624 -Label: Chromosomes -- 'Chromosomes' SubClassOf 'DNA' -+ 'Chromosomes' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0623 -Label: Gene families -- 'Gene families' SubClassOf 'Genetics' -+ 'Gene families' SubClassOf 'Molecular genetics' - -Class: http://edamontology.org/topic_0639 -Label: Protein sequence analysis -- 'Protein sequence analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0635 -Label: Specific protein resources -- 'Specific protein resources' SubClassOf 'Obsolete concept (EDAM)' -+ 'Specific protein resources' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0602 -Label: Molecular interactions, pathways and networks -- 'Molecular interactions, pathways and networks' SubClassOf 'Biology' -+ 'Molecular interactions, pathways and networks' SubClassOf 'Computational biology' - -Class: http://edamontology.org/topic_0608 -Label: Cell and tissue culture -- 'Cell and tissue culture' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cell and tissue culture' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0606 -Label: Literature data resources -- 'Literature data resources' SubClassOf 'Obsolete concept (EDAM)' -+ 'Literature data resources' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0612 -Label: Cell cycle -- 'Cell cycle' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cell cycle' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0613 -Label: Peptides and amino acids -- 'Peptides and amino acids' SubClassOf 'Small molecules' -+ 'Peptides and amino acids' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0616 -Label: Organelles -- 'Organelles' SubClassOf 'Obsolete concept (EDAM)' -+ 'Organelles' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0617 -Label: Ribosomes -- 'Ribosomes' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ribosomes' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0618 -Label: Scents -- 'Scents' SubClassOf 'Obsolete concept (EDAM)' -+ 'Scents' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2533 -Label: Nucleic acid features report (mutation) -- 'Nucleic acid features report (mutation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid features report (mutation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2539 -Label: Alignment data -- 'Alignment data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Alignment data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2540 -Label: Data index data -- 'Data index data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data index data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2579 -Label: Expressed gene list -- 'Expressed gene list' SubClassOf 'Obsolete concept (EDAM)' -+ 'Expressed gene list' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2581 -Label: GO concept name -- 'GO concept name' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO concept name' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0697 -Label: RNA structure -- 'RNA structure' SubClassOf 'Obsolete concept (EDAM)' -+ 'RNA structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0698 -Label: Protein tertiary structure -- 'Protein tertiary structure' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein tertiary structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2584 -Label: GO concept name (cellular component) -- 'GO concept name (cellular component)' SubClassOf 'Obsolete concept (EDAM)' -+ 'GO concept name (cellular component)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0694 -Label: Protein secondary structure -- 'Protein secondary structure' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein secondary structure' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2590 -Label: Hierarchy identifier -- 'Hierarchy identifier' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hierarchy identifier' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0663 -Label: tRNA -- 'tRNA' SubClassOf 'Obsolete concept (EDAM)' -+ 'tRNA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2592 -Label: Cancer type -- 'Cancer type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cancer type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2598 -Label: Secondary structure alignment metadata -- 'Secondary structure alignment metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Secondary structure alignment metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2599 -Label: Molecule interaction report -- 'Molecule interaction report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecule interaction report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0660 -Label: rRNA -- 'rRNA' SubClassOf 'Obsolete concept (EDAM)' -+ 'rRNA' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2602 -Label: Genotype and phenotype data -- 'Genotype and phenotype data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genotype and phenotype data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2601 -Label: Small molecule data -- 'Small molecule data' SubClassOf 'Obsolete concept (EDAM)' -+ 'Small molecule data' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1601 -Label: Nc statistic -- 'Nc statistic' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nc statistic' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1634 -Label: Linkage disequilibrium (report) -- 'Linkage disequilibrium (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Linkage disequilibrium (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1656 -Label: Metabolic pathway report -- 'Metabolic pathway report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Metabolic pathway report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1657 -Label: Genetic information processing pathway report -- 'Genetic information processing pathway report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genetic information processing pathway report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3118 -Label: Protein topological domains -- 'Protein topological domains' SubClassOf 'Protein domains and folds' -- 'Protein topological domains' SubClassOf 'Protein domains' -+ 'Protein topological domains' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1658 -Label: Environmental information processing pathway report -- 'Environmental information processing pathway report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Environmental information processing pathway report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1659 -Label: Signal transduction pathway report -- 'Signal transduction pathway report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Signal transduction pathway report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3120 -Label: Protein variants -- 'Protein variants' SubClassOf 'Protein structure analysis' -+ 'Protein variants' SubClassOf 'Protein expression' - -Class: http://edamontology.org/topic_3123 -Label: Expression signals -- 'Expression signals' SubClassOf 'Obsolete concept (EDAM)' -+ 'Expression signals' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1642 -Label: Affymetrix probe sets library file -- 'Affymetrix probe sets library file' SubClassOf 'Obsolete concept (EDAM)' -+ 'Affymetrix probe sets library file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1643 -Label: Affymetrix probe sets information library file -- 'Affymetrix probe sets information library file' SubClassOf 'Obsolete concept (EDAM)' -+ 'Affymetrix probe sets information library file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1646 -Label: Molecular weights standard fingerprint -- 'Molecular weights standard fingerprint' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular weights standard fingerprint' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3127 -Label: DNA replication and recombination -- 'DNA replication and recombination' SubClassOf 'DNA binding sites' -+ 'DNA replication and recombination' SubClassOf 'DNA' - -Class: http://edamontology.org/topic_3126 -Label: Nucleic acid repeats -- 'Nucleic acid repeats' SubClassOf 'Nucleic acid sites, features and motifs' -+ 'Nucleic acid repeats' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2627 -Label: Molecular interaction ID -- 'Molecular interaction ID' SubClassOf 'Obsolete concept (EDAM)' -+ 'Molecular interaction ID' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1673 -Label: Swiss-Prot to PDB mapping -- 'Swiss-Prot to PDB mapping' SubClassOf 'Obsolete concept (EDAM)' -+ 'Swiss-Prot to PDB mapping' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1674 -Label: Sequence database cross-references -- 'Sequence database cross-references' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence database cross-references' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1675 -Label: Job status -- 'Job status' SubClassOf 'Obsolete concept (EDAM)' -+ 'Job status' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1676 -Label: Job ID -- 'Job ID' SubClassOf 'Obsolete concept (EDAM)' -+ 'Job ID' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1670 -Label: Database version information -- 'Database version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1671 -Label: Tool version information -- 'Tool version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tool version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1672 -Label: CATH version information -- 'CATH version information' SubClassOf 'Obsolete concept (EDAM)' -+ 'CATH version information' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1677 -Label: Job type -- 'Job type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Job type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1678 -Label: Tool log -- 'Tool log' SubClassOf 'Obsolete concept (EDAM)' -+ 'Tool log' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1679 -Label: DaliLite log file -- 'DaliLite log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'DaliLite log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1664 -Label: MIRIAM datatype -- 'MIRIAM datatype' SubClassOf 'Obsolete concept (EDAM)' -+ 'MIRIAM datatype' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1663 -Label: Protein interaction networks -- 'Protein interaction networks' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein interaction networks' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1660 -Label: Cellular process pathways report -- 'Cellular process pathways report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Cellular process pathways report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1661 -Label: Disease pathway or network report -- 'Disease pathway or network report' SubClassOf 'Obsolete concept (EDAM)' -+ 'Disease pathway or network report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3139 -Label: Sequence tagged sites -- 'Sequence tagged sites' SubClassOf 'Nucleic acid sites, features and motifs' -+ 'Sequence tagged sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_2953 -Label: Nucleic acid design -- 'Nucleic acid design' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid design' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0581 -Label: Database -- 'Database' SubClassOf 'Obsolete concept (EDAM)' -+ 'Database' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_0583 -Label: Directory metadata -- 'Directory metadata' SubClassOf 'Obsolete concept (EDAM)' -+ 'Directory metadata' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3135 -Label: Signal or transit peptide -- 'Signal or transit peptide' SubClassOf 'Gene transcript features' -+ 'Signal or transit peptide' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3231 -Label: GWAS report -- 'GWAS report' SubClassOf 'Obsolete concept (EDAM)' -+ 'GWAS report' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_3268 -Label: Sequence feature type -- 'Sequence feature type' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence feature type' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3179 -Label: ChIP-on-chip -- 'ChIP-on-chip' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_3269 -Label: Gene homology (report) -- 'Gene homology (report)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Gene homology (report)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3177 -Label: DNA-Seq -- 'DNA-Seq' SubClassOf 'Obsolete concept (EDAM)' -+ 'DNA-Seq' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3178 -Label: RNA-Seq alignment -- 'RNA-Seq alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'RNA-Seq alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_3175 -Label: DNA structural variation -- 'DNA structural variation' SubClassOf 'Chromosomes' -+ 'DNA structural variation' SubClassOf 'DNA' - -Class: http://edamontology.org/topic_3171 -Label: DNA methylation -- 'DNA methylation' SubClassOf 'Obsolete concept (EDAM)' -+ 'DNA methylation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0741 -Label: Protein sequence alignment -- 'Protein sequence alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0740 -Label: Nucleic acid sequence alignment -- 'Nucleic acid sequence alignment' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence alignment' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0280 -Label: Data retrieval (restriction enzyme annotation) -- 'Data retrieval (restriction enzyme annotation)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (restriction enzyme annotation)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0281 -Label: Genetic marker identification -- 'Genetic marker identification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genetic marker identification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0749 -Label: Transcription factors and regulatory sites -- 'Transcription factors and regulatory sites' SubClassOf 'Gene transcription features' -+ 'Transcription factors and regulatory sites' SubClassOf 'Gene expression' - -Class: http://edamontology.org/topic_0748 -Label: Protein sites and features -- 'Protein sites and features' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sites and features' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0747 -Label: Nucleic acid sites and features -- 'Nucleic acid sites and features' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sites and features' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2034 -Label: Biological model format -- 'Biological model format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Biological model format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0751 -Label: Phosphorylation sites -- 'Phosphorylation sites' SubClassOf 'Obsolete concept (EDAM)' -+ 'Phosphorylation sites' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0753 -Label: Metabolic pathways -- 'Metabolic pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Metabolic pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0293 -Label: Hybrid sequence alignment construction -- 'Hybrid sequence alignment construction' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hybrid sequence alignment construction' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0754 -Label: Signaling pathways -- 'Signaling pathways' SubClassOf 'Molecular interactions, pathways and networks' -+ 'Signaling pathways' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2045 -Label: Electron microscopy model format -- 'Electron microscopy model format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Electron microscopy model format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0767 -Label: Protein and peptide identification -- 'Protein and peptide identification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein and peptide identification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2051 -Label: Polymorphism report format -- 'Polymorphism report format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Polymorphism report format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2059 -Label: Genotype and phenotype annotation format -- 'Genotype and phenotype annotation format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genotype and phenotype annotation format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0770 -Label: Data types and objects -- 'Data types and objects' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data types and objects' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0771 -Label: Theoretical biology -- 'Theoretical biology' SubClassOf 'Obsolete concept (EDAM)' -+ 'Theoretical biology' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0779 -Label: Mitochondria -- 'Mitochondria' SubClassOf 'Obsolete concept (EDAM)' -+ 'Mitochondria' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2063 -Label: Protein report (enzyme) format -- 'Protein report (enzyme) format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein report (enzyme) format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0722 -Label: Nucleic acid classification -- 'Nucleic acid classification' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid classification' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1688 -Label: EMBOSS vectorstrip log file -- 'EMBOSS vectorstrip log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS vectorstrip log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1685 -Label: EMBOSS supermatcher error file -- 'EMBOSS supermatcher error file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS supermatcher error file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2512 -Label: Sequence editing (protein) -- 'Sequence editing (protein)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence editing (protein)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1684 -Label: EMBOSS sites log file -- 'EMBOSS sites log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS sites log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1687 -Label: EMBOSS whichdb log file -- 'EMBOSS whichdb log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS whichdb log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1686 -Label: EMBOSS megamerger log file -- 'EMBOSS megamerger log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS megamerger log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2511 -Label: Sequence editing (nucleic acid) -- 'Sequence editing (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence editing (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1681 -Label: NACCESS log file -- 'NACCESS log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'NACCESS log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2516 -Label: Protein sequence visualisation -- 'Protein sequence visualisation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein sequence visualisation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1680 -Label: STRIDE log file -- 'STRIDE log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'STRIDE log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1683 -Label: EMBOSS domainatrix log file -- 'EMBOSS domainatrix log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS domainatrix log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1682 -Label: EMBOSS wordfinder log file -- 'EMBOSS wordfinder log file' SubClassOf 'Obsolete concept (EDAM)' -+ 'EMBOSS wordfinder log file' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2515 -Label: Nucleic acid sequence visualisation -- 'Nucleic acid sequence visualisation' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid sequence visualisation' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2679 -Label: Ensembl ID ('Echinops telfairi') -- 'Ensembl ID ('Echinops telfairi')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Echinops telfairi')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2678 -Label: Ensembl ID ('Dasypus novemcinctus') -- 'Ensembl ID ('Dasypus novemcinctus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Dasypus novemcinctus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2675 -Label: Ensembl ID ('Ciona intestinalis') -- 'Ensembl ID ('Ciona intestinalis')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Ciona intestinalis')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2674 -Label: Ensembl ID ('Cavia porcellus') -- 'Ensembl ID ('Cavia porcellus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Cavia porcellus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2677 -Label: Ensembl ID ('Danio rerio') -- 'Ensembl ID ('Danio rerio')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Danio rerio')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2676 -Label: Ensembl ID ('Ciona savignyi') -- 'Ensembl ID ('Ciona savignyi')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Ciona savignyi')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2671 -Label: Ensembl ID (Homo sapiens) -- 'Ensembl ID (Homo sapiens)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID (Homo sapiens)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2673 -Label: Ensembl ID ('Canis familiaris') -- 'Ensembl ID ('Canis familiaris')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Canis familiaris')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2672 -Label: Ensembl ID ('Bos taurus') -- 'Ensembl ID ('Bos taurus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Bos taurus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2680 -Label: Ensembl ID ('Erinaceus europaeus') -- 'Ensembl ID ('Erinaceus europaeus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Erinaceus europaeus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2500 -Label: Microarray raw data analysis -- 'Microarray raw data analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Microarray raw data analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1695 -Label: Hit sort order -- 'Hit sort order' SubClassOf 'Obsolete concept (EDAM)' -+ 'Hit sort order' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1694 -Label: Number of output entities -- 'Number of output entities' SubClassOf 'Obsolete concept (EDAM)' -+ 'Number of output entities' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2503 -Label: Sequence data processing -- 'Sequence data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_1693 -Label: Number of iterations -- 'Number of iterations' SubClassOf 'Obsolete concept (EDAM)' -+ 'Number of iterations' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2504 -Label: Structural data processing -- 'Structural data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structural data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2505 -Label: Text processing -- 'Text processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Text processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2689 -Label: Ensembl ID ('Myotis lucifugus') -- 'Ensembl ID ('Myotis lucifugus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Myotis lucifugus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2688 -Label: Ensembl ID ('Mus musculus') -- 'Ensembl ID ('Mus musculus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Mus musculus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2687 -Label: Ensembl ID ('Monodelphis domestica') -- 'Ensembl ID ('Monodelphis domestica')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Monodelphis domestica')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2686 -Label: Ensembl ID ('Macaca mulatta') -- 'Ensembl ID ('Macaca mulatta')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Macaca mulatta')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2685 -Label: Ensembl ID ('Loxodonta africana') -- 'Ensembl ID ('Loxodonta africana')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Loxodonta africana')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2684 -Label: Ensembl ID ('Homo sapiens') -- 'Ensembl ID ('Homo sapiens')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Homo sapiens')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2683 -Label: Ensembl ID ('Gasterosteus aculeatus') -- 'Ensembl ID ('Gasterosteus aculeatus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Gasterosteus aculeatus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2682 -Label: Ensembl ID ('Gallus gallus') -- 'Ensembl ID ('Gallus gallus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Gallus gallus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2681 -Label: Ensembl ID ('Felis catus') -- 'Ensembl ID ('Felis catus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Felis catus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0225 -Label: Data retrieval (database cross-reference) -- 'Data retrieval (database cross-reference)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (database cross-reference)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0228 -Label: Data index analysis -- 'Data index analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data index analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0229 -Label: Annotation retrieval (sequence) -- 'Annotation retrieval (sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Annotation retrieval (sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_2521 -Label: Map data processing -- 'Map data processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Map data processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0238 -Label: Sequence motif discovery -+ 'Sequence motif discovery' SubClassOf 'Sequence motif processing' - -Class: http://edamontology.org/operation_0239 -Label: Sequence motif recognition -+ 'Sequence motif recognition' SubClassOf 'Sequence motif processing' - -Class: http://edamontology.org/operation_2519 -Label: Structure processing (nucleic acid) -- 'Structure processing (nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Structure processing (nucleic acid)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0241 -Label: Transcription regulatory sequence analysis -- 'Transcription regulatory sequence analysis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Transcription regulatory sequence analysis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0240 -Label: Sequence motif comparison -+ 'Sequence motif comparison' SubClassOf 'Sequence motif processing' - -Class: http://edamontology.org/topic_0786 -Label: Arabidopsis -- 'Arabidopsis' SubClassOf 'Obsolete concept (EDAM)' -+ 'Arabidopsis' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0787 -Label: Rice -- 'Rice' SubClassOf 'Obsolete concept (EDAM)' -+ 'Rice' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0255 -Label: Feature table query -- 'Feature table query' SubClassOf 'Obsolete concept (EDAM)' -+ 'Feature table query' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0257 -Label: Data retrieval (sequence alignment) -- 'Data retrieval (sequence alignment)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (sequence alignment)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0796 -Label: Genetic mapping and linkage -- 'Genetic mapping and linkage' SubClassOf 'Obsolete concept (EDAM)' -+ 'Genetic mapping and linkage' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0254 -Label: Data retrieval (feature table) -- 'Data retrieval (feature table)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Data retrieval (feature table)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/topic_0798 -Label: Mobile genetic elements -- 'Mobile genetic elements' SubClassOf 'Nucleic acid sites, features and motifs' -- 'Mobile genetic elements' SubClassOf 'Genetics' -+ 'Mobile genetic elements' SubClassOf 'Gene structure' - -Class: http://edamontology.org/operation_0261 -Label: Nucleic acid property processing -- 'Nucleic acid property processing' SubClassOf 'Obsolete concept (EDAM)' -+ 'Nucleic acid property processing' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2691 -Label: Ensembl ID ('Oryctolagus cuniculus') -- 'Ensembl ID ('Oryctolagus cuniculus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Oryctolagus cuniculus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2690 -Label: Ensembl ID ("Ornithorhynchus anatinus") -- 'Ensembl ID ("Ornithorhynchus anatinus")' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ("Ornithorhynchus anatinus")' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2692 -Label: Ensembl ID ('Oryzias latipes') -- 'Ensembl ID ('Oryzias latipes')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Oryzias latipes')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2693 -Label: Ensembl ID ('Otolemur garnettii') -- 'Ensembl ID ('Otolemur garnettii')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Otolemur garnettii')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2694 -Label: Ensembl ID ('Pan troglodytes') -- 'Ensembl ID ('Pan troglodytes')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Pan troglodytes')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2695 -Label: Ensembl ID ('Rattus norvegicus') -- 'Ensembl ID ('Rattus norvegicus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Rattus norvegicus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2696 -Label: Ensembl ID ('Spermophilus tridecemlineatus') -- 'Ensembl ID ('Spermophilus tridecemlineatus')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Spermophilus tridecemlineatus')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2697 -Label: Ensembl ID ('Takifugu rubripes') -- 'Ensembl ID ('Takifugu rubripes')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Takifugu rubripes')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2698 -Label: Ensembl ID ('Tupaia belangeri') -- 'Ensembl ID ('Tupaia belangeri')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Tupaia belangeri')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/data_2699 -Label: Ensembl ID ('Xenopus tropicalis') -- 'Ensembl ID ('Xenopus tropicalis')' SubClassOf 'Obsolete concept (EDAM)' -+ 'Ensembl ID ('Xenopus tropicalis')' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0274 -Label: Protein-protein interaction prediction (from protein sequence) -- 'Protein-protein interaction prediction (from protein sequence)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-protein interaction prediction (from protein sequence)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/operation_0275 -Label: Protein-protein interaction prediction (from protein structure) -- 'Protein-protein interaction prediction (from protein structure)' SubClassOf 'Obsolete concept (EDAM)' -+ 'Protein-protein interaction prediction (from protein structure)' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -Class: http://edamontology.org/format_2015 -Label: Sequence-profile alignment (HMM) format -- 'Sequence-profile alignment (HMM) format' SubClassOf 'Obsolete concept (EDAM)' -+ 'Sequence-profile alignment (HMM) format' SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -## New classes: - -Class: http://edamontology.org/topic_3697 -Label: Microbial ecology -+ 'Microbial ecology' SubClassOf 'Ecology' -+ 'Microbial ecology' SubClassOf 'Microbiology' - -Class: http://edamontology.org/format_3688 -Label: SBtab -+ 'SBtab' SubClassOf 'Textual format' -+ 'SBtab' SubClassOf 'Biological pathway or network format' - -Class: http://edamontology.org/format_3687 -Label: ISA-TAB -+ 'ISA-TAB' SubClassOf 'Experiment annotation format' -+ 'ISA-TAB' SubClassOf 'Textual format' -+ 'ISA-TAB' SubClassOf 'Gene expression report format' - -Class: http://edamontology.org/format_3689 -Label: BCML -+ 'BCML' SubClassOf 'Biological pathway or network format' -+ 'BCML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3684 -Label: PRIDE XML -+ 'PRIDE XML' SubClassOf 'Experiment annotation format' -+ 'PRIDE XML' SubClassOf 'Mass spectrometry data format' -+ 'PRIDE XML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3683 -Label: qcML -+ 'qcML' SubClassOf 'Mass spectrometry data format' -+ 'qcML' SubClassOf 'Experiment annotation format' -+ 'qcML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3686 -Label: COMBINE OMEX -+ 'COMBINE OMEX' SubClassOf 'Biological pathway or network format' -+ 'COMBINE OMEX' SubClassOf 'Binary format' -+ 'COMBINE OMEX' SubClassOf 'Experiment annotation format' - -Class: http://edamontology.org/format_3685 -Label: SED-ML -+ 'SED-ML' SubClassOf 'XML' -+ 'SED-ML' SubClassOf 'Experiment annotation format' - -Class: http://edamontology.org/format_3682 -Label: imzML -+ 'imzML' SubClassOf 'Mass spectrometry data format' -+ 'imzML' SubClassOf 'Binary format' -+ 'imzML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3681 -Label: mzTab -+ 'mzTab' SubClassOf 'Mass spectrometry data format' -+ 'mzTab' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3699 -Label: VDB -+ 'VDB' SubClassOf 'Binary format' - -Class: http://edamontology.org/format_3698 -Label: SRA format -+ 'SRA format' SubClassOf 'Binary format' - -Class: http://edamontology.org/format_3693 -Label: AGP -+ 'AGP' SubClassOf 'Textual format' -+ 'AGP' SubClassOf 'Sequence assembly format' - -Class: http://edamontology.org/format_3692 -Label: SBGN-ML -+ 'SBGN-ML' SubClassOf 'XML' -+ 'SBGN-ML' SubClassOf 'Biological pathway or network format' - -Class: http://edamontology.org/format_3691 -Label: BEL -+ 'BEL' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3690 -Label: BDML -+ 'BDML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3696 -Label: PS -+ 'PS' SubClassOf 'Textual format' - -Class: http://edamontology.org/format_3700 -Label: Tabix index file format -+ 'Tabix index file format' SubClassOf 'Data index format' -+ 'Tabix index file format' SubClassOf 'is format of' some 'Data index' - -Class: http://edamontology.org/format_3701 -Label: sequin -+ 'sequin' SubClassOf 'Sequence feature table format (text)' - -Class: http://edamontology.org/operation_3680 -Label: RNA-Seq analysis -+ 'RNA-Seq analysis' SubClassOf 'Nucleic acid sequence analysis' - -Class: http://edamontology.org/operation_3694 -Label: Mass spectrum visualisation -+ 'Mass spectrum visualisation' SubClassOf 'Visualisation' - -Class: http://edamontology.org/operation_3695 -Label: Filtering -+ 'Filtering' SubClassOf 'File handling' - -# EDAM\_1.12.owl - -Class: http://edamontology.org/data_0872 -Label: Phylogenetic tree -- 'Phylogenetic tree' SubClassOf 'Data' -+ 'Phylogenetic tree' SubClassOf 'Phylogenetic data' - -Class: http://edamontology.org/data_1597 -Label: Codon usage table -- 'Codon usage table' SubClassOf 'Data' -+ 'Codon usage table' SubClassOf 'Codon usage data' - -Class: http://edamontology.org/operation_2990 -Label: Classification -- 'Classification' SubClassOf 'Analysis' -+ 'Classification' SubClassOf 'Operation' - -Class: http://edamontology.org/data_2337 -Label: Resource metadata -- 'Resource metadata' SubClassOf 'Data' -+ 'Resource metadata' SubClassOf 'Report' - -Class: http://edamontology.org/operation_2928 -Label: Alignment -- 'Alignment' SubClassOf 'Analysis' -+ 'Alignment' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2932 -Label: Hopp and Woods plotting -- 'Hopp and Woods plotting' SubClassOf 'Peptide immunogenicity prediction' -- 'Hopp and Woods plotting' SubClassOf 'Plotting' -+ 'Hopp and Woods plotting' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/format_3604 -Label: svg -- 'svg' SubClassOf 'Binary format' -+ 'svg' SubClassOf 'XML' - -Class: http://edamontology.org/operation_3501 -Label: Enrichment -- 'Enrichment' SubClassOf 'Analysis' -+ 'Enrichment' SubClassOf 'Operation' - -Class: http://edamontology.org/data_1549 -Label: Protein hydrogen bonds -- 'Protein hydrogen bonds' SubClassOf 'Protein residue interactions' -+ 'Protein hydrogen bonds' SubClassOf 'Protein interaction report' - -Class: http://edamontology.org/data_1546 -Label: Protein distance matrix -- 'Protein distance matrix' SubClassOf 'Protein residue interactions' -+ 'Protein distance matrix' SubClassOf 'Protein interaction report' -+ 'Protein distance matrix' SubClassOf 'Distance matrix' - -Class: http://edamontology.org/data_1548 -Label: Protein residue 3D cluster -- 'Protein residue 3D cluster' SubClassOf 'Protein residue interactions' -+ 'Protein residue 3D cluster' SubClassOf 'Protein interaction report' - -Class: http://edamontology.org/data_1547 -Label: Protein contact map -- 'Protein contact map' SubClassOf 'Protein residue interactions' -+ 'Protein contact map' SubClassOf 'Protein interaction report' - -Class: http://edamontology.org/data_1542 -Label: Protein solvent accessibility -- 'Protein solvent accessibility' SubClassOf 'Protein residue interactions' -+ 'Protein solvent accessibility' SubClassOf 'Protein structure report' - -Class: http://edamontology.org/data_1540 -Label: Protein residue interactions -- 'Protein residue interactions' SubClassOf 'Protein property' -- 'Protein residue interactions' SubClassOf 'has topic' some 'Protein folding, stability and design' -+ 'Protein residue interactions' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3545 -Label: Mathematical modelling -- 'Mathematical modelling' SubClassOf 'Modelling and simulation' -+ 'Mathematical modelling' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_1491 -Label: Structure alignment (nucleic acid pair) -- 'Structure alignment (nucleic acid pair)' SubClassOf 'Structure alignment (pair)' -- 'Structure alignment (nucleic acid pair)' SubClassOf 'Structure alignment (nucleic acid)' -+ 'Structure alignment (nucleic acid pair)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_1483 -Label: Structure alignment (protein pair) -- 'Structure alignment (protein pair)' SubClassOf 'Structure alignment (protein)' -- 'Structure alignment (protein pair)' SubClassOf 'Structure alignment (pair)' -+ 'Structure alignment (protein pair)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_2402 -Label: Protein-drug interaction report -- 'Protein-drug interaction report' SubClassOf 'Protein structure report' -- 'Protein-drug interaction report' SubClassOf 'Protein-ligand interaction report' -- 'Protein-drug interaction report' SubClassOf 'Drug report' -+ 'Protein-drug interaction report' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3435 -Label: Standardization and normalization -- 'Standardization and normalization' SubClassOf 'Analysis' -+ 'Standardization and normalization' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3434 -Label: Conversion -- 'Conversion' SubClassOf 'Utility operation' -+ 'Conversion' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3439 -Label: Pathway or network prediction -- 'Pathway or network prediction' SubClassOf 'Pathway or network processing' -+ 'Pathway or network prediction' SubClassOf 'Pathway or network analysis' - -Class: http://edamontology.org/operation_3438 -Label: Calculation -- 'Calculation' SubClassOf 'Analysis' -+ 'Calculation' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3433 -Label: Assembly -- 'Assembly' SubClassOf 'Analysis' -+ 'Assembly' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3432 -Label: Clustering -- 'Clustering' SubClassOf 'Analysis' -+ 'Clustering' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3440 -Label: Genome assembly -- 'Genome assembly' SubClassOf 'Sequence assembly' -+ 'Genome assembly' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3441 -Label: Plotting -- 'Plotting' SubClassOf 'Analysis' -+ 'Plotting' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3443 -Label: Image analysis -- 'Image analysis' SubClassOf 'Operation (typed)' -+ 'Image analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_3465 -Label: Correlation -- 'Correlation' SubClassOf 'Analysis' -+ 'Correlation' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_3429 -Label: Generation -- 'Generation' SubClassOf 'Analysis' -+ 'Generation' SubClassOf 'Operation' - -Class: http://edamontology.org/data_3546 -Label: Image metadata -- 'Image metadata' SubClassOf 'Data' -+ 'Image metadata' SubClassOf 'Report' - -Class: http://edamontology.org/format_1964 -Label: plain text format (unformatted) -- 'plain text format (unformatted)' SubClassOf 'Sequence record format (text)' - -Class: http://edamontology.org/data_1386 -Label: Sequence alignment (nucleic acid pair) -- 'Sequence alignment (nucleic acid pair)' SubClassOf 'Sequence alignment (pair)' -- 'Sequence alignment (nucleic acid pair)' SubClassOf 'Sequence alignment (nucleic acid)' -+ 'Sequence alignment (nucleic acid pair)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_1387 -Label: Sequence alignment (protein pair) -- 'Sequence alignment (protein pair)' SubClassOf 'Sequence alignment (pair)' -- 'Sequence alignment (protein pair)' SubClassOf 'Sequence alignment (protein)' -+ 'Sequence alignment (protein pair)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_0180 -Label: Protein fold recognition -- 'Protein fold recognition' SubClassOf 'Structure prediction' -+ 'Protein fold recognition' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_0199 -Label: Genetic variation -- 'Genetic variation' SubClassOf 'Genotype and phenotype' -+ 'Genetic variation' SubClassOf 'Genetics' - -Class: http://edamontology.org/topic_0160 -Label: Sequence sites, features and motifs -- 'Sequence sites, features and motifs' SubClassOf 'Sequence analysis' -+ 'Sequence sites, features and motifs' SubClassOf 'Computational biology' - -Class: http://edamontology.org/topic_0172 -Label: Protein structure prediction -- 'Protein structure prediction' SubClassOf 'Protein structure analysis' -- 'Protein structure prediction' SubClassOf 'Structure prediction' -+ 'Protein structure prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_0173 -Label: Nucleic acid structure prediction -- 'Nucleic acid structure prediction' SubClassOf 'Structure prediction' -- 'Nucleic acid structure prediction' SubClassOf 'Nucleic acid structure analysis' -+ 'Nucleic acid structure prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_0177 -Label: Molecular docking -- 'Molecular docking' SubClassOf 'Structure prediction' -+ 'Molecular docking' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0559 -Label: Immunogenicity prediction -- 'Immunogenicity prediction' SubClassOf 'has output' some 'Protein structure' -- 'Immunogenicity prediction' SubClassOf 'Peptide immunogenicity prediction' -- 'Immunogenicity prediction' SubClassOf 'has topic' some 'Protein folding, stability and design' -- 'Immunogenicity prediction' SubClassOf 'has topic' some 'Immunology' -+ 'Immunogenicity prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3283 -Label: Anonymisation -- 'Anonymisation' SubClassOf 'Utility operation' -+ 'Anonymisation' SubClassOf 'Operation' - -Class: http://edamontology.org/topic_0159 -Label: Sequence comparison -- 'Sequence comparison' SubClassOf 'Sequence analysis' -+ 'Sequence comparison' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3351 -Label: Protein surface analysis -- 'Protein surface analysis' SubClassOf 'Protein property calculation (from structure)' -+ 'Protein surface analysis' SubClassOf 'has topic' some 'Protein structural motifs and surfaces' -+ 'Protein surface analysis' SubClassOf 'Structure analysis' -+ 'Protein surface analysis' SubClassOf 'has topic' some 'Protein properties' - -Class: http://edamontology.org/operation_0567 -Label: Phylogenetic tree visualisation -- 'Phylogenetic tree visualisation' SubClassOf 'Phylogenetic tree processing' -+ 'Phylogenetic tree visualisation' SubClassOf 'Phylogenetic tree analysis' - -Class: http://edamontology.org/operation_0566 -Label: Sequence cluster visualisation -- 'Sequence cluster visualisation' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/operation_0563 -Label: Codon usage table formatting -- 'Codon usage table formatting' SubClassOf 'Formatting' -- 'Codon usage table formatting' SubClassOf 'Codon usage table processing' -- 'Codon usage table formatting' SubClassOf 'has output' some 'Codon usage table' -- 'Codon usage table formatting' SubClassOf 'has input' some 'Codon usage table' -+ 'Codon usage table formatting' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0562 -Label: Sequence alignment formatting -- 'Sequence alignment formatting' SubClassOf 'has input' some 'Sequence alignment' -- 'Sequence alignment formatting' SubClassOf 'has output' some 'Sequence alignment' -- 'Sequence alignment formatting' SubClassOf 'Formatting' -+ 'Sequence alignment formatting' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0561 -Label: Sequence formatting -- 'Sequence formatting' SubClassOf 'has input' some 'Sequence' -- 'Sequence formatting' SubClassOf 'Formatting' -- 'Sequence formatting' SubClassOf 'has output' some 'Sequence' -+ 'Sequence formatting' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0560 -Label: DNA vaccine design -- 'DNA vaccine design' SubClassOf 'Design' -- 'DNA vaccine design' SubClassOf 'has topic' some 'Nucleic acid structure prediction' - -Class: http://edamontology.org/topic_3517 -Label: GWAS study -- 'GWAS study' SubClassOf 'Laboratory techniques' -+ 'GWAS study' SubClassOf http://edamontology.org/topic_3678 - -Class: http://edamontology.org/topic_3523 -Label: RNAi experiment -- 'RNAi experiment' SubClassOf 'Sequencing' -+ 'RNAi experiment' SubClassOf 'Laboratory techniques' - -Class: http://edamontology.org/operation_2233 -Label: Representative sequence identification -- 'Representative sequence identification' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/data_2991 -Label: Protein torsion angle data -- 'Protein torsion angle data' SubClassOf 'Protein property' -+ 'Protein torsion angle data' SubClassOf 'Protein structure report' - -Class: http://edamontology.org/topic_1312 -Label: Promoters -- 'Promoters' SubClassOf 'Gene transcription features' -+ 'Promoters' SubClassOf 'Transcription factors and regulatory sites' - -Class: http://edamontology.org/data_1235 -Label: Sequence cluster -- 'Sequence cluster' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/data_1274 -Label: Map -- 'Map' SubClassOf 'Data' -+ 'Map' SubClassOf 'Map data' - -Class: http://edamontology.org/operation_3191 -Label: Trim to reference -- 'Trim to reference' SubClassOf 'Sequence trimming' -+ 'Trim to reference' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3190 -Label: Trim vector -- 'Trim vector' SubClassOf 'Sequence trimming' -+ 'Trim vector' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_0971 -Label: Article -- 'Article' SubClassOf 'Data' -+ 'Article' SubClassOf 'Article data' - -Class: http://edamontology.org/data_0906 -Label: Protein interaction report -- 'Protein interaction report' SubClassOf 'Protein report' -+ 'Protein interaction report' SubClassOf 'Protein structure report' - -Class: http://edamontology.org/operation_3189 -Label: Trim ends -- 'Trim ends' SubClassOf 'Sequence trimming' -+ 'Trim ends' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3219 -Label: Read pre-processing -- 'Read pre-processing' SubClassOf 'Validation' -+ 'Read pre-processing' SubClassOf 'Sequencing quality control' - -Class: http://edamontology.org/operation_3214 -Label: Spectral analysis -+ 'Spectral analysis' SubClassOf 'has topic' some 'Proteomics' - -Class: http://edamontology.org/operation_3213 -Label: Genome indexing (suffix arrays) -- 'Genome indexing (suffix arrays)' SubClassOf 'Genome indexing' -+ 'Genome indexing (suffix arrays)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3212 -Label: Genome indexing (Burrows-Wheeler) -- 'Genome indexing (Burrows-Wheeler)' SubClassOf 'Genome indexing' -+ 'Genome indexing (Burrows-Wheeler)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_3229 -Label: Exome analysis -- 'Exome analysis' SubClassOf 'Nucleic acid sequence analysis' -+ 'Exome analysis' SubClassOf 'Sequence assembly' - -Class: http://edamontology.org/operation_0303 -Label: Protein fold recognition -- 'Protein fold recognition' SubClassOf 'has topic' some 'Protein fold recognition' - -Class: http://edamontology.org/data_2767 -Label: Identifier with metadata -- 'Identifier with metadata' SubClassOf 'Identifier' -+ 'Identifier with metadata' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_2048 -Label: Report -- 'Report' SubClassOf 'Resource metadata' -+ 'Report' SubClassOf 'Data' - -Class: http://edamontology.org/topic_3298 -Label: Phenomics -+ 'Phenomics' SubClassOf 'Omics' - -Class: http://edamontology.org/data_1150 -Label: Disease ID -- 'Disease ID' SubClassOf 'is identifier of' some 'Disease report' -- 'Disease ID' SubClassOf 'Identifier (typed)' -+ 'Disease ID' SubClassOf http://edamontology.org/data_3667 - -Class: http://edamontology.org/operation_1832 -Label: Residue contact calculation (residue-nucleic acid) -- 'Residue contact calculation (residue-nucleic acid)' SubClassOf 'Protein-nucleic acid binding site analysis' -- 'Residue contact calculation (residue-nucleic acid)' SubClassOf 'Residue contact calculation' -- 'Residue contact calculation (residue-nucleic acid)' SubClassOf 'Protein structural motif recognition' -+ 'Residue contact calculation (residue-nucleic acid)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1834 -Label: Residue contact calculation (residue-metal) -- 'Residue contact calculation (residue-metal)' SubClassOf 'Residue contact calculation' -- 'Residue contact calculation (residue-metal)' SubClassOf 'Protein binding site prediction (from structure)' -+ 'Residue contact calculation (residue-metal)' SubClassOf 'Residue interaction calculation' - -Class: http://edamontology.org/operation_1835 -Label: Residue contact calculation (residue-negative ion) -- 'Residue contact calculation (residue-negative ion)' SubClassOf 'Residue contact calculation' -+ 'Residue contact calculation (residue-negative ion)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1831 -Label: Metal-bound cysteine detection -+ 'Metal-bound cysteine detection' SubClassOf 'Residue interaction calculation' - -Class: http://edamontology.org/operation_1826 -Label: Full torsion angle calculation -- 'Full torsion angle calculation' SubClassOf 'Torsion angle calculation' -+ 'Full torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1825 -Label: Backbone torsion angle calculation -- 'Backbone torsion angle calculation' SubClassOf 'Torsion angle calculation' -+ 'Backbone torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1828 -Label: Tau angle calculation -- 'Tau angle calculation' SubClassOf 'Torsion angle calculation' -+ 'Tau angle calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1827 -Label: Cysteine torsion angle calculation -- 'Cysteine torsion angle calculation' SubClassOf 'Torsion angle calculation' -+ 'Cysteine torsion angle calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1846 -Label: HET group detection -- 'HET group detection' SubClassOf 'Residue contact calculation (residue-ligand)' -+ 'HET group detection' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1843 -Label: Residue packing validation -- 'Residue packing validation' SubClassOf 'Residue non-canonical interaction detection' -+ 'Residue packing validation' SubClassOf 'Protein model validation' - -Class: http://edamontology.org/operation_1841 -Label: Rotamer likelihood prediction -- 'Rotamer likelihood prediction' SubClassOf 'Protein modelling (side chains)' -+ 'Rotamer likelihood prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1842 -Label: Proline mutation value calculation -- 'Proline mutation value calculation' SubClassOf 'Protein modelling (mutation)' -+ 'Proline mutation value calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1839 -Label: Salt bridge calculation -- 'Salt bridge calculation' SubClassOf 'Residue contact calculation (residue-residue)' -+ 'Salt bridge calculation' SubClassOf 'Residue interaction calculation' - -Class: http://edamontology.org/operation_1838 -Label: Residue contact calculation (residue-ligand) -- 'Residue contact calculation (residue-ligand)' SubClassOf 'Protein binding site prediction (from structure)' -- 'Residue contact calculation (residue-ligand)' SubClassOf 'Residue contact calculation' -+ 'Residue contact calculation (residue-ligand)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1837 -Label: Residue symmetry contact calculation -- 'Residue symmetry contact calculation' SubClassOf 'Residue contact calculation (residue-residue)' -+ 'Residue symmetry contact calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1836 -Label: Residue bump detection -- 'Residue bump detection' SubClassOf 'Residue non-canonical interaction detection' -+ 'Residue bump detection' SubClassOf 'Protein model validation' - -Class: http://edamontology.org/operation_1820 -Label: Protein residue surface calculation (vacuum accessible) -- 'Protein residue surface calculation (vacuum accessible)' SubClassOf 'Protein residue surface calculation' -+ 'Protein residue surface calculation (vacuum accessible)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1823 -Label: Protein surface calculation (accessible molecular) -- 'Protein surface calculation (accessible molecular)' SubClassOf 'Protein surface calculation' -+ 'Protein surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1824 -Label: Protein surface calculation (accessible) -- 'Protein surface calculation (accessible)' SubClassOf 'Protein surface calculation' -+ 'Protein surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1821 -Label: Protein residue surface calculation (accessible molecular) -- 'Protein residue surface calculation (accessible molecular)' SubClassOf 'Protein residue surface calculation' -+ 'Protein residue surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1822 -Label: Protein residue surface calculation (vacuum molecular) -- 'Protein residue surface calculation (vacuum molecular)' SubClassOf 'Protein residue surface calculation' -+ 'Protein residue surface calculation (vacuum molecular)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1817 -Label: Protein atom surface calculation (accessible) -- 'Protein atom surface calculation (accessible)' SubClassOf 'Protein atom surface calculation' -+ 'Protein atom surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1816 -Label: Surface rendering -- 'Surface rendering' SubClassOf 'Protein surface calculation' -+ 'Surface rendering' SubClassOf 'Protein surface analysis' - -Class: http://edamontology.org/operation_1819 -Label: Protein residue surface calculation (accessible) -- 'Protein residue surface calculation (accessible)' SubClassOf 'Protein residue surface calculation' -+ 'Protein residue surface calculation (accessible)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_1818 -Label: Protein atom surface calculation (accessible molecular) -- 'Protein atom surface calculation (accessible molecular)' SubClassOf 'Protein atom surface calculation' -+ 'Protein atom surface calculation (accessible molecular)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2495 -Label: Gene expression data analysis -- 'Gene expression data analysis' SubClassOf 'Operation (typed)' -+ 'Gene expression data analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_2497 -Label: Pathway or network analysis -- 'Pathway or network analysis' SubClassOf 'has input' some 'Pathway or network' -- 'Pathway or network analysis' SubClassOf 'Pathway or network processing' -+ 'Pathway or network analysis' SubClassOf 'Analysis' -+ 'Pathway or network analysis' SubClassOf 'has topic' some 'Molecular interactions, pathways and networks' - -Class: http://edamontology.org/operation_2491 -Label: Hydrogen bond calculation (inter-residue) -- 'Hydrogen bond calculation (inter-residue)' SubClassOf 'Hydrogen bond calculation' -- 'Hydrogen bond calculation (inter-residue)' SubClassOf 'Residue contact calculation (residue-residue)' -+ 'Hydrogen bond calculation (inter-residue)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2490 -Label: Residue contact calculation (residue-residue) -- 'Residue contact calculation (residue-residue)' SubClassOf 'Residue contact calculation' -+ 'Residue contact calculation (residue-residue)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2480 -Label: Structure analysis -- 'Structure analysis' SubClassOf 'Operation (typed)' -+ 'Structure analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_3083 -Label: Pathway or network visualisation -- 'Pathway or network visualisation' SubClassOf 'Pathway or network processing' -+ 'Pathway or network visualisation' SubClassOf 'Pathway or network analysis' - -Class: http://edamontology.org/operation_3096 -Label: Editing -- 'Editing' SubClassOf 'Utility operation' -+ 'Editing' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_1848 -Label: Structure formatting -- 'Structure formatting' SubClassOf 'Formatting' -+ 'Structure formatting' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2460 -Label: Protein atom surface calculation -- 'Protein atom surface calculation' SubClassOf 'Protein surface and interior calculation' -+ 'Protein atom surface calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2461 -Label: Protein residue surface calculation -- 'Protein residue surface calculation' SubClassOf 'Protein surface and interior calculation' -+ 'Protein residue surface calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2464 -Label: Protein-protein interaction prediction -+ 'Protein-protein interaction prediction' SubClassOf 'Protein function prediction' - -Class: http://edamontology.org/operation_2462 -Label: Protein surface calculation -- 'Protein surface calculation' SubClassOf 'Protein surface and interior calculation' -+ 'Protein surface calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2451 -Label: Sequence comparison -- 'Sequence comparison' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/operation_0385 -Label: Protein hydropathy cluster calculation -- 'Protein hydropathy cluster calculation' SubClassOf 'Protein hydropathy calculation (from structure)' -- 'Protein hydropathy cluster calculation' SubClassOf 'Protein residue cluster calculation' -+ 'Protein hydropathy cluster calculation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0384 -Label: Protein solvent accessibility calculation -- 'Protein solvent accessibility calculation' SubClassOf 'Protein property calculation (from structure)' -- 'Protein solvent accessibility calculation' SubClassOf 'has topic' some 'Protein properties' -+ 'Protein solvent accessibility calculation' SubClassOf 'Protein surface analysis' - -Class: http://edamontology.org/operation_0383 -Label: Protein hydropathy calculation (from structure) -- 'Protein hydropathy calculation (from structure)' SubClassOf 'Protein hydropathy calculation' -- 'Protein hydropathy calculation (from structure)' SubClassOf 'Protein property calculation (from structure)' -- 'Protein hydropathy calculation (from structure)' SubClassOf 'Protein structure analysis' -+ 'Protein hydropathy calculation (from structure)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0388 -Label: Protein binding site prediction (from structure) -- 'Protein binding site prediction (from structure)' SubClassOf 'Protein structural motif recognition' -- 'Protein binding site prediction (from structure)' SubClassOf 'Protein binding site prediction' -+ 'Protein binding site prediction (from structure)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0387 -Label: Protein surface and interior calculation -- 'Protein surface and interior calculation' SubClassOf 'has topic' some 'Protein structural motifs and surfaces' -- 'Protein surface and interior calculation' SubClassOf 'Protein solvent accessibility calculation' -+ 'Protein surface and interior calculation' SubClassOf 'Protein surface analysis' - -Class: http://edamontology.org/operation_0391 -Label: Protein distance matrix calculation -- 'Protein distance matrix calculation' SubClassOf 'Residue interaction calculation' -+ 'Protein distance matrix calculation' SubClassOf 'Residue contact calculation' - -Class: http://edamontology.org/operation_2443 -Label: Phylogenetic tree processing -- 'Phylogenetic tree processing' SubClassOf 'has topic' some 'Phylogeny' -- 'Phylogenetic tree processing' SubClassOf 'Operation (typed)' -+ 'Phylogenetic tree processing' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0393 -Label: Protein residue cluster calculation -- 'Protein residue cluster calculation' SubClassOf 'Residue contact calculation (residue-residue)' -+ 'Protein residue cluster calculation' SubClassOf 'Residue contact calculation' - -Class: http://edamontology.org/operation_0392 -Label: Protein contact map calculation -- 'Protein contact map calculation' SubClassOf 'Residue contact calculation (residue-residue)' -+ 'Protein contact map calculation' SubClassOf 'Protein distance matrix calculation' - -Class: http://edamontology.org/operation_0395 -Label: Residue non-canonical interaction detection -- 'Residue non-canonical interaction detection' SubClassOf 'Protein model validation' -- 'Residue non-canonical interaction detection' SubClassOf 'Residue interaction calculation' -+ 'Residue non-canonical interaction detection' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2430 -Label: Design -- 'Design' SubClassOf 'Analysis' -+ 'Design' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2438 -Label: Pathway or network processing -- 'Pathway or network processing' SubClassOf 'has topic' some 'Molecular interactions, pathways and networks' -- 'Pathway or network processing' SubClassOf 'Operation (typed)' -+ 'Pathway or network processing' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2420 -Label: Operation (typed) -- 'Operation (typed)' SubClassOf 'Operation' -+ 'Operation (typed)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_2423 -Label: Prediction and recognition -- 'Prediction and recognition' SubClassOf 'Analysis' -+ 'Prediction and recognition' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2425 -Label: Optimisation and refinement -- 'Optimisation and refinement' SubClassOf 'Analysis' -+ 'Optimisation and refinement' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2424 -Label: Comparison -- 'Comparison' SubClassOf 'Analysis' -+ 'Comparison' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2426 -Label: Modelling and simulation -- 'Modelling and simulation' SubClassOf 'Analysis' -+ 'Modelling and simulation' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2429 -Label: Mapping -- 'Mapping' SubClassOf 'Analysis' -+ 'Mapping' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2428 -Label: Validation -- 'Validation' SubClassOf 'Analysis' -+ 'Validation' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_2409 -Label: Utility operation -- 'Utility operation' SubClassOf 'Operation' -+ 'Utility operation' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_2403 -Label: Sequence analysis -- 'Sequence analysis' SubClassOf 'Operation (typed)' -+ 'Sequence analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_0325 -Label: Phylogenetic tree comparison -- 'Phylogenetic tree comparison' SubClassOf 'Phylogenetic tree processing' -+ 'Phylogenetic tree comparison' SubClassOf 'Phylogenetic tree analysis' - -Class: http://edamontology.org/operation_0326 -Label: Phylogenetic tree editing -- 'Phylogenetic tree editing' SubClassOf 'Phylogenetic tree processing' -+ 'Phylogenetic tree editing' SubClassOf 'Phylogenetic tree analysis' - -Class: http://edamontology.org/operation_0323 -Label: Phylogenetic tree generation -- 'Phylogenetic tree generation' SubClassOf 'Phylogenetic tree processing' -+ 'Phylogenetic tree generation' SubClassOf 'Phylogenetic tree analysis' - -Class: http://edamontology.org/operation_0324 -Label: Phylogenetic tree analysis -- 'Phylogenetic tree analysis' SubClassOf 'has output' some 'Phylogenetic data' -- 'Phylogenetic tree analysis' SubClassOf 'Phylogenetic tree processing' -- 'Phylogenetic tree analysis' SubClassOf 'has input' some 'Phylogenetic tree' -+ 'Phylogenetic tree analysis' SubClassOf 'has topic' some 'Phylogeny' -+ 'Phylogenetic tree analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_0321 -Label: Protein model validation -- 'Protein model validation' SubClassOf 'has topic' some 'Protein structure prediction' - -Class: http://edamontology.org/operation_0336 -Label: Format validation -- 'Format validation' SubClassOf 'Utility operation' - -Class: http://edamontology.org/operation_0338 -Label: Sequence database search -- 'Sequence database search' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/operation_0330 -Label: Protein SNP mapping -- 'Protein SNP mapping' SubClassOf 'Mapping' -- 'Protein SNP mapping' SubClassOf 'Protein modelling (mutation)' -- 'Protein SNP mapping' SubClassOf 'has topic' some 'SNP' -+ 'Protein SNP mapping' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0410 -Label: Protein crystallizability prediction -- 'Protein crystallizability prediction' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein crystallizability prediction' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0413 -Label: MHC peptide immunogenicity prediction -- 'MHC peptide immunogenicity prediction' SubClassOf 'Peptide immunogenicity prediction' -+ 'MHC peptide immunogenicity prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0408 -Label: Protein globularity prediction -- 'Protein globularity prediction' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein globularity prediction' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0409 -Label: Protein solubility prediction -- 'Protein solubility prediction' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein solubility prediction' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0406 -Label: Protein aliphatic index calculation -- 'Protein aliphatic index calculation' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein aliphatic index calculation' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0407 -Label: Protein hydrophobic moment plotting -- 'Protein hydrophobic moment plotting' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein hydrophobic moment plotting' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0401 -Label: Protein hydropathy calculation (from sequence) -- 'Protein hydropathy calculation (from sequence)' SubClassOf 'Protein hydropathy calculation' -- 'Protein hydropathy calculation (from sequence)' SubClassOf 'Protein sequence analysis' -+ 'Protein hydropathy calculation (from sequence)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0405 -Label: Protein hydrophobic region calculation -- 'Protein hydrophobic region calculation' SubClassOf 'Protein hydropathy calculation (from sequence)' -+ 'Protein hydrophobic region calculation' SubClassOf 'Protein hydropathy calculation' - -Class: http://edamontology.org/operation_0438 -Label: Regulatory element prediction -- 'Regulatory element prediction' SubClassOf 'Gene component prediction' -+ 'Regulatory element prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_0437 -Label: Selenocysteine insertion sequence (SECIS) prediction -- 'Selenocysteine insertion sequence (SECIS) prediction' SubClassOf 'Gene component prediction' -+ 'Selenocysteine insertion sequence (SECIS) prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_0436 -Label: Coding region prediction -- 'Coding region prediction' SubClassOf 'Gene component prediction' -+ 'Coding region prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_0435 -Label: Operon prediction -- 'Operon prediction' SubClassOf 'Whole gene prediction' -+ 'Operon prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_0434 -Label: Integrated gene prediction -- 'Integrated gene prediction' SubClassOf 'Whole gene prediction' -+ 'Integrated gene prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_0429 -Label: Quadruplex formation site detection -- 'Quadruplex formation site detection' SubClassOf 'has topic' some 'Nucleic acid structure prediction' - -Class: http://edamontology.org/operation_0425 -Label: Whole gene prediction -- 'Whole gene prediction' SubClassOf 'Gene prediction' -+ 'Whole gene prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0426 -Label: Gene component prediction -- 'Gene component prediction' SubClassOf 'Gene prediction' -+ 'Gene component prediction' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0420 -Label: Protein-nucleic acid binding prediction -- 'Protein-nucleic acid binding prediction' SubClassOf 'Protein binding site prediction (from sequence)' -+ 'Protein-nucleic acid binding prediction' SubClassOf 'Protein binding site prediction' - -Class: http://edamontology.org/topic_3346 -Label: Sequence search -- 'Sequence search' SubClassOf 'Sequence analysis' -+ 'Sequence search' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_3344 -Label: Biomedical science -- 'Biomedical science' SubClassOf 'Biology' -- 'Biomedical science' SubClassOf 'Medicine' -+ 'Biomedical science' SubClassOf 'Topic' - -Class: http://edamontology.org/operation_0419 -Label: Protein binding site prediction (from sequence) -- 'Protein binding site prediction (from sequence)' SubClassOf 'Protein sequence feature detection' -- 'Protein binding site prediction (from sequence)' SubClassOf 'Protein binding site prediction' -+ 'Protein binding site prediction (from sequence)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_3307 -Label: Computational biology -- 'Computational biology' SubClassOf 'Biology' -- 'Computational biology' SubClassOf 'Computer science' -+ 'Computational biology' SubClassOf 'Topic' - -Class: http://edamontology.org/topic_3391 -Label: Omics -- 'Omics' SubClassOf 'Biology' -+ 'Omics' SubClassOf 'Topic' - -Class: http://edamontology.org/topic_3379 -Label: Preclinical and clinical studies -+ 'Preclinical and clinical studies' SubClassOf http://edamontology.org/topic_3678 - -Class: http://edamontology.org/operation_0483 -Label: Structured RNA prediction and optimisation -- 'Structured RNA prediction and optimisation' SubClassOf 'has topic' some 'Nucleic acid structure prediction' - -Class: http://edamontology.org/topic_0594 -Label: Sequence classification -- 'Sequence classification' SubClassOf 'Sequence analysis' -+ 'Sequence classification' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0464 -Label: tRNA gene prediction -- 'tRNA gene prediction' SubClassOf 'Whole gene prediction' -+ 'tRNA gene prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_1913 -Label: Residue validation -- 'Residue validation' SubClassOf 'Protein model validation' -+ 'Residue validation' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0474 -Label: Protein structure prediction -- 'Protein structure prediction' SubClassOf 'has topic' some 'Protein structure prediction' - -Class: http://edamontology.org/operation_0475 -Label: Nucleic acid structure prediction -- 'Nucleic acid structure prediction' SubClassOf 'has topic' some 'Nucleic acid structure analysis' - -Class: http://edamontology.org/operation_0478 -Label: Molecular docking -- 'Molecular docking' SubClassOf 'has topic' some 'Molecular docking' - -Class: http://edamontology.org/topic_3123 -Label: Expression signals -- 'Expression signals' SubClassOf 'Gene structure' -- 'Expression signals' SubClassOf 'Gene expression' -+ 'Expression signals' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/data_1646 -Label: Molecular weights standard fingerprint -- 'Molecular weights standard fingerprint' SubClassOf 'Peptide mass fingerprint' -+ 'Molecular weights standard fingerprint' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/topic_3169 -Label: ChIP-seq -- 'ChIP-seq' SubClassOf 'Obsolete concept (EDAM)' -+ 'ChIP-seq' SubClassOf 'Sequencing' -+ 'ChIP-seq' SubClassOf http://edamontology.org/topic_3656 - -Class: http://edamontology.org/data_0582 -Label: Ontology -- 'Ontology' SubClassOf 'Data' -+ 'Ontology' SubClassOf 'Ontology data' - -Class: http://edamontology.org/topic_3179 -Label: ChIP-on-chip -+ 'ChIP-on-chip' SubClassOf http://edamontology.org/topic_3656 - -Class: http://edamontology.org/topic_3170 -Label: RNA-Seq -- 'RNA-Seq' SubClassOf 'Obsolete concept (EDAM)' -+ 'RNA-Seq' SubClassOf 'Sequencing' - -Class: http://edamontology.org/topic_0749 -Label: Transcription factors and regulatory sites -- 'Transcription factors and regulatory sites' SubClassOf 'Promoters' -+ 'Transcription factors and regulatory sites' SubClassOf 'Gene transcription features' - -Class: http://edamontology.org/operation_0291 -Label: Sequence clustering -- 'Sequence clustering' SubClassOf 'has topic' some 'Sequence comparison' - -Class: http://edamontology.org/operation_0299 -Label: 3D profile-to-3D profile alignment -- '3D profile-to-3D profile alignment' SubClassOf 'Alignment' - -Class: http://edamontology.org/topic_0769 -Label: Workflows -- 'Workflows' SubClassOf 'Obsolete concept (EDAM)' -+ 'Workflows' SubClassOf 'Data management' - -Class: http://edamontology.org/operation_2501 -Label: Nucleic acid analysis -- 'Nucleic acid analysis' SubClassOf 'Operation (typed)' -+ 'Nucleic acid analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_2502 -Label: Protein analysis -- 'Protein analysis' SubClassOf 'has topic' some 'Proteins' -- 'Protein analysis' SubClassOf 'Operation (typed)' -+ 'Protein analysis' SubClassOf 'has topic' some 'Proteomics' -+ 'Protein analysis' SubClassOf 'Analysis' - -Class: http://edamontology.org/operation_0226 -Label: Annotation -- 'Annotation' SubClassOf 'Analysis' -+ 'Annotation' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_0227 -Label: Indexing -- 'Indexing' SubClassOf 'Utility operation' -+ 'Indexing' SubClassOf 'Operation' - -Class: http://edamontology.org/operation_0248 -Label: Residue interaction calculation -- 'Residue interaction calculation' SubClassOf 'has output' some 'Protein residue interactions' -+ 'Residue interaction calculation' SubClassOf 'Residue interaction calculation' - -Class: http://edamontology.org/operation_0250 -Label: Protein property calculation -- 'Protein property calculation' SubClassOf 'has output' some 'Protein property' -- 'Protein property calculation' SubClassOf 'has topic' some 'Protein properties' - -Class: http://edamontology.org/operation_0267 -Label: Protein secondary structure prediction -- 'Protein secondary structure prediction' SubClassOf 'has topic' some 'Protein structure prediction' - -Class: http://edamontology.org/operation_2575 -Label: Protein binding site prediction -+ 'Protein binding site prediction' SubClassOf 'Protein feature detection' - -Class: http://edamontology.org/operation_0278 -Label: RNA secondary structure prediction -- 'RNA secondary structure prediction' SubClassOf 'has topic' some 'Nucleic acid structure prediction' - -Class: http://edamontology.org/operation_0279 -Label: Nucleic acid folding prediction -- 'Nucleic acid folding prediction' SubClassOf 'has topic' some 'Nucleic acid structure prediction' - -Class: http://edamontology.org/operation_0274 -Label: Protein-protein interaction prediction (from protein sequence) -- 'Protein-protein interaction prediction (from protein sequence)' SubClassOf 'Protein function prediction' -- 'Protein-protein interaction prediction (from protein sequence)' SubClassOf 'Protein-protein interaction prediction' -+ 'Protein-protein interaction prediction (from protein sequence)' SubClassOf 'Obsolete concept (EDAM)' - -Class: http://edamontology.org/operation_0275 -Label: Protein-protein interaction prediction (from protein structure) -- 'Protein-protein interaction prediction (from protein structure)' SubClassOf 'Protein-protein interaction prediction' -- 'Protein-protein interaction prediction (from protein structure)' SubClassOf 'Protein structure analysis' -- 'Protein-protein interaction prediction (from protein structure)' SubClassOf 'Protein feature detection' -+ 'Protein-protein interaction prediction (from protein structure)' SubClassOf 'Obsolete concept (EDAM)' -Back to top -New classes: - -Class: http://edamontology.org/format_3665 -Label: K-mer countgraph -+ 'K-mer countgraph' SubClassOf 'Graph format' -+ 'K-mer countgraph' SubClassOf 'Binary format' - -Class: http://edamontology.org/format_3652 -Label: dta -+ 'dta' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3653 -Label: pkl -+ 'pkl' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3650 -Label: netCDF -+ 'netCDF' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3651 -Label: MGF -+ 'MGF' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3657 -Label: GPML -+ 'GPML' SubClassOf 'Biological pathway or network format' -+ 'GPML' SubClassOf 'XML' - -Class: http://edamontology.org/format_3654 -Label: mzXML -+ 'mzXML' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/format_3655 -Label: pepXML -+ 'pepXML' SubClassOf 'Mass spectrometry data format' - -Class: http://edamontology.org/topic_3656 -Label: Immunoprecipitation experiment -+ 'Immunoprecipitation experiment' SubClassOf 'Laboratory techniques' - -Class: http://edamontology.org/topic_3676 -Label: Exome sequencing -+ 'Exome sequencing' SubClassOf 'Sequencing' - -Class: http://edamontology.org/topic_3679 -Label: Animal study -+ 'Animal study' SubClassOf 'Experimental design and studies' -+ 'Animal study' SubClassOf 'Laboratory animal science' - -Class: http://edamontology.org/topic_3678 -Label: Experimental design and studies -+ 'Experimental design and studies' SubClassOf 'Topic' - -Class: http://edamontology.org/topic_3673 -Label: Whole genome sequencing -+ 'Whole genome sequencing' SubClassOf 'Sequencing' - -Class: http://edamontology.org/topic_3674 -Label: Methylated DNA immunoprecipitation -+ 'Methylated DNA immunoprecipitation' SubClassOf 'Immunoprecipitation experiment' - -Class: http://edamontology.org/format_3626 -Label: MAT -+ 'MAT' SubClassOf 'is format of' some '3D-1D scoring matrix' -+ 'MAT' SubClassOf 'Matrix format' - -Class: http://edamontology.org/data_3671 -Label: Text -+ 'Text' SubClassOf 'Parameter' -+ 'Text' SubClassOf 'Article data' - -Class: http://edamontology.org/data_3670 -Label: Online course -+ 'Online course' SubClassOf 'Training material' - -Class: http://edamontology.org/data_3667 -Label: Disease identifier -+ 'Disease identifier' SubClassOf 'is identifier of' some 'Disease report' -+ 'Disease identifier' SubClassOf 'Accession' -+ 'Disease identifier' SubClassOf 'Identifier (typed)' - -Class: http://edamontology.org/data_3669 -Label: Training material -+ 'Training material' SubClassOf 'Data' - -Class: http://edamontology.org/data_3668 -Label: Disease name -+ 'Disease name' SubClassOf 'Name' -+ 'Disease name' SubClassOf 'Disease identifier' - -Class: http://edamontology.org/topic_3557 -Label: Protein interaction experiment -+ 'Protein interaction experiment' SubClassOf 'Laboratory techniques' - -Class: http://edamontology.org/operation_3642 -Label: Dimethyl -+ 'Dimethyl' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3641 -Label: TMT-tag -+ 'TMT-tag' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3640 -Label: 18O labeling -+ '18O labeling' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3646 -Label: Peptide database search -+ 'Peptide database search' SubClassOf 'Peptide identification' -+ 'Peptide database search' SubClassOf 'Database search' - -Class: http://edamontology.org/operation_3645 -Label: PTM identification -+ 'PTM identification' SubClassOf 'Peptide identification' - -Class: http://edamontology.org/operation_3644 -Label: de Novo sequencing -+ 'de Novo sequencing' SubClassOf 'Sequence generation (protein)' -+ 'de Novo sequencing' SubClassOf 'Peptide identification' - -Class: http://edamontology.org/operation_3643 -Label: Tag-based peptide identification -+ 'Tag-based peptide identification' SubClassOf 'Peptide identification' - -Class: http://edamontology.org/operation_3649 -Label: Target-Decoy -+ 'Target-Decoy' SubClassOf 'Validation of peptide-spectrum matches' - -Class: http://edamontology.org/operation_3648 -Label: Validation of peptide-spectrum matches -+ 'Validation of peptide-spectrum matches' SubClassOf 'Peptide database search' -+ 'Validation of peptide-spectrum matches' SubClassOf 'Validation' - -Class: http://edamontology.org/operation_3647 -Label: Blind peptide database search -+ 'Blind peptide database search' SubClassOf 'Peptide database search' - -Class: http://edamontology.org/operation_3629 -Label: Deisotoping -+ 'Deisotoping' SubClassOf 'has input' some 'Mass spectrometry spectra' -+ 'Deisotoping' SubClassOf 'Spectral analysis' - -Class: http://edamontology.org/operation_3631 -Label: Peptide identification -+ 'Peptide identification' SubClassOf 'has input' some 'Mass spectrometry spectra' -+ 'Peptide identification' SubClassOf 'Spectral analysis' - -Class: http://edamontology.org/operation_3630 -Label: Quantification -+ 'Quantification' SubClassOf 'has input' some 'Mass spectrometry spectra' -+ 'Quantification' SubClassOf 'Spectral analysis' - -Class: http://edamontology.org/operation_3633 -Label: Retention times calculation -+ 'Retention times calculation' SubClassOf 'Calculation' - -Class: http://edamontology.org/operation_3632 -Label: Isotopic distributions calculation -+ 'Isotopic distributions calculation' SubClassOf 'Calculation' -+ 'Isotopic distributions calculation' SubClassOf 'has input' some 'Mass spectrometry spectra' -+ 'Isotopic distributions calculation' SubClassOf 'has topic' some 'Proteomics' - -Class: http://edamontology.org/operation_3635 -Label: Labeled quantification -+ 'Labeled quantification' SubClassOf 'Quantification' - -Class: http://edamontology.org/operation_3634 -Label: Label-free quantification -+ 'Label-free quantification' SubClassOf 'Quantification' - -Class: http://edamontology.org/operation_3637 -Label: Spectral counting -+ 'Spectral counting' SubClassOf 'Label-free quantification' - -Class: http://edamontology.org/operation_3636 -Label: MRM/SRM -+ 'MRM/SRM' SubClassOf 'Quantification' - -Class: http://edamontology.org/operation_3639 -Label: iTRAQ -+ 'iTRAQ' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3638 -Label: SILAC -+ 'SILAC' SubClassOf 'Labeled quantification' - -Class: http://edamontology.org/operation_3664 -Label: Statistical modelling -+ 'Statistical modelling' SubClassOf 'Statistical calculation' - -Class: http://edamontology.org/operation_3663 -Label: Homology-based gene prediction -+ 'Homology-based gene prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_3662 -Label: Ab-initio gene prediction -+ 'Ab-initio gene prediction' SubClassOf 'Gene prediction' - -Class: http://edamontology.org/operation_3661 -Label: SNP annotation -+ 'SNP annotation' SubClassOf 'Sequence annotation' - -Class: http://edamontology.org/operation_3660 -Label: Metabolic network modelling -+ 'Metabolic network modelling' SubClassOf 'Network simulation' -+ 'Metabolic network modelling' SubClassOf 'has topic' some 'Systems biology' - -Class: http://edamontology.org/operation_3666 -Label: Molecular surface comparison -+ 'Molecular surface comparison' SubClassOf 'Molecular surface analysis' -+ 'Molecular surface comparison' SubClassOf 'Structure comparison' - -Class: http://edamontology.org/operation_3659 -Label: Regression analysis -+ 'Regression analysis' SubClassOf 'Statistical calculation' - -Class: http://edamontology.org/operation_3658 -Label: Statistical inference -+ 'Statistical inference' SubClassOf 'Statistical calculation' - -Class: http://edamontology.org/operation_3627 -Label: Mass spectra calibration -+ 'Mass spectra calibration' SubClassOf 'Spectral analysis' -+ 'Mass spectra calibration' SubClassOf 'has input' some 'Mass spectrometry spectra' - -Class: http://edamontology.org/operation_3628 -Label: Chromatographic alignment -+ 'Chromatographic alignment' SubClassOf 'Spectral analysis' -+ 'Chromatographic alignment' SubClassOf 'has input' some 'Mass spectrometry spectra' - -Class: http://edamontology.org/operation_3625 -Label: Relationship inference -+ 'Relationship inference' SubClassOf 'has output' some 'Article data' -+ 'Relationship inference' SubClassOf 'has input' some 'Article' -+ 'Relationship inference' SubClassOf 'Text mining' -+ 'Relationship inference' SubClassOf 'has topic' some 'Literature and reference' - -Class: http://edamontology.org/operation_3677 -Label: Differential binding analysis -+ 'Differential binding analysis' SubClassOf 'Nucleic acid sequence analysis' - -Class: http://edamontology.org/operation_3675 -Label: Variant filtering -+ 'Variant filtering' SubClassOf 'Sequencing quality control' -+ 'Variant filtering' SubClassOf 'Nucleic acid sequence analysis' - -Class: http://edamontology.org/operation_3672 -Label: Gene functional annotation -+ 'Gene functional annotation' SubClassOf 'Sequence annotation' diff --git a/changelog.md b/changelog.md deleted file mode 100644 index 80abd7b..0000000 --- a/changelog.md +++ /dev/null @@ -1,333 +0,0 @@ -# Changelog for EDAM -Description of changes are grouped as follows: -* **Added:** new features -* **Changed:** changes to existing functionality -* **Deprecated:** a once-stable feature that has been removed -* **Removed:** a deprecated feature that has been removed -* **Fixed:** a bug fix -* **Misc:** some miscellaneous other change - -# EDAM\_1.14.owl -See the [detailed change log](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) for exact details of changes. - -EDAM\_14 includes: -* many new terms or term corrections requested by the community (directly on github, or during the last hackathons). -* a new CI process that will be extended over time to monitor and improve the quality of the ontology. - -## Added -* 14 classes changed - -## Changed -* 28 classes added, mainly new data and formats. - -# EDAM\_1.13.owl -See the [detailed change log](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) for exact details of changes. - -The main focus of EDAM\_1.13.owl is: -* a Topic branch simplification in response to requests for a smaller, more usable and thus also more sustainable set of topics -* addition of new concepts requested via GitHub, prioritising addition of new formats from recent [de.NBI/EDAM](http://tinyurl.com/registryhackathon7) hackathon -* additions and changes for NGS tools packages within Debian Med but not included in SEQanswers Wiki (SEQwiki) (work in progress) - -## Added -* 23 new concepts (mostly in Format branch) added - -## Changed -* 105 concepts changed (excluding changes/additions to synonyms) -* topic branch restructured for easier navigation -* all deprecated classes are now child of SubClassOf http://www.w3.org/2002/07/owl#DeprecatedClass - -## Deprecated -* 60 concepts were deprecated, mostly to greatly simplify the Topics branch -* removal of some overly specialised Operation concepts (work in progress) -* NB: terms, synonyms and comments on deprecated concepts were generally preserved in the parent concepts - -## Fixed -* all deprecated concepts now have a suggestion (either consider or replacedBy) for an alternative -* all suggested alternatives for deprecated concepts are now to active (i.e. non-deprecated) concepts -* various other miscellaneous fixes as requested via GitHub - -## Misc -* new 'isdebtags' annotation defined on concepts to annotate a concept is a candidate for tagging Debian Med packages, following the recent [Debian Med sprint](https://wiki.debian.org/Sprints/2016/DebianMed2016) - - - -# EDAM\_1.12.owl -See the [detailed change log](https://github.com/edamontology/edamontology/blob/master/changelog-detailed.md) for exact details of changes. - -56 new concepts were added and 190 concepts changed. - -## Added -* 56 new concepts added -* new concepts for mass spec from analysis of msutils.org -* new concepts for NGS from analysis of SEQanswers Wiki -* misc. additions arising from the recent hackathons in [Brno, CZ](http://tinyurl.com/registryhackathon3) and [Amsterdam, NL](http://tinyurl.com/registryhackathon5) -* multiple new synonyms - -## Changed -* reorganisation of top-level Operation concepts to make this branch more usable -* reorganisation of top-level Data concepts to make this branch more usable - -## Deprecated -* 72 concepts were deprecated -* removal of overly-specific Topic concepts that were overlapping with operations -* removal of overly-specific Data and Operation concepts -* removal of some obscure organisational classes (e.g. ``) - - - -# EDAM\_1.11.owl -## Added -* 44 new formats have been added, based on the needs of the Galaxy (http://usegalaxy.org), ReGaTE (https://github.com/bioinfo-center-pasteur-fr/ReGaTE), and Common Workflow Language (https://github.com/common-workflow-language) projects, as part of the BOSC Codefest 2015 (http://open-bio.org/wiki/Codefest-2015.html). - -# EDAM\_1.10.owl -## Added -* hasDBXref class annotations added to Topic concepts to provide mapping to all VT Scientific Disciplines in branches 1.1 Mathematics, 1.2 Computer sciences, 1.3 Information sciences, 1.5 Biological sciences, 1.7 Chemical sciences, 3. Medical and Health Sciences, 3.2 Clinical medicine, 3.3 Health sciences and 3.4 Medical biotechnology. -* 9 new Topic concepts from mapping to VT Scientific Disciplines. -* 3 new Format concepts and 2 new Data concepts. - -## Changed -* 'Topic:Informatics' undeprecated and used as placeholder for various information science-related terms. -* 'Topic:Data management' and 'Topic:Computer science" siblings rearranged for conceptual clarity. - -## Fixed -* Multiple duplications of synonyms and labels in Topics branch. - -## Misc -Style of Topic concept definitions changed, removing "Topic concerning ...", to make them more usable. - -# EDAM\_1.9.owl - -- 20 new concepts in preparation for the ELIXIR Tools and Data Services Registry -- 1 concept deprecation -- Various minor changes (synonyms etc.) - - -# EDAM\_1.8.owl - -- Revision to provide comprehensive coverage of EBI Tool Topics, Data and Operations -- Removal of fine-grained report (human-readable data) concepts from the Data branch -- Rooting all report concepts under "Data->Report" -- Removal of operation-like concepts from the Topics branch -- Biological concepts (sequence feature-related, pathways and networks, experimental techniques) that were previously modeled under as reports within Data, are now given under Topic -- Simplification of key Data concepts concerning sequences, alignments and signatures (motifs/profiles) -- Many other additions and minor changes -- 107 concept deprecations -- 53 new concepts - - - -# EDAM\_1.7.owl - -- Additions and changes following from the recent ELIXIR Registry Hackathon (tinyurl.com/RegistryHackathon). -- About 50 new concepts added -- 9 concept deprecations -- Many minor changes (new synonyms, minor structural changes etc.) - - **Bug fixes** -- Fixed synonyms that had URIs as values (1) - -(1) for any synonyms that had a URI as value, that URI is now given as a seeAlso annotation instead. It was also necessary to remove all statements that defined a synyonm, from all "annotations on annotations", i.e. where comments had been added to an annotation on a class, via an owl:Axiom statement. - - -# EDAM\_1.6.owl - -- A major revision of the EDAM Operation branch to simplify it and improve usability. -- 64 EDAM Operation concept deprecations. -- Top-level Operations now correspond to tool types in the ELIXIR Tools & Data Services Registry: Analysis, Query and retrieval, Visualisation, Deposition, Utility operation. -- Removal of excessively fine-grained Operation concepts. -- Removed "bioinformatics" subset and all corresponding annotations -- Removal of unnecessary "organisational" classes. -- Renaming of concepts (terms) to reflect the common terms in use. - - -# EDAM\_1.5.owl - -- A major revision of the EDAM Data branch aiming for simplification and ease of use. -- 117 EDAM Data concept deprecations -- simplification of Data hierarchy -- removal of excessively fine-grained Dat concepts -- removal of out-of-scope Data concepts -- removal of unnecessary "organisational" classes (near top of Data hierarchy) -- renaming of concepts (terms) to reflect the common terms in use -- addition of Data synonyms - - **Bug fixes** -- fixed many references to deprecated concepts - - - -# EDAM\_1.4.owl - -- A major revision of the "Topic" sub-ontology expanding this into medical concepts (~60 new topics), following an effort led by Cath Brooksbank with major input from partners from EMTRAIN (European Medicines research TRAINing network) and partners from related ESFRI (European Strategy Forum on Research Infrastructures) projects. -- Fixing many minor bugs (mostly overlapping or bad synonyms) within topics, and other clean-ups. -- Removed the lowest tier of the "Topic" branch (mostly by moving terms up a level). -- Removed all `oboOther:namespace` and some subsets; removed most `oboInOwl:inSubset` for deprecated concepts and added subset 'obsolete'. -- New forms of UniProt identifiers added (regex). -- Examples of IANA and chemical media types added. -- A couple of file-/data-handling concepts added (operations and an identifier). -- An OBO-format version of EDAM has been omitted. We will only resume providing OBO format in case of substantial demand or full automation of the conversion. -- Documentation files have been substantially updated, _e.g._ specifying channels for the most welcome community contributions. - - **And most importantly:** -- EDAM is now being developed at GitHub!!! - - - -# EDAM\_1.3.owl - -Highlights of changes: -- Greatly simplified "Topic" branch. -- Many new terms added for annotating tools in the [BioToolsRegistry](bioregistry.cbs.dtu.dk). - - - -# EDAM\_1.2.owl - -This is the first version of EDAM now that is maintained in OWL format. The OBO-format version is generated from it by processing the OWL file. - -Highlights of changes: -- New references to MeSH. -- Edits to synonyms. -- About a dozen new formats. - -- Clean-ups for cleaner viewing in Protege and OLS: -- Removed problematic "has input" and "has output" axioms. -- Cleaner annotations on the ontology itself. - - - - -# EDAM\_1.1.obo - -Many additions (mostly in "Operation" and some in "Topic" branches) for "next generation" sequencing analysis. -EDAM now provides complete coverage of biological domains and bioinformatics methods from [SeqWiki](http://seqanswers.com/wiki/SEQanswers). -SeqWiki "biological domains" map to EDAM "Topic", SeqWiki "bioinformatics methods" map to EDAM "Operation". - - -# EDAM\_1.0.obo -The first release proper. - -General changes - - New style for concept IDs: 4 digit number, subontology namespace / subset("operation", "topic" etc) _e.g._ - "EDAM\_operation:0004" (new style) instead of "EDAM:0000004" (old style). - - - New relations ("has function", "is function of") are defined for use by annotators (they are not used in EDAM itself). - - - Synonyms are defined that define related or relevant concepts in many other ontologies and systems. Synonyms are added throughout but especially on top-level concepts ("Operation", "Data", "Format" and "Topic") and relations ("has input", "is input of", "has output", "is output of", "has topic", "is topic of", "has format", "is format of", "has function", "is function of"). - - - New concept attributes and modifiers have been added, most importantly: - - "{note}" for comments on synonyms and other attributes, _e.g._ -`synonym: "assembly" NARROW [SO:0001248] {note="Perhaps surprisingly, the definition of 'SO:assembly' is narrower than the 'SO:sequence\_assembly'."}`. - - "{since}" for annotation of version information, _e.g._ data of creation or obsoletion of a concept id: -`id: EDAM\_data:3165 {since=1.0}` or -`is\_obsolete: true {since=1.0}`. - -"Format" branch - - 10 new formats. - - - - -# EDAM\_beta13.obo -General changes - - "Identifier" branch moved from top-level to beneath "Data". The "identifier" namespace / definitions have been kept! - - Extensive revision of "Data", "Operation" and "Topic" branches to reduce clutter and ease navigation. - - Bottom-up clean up removing terms that are too fine-grained. Top-down clean up to add or remove terms to aid navigation. - - has\_topic (defined on "Data" and "Operation") replaces in\_topic. - - Duplicated relationships (child terms erroneously restating the inherited relationships of their parents) have been removed. - -"Data" branch - - All "Data" concepts now organised into 4 sub-concepts: - - "Core data" - Data that typically are the primary input or output of a tool or which correspond to entries from the primary (_e.g._ sequence or structural) biological databases. - - "Identifier" - A short numerical or textual label that identifies (typically uniquely) something such as data, a resource or a biological entity. - - "Parameter" - Typically a simple numerical or string value that controls the operation of a tool. - - "Report" - A human-readable collection of information that is distinct from primary (_e.g._ sequence or structural) biological data, including free text, annotation about biological entities and phenomena, computer-generated reports of analysis of primary data and metadata. - - "Report" concepts for sequences correspond better (without duplicating) established sequence feature keys. - -"Operation" branch - - Fewer concepts, simpler is\_a hierarchy - - "has\_input" and "has\_input" relations defined (on nearly all terms) - -"Format" branch - - "is\_format\_of" relations defined (for nearly all terms) - -"Topic" branch - - Improved term names and is_a hierarchy, reflecting whether topics concern a type of data, operation or are more general. - - New "Biological data resources" sub-branch includes common data resource concepts. - - Major revision! Too much to mention, so take a look :) - - - - - -# EDAM\_beta12.obo -General changes - - OBO subset definitions added - - Sub-ontologies / namespaces / subsets now are "topic", "data", "format", "identifier", "operation" - - Relation types now are "in\_topic", "has\_input", "has\_output", "is\_format\_of", "is\_identifier\_of" - - Many edits (to concepts and "is\_a" relations) to improve navigability in all sub-ontologies - -New "Identifier" sub-ontology - - Containing concepts which were under Data<-Identifier - - For fine-grained annotation of identifiers of data - -"Resource" sub-ontology obsoleted - - Most concepts merged into "Topic" sub-ontology (see below) - - All remaining concepts in "resource" namespace obsoleted - -Major revisions to "Topic" sub-ontology - - Concepts redefined as "...general bioinformatics subject or category, such as a field of study, data, processing, analysis or technology." - - For coarse-grained annotation of diverse resources - - Subsumes concepts from old "resource" sub-ontology (see above) - -EDAM-specific relations - - Many new relations added (most term statements which should define relations now do) - - Relations defined on parent only (not duplicated in children) - -"Format" sub-ontology - - About 50 new formats added - - - - - -# EDAM\_beta11.obo -- Entire "Entity" branch (all terms) made obsolete - -- Root term of "resource" namespace ("Data resource") renamed to "Resource" - -- Root term of "format" namespace ("Data format") renamed to "Format" - -- Corrections (2) removing duplicate IDs - - - - - -# EDAM\_beta10.obo -Major revision of "Operation" branch - - immensely simplified top level - - better hierarchy - -Major revision of "Data" branch - - simpler top-level - - better hierarchy - - new branches for "Protein data", "Nucleic acid data" - - new terms to aid navigation - - clean up "annotation" and "metadata" concepts - -Major revision of "Data format" branch - - better hierarchy - - children of "HTML format" are now (mostly) obsolete - - many new formats added - -Simplification of "Topic" branch - - concepts are now more strictly "fields of study" - -General changes - - term relations are now defined in one direction only - - more consistent usage of words in term names - - more intuitive term names (child names follow parent in style where possible) - - many term additions and deletions diff --git a/debtags.txt b/debtags.txt deleted file mode 100644 index cee5107..0000000 --- a/debtags.txt +++ /dev/null @@ -1,137 +0,0 @@ -# Candidate Deb Tags -# -# Comments in this file begin with a '#' -# Each line contains one EDAM term (the 'preferred label' of an EDAM concept) -# Suggested hierarchy ('is_a' relationships) are indicated by two spaces -# -# Biology topics -Molecular interactions, pathways and networks -Model organisms -Genotype and phenotype -Literature and reference -Biophysics -Imaging -Experimental design and studies -Taxonomy -Ecology - Biodiversity - Microbial ecology -Informatics - Bioinformatics - Cheminformatics - Medical informatics - Data management - Laboratory information management - Ontology and terminology - Data mining - Data visualisation -Computational biology - Function analysis - Sequence analysis - Sequence composition, complexity and repeats - Sequence sites, features and motifs - Sequence assembly - Probes and primers - Mapping - Sequencing - Structure analysis - Structure prediction - Molecular dynamics - Molecular docking - Molecular modelling - Phylogeny - Phylogenetics -Biochemistry - Carbohydrates - Lipids - Small molecules - Proteins - Protein properties - Protein interactions - Protein folding, stability and design - Protein structural motifs and surfaces - Protein modifications - Protein families - Membrane and lipoproteins - Enzymes - Protein folds and structural domains - Protein variants - Protein structure analysis - Nucleic acids - RNA splicing - Functional, regulatory and non-coding RNA - DNA replication and recombination - DNA polymorphism -Genetics - Epigenetics - Population genetics - Human genetics - Molecular genetics - Quantitative genetics - Gene structure - Genetic variation - Gene expression - Gene regulation - Gene families -Biology - Structural biology - Cell biology - Systems biology - Molecular biology - Evolutionary biology - Microbiology - Marine biology - Developmental biology - Neurobiology - Chemical biology -Omics - Functional genomics - Proteomics - Structural genomics - Phylogenomics - Genomics - Comparative genomics - Metabolomics - Epigenomics - Metagenomics - Transcriptomics - Pharmacogenomics - Phenomics -# Biomedical topics -Biomedical science -Pharmacology -Medicinal chemistry -Pathology -Immunology -Oncology -Toxicology -Embryology -Anatomy -Biotechnology -Physiology -Medicine -Public health and epidemiology -Respiratory medicine -Computational chemistry -Neurology -Cardiology -Biobank -Translational medicine -Biomarkers -Drug discovery -Drug development -Pharmacokinetics and pharmacodynamics -Molecular medicine -Regnerative medicine -Systems medicine -Geriatric medicine -Allergy, clinical immunology and immunotherapeutics. -Pain medicine -Haematology -Gastroenterology -Gynaecology and obstetrics -Hepatic and biliary medicine -Medical toxicology -Medical biotechnology -Personalized medicine - diff --git a/images/body-bg.jpg b/images/body-bg.jpg new file mode 100644 index 0000000..719fb88 Binary files /dev/null and b/images/body-bg.jpg differ diff --git a/images/download-button.png b/images/download-button.png new file mode 100644 index 0000000..c5ffb3a Binary files /dev/null and b/images/download-button.png differ diff --git a/images/github-button.png b/images/github-button.png new file mode 100644 index 0000000..cd41580 Binary files /dev/null and b/images/github-button.png differ diff --git a/images/header-bg.jpg b/images/header-bg.jpg new file mode 100644 index 0000000..d16497a Binary files /dev/null and b/images/header-bg.jpg differ diff --git a/images/highlight-bg.jpg b/images/highlight-bg.jpg new file mode 100644 index 0000000..355e089 Binary files /dev/null and b/images/highlight-bg.jpg differ diff --git a/images/sidebar-bg.jpg b/images/sidebar-bg.jpg new file mode 100644 index 0000000..536ead9 Binary files /dev/null and b/images/sidebar-bg.jpg differ diff --git a/index.html b/index.html new file mode 100644 index 0000000..619837c --- /dev/null +++ b/index.html @@ -0,0 +1,315 @@ + + + + + + + + + + + + + + edamontology by edamontology + + + +
+
+

edamontology

+

EDAM is an ontology of bioinformatics types of data, data identifiers, data formats, operations and topics.

+ View project on GitHub +
+
+ +
+
+
+

+What is EDAM?

+ +

EDAM is a simple ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM provides a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use.

+ +

You can browse EDAM at BioPortal.

+ +

See http://twitter.com/edamontology (follow), please use #edamontology

+ +

+Motivation

+ +

Bioinformaticians handle an increasingly large and diverse set of tools and data. Meanwhile, researchers demand ever more powerful and convenient means to organise, find, understand, compare, select, use and connect the available resources. These tasks often rely on consistent, machine-understandable descriptions of the underlying components, but these have been generally lacking in ad hoc resource descriptions. The urgent need - filled by EDAM - is for an ontology that unifies semantically the bioinformatics concepts in common use, provides the curator with a comprehensive controlled vocabulary that is broadly applicable, and supports new and powerful search, browse and query functions.

+ +

+Applications

+ +

EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources, including:

+ +
    +
  • Web services including REST and SOAP APIs
  • +
  • Application software
  • +
  • Tool collections and packages
  • +
  • Workflows / pipelines
  • +
  • Databases
  • +
  • XML Schemata and data objects
  • +
  • Data syntax and file formats
  • +
  • Web portals and pages
  • +
  • Resource catalogues
  • +
  • Training materials
  • +
  • Courses, tutorials, and other events
  • +
  • Areas of scientific interest
  • +
  • Documents, such as scientific publications
  • +
+ +

EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries.

+ +

+Scope

+ +

EDAM includes 4 main sub-ontologies or 'branches' of concepts:

+ +
    +
  • +Data - “Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output.”
  • +
  • +Format - “A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere.”
  • +
  • +Operation - “A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs).”
  • +
  • +Topic - “A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other.”
  • +
+ +

Noteworthy within the the Data sub-ontology is:

+ +
    +
  • +Identifier - “A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things).”
  • +
+ +

EDAM concepts figure

+ +

As a general rule, the Data, Format, and Operation branches include concepts strictly in domain of bioinformatics and computational biology: concepts purely concerning biology, computer science, etc. are not included. The Topic branch, however, includes broader inter-disciplinary concepts from the biological and medical domains.

+ +

EDAM provides different semantic 'axes' for annotation. For example, annotation of a software tool might include:

+ +
    +
  • +Topic - general scientific domain the software serves, e.g. “Structural biology”
  • +
  • +Operation - the precise function of the tool, e.g. “Homology modelling”
  • +
  • +Data - the primary input and output, e.g. “Protein structure”
  • +
  • +Format - the supported format(s) of the input and output, e.g. “PDB format”
  • +
+ +

+Principles

+ +

EDAM strives to uphold a few founding principles including:

+ +
    +
  • +Quality - a controlled vocabulary that is moderated
  • +
  • +Openness - development in collaboration with the community
  • +
  • +Relevance - prioritising use-case-driven development towards comprehensive but practical coverage
  • +
  • +Practicality - practical utility is valued over ontological “strictness” or any metaphysical doctrine
  • +
  • +Clear scope - respecting the scope of other complementary, well-developed ontologies
  • +
  • +Familiarity - including only concepts that are well established; familiar are prevalent and jargon is discouraged
  • +
  • +Usability - conceptual hierarchy with sufficient richness but only necessary complexity
  • +
  • +Maintainability - development must be efficient and sustainably up to date in the long term
  • +
+ +

EDAM is working towards implementing these principles fully and is open to suggestions.

+ +

+Architecture

+ +

EDAM has 3 components:

+ +
    +
  • +Concepts - All concepts have a name (the term or label) and definition. Further, a concept may have simple relations (see below) to other EDAM concepts, as well other intrinsic properties, e.g. an identifier may have a regular expression defining its syntax.
  • +
  • +Hierarchy - Every concept (excluding top-level concepts) is related to one or more other concepts within the same branch by an is a (specialisation) relation. Hence EDAM has 4 primary hierarchies (for Data, Format, Operation, and Topic).
  • +
  • +Relations - Concepts are related by defined relation types (see figure below), which reflect well established or self-evident principles, and are used primarily to define internal consistency of EDAM. These have external applications too, e.g. annotations on the Semantic Web.
  • +
+ +

EDAM relations figure

+ +

+Priorities

+ +

Our core priority is to be responsive to users of EDAM. Furthermore, to establish a more sustainable footing for essential EDAM maintenance and developments, including:

+ +
    +
  • Content review and refactoring to ensure structural and semantic simplicity ensuring high usability
  • +
  • Community build-up and development including more formal, but agile, governance and maintenance models and mechanisms
  • +
  • Agile and responsive development of content in close collaboration with end-users and serving concrete use-cases
  • +
  • Technical refactoring to minimise the cost of routine housekeeping and content development
  • +
  • Implementation of tooling for routine maintenance to serve the needs of end-users, e.g. harvesting change requests and mappings between concepts
  • +
+ +

+Governance of EDAM

+ +

EDAM follows a model with five tiers of governance:

+ +
    +
  1. +EDAM Advisory Group advises the EDAM Core Developers on how best to uphold the EDAM principles and achieve its current aims. It represents the broad life science community, especially scientist end-users. Advisory Group members have no formal responsibilities, but are expected to advocate EDAM and actively offer constructive advice based on their practical experience, requirements and expertise. The EDAM Core Developers will respect this advice and give quarterly progress reports by email. The Core Developers aim to assemble with the Advisory Group virtually 2 or 3 times a year or as circumstances dictate, in meetings with open agenda and followed up with actions and notes on key recommendations. The Advisory Group will be reconstituted each year and the Steering Group (below) reserves the right to replace inactive members.
  2. +
  3. +

    EDAM Steering Group includes representatives of institutes that are committing significant resources to EDAM. Members of the Steering Group have four primary responsibilities:

    + +
      +
    • Agree strategy and set priorities in consultation with the Core Developers
    • +
    • Verify whether stated aims are coherent and wise
    • +
    • Monitor progress and provide feedback
    • +
    • Help arrange funding for EDAM
    • +
    +
  4. +
  5. +

    EDAM Core Developers are funded to develop EDAM and have GitHub commit rights. Responsible for agreeing aims and general good practice, overseeing and approving developments and routine maintenance. The model is quasi-democratic with a leader (currently Jon Ison) having the final say where necessary. The leader ensures the Advisory Group, and all developers and contributors, are listened to and informed. The leader may be temporarily appointed from the core developers as necessary, e.g. during holidays. Core Developers must have the intent and some bandwidth to develop EDAM in the long-term. They have 3 primary responsibilities:

    + +
      +
    • Understand and uphold the EDAM principles
    • +
    • Advocate EDAM
    • +
    • Develop EDAM as bandwidth permits
    • +
    +
  6. +
  7. Developers would not normally have GitHub commit rights long-term. They include anyone who makes significant technical or scientific contributions, by whatever means, but have none of the commitments or responsibilities of the core developers.

  8. +
  9. +Other contributors do not have GitHub commit rights, but can still make comments, contribute suggestions for new terms and other changes.
  10. +
+ +

+People

+ +

+EDAM Core Developers

+ +
    +
  • Jon Ison (CBS-DTU, DK) - lead developer +
  • +
  • Matúš Kalaš (University of Bergen, NO)
  • +
  • Hervé Ménager (Institut Pasteur, FR)
  • +
  • Marie Grosjean (IFB, FR)
  • +
+ +

+EDAM Steering Group

+ +
    +
  • Karel Berka (ELIXIR CZ)
  • +
  • Christophe Blanchet (ELIXIR FR)
  • +
  • Cath Brooksbank (ELIXIR EMBL-EBI)
  • +
  • Søren Brunak (ELIXIR DK)
  • +
  • Inge Jonassen (ELIXIR NO)
  • +
  • Steven Newhouse (ELIXIR EMBL-EBI)
  • +
  • Heinz Stockinger (ELIXIR CH)
  • +
  • Alfonso Valencia (ELIXIR ES)
  • +
+ +

+EDAM Advisory Group

+ +
    +
  • Frederik Coppens (ELIXIR BE)
  • +
  • Melissa Haendel (Oregon Health & Science University, USA)
  • +
  • Hans-Ioan Ienasescu (University of Copenhagen, DK)
  • +
  • Niclas Jareborg (ELIXIR SE)
  • +
  • Rafael Jimenez (ELIXIR HUB)
  • +
  • Anna-Lena Lamprecht (University of Potsdam, DE)
  • +
  • Jane Lomax (Sanger Institute, UK)
  • +
  • Hedi Peterson (ELIXIR EE)
  • +
+ +

+Contributors

+ +

Thanks to the many people who have contributed - if you're not listed below, please let us know!

+ +
    +
  • Dan Bolser (EMBL-EBI, UK)
  • +
  • Nathalie Conte (EMBL-EBI, UK)
  • +
  • Victor de la Torre (ELIXIR-ES)
  • +
  • Ray Fergerson (Stanford University, USA)
  • +
  • Carole Goble (ELIXIR-UK)
  • +
  • Simon Jupp (EMBL-EBI, UK)
  • +
  • Peter Løngreen (CBS-DTU, DK)
  • +
  • Allyson Lister (Newcastle University, UK)
  • +
  • Rodrigo Lopez (EMBL-EBI, UK)
  • +
  • James Malone (EMBL-EBI, UK)
  • +
  • Julie McMurry (EMBL-EBI, UK)
  • +
  • Hamish McWilliam (formely EMBL-EBI, UK)
  • +
  • Helen Parkinson (EMBL-EBI, UK)
  • +
  • Steve Pettifer (University of Manchester, UK)
  • +
  • Kristoffer Rapacki (CBS-DTU, DK)
  • +
  • Peter Rice (Imperial College, UK)
  • +
  • Radka Svobodova (ELIXIR-CZ)
  • +
  • Mahmut Uludag (EMBL-EBI, UK)
  • +
  • Jiří Vondrášek (ELIXIR-CZ)
  • +
  • Gert Vriend (CMBI, NL)
  • +
  • Trish Whetzel (University of California, USA)
  • +
+ +

+Recent workshops (2014 - )

+ +

Thank you to all of the participants of various meetings and workshops organised by ELIXIR, BioMedBridges and others.

+ + + +

+Publication

+ +

If you use EDAM or its part, please reference:

+ +

Ison, J., Kalaš, M., Jonassen, I., Bolser, D., Uludag, M., McWilliam, H., Malone, J., Lopez, R., Pettifer, S. and Rice, P. (2013). EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats. Bioinformatics, 29, 1325-1332.

+ +

doi: 10.1093/bioinformatics/btt113 PMID: 23479348

+ +

This article is freely available (Open Access).

+ +

+Documentation and website

+ +

Full user documentation of the EDAM ontology is available at http://edamontology.org.

+ +

The edamontology.org site provides content negotiation with respect to the desired media type (i.e. format, e.g. HTML, OWL, etc.). This applies also to the URIs of EDAM concepts that are in this way dereferencable, concise, and stable. Alternatively to requesting the format in the HTTP header, users can retrieve the desired content from a web browser by inserting ?format=<desiredformat> query into the URL.

+
+ + +
+
+ + + + diff --git a/javascripts/main.js b/javascripts/main.js new file mode 100644 index 0000000..d8135d3 --- /dev/null +++ b/javascripts/main.js @@ -0,0 +1 @@ +console.log('This would be the main JS file.'); diff --git a/params.json b/params.json new file mode 100644 index 0000000..0b69238 --- /dev/null +++ b/params.json @@ -0,0 +1 @@ +{"name":"edamontology","tagline":"EDAM is an ontology of bioinformatics types of data, data identifiers, data formats, operations and topics.","body":"# What is EDAM?\r\nEDAM is a simple ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM provides a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use.\r\n\r\nYou can browse [EDAM at BioPortal](http://bioportal.bioontology.org/ontologies/EDAM/).\r\n\r\nSee http://twitter.com/edamontology ([follow](https://twitter.com/intent/follow?original_referer=https%3A%2F%2Fgithub.com%2Fedamontology%2Fedamontology®ion=follow_link&screen_name=edamontology&tw_p=followbutton)), please use [#edamontology](https://twitter.com/search?q=%23edamontology)\r\n\r\n# Motivation\r\nBioinformaticians handle an increasingly large and diverse set of tools and data. Meanwhile, researchers demand ever more powerful and convenient means to organise, find, understand, compare, select, use and connect the available resources. These tasks often rely on consistent, machine-understandable descriptions of the underlying components, but these have been generally lacking in _ad hoc_ resource descriptions. The urgent need - filled by EDAM - is for an ontology that unifies semantically the bioinformatics concepts in common use, provides the curator with a comprehensive controlled vocabulary that is broadly applicable, and supports new and powerful search, browse and query functions.\r\n\r\n# Applications \r\nEDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources, including:\r\n\r\n- Web services including REST and SOAP APIs\r\n- Application software\r\n- Tool collections and packages\r\n- Workflows / pipelines\r\n- Databases\r\n- XML Schemata and data objects\r\n- Data syntax and file formats\r\n- Web portals and pages\r\n- Resource catalogues\r\n- Training materials \r\n- Courses, tutorials, and other events\r\n- Areas of scientific interest\r\n- Documents, such as scientific publications\r\n\r\nEDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries.\r\n\r\n# Scope\r\n\r\nEDAM includes 4 main sub-ontologies or 'branches' of concepts:\r\n\r\n- _**Data**_ - “Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output.”\r\n- _**Format**_ - “A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere.”\r\n- _**Operation**_ - “A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs).” \r\n- _**Topic**_ - “A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other.”\r\n\r\nNoteworthy within the the Data sub-ontology is:\r\n- _**Identifier**_ - “A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things).”\r\n\r\n![EDAM concepts figure](https://raw.githubusercontent.com/edamontology/edamontology/master/web/EDAMconcepts.png)\r\n\r\nAs a general rule, the _**Data**_, _**Format**_, and _**Operation**_ branches include concepts strictly in domain of bioinformatics and computational biology: concepts purely concerning biology, computer science, _etc._ are not included. The _**Topic**_ branch, however, includes broader inter-disciplinary concepts from the biological and medical domains.\r\n\r\nEDAM provides different semantic 'axes' for annotation. For example, annotation of a software tool might include:\r\n\r\n- _Topic_ - general scientific domain the software serves, _e.g._ “Structural biology”\r\n- _Operation_ - the precise function of the tool, _e.g._ “Homology modelling”\r\n- _Data_ - the primary input and output, _e.g._ “Protein structure”\r\n- _Format_ - the supported format(s) of the input and output, _e.g._ “PDB format”\r\n\r\n# Principles\r\n\r\nEDAM strives to uphold a few founding principles including:\r\n\r\n- **Quality** - a controlled vocabulary that is moderated\r\n- **Openness** - development in collaboration with the community\r\n- **Relevance** - prioritising use-case-driven development towards comprehensive but practical coverage\r\n- **Practicality** - practical utility is valued over ontological “strictness” or any metaphysical doctrine\r\n- **Clear scope** - respecting the scope of other complementary, well-developed ontologies\r\n- **Familiarity** - including only concepts that are well established; familiar are prevalent and jargon is discouraged\r\n- **Usability** - conceptual hierarchy with sufficient richness but only necessary complexity\r\n- **Maintainability** - development must be efficient and sustainably up to date in the long term\r\n\r\nEDAM is working towards implementing these principles fully and is open to suggestions.\r\n\r\n# Architecture\r\nEDAM has 3 components:\r\n\r\n- _**Concepts**_ - All concepts have a name (the term or label) and definition. Further, a concept may have simple relations (see below) to other EDAM concepts, as well other intrinsic properties, _e.g._ an identifier may have a regular expression defining its syntax.\r\n- _**Hierarchy**_ - Every concept (excluding top-level concepts) is related to one or more other concepts within the same branch by an _**is a**_ (specialisation) relation. Hence EDAM has 4 primary hierarchies (for _Data_, _Format_, _Operation_, and _Topic_).\r\n- _**Relations**_ - Concepts are related by defined relation types (see figure below), which reflect well established or self-evident principles, and are used primarily to define internal consistency of EDAM. These have external applications too, e.g. annotations on the Semantic Web.\r\n\r\n![EDAM relations figure](https://raw.githubusercontent.com/edamontology/edamontology/master/web/EDAMrelations.png)\r\n\r\n# Priorities\r\n\r\nOur core priority is to be responsive to users of EDAM. Furthermore, to establish a more sustainable footing for essential EDAM maintenance and developments, including:\r\n- Content review and refactoring to ensure structural and semantic simplicity ensuring high usability\r\n- Community build-up and development including more formal, but agile, governance and maintenance models and mechanisms\r\n- Agile and responsive development of content in close collaboration with end-users and serving concrete use-cases\r\n- Technical refactoring to minimise the cost of routine housekeeping and content development \r\n- Implementation of tooling for routine maintenance to serve the needs of end-users, _e.g._ harvesting change requests and mappings between concepts\r\n\r\n# Governance of EDAM\r\n\r\nEDAM follows a model with five tiers of governance:\r\n\r\n1. **EDAM Advisory Group** advises the EDAM Core Developers on how best to uphold the EDAM principles and achieve its current aims. It represents the broad life science community, especially scientist end-users. Advisory Group members have no formal responsibilities, but are expected to advocate EDAM and actively offer constructive advice based on their practical experience, requirements and expertise. The EDAM Core Developers will respect this advice and give quarterly progress reports by email. The Core Developers aim to assemble with the Advisory Group virtually 2 or 3 times a year or as circumstances dictate, in meetings with open agenda and followed up with actions and notes on key recommendations. The Advisory Group will be reconstituted each year and the Steering Group (below) reserves the right to replace inactive members.\r\n2. **EDAM Steering Group** includes representatives of institutes that are committing significant resources to EDAM. Members of the Steering Group have four primary responsibilities:\r\n\r\n * Agree strategy and set priorities in consultation with the Core Developers\r\n * Verify whether stated aims are coherent and wise\r\n * Monitor progress and provide feedback\r\n * Help arrange funding for EDAM\r\n3. **EDAM Core Developers** are funded to develop EDAM and have GitHub commit rights. Responsible for agreeing aims and general good practice, overseeing and approving developments and routine maintenance. The model is quasi-democratic with a leader (currently Jon Ison) having the final say where necessary. The leader ensures the Advisory Group, and all developers and contributors, are listened to and informed. The leader may be temporarily appointed from the core developers as necessary, e.g. during holidays. Core Developers must have the intent and some bandwidth to develop EDAM in the long-term. They have 3 primary responsibilities: \r\n * Understand and uphold the EDAM principles\r\n * Advocate EDAM\r\n * Develop EDAM as bandwidth permits\r\n\r\n4. **Developers** would not normally have GitHub commit rights long-term. They include anyone who makes significant technical or scientific contributions, by whatever means, but have none of the commitments or responsibilities of the core developers.\r\n5. **Other contributors** do not have GitHub commit rights, but can still make comments, contribute suggestions for new terms and other changes. \r\n\r\n\r\n# People\r\n\r\n## EDAM Core Developers\r\n* Jon Ison (CBS-DTU, DK) *- lead developer*\r\n* Matúš Kalaš (University of Bergen, NO)\r\n* Hervé Ménager (Institut Pasteur, FR)\r\n* Marie Grosjean (IFB, FR)\r\n\r\n## EDAM Steering Group\r\n* Karel Berka (ELIXIR CZ)\r\n* Christophe Blanchet (ELIXIR FR)\r\n* Cath Brooksbank (ELIXIR EMBL-EBI)\r\n* Søren Brunak (ELIXIR DK)\r\n* Inge Jonassen (ELIXIR NO)\r\n* Steven Newhouse (ELIXIR EMBL-EBI)\r\n* Heinz Stockinger (ELIXIR CH)\r\n* Alfonso Valencia (ELIXIR ES)\r\n\r\n\r\n## EDAM Advisory Group\r\n* Frederik Coppens (ELIXIR BE)\r\n* Melissa Haendel (Oregon Health & Science University, USA)\r\n* Hans-Ioan Ienasescu (University of Copenhagen, DK)\r\n* Niclas Jareborg (ELIXIR SE)\r\n* Rafael Jimenez (ELIXIR HUB)\r\n* Anna-Lena Lamprecht (University of Potsdam, DE)\r\n* Jane Lomax (Sanger Institute, UK)\r\n* Hedi Peterson (ELIXIR EE)\r\n\r\n\r\n## Contributors\r\nThanks to the many people who have contributed - if you're not listed below, please let us know!\r\n\r\n* Dan Bolser (EMBL-EBI, UK)\r\n* Nathalie Conte (EMBL-EBI, UK)\r\n* Victor de la Torre (ELIXIR-ES)\r\n* Ray Fergerson (Stanford University, USA)\r\n* Carole Goble (ELIXIR-UK)\r\n* Simon Jupp (EMBL-EBI, UK)\r\n* Peter Løngreen (CBS-DTU, DK)\r\n* Allyson Lister (Newcastle University, UK)\r\n* Rodrigo Lopez (EMBL-EBI, UK)\r\n* James Malone (EMBL-EBI, UK)\r\n* Julie McMurry (EMBL-EBI, UK)\r\n* Hamish McWilliam (formely EMBL-EBI, UK)\r\n* Helen Parkinson (EMBL-EBI, UK)\r\n* Steve Pettifer (University of Manchester, UK)\r\n* Kristoffer Rapacki (CBS-DTU, DK)\r\n* Peter Rice (Imperial College, UK)\r\n* Radka Svobodova (ELIXIR-CZ)\r\n* Mahmut Uludag (EMBL-EBI, UK)\r\n* Jiří Vondrášek (ELIXIR-CZ)\r\n* Gert Vriend (CMBI, NL)\r\n* Trish Whetzel (University of California, USA)\r\n\r\n\r\n\r\n# Recent workshops (2014 - )\r\nThank you to all of the participants of various meetings and workshops organised by ELIXIR, BioMedBridges and others.\r\n\r\n* [ELIXIR Curation Hackathon I : Registration of Tools & Data Services](https://docs.google.com/document/d/1s3J8msba1jHv18Ywz1wTH8UAjedD01IW-YfJNt17X_k/edit#heading=h.k2c28vnbr5jw)\r\n\r\n* [ELIXIR Technical Hackathon I: EDAM Development & Governance](https://docs.google.com/document/d/1CoDvzq6o9J4g5agEj6b9CugGGjWw8QzSU89FLeTjVww/edit#heading=h.k2c28vnbr5jw)\r\n\r\n* [ELIXIR, BioMedBridges & RDA Workshop: A common vocabulary to classify resources in the life sciences](http://www.biomedbridges.eu/news/workshop-common-vocabulary-classify-resources-life-sciences)\r\n\r\n\r\n\r\n# Publication\r\n\r\nIf you use EDAM or its part, please reference:\r\n\r\nIson, J., Kalaš, M., Jonassen, I., Bolser, D., Uludag, M., McWilliam, H., Malone, J., Lopez, R., Pettifer, S. and Rice, P. (2013). [EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.](http://bioinformatics.oxfordjournals.org/content/29/10/1325.full) _Bioinformatics_, **29**, 1325-1332.\r\n\r\ndoi: [10.1093/bioinformatics/btt113](http://dx.doi.org/10.1093/bioinformatics/btt113) PMID: [23479348](http://www.ncbi.nlm.nih.gov/pubmed/23479348)\r\n\r\nThis article is freely available (Open Access).\r\n\r\n# Documentation and website\r\n\r\nFull user documentation of the EDAM ontology is available at http://edamontology.org.\r\n\r\nThe _edamontology.org_ site provides content negotiation with respect to the desired media type (_i.e._ format, _e.g._ HTML, OWL, _etc._). This applies also to the URIs of EDAM concepts that are in this way dereferencable, concise, and stable. Alternatively to requesting the format in the HTTP header, users can retrieve the desired content from a web browser by inserting _?format=\\_ query into the URL.\r\n","google":"","note":"Don't delete this file! It's used internally to help with page regeneration."} \ No newline at end of file diff --git a/releases/EDAM_1.13.owl b/releases/EDAM_1.13.owl deleted file mode 100644 index 4941ab6..0000000 --- a/releases/EDAM_1.13.owl +++ /dev/null @@ -1,52818 +0,0 @@ - - - - - - - - - - - - - -]> - - - - - EDAM_topic http://edamontology.org/topic_ "EDAM topics" - 08:02:2016 22:15GMT - EDAM_operation http://edamontology.org/operation_ "EDAM operations" - formats "EDAM data formats" - EDAM - An ontology of bioinformatics topics, operations, types of data including identifiers, and data formats - Jon Ison, Matus Kalas, Hervé Ménager - identifiers "EDAM types of identifiers" - data "EDAM types of data" - relations "EDAM relations" - edam "EDAM" - EDAM editors: Jon Ison, Matus Kalas, and Herve Menager. Contributors: Inge Jonassen, Dan Bolser, Hamish McWilliam, Mahmut Uludag, James Malone, Rodrigo Lopez, Steve Pettifer, and Peter Rice. Contibutions from these projects: EMBRACE, ELIXIR, and BioMedBridges (EU); EMBOSS (BBSRC, UK); eSysbio, FUGE Bioinformatics Platform, and ELIXIR.NO/Norwegian Bioinformatics Platform (Research Council of Norway). See http://edamontology.org for documentation and licence. - 3702 - operations "EDAM operations" - EDAM http://edamontology.org/ "EDAM relations and concept properties" - application/rdf+xml - EDAM_data http://edamontology.org/data_ "EDAM types of data" - concept_properties "EDAM concept properties" - Jon Ison - Matúš Kalaš - EDAM_format http://edamontology.org/format_ "EDAM data formats" - 1.13 - topics "EDAM topics" - Hervé Ménager - EDAM is an ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM is a simple ontology - essentially a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use by curators, software developers and end-users. EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources. EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries. - - - - - - - - - - - - - - - Citation - concept_properties - 1.13 - Publication reference - Publication - 'Citation' concept property ('citation' metadata tag) contains a dereferenceable URI, preferrably including a DOI, pointing to a citeable publication of the given data format. - true - - - - - - - - Created in - Version in which a concept was created. - true - concept_properties - - - - - - - - Documentation - Specification - 'Documentation' trailing modifier (qualifier, 'documentation') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page with explanation, description, documentation, or specification of the given data format. - true - concept_properties - - - - - - - - Example - 'Example' concept property ('example' metadata tag) lists examples of valid values of types of identifiers (accessions). Applicable to some other types of data, too. - true - Separated by bar ('|'). - concept_properties - - - - - - - - File extension - 'File extension' concept property ('file_extension' metadata tag) lists examples of usual file extensions of formats. - Separated by bar ('|'), without a dot ('.') prefix, preferrably not all capital characters. - concept_properties - true - - - - - - - - isdebtag - When 'true', the term has been proposed or is supported within Debian Med as a tag. - concept_properties - true - - - - - - - - Media type - MIME type - 'Media type' trailing modifier (qualifier, 'media_type') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page specifying a media type of the given data format. - true - concept_properties - - - - - - - - - - - - - - Obsolete since - true - concept_properties - Version in which a concept was made obsolete. - - - - - - - - Regular expression - 'Regular expression' concept property ('regex' metadata tag) specifies the allowed values of types of identifiers (accessions). Applicable to some other types of data, too. - concept_properties - true - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - has format - "http://purl.obolibrary.org/obo/OBI_0000298" - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. Object B can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology that is a 'Format' or is in the role of a 'Format'. In EDAM, 'has_format' is not explicitly defined between EDAM concepts, only the inverse 'is_format_of'. - false - OBO_REL:is_a - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - false - false - edam - 'A has_format B' defines for the subject A, that it has the object B as its data format. - false - - - - - - - - - - has function - http://wsio.org/has_function - false - OBO_REL:is_a - OBO_REL:bearer_of - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is (or is in a role of) a function, or an entity outside of an ontology that is (or is in a role of) a function specification. In the scope of EDAM, 'has_function' serves only for relating annotated entities outside of EDAM with 'Operation' concepts. - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - true - 'A has_function B' defines for the subject A, that it has the object B as its function. - "http://purl.obolibrary.org/obo/OBI_0000306" - relations - false - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:bearer_of' is narrower in the sense that it only relates ontological categories (concepts) that are an 'independent_continuant' (snap:IndependentContinuant) with ontological categories that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant), and broader in the sense that it relates with any borne objects not just functions of the subject. - OBO_REL:bearer_of - - - - - true - In very unusual cases. - - - - - - - - - - has identifier - false - false - relations - OBO_REL:is_a - edam - 'A has_identifier B' defines for the subject A, that it has the object B as its identifier. - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is an 'Identifier', or an entity outside of an ontology that is an 'Identifier' or is in the role of an 'Identifier'. In EDAM, 'has_identifier' is not explicitly defined between EDAM concepts, only the inverse 'is_identifier_of'. - false - false - - - - - - - - - - has input - OBO_REL:has_participant - "http://purl.obolibrary.org/obo/OBI_0000293" - false - http://wsio.org/has_input - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_input' is explicitly defined between EDAM concepts ('Operation' 'has_input' 'Data'). The inverse, 'is_input_of', is not explicitly defined. - relations - OBO_REL:is_a - false - 'A has_input B' defines for the subject A, that it has the object B as a necessary or actual input or input argument. - false - true - edam - - - - - In very unusual cases. - true - - - - - OBO_REL:has_participant - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just inputs or input arguments of the subject. - - - - - - - - - - has output - http://wsio.org/has_output - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_output' is explicitly defined between EDAM concepts ('Operation' 'has_output' 'Data'). The inverse, 'is_output_of', is not explicitly defined. - edam - "http://purl.obolibrary.org/obo/OBI_0000299" - OBO_REL:is_a - relations - OBO_REL:has_participant - true - 'A has_output B' defines for the subject A, that it has the object B as a necessary or actual output or output argument. - false - false - false - - - - - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just outputs or output arguments of the subject. It is also not clear whether an output (result) actually participates in the process that generates it. - OBO_REL:has_participant - - - - - In very unusual cases. - true - - - - - - - - - - has topic - relations - true - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology that is a 'Topic' or is in the role of a 'Topic'. In EDAM, only 'has_topic' is explicitly defined between EDAM concepts ('Operation' or 'Data' 'has_topic' 'Topic'). The inverse, 'is_topic_of', is not explicitly defined. - false - 'A has_topic B' defines for the subject A, that it has the object B as its topic (A is in the scope of a topic B). - edam - OBO_REL:is_a - http://annotation-ontology.googlecode.com/svn/trunk/annotation-core.owl#hasTopic - false - "http://purl.obolibrary.org/obo/IAO_0000136" - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality - "http://purl.obolibrary.org/obo/OBI_0000298" - - - - - - - - - - - - In very unusual cases. - true - - - - - - - - - - is format of - false - OBO_REL:is_a - false - false - false - 'A is_format_of B' defines for the subject A, that it is a data format of the object B. - edam - relations - Subject A can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Format' or is in the role of a 'Format'. Object B can be any concept or entity outside of an ontology that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. In EDAM, only 'is_format_of' is explicitly defined between EDAM concepts ('Format' 'is_format_of' 'Data'). The inverse, 'has_format', is not explicitly defined. - OBO_REL:quality_of - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - - - - - - OBO_REL:quality_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - - - - - - - - - - is function of - Subject A can either be concept that is (or is in a role of) a function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) a function specification. Object B can be any concept or entity. Within EDAM itself, 'is_function_of' is not used. - OBO_REL:inheres_in - true - OBO_REL:is_a - false - 'A is_function_of B' defines for the subject A, that it is a function of the object B. - OBO_REL:function_of - edam - http://wsio.org/is_function_of - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - false - - - - - true - In very unusual cases. - - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:function_of' only relates subjects that are a 'function' (snap:Function) with objects that are an 'independent_continuant' (snap:IndependentContinuant), so for example no processes. It does not define explicitly that the subject is a function of the object. - OBO_REL:function_of - - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:inheres_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant) with ontological categories that are an 'independent_continuant' (snap:IndependentContinuant), and broader in the sense that it relates any borne subjects not just functions. - OBO_REL:inheres_in - - - - - - - - - - is identifier of - false - false - edam - false - relations - Subject A can either be a concept that is an 'Identifier', or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is an 'Identifier' or is in the role of an 'Identifier'. Object B can be any concept or entity outside of an ontology. In EDAM, only 'is_identifier_of' is explicitly defined between EDAM concepts (only 'Identifier' 'is_identifier_of' 'Data'). The inverse, 'has_identifier', is not explicitly defined. - 'A is_identifier_of B' defines for the subject A, that it is an identifier of the object B. - OBO_REL:is_a - false - - - - - - - - - - - is input of - false - http://wsio.org/is_input_of - relations - true - false - OBO_REL:participates_in - OBO_REL:is_a - "http://purl.obolibrary.org/obo/OBI_0000295" - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_input_of' is not explicitly defined between EDAM concepts, only the inverse 'has_input'. - false - 'A is_input_of B' defines for the subject A, that it as a necessary or actual input or input argument of the object B. - - - - - - true - In very unusual cases. - - - - - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just inputs or input arguments. - OBO_REL:participates_in - - - - - - - - - - is output of - OBO_REL:is_a - false - false - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_output_of' is not explicitly defined between EDAM concepts, only the inverse 'has_output'. - edam - false - 'A is_output_of B' defines for the subject A, that it as a necessary or actual output or output argument of the object B. - OBO_REL:participates_in - http://wsio.org/is_output_of - true - relations - "http://purl.obolibrary.org/obo/OBI_0000312" - - - - - - In very unusual cases. - true - - - - - OBO_REL:participates_in - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just outputs or output arguments. It is also not clear whether an output (result) actually participates in the process that generates it. - - - - - - - - - - is topic of - 'A is_topic_of B' defines for the subject A, that it is a topic of the object B (a topic A is the scope of B). - relations - OBO_REL:quality_of - false - true - false - Subject A can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Topic' or is in the role of a 'Topic'. Object B can be any concept or entity outside of an ontology. In EDAM, 'is_topic_of' is not explicitly defined between EDAM concepts, only the inverse 'has_topic'. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - OBO_REL:is_a - edam - - - - - - - - - - - - - OBO_REL:quality_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - - - - - true - In very unusual cases. - - - - - - - - - - - - - - - Resource type - - beta12orEarlier - beta12orEarlier - A type of computational resource used in bioinformatics. - true - - - - - - - - - - Data - - - - - Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - http://semanticscience.org/resource/SIO_000088 - http://semanticscience.org/resource/SIO_000069 - "http://purl.obolibrary.org/obo/IAO_0000030" - "http://purl.obolibrary.org/obo/IAO_0000027" - Data set - Data record - beta12orEarlier - http://wsio.org/data_002 - http://purl.org/biotop/biotop.owl#DigitalEntity - http://www.ifomis.org/bfo/1.1/snap#Continuant - Datum - - - - - Data record - EDAM does not distinguish a data record (a tool-understandable information artefact) from data or datum (its content, the tool-understandable encoding of an information). - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Data set - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Datum - - - - - - - - - - Tool - - beta12orEarlier - A bioinformatics package or tool, e.g. a standalone application or web service. - beta12orEarlier - true - - - - - - - - - - Database - - A digital data archive typically based around a relational model but sometimes using an object-oriented, tree or graph-based model. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ontology - - - - - - - - beta12orEarlier - Ontologies - An ontology of biological or bioinformatics concepts and relations, a controlled vocabulary, structured glossary etc. - - - - - - - - - - Directory metadata - - 1.5 - A directory on disk from which files are read. - beta12orEarlier - true - - - - - - - - - - MeSH vocabulary - - beta12orEarlier - true - Controlled vocabulary from National Library of Medicine. The MeSH thesaurus is used to index articles in biomedical journals for the Medline/PubMED databases. - beta12orEarlier - - - - - - - - - - HGNC vocabulary - - beta12orEarlier - beta12orEarlier - Controlled vocabulary for gene names (symbols) from HUGO Gene Nomenclature Committee. - true - - - - - - - - - - UMLS vocabulary - - Compendium of controlled vocabularies for the biomedical domain (Unified Medical Language System). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Identifier - - - - - - - - - - http://semanticscience.org/resource/SIO_000115 - beta12orEarlier - ID - "http://purl.org/dc/elements/1.1/identifier" - http://wsio.org/data_005 - A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things). - - - - - - - Almost exact but limited to identifying resources. - - - - - - - - - - - Database entry - - beta12orEarlier - beta12orEarlier - An entry (retrievable via URL) from a biological database. - true - - - - - - - - - - Molecular mass - - Mass of a molecule. - beta12orEarlier - - - - - - - - - - Molecular charge - - Net charge of a molecule. - beta12orEarlier - PDBML:pdbx_formal_charge - - - - - - - - - - Chemical formula - - Chemical structure specification - A specification of a chemical structure. - beta12orEarlier - - - - - - - - - - QSAR descriptor - - A QSAR quantitative descriptor (name-value pair) of chemical structure. - QSAR descriptors have numeric values that quantify chemical information encoded in a symbolic representation of a molecule. They are used in quantitative structure activity relationship (QSAR) applications. Many subtypes of individual descriptors (not included in EDAM) cover various types of protein properties. - beta12orEarlier - - - - - - - - - - Raw sequence - - beta12orEarlier - A raw molecular sequence (string of characters) which might include ambiguity, unknown positions and non-sequence characters. - Non-sequence characters may be used for example for gaps and translation stop. - - - - - - - - - - Sequence record - - http://purl.bioontology.org/ontology/MSH/D058977 - beta12orEarlier - A molecular sequence and associated metadata. - SO:2000061 - - - - - - - - - - Sequence set - - A collection of multiple molecular sequences and associated metadata that do not (typically) correspond to molecular sequence database records or entries and which (typically) are derived from some analytical method. - This concept may be used for arbitrary sequence sets and associated data arising from processing. - beta12orEarlier - SO:0001260 - - - - - - - - - - Sequence mask character - - true - beta12orEarlier - 1.5 - A character used to replace (mask) other characters in a molecular sequence. - - - - - - - - - - Sequence mask type - - A label (text token) describing the type of sequence masking to perform. - Sequence masking is where specific characters or positions in a molecular sequence are masked (replaced) with an another (mask character). The mask type indicates what is masked, for example regions that are not of interest or which are information-poor including acidic protein regions, basic protein regions, proline-rich regions, low compositional complexity regions, short-periodicity internal repeats, simple repeats and low complexity regions. Masked sequences are used in database search to eliminate statistically significant but biologically uninteresting hits. - beta12orEarlier - 1.5 - true - - - - - - - - - - DNA sense specification - - DNA strand specification - beta12orEarlier - Strand - The strand of a DNA sequence (forward or reverse). - The forward or 'top' strand might specify a sequence is to be used as given, the reverse or 'bottom' strand specifying the reverse complement of the sequence is to be used. - - - - - - - - - - Sequence length specification - - true - A specification of sequence length(s). - beta12orEarlier - 1.5 - - - - - - - - - - Sequence metadata - - beta12orEarlier - Basic or general information concerning molecular sequences. - This is used for such things as a report including the sequence identifier, type and length. - 1.5 - true - - - - - - - - - - Sequence feature source - - This might be the name and version of a software tool, the name of a database, or 'curated' to indicate a manual annotation (made by a human). - How the annotation of a sequence feature (for example in EMBL or Swiss-Prot) was derived. - beta12orEarlier - - - - - - - - - - Sequence search results - - beta12orEarlier - Database hits (sequence) - - Sequence database hits - Sequence search hits - The score list includes the alignment score, percentage of the query sequence matched, length of the database sequence entry in this alignment, identifier of the database sequence entry, excerpt of the database sequence entry description etc. - A report of sequence hits and associated data from searching a database of sequences (for example a BLAST search). This will typically include a list of scores (often with statistical evaluation) and a set of alignments for the hits. - Sequence database search results - - - - - - - - - - Sequence signature matches - - Sequence motif matches - Protein secondary database search results - beta12orEarlier - Report on the location of matches in one or more sequences to profiles, motifs (conserved or functional patterns) or other signatures. - Sequence profile matches - This ncluding reports of hits from a search of a protein secondary or domain database. - Search results (protein secondary database) - - - - - - - - - - Sequence signature model - - Data files used by motif or profile methods. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence signature data - - - - - - - - beta12orEarlier - This can include metadata about a motif or sequence profile such as its name, length, technical details about the profile construction, and so on. - Data concering concerning specific or conserved pattern in molecular sequences and the classifiers used for their identification, including sequence motifs, profiles or other diagnostic element. - - - - - - - - - - Sequence alignment (words) - - 1.5 - beta12orEarlier - true - Sequence word alignment - Alignment of exact matches between subsequences (words) within two or more molecular sequences. - - - - - - - - - - Dotplot - - A dotplot of sequence similarities identified from word-matching or character comparison. - beta12orEarlier - - - - - - - - - - Sequence alignment - - - - - - - - http://en.wikipedia.org/wiki/Sequence_alignment - http://purl.bioontology.org/ontology/MSH/D016415 - http://semanticscience.org/resource/SIO_010066 - beta12orEarlier - Alignment of multiple molecular sequences. - - - - - - - - - - Sequence alignment parameter - - Some simple value controlling a sequence alignment (or similar 'match') operation. - true - 1.5 - beta12orEarlier - - - - - - - - - - Sequence similarity score - - A value representing molecular sequence similarity. - beta12orEarlier - - - - - - - - - - Sequence alignment metadata - - Report of general information on a sequence alignment, typically include a description, sequence identifiers and alignment score. - beta12orEarlier - true - 1.5 - - - - - - - - - - Sequence alignment report - - Use this for any computer-generated reports on sequence alignments, and for general information (metadata) on a sequence alignment, such as a description, sequence identifiers and alignment score. - An informative report of molecular sequence alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Profile-profile alignment - - beta12orEarlier - A profile-profile alignment (each profile typically representing a sequence alignment). - Sequence profile alignment - - - - - - - - - - Sequence-profile alignment - - beta12orEarlier - Alignment of one or more molecular sequence(s) to one or more sequence profile(s) (each profile typically representing a sequence alignment). - Data associated with the alignment might also be included, e.g. ranked list of best-scoring sequences and a graphical representation of scores. - - - - - - - - - - Sequence distance matrix - - beta12orEarlier - Moby:phylogenetic_distance_matrix - A matrix of estimated evolutionary distance between molecular sequences, such as is suitable for phylogenetic tree calculation. - Phylogenetic distance matrix - Methods might perform character compatibility analysis or identify patterns of similarity in an alignment or data matrix. - - - - - - - - - - Phylogenetic character data - - Basic character data from which a phylogenetic tree may be generated. - As defined, this concept would also include molecular sequences, microsatellites, polymorphisms (RAPDs, RFLPs, or AFLPs), restriction sites and fragments - http://www.evolutionaryontology.org/cdao.owl#Character - beta12orEarlier - - - - - - - - - - Phylogenetic tree - - - - - - - - Phylogeny - Moby:Tree - http://www.evolutionaryontology.org/cdao.owl#Tree - A phylogenetic tree is usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. See also 'Phylogenetic tree image'. - http://purl.bioontology.org/ontology/MSH/D010802 - Moby:phylogenetic_tree - The raw data (not just an image) from which a phylogenetic tree is directly generated or plotted, such as topology, lengths (in time or in expected amounts of variance) and a confidence interval for each length. - beta12orEarlier - Moby:myTree - - - - - - - - - - Comparison matrix - - beta12orEarlier - The comparison matrix might include matrix name, optional comment, height and width (or size) of matrix, an index row/column (of characters) and data rows/columns (of integers or floats). - Matrix of integer or floating point numbers for amino acid or nucleotide sequence comparison. - Substitution matrix - - - - - - - - - - Protein topology - - beta12orEarlier - beta12orEarlier - Predicted or actual protein topology represented as a string of protein secondary structure elements. - true - The location and size of the secondary structure elements and intervening loop regions is usually indicated. - - - - - - - - - - Protein features report (secondary structure) - - beta12orEarlier - 1.8 - true - Secondary structure (predicted or real) of a protein. - - - - - - - - - - Protein features report (super-secondary) - - 1.8 - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - true - beta12orEarlier - Super-secondary structure of protein sequence(s). - - - - - - - - - - Secondary structure alignment (protein) - - - Alignment of the (1D representations of) secondary structure of two or more proteins. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (protein) - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RNA secondary structure - - - - - - - - An informative report of secondary structure (predicted or real) of an RNA molecule. - This includes thermodynamically stable or evolutionarily conserved structures such as knots, pseudoknots etc. - Moby:RNAStructML - Secondary structure (RNA) - beta12orEarlier - - - - - - - - - - Secondary structure alignment (RNA) - - Moby:RNAStructAlignmentML - Alignment of the (1D representations of) secondary structure of two or more RNA molecules. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (RNA) - - true - beta12orEarlier - An informative report of RNA secondary structure alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Structure - - - - - - - - beta12orEarlier - Coordinate model - Structure data - The coordinate data may be predicted or real. - http://purl.bioontology.org/ontology/MSH/D015394 - 3D coordinate and associated data for a macromolecular tertiary (3D) structure or part of a structure. - - - - - - - - - - Tertiary structure record - - true - beta12orEarlier - beta12orEarlier - An entry from a molecular tertiary (3D) structure database. - - - - - - - - - - Structure database search results - - 1.8 - Results (hits) from searching a database of tertiary structure. - beta12orEarlier - true - - - - - - - - - - Structure alignment - - - - - - - - Alignment (superimposition) of molecular tertiary (3D) structures. - A tertiary structure alignment will include the untransformed coordinates of one macromolecule, followed by the second (or subsequent) structure(s) with all the coordinates transformed (by rotation / translation) to give a superposition. - beta12orEarlier - - - - - - - - - - Structure alignment report - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - An informative report of molecular tertiary structure alignment-derived data. - - - - - - - - - - Structure similarity score - - beta12orEarlier - A value representing molecular structure similarity, measured from structure alignment or some other type of structure comparison. - - - - - - - - - - Structural profile - - - - - - - - beta12orEarlier - 3D profile - Some type of structural (3D) profile or template (representing a structure or structure alignment). - Structural (3D) profile - - - - - - - - - - Structural (3D) profile alignment - - beta12orEarlier - Structural profile alignment - A 3D profile-3D profile alignment (each profile representing structures or a structure alignment). - - - - - - - - - - Sequence-3D profile alignment - - Sequence-structural profile alignment - 1.5 - An alignment of a sequence to a 3D profile (representing structures or a structure alignment). - beta12orEarlier - true - - - - - - - - - - Protein sequence-structure scoring matrix - - beta12orEarlier - Matrix of values used for scoring sequence-structure compatibility. - - - - - - - - - - Sequence-structure alignment - - beta12orEarlier - An alignment of molecular sequence to structure (from threading sequence(s) through 3D structure or representation of structure(s)). - - - - - - - - - - Amino acid annotation - - An informative report about a specific amino acid. - 1.4 - true - beta12orEarlier - - - - - - - - - - Peptide annotation - - 1.4 - true - An informative report about a specific peptide. - beta12orEarlier - - - - - - - - - - Protein report - - Gene product annotation - beta12orEarlier - An informative human-readable report about one or more specific protein molecules or protein structural domains, derived from analysis of primary (sequence or structural) data. - - - - - - - - - - Protein property - - Protein physicochemical property - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a protein molecule or model. - beta12orEarlier - Protein sequence statistics - Protein properties - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Protein structural motifs and surfaces - - true - 1.8 - 3D structural motifs in a protein. - beta12orEarlier - Protein 3D motifs - - - - - - - - - Protein domain classification - - true - Data concerning the classification of the sequences and/or structures of protein structural domain(s). - 1.5 - beta12orEarlier - - - - - - - - - - Protein features report (domains) - - true - structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - beta12orEarlier - - - - - - - - - - Protein architecture report - - 1.4 - An informative report on architecture (spatial arrangement of secondary structure) of a protein structure. - Protein property (architecture) - Protein structure report (architecture) - beta12orEarlier - true - - - - - - - - - - Protein folding report - - beta12orEarlier - A report on an analysis or model of protein folding properties, folding pathways, residues or sites that are key to protein folding, nucleation or stabilization centers etc. - true - 1.8 - - - - - - - - - - Protein features (mutation) - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on the effect of (typically point) mutation on protein folding, stability, structure and function. - true - beta12orEarlier - Protein property (mutation) - Protein structure report (mutation) - beta13 - Protein report (mutation) - - - - - - - - - - Protein interaction raw data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein interaction report - - - - - - - - Protein report (interaction) - beta12orEarlier - Protein interaction record - Residue interaction data - Atom interaction data - Protein non-covalent interactions report - An informative report on interactions (predicted or known) within or between a protein, structural domain or part of a protein. This includes intra- and inter-residue contacts and distances, as well as interactions with other proteins and non-protein entities such as nucleic acid, metal atoms, water, ions etc. - - - - - - - - - - - - - Protein family report - - - - - - - - beta12orEarlier - An informative report on a specific protein family or other classification or group of protein sequences or structures. - Protein family annotation - Protein classification data - - - - - - - - - - Vmax - - beta12orEarlier - The maximum initial velocity or rate of a reaction. It is the limiting velocity as substrate concentrations get very large. - - - - - - - - - - Km - - Km is the concentration (usually in Molar units) of substrate that leads to half-maximal velocity of an enzyme-catalysed reaction. - beta12orEarlier - - - - - - - - - - Nucleotide base annotation - - beta12orEarlier - true - An informative report about a specific nucleotide base. - 1.4 - - - - - - - - - - Nucleic acid property - - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a nucleic acid molecule. - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - Nucleic acid physicochemical property - beta12orEarlier - - - - - - - - - - Codon usage data - - - - - - - - beta12orEarlier - Data derived from analysis of codon usage (typically a codon usage table) of DNA sequences. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Gene report - - Gene structure (repot) - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - Gene and transcript structure (report) - Gene features report - Nucleic acid features (gene and transcript structure) - Moby:gene - This includes any report on a particular locus or gene. This might include the gene name, description, summary and so on. It can include details about the function of a gene, such as its encoded protein or a functional classification of the gene sequence along according to the encoded protein(s). - Gene annotation - beta12orEarlier - Moby_namespace:Human_Readable_Description - Gene function (report) - Moby:GeneInfo - - - - - - - - - - Gene classification - - beta12orEarlier - true - A report on the classification of nucleic acid / gene sequences according to the functional classification of their gene products. - beta12orEarlier - - - - - - - - - - DNA variation - - stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - true - 1.8 - beta12orEarlier - - - - - - - - - - Chromosome report - - beta12orEarlier - An informative report on a specific chromosome. - This includes basic information. e.g. chromosome number, length, karyotype features, chromosome sequence etc. - - - - - - - - - - Genotype/phenotype report - - An informative report on the set of genes (or allelic forms) present in an individual, organism or cell and associated with a specific physical characteristic, or a report concerning an organisms traits and phenotypes. - Genotype/phenotype annotation - beta12orEarlier - - - - - - - - - - Nucleic acid features report (primers) - - true - 1.8 - beta12orEarlier - PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - PCR experiment report - - true - beta12orEarlier - PCR experiments, e.g. quantitative real-time PCR. - 1.8 - - - - - - - - - - Sequence trace - - - Fluorescence trace data generated by an automated DNA sequencer, which can be interprted as a molecular sequence (reads), given associated sequencing metadata such as base-call quality scores. - This is the raw data produced by a DNA sequencing machine. - beta12orEarlier - - - - - - - - - - Sequence assembly - - beta12orEarlier - An assembly of fragments of a (typically genomic) DNA sequence. - Contigs - http://en.wikipedia.org/wiki/Sequence_assembly - SO:0001248 - Typically, an assembly is a collection of contigs (for example ESTs and genomic DNA fragments) that are ordered, aligned and merged. Annotation of the assembled sequence might be included. - SO:0000353 - - - - - Perhaps surprisingly, the definition of 'SO:assembly' is narrower than the 'SO:sequence_assembly'. - SO:0001248 - - - - - - - - - - Radiation Hybrid (RH) scores - - beta12orEarlier - Radiation Hybrid (RH) scores are used in Radiation Hybrid mapping. - Radiation hybrid scores (RH) scores for one or more markers. - - - - - - - - - - Genetic linkage report - - beta12orEarlier - Gene annotation (linkage) - Linkage disequilibrium (report) - An informative report on the linkage of alleles. - This includes linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Gene expression profile - - Data quantifying the level of expression of (typically) multiple genes, derived for example from microarray experiments. - beta12orEarlier - Gene expression pattern - - - - - - - - - - Microarray experiment report - - true - microarray experiments including conditions, protocol, sample:data relationships etc. - 1.8 - beta12orEarlier - - - - - - - - - - Oligonucleotide probe data - - beta12orEarlier - beta13 - true - Data on oligonucleotide probes (typically for use with DNA microarrays). - - - - - - - - - - SAGE experimental data - - beta12orEarlier - true - Output from a serial analysis of gene expression (SAGE) experiment. - Serial analysis of gene expression (SAGE) experimental data - beta12orEarlier - - - - - - - - - - MPSS experimental data - - beta12orEarlier - Massively parallel signature sequencing (MPSS) data. - beta12orEarlier - Massively parallel signature sequencing (MPSS) experimental data - true - - - - - - - - - - SBS experimental data - - beta12orEarlier - beta12orEarlier - true - Sequencing by synthesis (SBS) experimental data - Sequencing by synthesis (SBS) data. - - - - - - - - - - Sequence tag profile (with gene assignment) - - beta12orEarlier - Tag to gene assignments (tag mapping) of SAGE, MPSS and SBS data. Typically this is the sequencing-based expression profile annotated with gene identifiers. - - - - - - - - - - Protein X-ray crystallographic data - - X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein NMR data - - Protein nuclear magnetic resonance (NMR) raw data. - beta12orEarlier - - - - - - - - - - Protein circular dichroism (CD) spectroscopic data - - beta12orEarlier - Protein secondary structure from protein coordinate or circular dichroism (CD) spectroscopic data. - - - - - - - - - - Electron microscopy volume map - - - - - - - - beta12orEarlier - Volume map data from electron microscopy. - EM volume map - - - - - - - - - - Electron microscopy model - - - - - - - - beta12orEarlier - Annotation on a structural 3D model (volume map) from electron microscopy. - This might include the location in the model of the known features of a particular macromolecule. - - - - - - - - - - 2D PAGE image - - - - - - - - beta12orEarlier - Two-dimensional gel electrophoresis image - - - - - - - - - - Mass spectrometry spectra - - - - - - - - beta12orEarlier - Spectra from mass spectrometry. - - - - - - - - - - Peptide mass fingerprint - - - - - - - - - Peak list - Protein fingerprint - A molecular weight standard fingerprint is standard protonated molecular masses e.g. from trypsin (modified porcine trypsin, Promega) and keratin peptides. - A set of peptide masses (peptide mass fingerprint) from mass spectrometry. - beta12orEarlier - Molecular weights standard fingerprint - - - - - - - - - - Peptide identification - - - - - - - - Protein or peptide identifications with evidence supporting the identifications, typically from comparing a peptide mass fingerprint (from mass spectrometry) to a sequence database. - beta12orEarlier - - - - - - - - - - Pathway or network annotation - - beta12orEarlier - true - An informative report about a specific biological pathway or network, typically including a map (diagram) of the pathway. - beta12orEarlier - - - - - - - - - - Biological pathway map - - beta12orEarlier - true - A map (typically a diagram) of a biological pathway. - beta12orEarlier - - - - - - - - - - Data resource definition - - beta12orEarlier - true - 1.5 - A definition of a data resource serving one or more types of data, including metadata and links to the resource or data proper. - - - - - - - - - - Workflow metadata - - Basic information, annotation or documentation concerning a workflow (but not the workflow itself). - beta12orEarlier - - - - - - - - - - Mathematical model - - - - - - - - Biological model - beta12orEarlier - A biological model represented in mathematical terms. - - - - - - - - - - Statistical estimate score - - beta12orEarlier - A value representing estimated statistical significance of some observed data; typically sequence database hits. - - - - - - - - - - EMBOSS database resource definition - - beta12orEarlier - Resource definition for an EMBOSS database. - true - 1.5 - - - - - - - - - - Version information - - "http://purl.obolibrary.org/obo/IAO_0000129" - 1.5 - Development status / maturity may be part of the version information, for example in case of tools, standards, or some data records. - http://www.ebi.ac.uk/swo/maturity/SWO_9000061 - beta12orEarlier - Information on a version of software or data, for example name, version number and release date. - http://semanticscience.org/resource/SIO_000653 - true - http://usefulinc.com/ns/doap#Version - - - - - - - - - - Database cross-mapping - - beta12orEarlier - A mapping of the accession numbers (or other database identifier) of entries between (typically) two biological or biomedical databases. - The cross-mapping is typically a table where each row is an accession number and each column is a database being cross-referenced. The cells give the accession number or identifier of the corresponding entry in a database. If a cell in the table is not filled then no mapping could be found for the database. Additional information might be given on version, date etc. - - - - - - - - - - Data index - - - - - - - - An index of data of biological relevance. - beta12orEarlier - - - - - - - - - - Data index report - - - - - - - - A report of an analysis of an index of biological data. - Database index annotation - beta12orEarlier - - - - - - - - - - Database metadata - - Basic information on bioinformatics database(s) or other data sources such as name, type, description, URL etc. - beta12orEarlier - - - - - - - - - - Tool metadata - - beta12orEarlier - Basic information about one or more bioinformatics applications or packages, such as name, type, description, or other documentation. - - - - - - - - - - Job metadata - - beta12orEarlier - true - 1.5 - Moby:PDGJOB - Textual metadata on a submitted or completed job. - - - - - - - - - - User metadata - - beta12orEarlier - Textual metadata on a software author or end-user, for example a person or other software. - - - - - - - - - - Small molecule report - - - - - - - - Small molecule annotation - Chemical structure report - An informative report on a specific chemical compound. - beta12orEarlier - Chemical compound annotation - - - - - - - - - - Cell line report - - Organism strain data - Cell line annotation - Report on a particular strain of organism cell line including plants, virus, fungi and bacteria. The data typically includes strain number, organism type, growth conditions, source and so on. - beta12orEarlier - - - - - - - - - - Scent annotation - - beta12orEarlier - An informative report about a specific scent. - 1.4 - true - - - - - - - - - - Ontology term - - Ontology class name - beta12orEarlier - A term (name) from an ontology. - Ontology terms - - - - - - - - - - Ontology concept data - - beta12orEarlier - Ontology class metadata - Ontology term metadata - Data concerning or derived from a concept from a biological ontology. - - - - - - - - - - Keyword - - Phrases - Keyword(s) or phrase(s) used (typically) for text-searching purposes. - Boolean operators (AND, OR and NOT) and wildcard characters may be allowed. - Moby:QueryString - beta12orEarlier - Moby:BooleanQueryString - Moby:Wildcard_Query - Moby:Global_Keyword - Terms - Text - - - - - - - - - - Citation - - Bibliographic data that uniquely identifies a scientific article, book or other published material. - A bibliographic reference might include information such as authors, title, journal name, date and (possibly) a link to the abstract or full-text of the article if available. - Moby:GCP_SimpleCitation - Reference - Bibliographic reference - Moby:Publication - beta12orEarlier - - - - - - - - - - Article - - - - - - - - A document of scientific text, typically a full text article from a scientific journal. - beta12orEarlier - - - - - - - - - - Text mining report - - An abstract of the results of text mining. - beta12orEarlier - Text mining output - A text mining abstract will typically include an annotated a list of words or sentences extracted from one or more scientific articles. - - - - - - - - - - Entity identifier - - beta12orEarlier - true - beta12orEarlier - An identifier of a biological entity or phenomenon. - - - - - - - - - - Data resource identifier - - true - An identifier of a data resource. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Identifier (typed) - - beta12orEarlier - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - An identifier that identifies a particular type of data. - - - - - - - - - - - Tool identifier - - An identifier of a bioinformatics tool, e.g. an application or web service. - beta12orEarlier - - - - - - - - - - - Discrete entity identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a discrete entity (any biological thing with a distinct, discrete physical existence). - - - - - - - - - - Entity feature identifier - - true - beta12orEarlier - Name or other identifier of an entity feature (a physical part or region of a discrete biological entity, or a feature that can be mapped to such a thing). - beta12orEarlier - - - - - - - - - - Entity collection identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a collection of discrete biological entities. - - - - - - - - - - Phenomenon identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a physical, observable biological occurrence or event. - - - - - - - - - - Molecule identifier - - Name or other identifier of a molecule. - beta12orEarlier - - - - - - - - - - - Atom ID - - Atom identifier - Identifier (e.g. character symbol) of a specific atom. - beta12orEarlier - - - - - - - - - - - Molecule name - - - Name of a specific molecule. - beta12orEarlier - - - - - - - - - - - Molecule type - - For example, 'Protein', 'DNA', 'RNA' etc. - true - 1.5 - beta12orEarlier - A label (text token) describing the type a molecule. - Protein|DNA|RNA - - - - - - - - - - Chemical identifier - - true - beta12orEarlier - beta12orEarlier - Unique identifier of a chemical compound. - - - - - - - - - - Chromosome name - - - - - - - - - beta12orEarlier - Name of a chromosome. - - - - - - - - - - - Peptide identifier - - Identifier of a peptide chain. - beta12orEarlier - - - - - - - - - - - Protein identifier - - - - - - - - beta12orEarlier - Identifier of a protein. - - - - - - - - - - - Compound name - - - Chemical name - Unique name of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number - - beta12orEarlier - Unique registry number of a chemical compound. - - - - - - - - - - - Ligand identifier - - true - beta12orEarlier - Code word for a ligand, for example from a PDB file. - beta12orEarlier - - - - - - - - - - Drug identifier - - - - - - - - beta12orEarlier - Identifier of a drug. - - - - - - - - - - - Amino acid identifier - - - - - - - - Identifier of an amino acid. - beta12orEarlier - Residue identifier - - - - - - - - - - - Nucleotide identifier - - beta12orEarlier - Name or other identifier of a nucleotide. - - - - - - - - - - - Monosaccharide identifier - - beta12orEarlier - Identifier of a monosaccharide. - - - - - - - - - - - Chemical name (ChEBI) - - ChEBI chemical name - Unique name from Chemical Entities of Biological Interest (ChEBI) of a chemical compound. - beta12orEarlier - This is the recommended chemical name for use for example in database annotation. - - - - - - - - - - - Chemical name (IUPAC) - - IUPAC recommended name of a chemical compound. - IUPAC chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (INN) - - INN chemical name - beta12orEarlier - International Non-proprietary Name (INN or 'generic name') of a chemical compound, assigned by the World Health Organization (WHO). - - - - - - - - - - - Chemical name (brand) - - Brand name of a chemical compound. - Brand chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (synonymous) - - beta12orEarlier - Synonymous chemical name - Synonymous name of a chemical compound. - - - - - - - - - - - Chemical registry number (CAS) - - CAS chemical registry number - CAS registry number of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number (Beilstein) - - Beilstein chemical registry number - beta12orEarlier - Beilstein registry number of a chemical compound. - - - - - - - - - - - Chemical registry number (Gmelin) - - Gmelin chemical registry number - beta12orEarlier - Gmelin registry number of a chemical compound. - - - - - - - - - - - HET group name - - 3-letter code word for a ligand (HET group) from a PDB file, for example ATP. - Short ligand name - Component identifier code - beta12orEarlier - - - - - - - - - - - Amino acid name - - String of one or more ASCII characters representing an amino acid. - beta12orEarlier - - - - - - - - - - - Nucleotide code - - - beta12orEarlier - String of one or more ASCII characters representing a nucleotide. - - - - - - - - - - - Polypeptide chain ID - - - - - - - - beta12orEarlier - WHATIF: chain - Chain identifier - Identifier of a polypeptide chain from a protein. - PDBML:pdbx_PDB_strand_id - Protein chain identifier - PDB strand id - PDB chain identifier - This is typically a character (for the chain) appended to a PDB identifier, e.g. 1cukA - Polypeptide chain identifier - - - - - - - - - - - Protein name - - - Name of a protein. - beta12orEarlier - - - - - - - - - - - Enzyme identifier - - beta12orEarlier - Name or other identifier of an enzyme or record from a database of enzymes. - - - - - - - - - - - EC number - - [0-9]+\.-\.-\.-|[0-9]+\.[0-9]+\.-\.-|[0-9]+\.[0-9]+\.[0-9]+\.-|[0-9]+\.[0-9]+\.[0-9]+\.[0-9]+ - EC code - Moby:EC_Number - An Enzyme Commission (EC) number of an enzyme. - EC - Moby:Annotated_EC_Number - beta12orEarlier - Enzyme Commission number - - - - - - - - - - - Enzyme name - - - Name of an enzyme. - beta12orEarlier - - - - - - - - - - - Restriction enzyme name - - Name of a restriction enzyme. - beta12orEarlier - - - - - - - - - - - Sequence position specification - - 1.5 - A specification (partial or complete) of one or more positions or regions of a molecular sequence or map. - beta12orEarlier - true - - - - - - - - - - Sequence feature ID - - - A unique identifier of molecular sequence feature, for example an ID of a feature that is unique within the scope of the GFF file. - beta12orEarlier - - - - - - - - - - - Sequence position - - WHATIF: number - WHATIF: PDBx_atom_site - beta12orEarlier - PDBML:_atom_site.id - SO:0000735 - A position of one or more points (base or residue) in a sequence, or part of such a specification. - - - - - - - - - - Sequence range - - beta12orEarlier - Specification of range(s) of sequence positions. - - - - - - - - - - Nucleic acid feature identifier - - beta12orEarlier - beta12orEarlier - Name or other identifier of an nucleic acid feature. - true - - - - - - - - - - Protein feature identifier - - Name or other identifier of a protein feature. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence feature key - - Sequence feature method - The type of a sequence feature, typically a term or accession from the Sequence Ontology, for example an EMBL or Swiss-Prot sequence feature key. - Sequence feature type - beta12orEarlier - A feature key indicates the biological nature of the feature or information about changes to or versions of the sequence. - - - - - - - - - - Sequence feature qualifier - - beta12orEarlier - Typically one of the EMBL or Swiss-Prot feature qualifiers. - Feature qualifiers hold information about a feature beyond that provided by the feature key and location. - - - - - - - - - - Sequence feature label - - Sequence feature name - Typically an EMBL or Swiss-Prot feature label. - A feature label identifies a feature of a sequence database entry. When used with the database name and the entry's primary accession number, it is a unique identifier of that feature. - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Feature Object - - - beta12orEarlier - UFO - The name of a sequence feature-containing entity adhering to the standard feature naming scheme used by all EMBOSS applications. - - - - - - - - - - Codon name - - beta12orEarlier - beta12orEarlier - String of one or more ASCII characters representing a codon. - true - - - - - - - - - - Gene identifier - - - - - - - - Moby:GeneAccessionList - An identifier of a gene, such as a name/symbol or a unique identifier of a gene in a database. - beta12orEarlier - - - - - - - - - - - Gene symbol - - Moby_namespace:Global_GeneSymbol - beta12orEarlier - Moby_namespace:Global_GeneCommonName - The short name of a gene; a single word that does not contain white space characters. It is typically derived from the gene name. - - - - - - - - - - - Gene ID (NCBI) - - - NCBI geneid - Gene identifier (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs:NCBI_Gene - Entrez gene ID - Gene identifier (Entrez) - http://www.geneontology.org/doc/GO.xrf_abbs:LocusID - An NCBI unique identifier of a gene. - NCBI gene ID - beta12orEarlier - - - - - - - - - - - Gene identifier (NCBI RefSeq) - - beta12orEarlier - true - beta12orEarlier - An NCBI RefSeq unique identifier of a gene. - - - - - - - - - - Gene identifier (NCBI UniGene) - - beta12orEarlier - An NCBI UniGene unique identifier of a gene. - beta12orEarlier - true - - - - - - - - - - Gene identifier (Entrez) - - An Entrez unique identifier of a gene. - beta12orEarlier - true - [0-9]+ - beta12orEarlier - - - - - - - - - - Gene ID (CGD) - - CGD ID - Identifier of a gene or feature from the CGD database. - beta12orEarlier - - - - - - - - - - - Gene ID (DictyBase) - - beta12orEarlier - Identifier of a gene from DictyBase. - - - - - - - - - - - Ensembl gene ID - - - beta12orEarlier - Gene ID (Ensembl) - Unique identifier for a gene (or other feature) from the Ensembl database. - - - - - - - - - - - Gene ID (SGD) - - - Identifier of an entry from the SGD database. - S[0-9]+ - SGD identifier - beta12orEarlier - - - - - - - - - - - Gene ID (GeneDB) - - Moby_namespace:GeneDB - GeneDB identifier - beta12orEarlier - [a-zA-Z_0-9\.-]* - Identifier of a gene from the GeneDB database. - - - - - - - - - - - TIGR identifier - - - beta12orEarlier - Identifier of an entry from the TIGR database. - - - - - - - - - - - TAIR accession (gene) - - - Gene:[0-9]{7} - beta12orEarlier - Identifier of an gene from the TAIR database. - - - - - - - - - - - Protein domain ID - - - - - - - - - beta12orEarlier - Identifier of a protein structural domain. - This is typically a character or string concatenated with a PDB identifier and a chain identifier. - - - - - - - - - - - SCOP domain identifier - - Identifier of a protein domain (or other node) from the SCOP database. - beta12orEarlier - - - - - - - - - - - CATH domain ID - - 1nr3A00 - beta12orEarlier - CATH domain identifier - Identifier of a protein domain from CATH. - - - - - - - - - - - SCOP concise classification string (sccs) - - A SCOP concise classification string (sccs) is a compact representation of a SCOP domain classification. - beta12orEarlier - An scss includes the class (alphabetical), fold, superfamily and family (all numerical) to which a given domain belongs. - - - - - - - - - - - SCOP sunid - - Unique identifier (number) of an entry in the SCOP hierarchy, for example 33229. - beta12orEarlier - A sunid uniquely identifies an entry in the SCOP hierarchy, including leaves (the SCOP domains) and higher level nodes including entries corresponding to the protein level. - sunid - SCOP unique identifier - 33229 - - - - - - - - - - - CATH node ID - - 3.30.1190.10.1.1.1.1.1 - CATH code - A code number identifying a node from the CATH database. - CATH node identifier - beta12orEarlier - - - - - - - - - - - Kingdom name - - The name of a biological kingdom (Bacteria, Archaea, or Eukaryotes). - beta12orEarlier - - - - - - - - - - - Species name - - The name of a species (typically a taxonomic group) of organism. - Organism species - beta12orEarlier - - - - - - - - - - - Strain name - - - beta12orEarlier - The name of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - URI - - A string of characters that name or otherwise identify a resource on the Internet. - URIs - beta12orEarlier - - - - - - - - - - Database ID - - - - - - - - An identifier of a biological or bioinformatics database. - Database identifier - beta12orEarlier - - - - - - - - - - - Directory name - - beta12orEarlier - The name of a directory. - - - - - - - - - - - File name - - The name (or part of a name) of a file (of any type). - beta12orEarlier - - - - - - - - - - - Ontology name - - - - - - - - - beta12orEarlier - Name of an ontology of biological or bioinformatics concepts and relations. - - - - - - - - - - - URL - - A Uniform Resource Locator (URL). - Moby:URL - Moby:Link - beta12orEarlier - - - - - - - - - - URN - - beta12orEarlier - A Uniform Resource Name (URN). - - - - - - - - - - LSID - - beta12orEarlier - LSIDs provide a standard way to locate and describe data. An LSID is represented as a Uniform Resource Name (URN) with the following format: URN:LSID:<Authority>:<Namespace>:<ObjectID>[:<Version>] - Life Science Identifier - A Life Science Identifier (LSID) - a unique identifier of some data. - - - - - - - - - - Database name - - - The name of a biological or bioinformatics database. - beta12orEarlier - - - - - - - - - - - Sequence database name - - The name of a molecular sequence database. - true - beta13 - beta12orEarlier - - - - - - - - - - Enumerated file name - - beta12orEarlier - The name of a file (of any type) with restricted possible values. - - - - - - - - - - - File name extension - - The extension of a file name. - A file extension is the characters appearing after the final '.' in the file name. - beta12orEarlier - - - - - - - - - - - File base name - - beta12orEarlier - The base name of a file. - A file base name is the file name stripped of its directory specification and extension. - - - - - - - - - - - QSAR descriptor name - - - - - - - - - beta12orEarlier - Name of a QSAR descriptor. - - - - - - - - - - - Database entry identifier - - true - This concept is required for completeness. It should never have child concepts. - beta12orEarlier - An identifier of an entry from a database where the same type of identifier is used for objects (data) of different semantic type. - beta12orEarlier - - - - - - - - - - Sequence identifier - - - - - - - - An identifier of molecular sequence(s) or entries from a molecular sequence database. - beta12orEarlier - - - - - - - - - - - Sequence set ID - - - - - - - - - An identifier of a set of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence signature identifier - - beta12orEarlier - beta12orEarlier - true - Identifier of a sequence signature (motif or profile) for example from a database of sequence patterns. - - - - - - - - - - - Sequence alignment ID - - - - - - - - - Identifier of a molecular sequence alignment, for example a record from an alignment database. - beta12orEarlier - - - - - - - - - - - Phylogenetic distance matrix identifier - - beta12orEarlier - Identifier of a phylogenetic distance matrix. - true - beta12orEarlier - - - - - - - - - - Phylogenetic tree ID - - - - - - - - - beta12orEarlier - Identifier of a phylogenetic tree for example from a phylogenetic tree database. - - - - - - - - - - - Comparison matrix identifier - - - - - - - - An identifier of a comparison matrix. - Substitution matrix identifier - beta12orEarlier - - - - - - - - - - - Structure ID - - - beta12orEarlier - A unique and persistent identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Structural (3D) profile ID - - - - - - - - - Structural profile identifier - Identifier or name of a structural (3D) profile or template (representing a structure or structure alignment). - beta12orEarlier - - - - - - - - - - - Structure alignment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of tertiary structure alignments. - - - - - - - - - - - Amino acid index ID - - - - - - - - - Identifier of an index of amino acid physicochemical and biochemical property data. - beta12orEarlier - - - - - - - - - - - Protein interaction ID - - - - - - - - - beta12orEarlier - Molecular interaction ID - Identifier of a report of protein interactions from a protein interaction database (typically). - - - - - - - - - - - Protein family identifier - - - - - - - - Protein secondary database record identifier - Identifier of a protein family. - beta12orEarlier - - - - - - - - - - - Codon usage table name - - - - - - - - - - - - - - - Unique name of a codon usage table. - beta12orEarlier - - - - - - - - - - - Transcription factor identifier - - - Identifier of a transcription factor (or a TF binding site). - beta12orEarlier - - - - - - - - - - - Experiment annotation ID - - - - - - - - beta12orEarlier - Identifier of an entry from a database of microarray data. - - - - - - - - - - - Electron microscopy model ID - - - - - - - - - Identifier of an entry from a database of electron microscopy data. - beta12orEarlier - - - - - - - - - - - Gene expression report ID - - - - - - - - - Accession of a report of gene expression (e.g. a gene expression profile) from a database. - beta12orEarlier - Gene expression profile identifier - - - - - - - - - - - Genotype and phenotype annotation ID - - - - - - - - - Identifier of an entry from a database of genotypes and phenotypes. - beta12orEarlier - - - - - - - - - - - Pathway or network identifier - - - - - - - - Identifier of an entry from a database of biological pathways or networks. - beta12orEarlier - - - - - - - - - - - Workflow ID - - - beta12orEarlier - Identifier of a biological or biomedical workflow, typically from a database of workflows. - - - - - - - - - - - Data resource definition ID - - beta12orEarlier - Identifier of a data type definition from some provider. - Data resource definition identifier - - - - - - - - - - - Biological model ID - - - - - - - - Biological model identifier - beta12orEarlier - Identifier of a mathematical model, typically an entry from a database. - - - - - - - - - - - Compound identifier - - - - - - - - beta12orEarlier - Chemical compound identifier - Identifier of an entry from a database of chemicals. - Small molecule identifier - - - - - - - - - - - Ontology concept ID - - - A unique (typically numerical) identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - Article ID - - - - - - - - - beta12orEarlier - Unique identifier of a scientific article. - Article identifier - - - - - - - - - - - FlyBase ID - - - Identifier of an object from the FlyBase database. - FB[a-zA-Z_0-9]{2}[0-9]{7} - beta12orEarlier - - - - - - - - - - - WormBase name - - - Name of an object from the WormBase database, usually a human-readable name. - beta12orEarlier - - - - - - - - - - - WormBase class - - beta12orEarlier - Class of an object from the WormBase database. - A WormBase class describes the type of object such as 'sequence' or 'protein'. - - - - - - - - - - - Sequence accession - - - beta12orEarlier - A persistent, unique identifier of a molecular sequence database entry. - Sequence accession number - - - - - - - - - - - Sequence type - - 1.5 - Sequence type might reflect the molecule (protein, nucleic acid etc) or the sequence itself (gapped, ambiguous etc). - A label (text token) describing a type of molecular sequence. - true - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Sequence Address - - - EMBOSS USA - beta12orEarlier - The name of a sequence-based entity adhering to the standard sequence naming scheme used by all EMBOSS applications. - - - - - - - - - - - Sequence accession (protein) - - - - - - - - Accession number of a protein sequence database entry. - Protein sequence accession number - beta12orEarlier - - - - - - - - - - - Sequence accession (nucleic acid) - - - - - - - - Accession number of a nucleotide sequence database entry. - beta12orEarlier - Nucleotide sequence accession number - - - - - - - - - - - RefSeq accession - - Accession number of a RefSeq database entry. - beta12orEarlier - RefSeq ID - (NC|AC|NG|NT|NW|NZ|NM|NR|XM|XR|NP|AP|XP|YP|ZP)_[0-9]+ - - - - - - - - - - - UniProt accession (extended) - - true - Accession number of a UniProt (protein sequence) database entry. May contain version or isoform number. - [A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9].[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9].[0-9]+|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]-[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]-[0-9]+ - beta12orEarlier - Q7M1G0|P43353-2|P01012.107 - 1.0 - - - - - - - - - - PIR identifier - - - - - - - - An identifier of PIR sequence database entry. - beta12orEarlier - PIR ID - PIR accession number - - - - - - - - - - - TREMBL accession - - beta12orEarlier - Identifier of a TREMBL sequence database entry. - true - 1.2 - - - - - - - - - - Gramene primary identifier - - beta12orEarlier - Gramene primary ID - Primary identifier of a Gramene database entry. - - - - - - - - - - - EMBL/GenBank/DDBJ ID - - Identifier of a (nucleic acid) entry from the EMBL/GenBank/DDBJ databases. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniGene) - - UniGene identifier - UniGene cluster id - UniGene ID - UniGene cluster ID - beta12orEarlier - A unique identifier of an entry (gene cluster) from the NCBI UniGene database. - - - - - - - - - - - dbEST accession - - - dbEST ID - Identifier of a dbEST database entry. - beta12orEarlier - - - - - - - - - - - dbSNP ID - - beta12orEarlier - dbSNP identifier - Identifier of a dbSNP database entry. - - - - - - - - - - - EMBOSS sequence type - - beta12orEarlier - true - See the EMBOSS documentation (http://emboss.sourceforge.net/) for a definition of what this includes. - beta12orEarlier - The EMBOSS type of a molecular sequence. - - - - - - - - - - EMBOSS listfile - - 1.5 - List of EMBOSS Uniform Sequence Addresses (EMBOSS listfile). - true - beta12orEarlier - - - - - - - - - - Sequence cluster ID - - - - - - - - An identifier of a cluster of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (COG) - - COG ID - beta12orEarlier - Unique identifier of an entry from the COG database. - - - - - - - - - - - Sequence motif identifier - - - - - - - - Identifier of a sequence motif, for example an entry from a motif database. - beta12orEarlier - - - - - - - - - - - Sequence profile ID - - - - - - - - - Identifier of a sequence profile. - beta12orEarlier - A sequence profile typically represents a sequence alignment. - - - - - - - - - - - ELM ID - - Identifier of an entry from the ELMdb database of protein functional sites. - beta12orEarlier - - - - - - - - - - - Prosite accession number - - beta12orEarlier - Accession number of an entry from the Prosite database. - PS[0-9]{5} - Prosite ID - - - - - - - - - - - HMMER hidden Markov model ID - - - - - - - - Unique identifier or name of a HMMER hidden Markov model. - beta12orEarlier - - - - - - - - - - - JASPAR profile ID - - beta12orEarlier - Unique identifier or name of a profile from the JASPAR database. - - - - - - - - - - - Sequence alignment type - - beta12orEarlier - 1.5 - true - Possible values include for example the EMBOSS alignment types, BLAST alignment types and so on. - A label (text token) describing the type of a sequence alignment. - - - - - - - - - - BLAST sequence alignment type - - true - beta12orEarlier - beta12orEarlier - The type of a BLAST sequence alignment. - - - - - - - - - - Phylogenetic tree type - - For example 'nj', 'upgmp' etc. - beta12orEarlier - true - A label (text token) describing the type of a phylogenetic tree. - 1.5 - nj|upgmp - - - - - - - - - - TreeBASE study accession number - - Accession number of an entry from the TreeBASE database. - beta12orEarlier - - - - - - - - - - - TreeFam accession number - - beta12orEarlier - Accession number of an entry from the TreeFam database. - - - - - - - - - - - Comparison matrix type - - 1.5 - true - beta12orEarlier - blosum|pam|gonnet|id - A label (text token) describing the type of a comparison matrix. - Substitution matrix type - For example 'blosum', 'pam', 'gonnet', 'id' etc. Comparison matrix type may be required where a series of matrices of a certain type are used. - - - - - - - - - - Comparison matrix name - - - - - - - - - beta12orEarlier - Substitution matrix name - See for example http://www.ebi.ac.uk/Tools/webservices/help/matrix. - Unique name or identifier of a comparison matrix. - - - - - - - - - - - PDB ID - - An identifier of an entry from the PDB database. - [a-zA-Z_0-9]{4} - PDBID - PDB identifier - beta12orEarlier - - - - - - - - - - - AAindex ID - - beta12orEarlier - Identifier of an entry from the AAindex database. - - - - - - - - - - - BIND accession number - - Accession number of an entry from the BIND database. - beta12orEarlier - - - - - - - - - - - IntAct accession number - - EBI\-[0-9]+ - beta12orEarlier - Accession number of an entry from the IntAct database. - - - - - - - - - - - Protein family name - - - beta12orEarlier - Name of a protein family. - - - - - - - - - - - InterPro entry name - - - - - - - - beta12orEarlier - Name of an InterPro entry, usually indicating the type of protein matches for that entry. - - - - - - - - - - - InterPro accession - - - - - - - - Primary accession number of an InterPro entry. - InterPro primary accession - Every InterPro entry has a unique accession number to provide a persistent citation of database records. - beta12orEarlier - InterPro primary accession number - IPR015590 - IPR[0-9]{6} - - - - - - - - - - - InterPro secondary accession - - - - - - - - Secondary accession number of an InterPro entry. - beta12orEarlier - InterPro secondary accession number - - - - - - - - - - - Gene3D ID - - beta12orEarlier - Unique identifier of an entry from the Gene3D database. - - - - - - - - - - - PIRSF ID - - PIRSF[0-9]{6} - beta12orEarlier - Unique identifier of an entry from the PIRSF database. - - - - - - - - - - - PRINTS code - - beta12orEarlier - PR[0-9]{5} - The unique identifier of an entry in the PRINTS database. - - - - - - - - - - - Pfam accession number - - PF[0-9]{5} - Accession number of a Pfam entry. - beta12orEarlier - - - - - - - - - - - SMART accession number - - Accession number of an entry from the SMART database. - beta12orEarlier - SM[0-9]{5} - - - - - - - - - - - Superfamily hidden Markov model number - - Unique identifier (number) of a hidden Markov model from the Superfamily database. - beta12orEarlier - - - - - - - - - - - TIGRFam ID - - TIGRFam accession number - Accession number of an entry (family) from the TIGRFam database. - beta12orEarlier - - - - - - - - - - - ProDom accession number - - A ProDom domain family accession number. - PD[0-9]+ - beta12orEarlier - ProDom is a protein domain family database. - - - - - - - - - - - TRANSFAC accession number - - beta12orEarlier - Identifier of an entry from the TRANSFAC database. - - - - - - - - - - - ArrayExpress accession number - - Accession number of an entry from the ArrayExpress database. - beta12orEarlier - [AEP]-[a-zA-Z_0-9]{4}-[0-9]+ - ArrayExpress experiment ID - - - - - - - - - - - PRIDE experiment accession number - - [0-9]+ - beta12orEarlier - PRIDE experiment accession number. - - - - - - - - - - - EMDB ID - - beta12orEarlier - Identifier of an entry from the EMDB electron microscopy database. - - - - - - - - - - - GEO accession number - - Accession number of an entry from the GEO database. - o^GDS[0-9]+ - beta12orEarlier - - - - - - - - - - - GermOnline ID - - beta12orEarlier - Identifier of an entry from the GermOnline database. - - - - - - - - - - - EMAGE ID - - Identifier of an entry from the EMAGE database. - beta12orEarlier - - - - - - - - - - - Disease ID - - - Accession number of an entry from a database of disease. - beta12orEarlier - - - - - - - - - - - HGVbase ID - - Identifier of an entry from the HGVbase database. - beta12orEarlier - - - - - - - - - - - HIVDB identifier - - true - beta12orEarlier - Identifier of an entry from the HIVDB database. - beta12orEarlier - - - - - - - - - - OMIM ID - - beta12orEarlier - [*#+%^]?[0-9]{6} - Identifier of an entry from the OMIM database. - - - - - - - - - - - KEGG object identifier - - - beta12orEarlier - Unique identifier of an object from one of the KEGG databases (excluding the GENES division). - - - - - - - - - - - Pathway ID (reactome) - - Identifier of an entry from the Reactome database. - Reactome ID - beta12orEarlier - REACT_[0-9]+(\.[0-9]+)? - - - - - - - - - - - Pathway ID (aMAZE) - - beta12orEarlier - aMAZE ID - true - beta12orEarlier - Identifier of an entry from the aMAZE database. - - - - - - - - - - Pathway ID (BioCyc) - - - BioCyc pathway ID - beta12orEarlier - Identifier of an pathway from the BioCyc biological pathways database. - - - - - - - - - - - Pathway ID (INOH) - - beta12orEarlier - INOH identifier - Identifier of an entry from the INOH database. - - - - - - - - - - - Pathway ID (PATIKA) - - Identifier of an entry from the PATIKA database. - PATIKA ID - beta12orEarlier - - - - - - - - - - - Pathway ID (CPDB) - - This concept refers to identifiers used by the databases collated in CPDB; CPDB identifiers are not independently defined. - CPDB ID - Identifier of an entry from the CPDB (ConsensusPathDB) biological pathways database, which is an identifier from an external database integrated into CPDB. - beta12orEarlier - - - - - - - - - - - Pathway ID (Panther) - - Identifier of a biological pathway from the Panther Pathways database. - beta12orEarlier - PTHR[0-9]{5} - Panther Pathways ID - - - - - - - - - - - MIRIAM identifier - - - - - - - - Unique identifier of a MIRIAM data resource. - MIR:00100005 - MIR:[0-9]{8} - beta12orEarlier - This is the identifier used internally by MIRIAM for a data type. - - - - - - - - - - - MIRIAM data type name - - - - - - - - beta12orEarlier - The name of a data type from the MIRIAM database. - - - - - - - - - - - MIRIAM URI - - - - - - - - - beta12orEarlier - The URI (URL or URN) of a data entity from the MIRIAM database. - identifiers.org synonym - urn:miriam:pubmed:16333295|urn:miriam:obo.go:GO%3A0045202 - A MIRIAM URI consists of the URI of the MIRIAM data type (PubMed, UniProt etc) followed by the identifier of an element of that data type, for example PMID for a publication or an accession number for a GO term. - - - - - - - - - - - MIRIAM data type primary name - - beta12orEarlier - The primary name of a MIRIAM data type is taken from a controlled vocabulary. - UniProt|Enzyme Nomenclature - The primary name of a data type from the MIRIAM database. - - - - - - UniProt|Enzyme Nomenclature - A protein entity has the MIRIAM data type 'UniProt', and an enzyme has the MIRIAM data type 'Enzyme Nomenclature'. - - - - - - - - - - MIRIAM data type synonymous name - - A synonymous name of a data type from the MIRIAM database. - A synonymous name for a MIRIAM data type taken from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Taverna workflow ID - - beta12orEarlier - Unique identifier of a Taverna workflow. - - - - - - - - - - - Biological model name - - - beta12orEarlier - Name of a biological (mathematical) model. - - - - - - - - - - - BioModel ID - - Unique identifier of an entry from the BioModel database. - beta12orEarlier - (BIOMD|MODEL)[0-9]{10} - - - - - - - - - - - PubChem CID - - - [0-9]+ - PubChem compound accession identifier - Chemical structure specified in PubChem Compound Identification (CID), a non-zero integer identifier for a unique chemical structure. - beta12orEarlier - - - - - - - - - - - ChemSpider ID - - Identifier of an entry from the ChemSpider database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ChEBI ID - - Identifier of an entry from the ChEBI database. - ChEBI IDs - ChEBI identifier - CHEBI:[0-9]+ - beta12orEarlier - - - - - - - - - - - BioPax concept ID - - beta12orEarlier - An identifier of a concept from the BioPax ontology. - - - - - - - - - - - GO concept ID - - GO concept identifier - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a concept from The Gene Ontology. - - - - - - - - - - - MeSH concept ID - - beta12orEarlier - An identifier of a concept from the MeSH vocabulary. - - - - - - - - - - - HGNC concept ID - - beta12orEarlier - An identifier of a concept from the HGNC controlled vocabulary. - - - - - - - - - - - NCBI taxonomy ID - - - NCBI taxonomy identifier - [1-9][0-9]{0,8} - NCBI tax ID - A stable unique identifier for each taxon (for a species, a family, an order, or any other group in the NCBI taxonomy database. - 9662|3483|182682 - beta12orEarlier - - - - - - - - - - - Plant Ontology concept ID - - An identifier of a concept from the Plant Ontology (PO). - beta12orEarlier - - - - - - - - - - - UMLS concept ID - - An identifier of a concept from the UMLS vocabulary. - beta12orEarlier - - - - - - - - - - - FMA concept ID - - An identifier of a concept from Foundational Model of Anatomy. - FMA:[0-9]+ - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - - - - - - - - - - - EMAP concept ID - - beta12orEarlier - An identifier of a concept from the EMAP mouse ontology. - - - - - - - - - - - ChEBI concept ID - - beta12orEarlier - An identifier of a concept from the ChEBI ontology. - - - - - - - - - - - MGED concept ID - - beta12orEarlier - An identifier of a concept from the MGED ontology. - - - - - - - - - - - myGrid concept ID - - beta12orEarlier - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - An identifier of a concept from the myGrid ontology. - - - - - - - - - - - PubMed ID - - PMID - [1-9][0-9]{0,8} - PubMed unique identifier of an article. - beta12orEarlier - 4963447 - - - - - - - - - - - DOI - - beta12orEarlier - (doi\:)?[0-9]{2}\.[0-9]{4}/.* - Digital Object Identifier - Digital Object Identifier (DOI) of a published article. - - - - - - - - - - - Medline UI - - beta12orEarlier - Medline UI (unique identifier) of an article. - The use of Medline UI has been replaced by the PubMed unique identifier. - Medline unique identifier - - - - - - - - - - - Tool name - - The name of a computer package, application, method or function. - beta12orEarlier - - - - - - - - - - - Tool name (signature) - - beta12orEarlier - The unique name of a signature (sequence classifier) method. - Signature methods from http://www.ebi.ac.uk/Tools/InterProScan/help.html#results include BlastProDom, FPrintScan, HMMPIR, HMMPfam, HMMSmart, HMMTigr, ProfileScan, ScanRegExp, SuperFamily and HAMAP. - - - - - - - - - - - Tool name (BLAST) - - This include 'blastn', 'blastp', 'blastx', 'tblastn' and 'tblastx'. - The name of a BLAST tool. - beta12orEarlier - BLAST name - - - - - - - - - - - Tool name (FASTA) - - beta12orEarlier - The name of a FASTA tool. - This includes 'fasta3', 'fastx3', 'fasty3', 'fastf3', 'fasts3' and 'ssearch'. - - - - - - - - - - - Tool name (EMBOSS) - - The name of an EMBOSS application. - beta12orEarlier - - - - - - - - - - - Tool name (EMBASSY package) - - The name of an EMBASSY package. - beta12orEarlier - - - - - - - - - - - QSAR descriptor (constitutional) - - A QSAR constitutional descriptor. - beta12orEarlier - QSAR constitutional descriptor - - - - - - - - - - QSAR descriptor (electronic) - - beta12orEarlier - A QSAR electronic descriptor. - QSAR electronic descriptor - - - - - - - - - - QSAR descriptor (geometrical) - - QSAR geometrical descriptor - A QSAR geometrical descriptor. - beta12orEarlier - - - - - - - - - - QSAR descriptor (topological) - - beta12orEarlier - QSAR topological descriptor - A QSAR topological descriptor. - - - - - - - - - - QSAR descriptor (molecular) - - A QSAR molecular descriptor. - QSAR molecular descriptor - beta12orEarlier - - - - - - - - - - Sequence set (protein) - - Any collection of multiple protein sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - beta12orEarlier - - - - - - - - - - Sequence set (nucleic acid) - - beta12orEarlier - Any collection of multiple nucleotide sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - - - - - - - - - - Sequence cluster - - - - - - - - A set of sequences that have been clustered or otherwise classified as belonging to a group including (typically) sequence cluster information. - The cluster might include sequences identifiers, short descriptions, alignment and summary information. - beta12orEarlier - - - - - - - - - - Psiblast checkpoint file - - beta12orEarlier - A Psiblast checkpoint file uses ASN.1 Binary Format and usually has the extension '.asn'. - beta12orEarlier - true - A file of intermediate results from a PSIBLAST search that is used for priming the search in the next PSIBLAST iteration. - - - - - - - - - - HMMER synthetic sequences set - - Sequences generated by HMMER package in FASTA-style format. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Proteolytic digest - - - - - - - - beta12orEarlier - A protein sequence cleaved into peptide fragments (by enzymatic or chemical cleavage) with fragment masses. - - - - - - - - - - Restriction digest - - Restriction digest fragments from digesting a nucleotide sequence with restriction sites using a restriction endonuclease. - SO:0000412 - beta12orEarlier - - - - - - - - - - PCR primers - - beta12orEarlier - Oligonucleotide primer(s) for PCR and DNA amplification, for example a minimal primer set. - - - - - - - - - - vectorstrip cloning vector definition file - - beta12orEarlier - true - File of sequence vectors used by EMBOSS vectorstrip application, or any file in same format. - beta12orEarlier - - - - - - - - - - Primer3 internal oligo mishybridizing library - - true - beta12orEarlier - A library of nucleotide sequences to avoid during hybridization events. Hybridization of the internal oligo to sequences in this library is avoided, rather than priming from them. The file is in a restricted FASTA format. - beta12orEarlier - - - - - - - - - - Primer3 mispriming library file - - true - A nucleotide sequence library of sequences to avoid during amplification (for example repetitive sequences, or possibly the sequences of genes in a gene family that should not be amplified. The file must is in a restricted FASTA format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - primersearch primer pairs sequence record - - true - beta12orEarlier - beta12orEarlier - File of one or more pairs of primer sequences, as used by EMBOSS primersearch application. - - - - - - - - - - Sequence cluster (protein) - - - Protein sequence cluster - The sequences are typically related, for example a family of sequences. - beta12orEarlier - A cluster of protein sequences. - - - - - - - - - - Sequence cluster (nucleic acid) - - - A cluster of nucleotide sequences. - Nucleotide sequence cluster - beta12orEarlier - The sequences are typically related, for example a family of sequences. - - - - - - - - - - Sequence length - - beta12orEarlier - The size (length) of a sequence, subsequence or region in a sequence, or range(s) of lengths. - - - - - - - - - - Word size - - Word size is used for example in word-based sequence database search methods. - Word length - 1.5 - Size of a sequence word. - true - beta12orEarlier - - - - - - - - - - Window size - - 1.5 - true - A window is a region of fixed size but not fixed position over a molecular sequence. It is typically moved (computationally) over a sequence during scoring. - beta12orEarlier - Size of a sequence window. - - - - - - - - - - Sequence length range - - true - Specification of range(s) of length of sequences. - beta12orEarlier - 1.5 - - - - - - - - - - Sequence information report - - Report on basic information about a molecular sequence such as name, accession number, type (nucleic or protein), length, description etc. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence property - - beta12orEarlier - An informative report about non-positional sequence features, typically a report on general molecular sequence properties derived from sequence analysis. - Sequence properties report - - - - - - - - - - Sequence features - - Sequence features report - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D058977 - SO:0000110 - This includes annotation of positional sequence features, organized into a standard feature table, or any other report of sequence features. General feature reports are a source of sequence feature table information although internal conversion would be required. - General sequence features - Annotation of positional features of molecular sequence(s), i.e. that can be mapped to position(s) in the sequence. - Features - Feature record - - - - - - - - - - Sequence features (comparative) - - Comparative data on sequence features such as statistics, intersections (and data on intersections), differences etc. - beta13 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - beta12orEarlier - - - - - - - - - - Sequence property (protein) - - true - A report of general sequence properties derived from protein sequence data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence property (nucleic acid) - - A report of general sequence properties derived from nucleotide sequence data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence complexity report - - A report on sequence complexity, for example low-complexity or repeat regions in sequences. - beta12orEarlier - Sequence property (complexity) - - - - - - - - - - Sequence ambiguity report - - A report on ambiguity in molecular sequence(s). - Sequence property (ambiguity) - beta12orEarlier - - - - - - - - - - Sequence composition report - - beta12orEarlier - A report (typically a table) on character or word composition / frequency of a molecular sequence(s). - Sequence property (composition) - - - - - - - - - - Peptide molecular weight hits - - A report on peptide fragments of certain molecular weight(s) in one or more protein sequences. - beta12orEarlier - - - - - - - - - - Base position variability plot - - beta12orEarlier - A plot of third base position variability in a nucleotide sequence. - - - - - - - - - - Sequence composition table - - A table of character or word composition / frequency of a molecular sequence. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Base frequencies table - - - beta12orEarlier - A table of base frequencies of a nucleotide sequence. - - - - - - - - - - Base word frequencies table - - - A table of word composition of a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Amino acid frequencies table - - - Sequence composition (amino acid frequencies) - A table of amino acid frequencies of a protein sequence. - beta12orEarlier - - - - - - - - - - Amino acid word frequencies table - - - A table of amino acid word composition of a protein sequence. - Sequence composition (amino acid words) - beta12orEarlier - - - - - - - - - - DAS sequence feature annotation - - beta12orEarlier - Annotation of a molecular sequence in DAS format. - beta12orEarlier - true - - - - - - - - - - Feature table - - Sequence feature table - beta12orEarlier - Annotation of positional sequence features, organized into a standard feature table. - - - - - - - - - - Map - - - - - - - - DNA map - beta12orEarlier - A map of (typically one) DNA sequence annotated with positional or non-positional features. - - - - - - - - - - Nucleic acid features - - - An informative report on intrinsic positional features of a nucleotide sequence. - beta12orEarlier - Genome features - This includes nucleotide sequence feature annotation in any known sequence feature table format and any other report of nucleic acid features. - Genomic features - Nucleic acid feature table - Feature table (nucleic acid) - - - - - - - - - - Protein features - - - An informative report on intrinsic positional features of a protein sequence. - beta12orEarlier - This includes protein sequence feature annotation in any known sequence feature table format and any other report of protein features. - Feature table (protein) - Protein feature table - - - - - - - - - - Genetic map - - A map showing the relative positions of genetic markers in a nucleic acid sequence, based on estimation of non-physical distance such as recombination frequencies. - beta12orEarlier - A genetic (linkage) map indicates the proximity of two genes on a chromosome, whether two genes are linked and the frequency they are transmitted together to an offspring. They are limited to genetic markers of traits observable only in whole organisms. - Linkage map - Moby:GeneticMap - - - - - - - - - - Sequence map - - A sequence map typically includes annotation on significant subsequences such as contigs, haplotypes and genes. The contigs shown will (typically) be a set of small overlapping clones representing a complete chromosomal segment. - beta12orEarlier - A map of genetic markers in a contiguous, assembled genomic sequence, with the sizes and separation of markers measured in base pairs. - - - - - - - - - - Physical map - - A map of DNA (linear or circular) annotated with physical features or landmarks such as restriction sites, cloned DNA fragments, genes or genetic markers, along with the physical distances between them. - Distance in a physical map is measured in base pairs. A physical map might be ordered relative to a reference map (typically a genetic map) in the process of genome sequencing. - beta12orEarlier - - - - - - - - - - Sequence signature map - - true - Image of a sequence with matches to signatures, motifs or profiles. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cytogenetic map - - beta12orEarlier - A map showing banding patterns derived from direct observation of a stained chromosome. - Cytologic map - Chromosome map - Cytogenic map - This is the lowest-resolution physical map and can provide only rough estimates of physical (base pair) distances. Like a genetic map, they are limited to genetic markers of traits observable only in whole organisms. - - - - - - - - - - DNA transduction map - - beta12orEarlier - A gene map showing distances between loci based on relative cotransduction frequencies. - - - - - - - - - - Gene map - - Sequence map of a single gene annotated with genetic features such as introns, exons, untranslated regions, polyA signals, promoters, enhancers and (possibly) mutations defining alleles of a gene. - beta12orEarlier - - - - - - - - - - Plasmid map - - Sequence map of a plasmid (circular DNA). - beta12orEarlier - - - - - - - - - - Genome map - - beta12orEarlier - Sequence map of a whole genome. - - - - - - - - - - Restriction map - - - Image of the restriction enzyme cleavage sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - InterPro compact match image - - beta12orEarlier - Image showing matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Each protein is represented as a scaled horizontal line with colored bars indicating the position of the matches. - beta12orEarlier - true - - - - - - - - - - InterPro detailed match image - - beta12orEarlier - beta12orEarlier - Image showing detailed information on matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. - true - - - - - - - - - - InterPro architecture image - - beta12orEarlier - beta12orEarlier - true - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Domain architecture is shown as a series of non-overlapping domains in the protein. - Image showing the architecture of InterPro domains in a protein sequence. - - - - - - - - - - SMART protein schematic - - true - beta12orEarlier - beta12orEarlier - SMART protein schematic in PNG format. - - - - - - - - - - GlobPlot domain image - - beta12orEarlier - beta12orEarlier - true - Images based on GlobPlot prediction of intrinsic disordered regions and globular domains in protein sequences. - - - - - - - - - - Sequence motif matches - - beta12orEarlier - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more sequences. - 1.8 - true - - - - - - - - - - Sequence features (repeats) - - beta12orEarlier - true - 1.5 - Repeat sequence map - The report might include derived data map such as classification, annotation, organization, periodicity etc. - Location of short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Gene and transcript structure (report) - - 1.5 - beta12orEarlier - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - true - - - - - - - - - - Mobile genetic elements - - true - beta12orEarlier - regions of a nucleic acid sequence containing mobile genetic elements. - 1.8 - - - - - - - - - - Nucleic acid features report (PolyA signal or site) - - true - regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features (quadruplexes) - - true - 1.5 - A report on quadruplex-forming motifs in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Nucleic acid features report (CpG island and isochore) - - 1.8 - CpG rich regions (isochores) in a nucleotide sequence. - beta12orEarlier - true - - - - - - - - - - Nucleic acid features report (restriction sites) - - beta12orEarlier - true - 1.8 - restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - - - - - - - - - - Nucleosome exclusion sequences - - beta12orEarlier - true - Report on nucleosome formation potential or exclusion sequence(s). - 1.8 - - - - - - - - - - Nucleic acid features report (splice sites) - - splice sites in a nucleotide sequence or alternative RNA splicing events. - beta12orEarlier - true - 1.8 - - - - - - - - - - Nucleic acid features report (matrix/scaffold attachment sites) - - 1.8 - matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Gene features (exonic splicing enhancer) - - beta12orEarlier - beta13 - true - A report on exonic splicing enhancers (ESE) in an exon. - - - - - - - - - - Nucleic acid features (microRNA) - - true - beta12orEarlier - A report on microRNA sequence (miRNA) or precursor, microRNA targets, miRNA binding sites in an RNA sequence etc. - 1.5 - - - - - - - - - - Gene features report (operon) - - true - operons (operators, promoters and genes) from a bacterial genome. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features report (promoters) - - 1.8 - whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Coding region - - beta12orEarlier - protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. - 1.8 - true - - - - - - - - - - Gene features (SECIS element) - - beta12orEarlier - beta13 - A report on selenocysteine insertion sequence (SECIS) element in a DNA sequence. - true - - - - - - - - - - Transcription factor binding sites - - transcription factor binding sites (TFBS) in a DNA sequence. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein features (sites) - - true - beta12orEarlier - Use this concept for collections of specific sites which are not necessarily contiguous, rather than contiguous stretches of amino acids. - beta12orEarlier - A report on predicted or known key residue positions (sites) in a protein sequence, such as binding or functional sites. - - - - - - - - - - Protein features report (signal peptides) - - true - signal peptides or signal peptide cleavage sites in protein sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Protein features report (cleavage sites) - - true - 1.8 - cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - beta12orEarlier - - - - - - - - - - Protein features (post-translation modifications) - - true - beta12orEarlier - post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein features report (active sites) - - 1.8 - true - beta12orEarlier - catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein features report (binding sites) - - beta12orEarlier - ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids. - true - 1.8 - - - - - - - - - - Protein features (epitopes) - - A report on antigenic determinant sites (epitopes) in proteins, from sequence and / or structural data. - beta13 - beta12orEarlier - Epitope mapping is commonly done during vaccine design. - true - - - - - - - - - - Protein features report (nucleic acid binding sites) - - true - beta12orEarlier - 1.8 - RNA and DNA-binding proteins and binding sites in protein sequences. - - - - - - - - - - MHC Class I epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on epitopes that bind to MHC class I molecules. - - - - - - - - - - MHC Class II epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on predicted epitopes that bind to MHC class II molecules. - - - - - - - - - - Protein features (PEST sites) - - beta12orEarlier - A report or plot of PEST sites in a protein sequence. - true - beta13 - 'PEST' motifs target proteins for proteolytic degradation and reduce the half-lives of proteins dramatically. - - - - - - - - - - Sequence database hits scores list - - Scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database hits alignments list - - beta12orEarlier - Alignments from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - Sequence database hits evaluation data - - beta12orEarlier - A report on the evaluation of the significance of sequence similarity scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - MEME motif alphabet - - Alphabet for the motifs (patterns) that MEME will search for. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - MEME background frequencies file - - MEME background frequencies file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - MEME motifs directive file - - beta12orEarlier - true - File of directives for ordering and spacing of MEME motifs. - beta12orEarlier - - - - - - - - - - Dirichlet distribution - - Dirichlet distribution used by hidden Markov model analysis programs. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts - - Emission and transition counts of a hidden Markov model, generated once HMM has been determined, for example after residues/gaps have been assigned to match, delete and insert states. - true - 1.4 - beta12orEarlier - - - - - - - - - - - Regular expression - - Regular expression pattern. - beta12orEarlier - - - - - - - - - - Sequence motif - - - - - - - - beta12orEarlier - Any specific or conserved pattern (typically expressed as a regular expression) in a molecular sequence. - - - - - - - - - - Sequence profile - - - - - - - - Some type of statistical model representing a (typically multiple) sequence alignment. - http://semanticscience.org/resource/SIO_010531 - beta12orEarlier - - - - - - - - - - Protein signature - - An informative report about a specific or conserved protein sequence pattern. - InterPro entry - Protein repeat signature - Protein region signature - Protein site signature - beta12orEarlier - Protein family signature - Protein domain signature - - - - - - - - - - Prosite nucleotide pattern - - A nucleotide regular expression pattern from the Prosite database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Prosite protein pattern - - A protein regular expression pattern from the Prosite database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Position frequency matrix - - beta12orEarlier - PFM - A profile (typically representing a sequence alignment) that is a simple matrix of nucleotide (or amino acid) counts per position. - - - - - - - - - - Position weight matrix - - PWM - beta12orEarlier - A profile (typically representing a sequence alignment) that is weighted matrix of nucleotide (or amino acid) counts per position. - Contributions of individual sequences to the matrix might be uneven (weighted). - - - - - - - - - - Information content matrix - - beta12orEarlier - ICM - A profile (typically representing a sequence alignment) derived from a matrix of nucleotide (or amino acid) counts per position that reflects information content at each position. - - - - - - - - - - Hidden Markov model - - HMM - beta12orEarlier - A hidden Markov model representation of a set or alignment of sequences. - - - - - - - - - - Fingerprint - - beta12orEarlier - One or more fingerprints (sequence classifiers) as used in the PRINTS database. - - - - - - - - - - Domainatrix signature - - A protein signature of the type used in the EMBASSY Signature package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - HMMER NULL hidden Markov model - - beta12orEarlier - beta12orEarlier - true - NULL hidden Markov model representation used by the HMMER package. - - - - - - - - - - Protein family signature - - Protein family signatures cover all domains in the matching proteins and span >80% of the protein length and with no adjacent protein domain signatures or protein region signatures. - beta12orEarlier - true - 1.5 - A protein family signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein domain signature - - beta12orEarlier - 1.5 - true - A protein domain signature (sequence classifier) from the InterPro database. - Protein domain signatures identify structural or functional domains or other units with defined boundaries. - - - - - - - - - - Protein region signature - - A protein region signature (sequence classifier) from the InterPro database. - true - beta12orEarlier - 1.5 - A protein region signature defines a region which cannot be described as a protein family or domain signature. - - - - - - - - - - Protein repeat signature - - true - 1.5 - A protein repeat signature is a repeated protein motif, that is not in single copy expected to independently fold into a globular domain. - beta12orEarlier - A protein repeat signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein site signature - - A protein site signature is a classifier for a specific site in a protein. - beta12orEarlier - A protein site signature (sequence classifier) from the InterPro database. - true - 1.5 - - - - - - - - - - Protein conserved site signature - - 1.4 - true - A protein conserved site signature is any short sequence pattern that may contain one or more unique residues and is cannot be described as a active site, binding site or post-translational modification. - A protein conserved site signature (sequence classifier) from the InterPro database. - beta12orEarlier - - - - - - - - - - Protein active site signature - - A protein active site signature (sequence classifier) from the InterPro database. - A protein active site signature corresponds to an enzyme catalytic pocket. An active site typically includes non-contiguous residues, therefore multiple signatures may be required to describe an active site. ; residues involved in enzymatic reactions for which mutational data is typically available. - true - 1.4 - beta12orEarlier - - - - - - - - - - Protein binding site signature - - 1.4 - A protein binding site signature (sequence classifier) from the InterPro database. - true - A protein binding site signature corresponds to a site that reversibly binds chemical compounds, which are not themselves substrates of the enzymatic reaction. This includes enzyme cofactors and residues involved in electron transport or protein structure modification. - beta12orEarlier - - - - - - - - - - Protein post-translational modification signature - - A protein post-translational modification signature (sequence classifier) from the InterPro database. - A protein post-translational modification signature corresponds to sites that undergo modification of the primary structure, typically to activate or de-activate a function. For example, methylation, sumoylation, glycosylation etc. The modification might be permanent or reversible. - 1.4 - beta12orEarlier - true - - - - - - - - - - Sequence alignment (pair) - - http://semanticscience.org/resource/SIO_010068 - beta12orEarlier - Alignment of exactly two molecular sequences. - - - - - - - - - - Sequence alignment (multiple) - - beta12orEarlier - beta12orEarlier - Alignment of more than two molecular sequences. - true - - - - - - - - - - Sequence alignment (nucleic acid) - - beta12orEarlier - Alignment of multiple nucleotide sequences. - - - - - - - - - - Sequence alignment (protein) - - - Alignment of multiple protein sequences. - beta12orEarlier - - - - - - - - - - Sequence alignment (hybrid) - - Alignment of multiple molecular sequences of different types. - Hybrid sequence alignments include for example genomic DNA to EST, cDNA or mRNA. - beta12orEarlier - - - - - - - - - - Sequence alignment (nucleic acid pair) - - beta12orEarlier - Alignment of exactly two nucleotide sequences. - true - 1.12 - - - - - - - - - - - Sequence alignment (protein pair) - - true - 1.12 - Alignment of exactly two protein sequences. - beta12orEarlier - - - - - - - - - - - Hybrid sequence alignment (pair) - - true - beta12orEarlier - beta12orEarlier - Alignment of exactly two molecular sequences of different types. - - - - - - - - - - Multiple nucleotide sequence alignment - - beta12orEarlier - Alignment of more than two nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - Multiple protein sequence alignment - - true - beta12orEarlier - beta12orEarlier - Alignment of more than two protein sequences. - - - - - - - - - - Alignment score or penalty - - beta12orEarlier - A simple floating point number defining the penalty for opening or extending a gap in an alignment. - - - - - - - - - - Score end gaps control - - beta12orEarlier - beta12orEarlier - Whether end gaps are scored or not. - true - - - - - - - - - - Aligned sequence order - - beta12orEarlier - beta12orEarlier - true - Controls the order of sequences in an output sequence alignment. - - - - - - - - - - Gap opening penalty - - A penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap extension penalty - - A penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty - - beta12orEarlier - A penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap penalty - - beta12orEarlier - A penalty for gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - - Match reward score - - beta12orEarlier - The score for a 'match' used in various sequence database search applications with simple scoring schemes. - - - - - - - - - - Mismatch penalty score - - beta12orEarlier - The score (penalty) for a 'mismatch' used in various alignment and sequence database search applications with simple scoring schemes. - - - - - - - - - - Drop off score - - This is the threshold drop in score at which extension of word alignment is halted. - beta12orEarlier - - - - - - - - - - Gap opening penalty (integer) - - beta12orEarlier - true - A simple floating point number defining the penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap opening penalty (float) - - beta12orEarlier - beta12orEarlier - A simple floating point number defining the penalty for opening a gap in an alignment. - true - - - - - - - - - - Gap extension penalty (integer) - - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gap extension penalty (float) - - beta12orEarlier - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty (integer) - - A simple floating point number defining the penalty for gaps that are close together in an alignment. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gap separation penalty (float) - - beta12orEarlier - true - beta12orEarlier - A simple floating point number defining the penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap opening penalty - - beta12orEarlier - A number defining the penalty for opening gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - - - - - - - - - - Terminal gap extension penalty - - A number defining the penalty for extending gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence identity - - Sequence identity is the number (%) of matches (identical characters) in positions from an alignment of two molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity - - beta12orEarlier - Sequence similarity is the similarity (expressed as a percentage) of two molecular sequences calculated from their alignment, a scoring matrix for scoring characters substitutions and penalties for gap insertion and extension. - Data Type is float probably. - - - - - - - - - - Sequence alignment metadata (quality report) - - beta12orEarlier - true - beta12orEarlier - Data on molecular sequence alignment quality (estimated accuracy). - - - - - - - - - - Sequence alignment report (site conservation) - - beta12orEarlier - Data on character conservation in a molecular sequence alignment. - 1.4 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - true - - - - - - - - - - Sequence alignment report (site correlation) - - 1.4 - beta12orEarlier - Data on correlations between sites in a molecular sequence alignment, typically to identify possible covarying positions and predict contacts or structural constraints in protein structures. - true - - - - - - - - - - Sequence-profile alignment (Domainatrix signature) - - beta12orEarlier - Alignment of molecular sequences to a Domainatrix signature (representing a sequence alignment). - beta12orEarlier - true - - - - - - - - - - Sequence-profile alignment (HMM) - - beta12orEarlier - 1.5 - true - Alignment of molecular sequence(s) to a hidden Markov model(s). - - - - - - - - - - Sequence-profile alignment (fingerprint) - - Alignment of molecular sequences to a protein fingerprint from the PRINTS database. - 1.5 - beta12orEarlier - true - - - - - - - - - - Phylogenetic continuous quantitative data - - beta12orEarlier - Phylogenetic continuous quantitative characters - Quantitative traits - Continuous quantitative data that may be read during phylogenetic tree calculation. - - - - - - - - - - Phylogenetic discrete data - - Discrete characters - Character data with discrete states that may be read during phylogenetic tree calculation. - Phylogenetic discrete states - beta12orEarlier - Discretely coded characters - - - - - - - - - - Phylogenetic character cliques - - One or more cliques of mutually compatible characters that are generated, for example from analysis of discrete character data, and are used to generate a phylogeny. - Phylogenetic report (cliques) - beta12orEarlier - - - - - - - - - - Phylogenetic invariants - - - - - - - - Phylogenetic invariants data for testing alternative tree topologies. - beta12orEarlier - Phylogenetic report (invariants) - - - - - - - - - - Phylogenetic report - - Phylogenetic tree-derived report - This is a broad data type and is used for example for reports on confidence, shape or stratigraphic (age) data derived from phylogenetic tree analysis. - beta12orEarlier - A report of data concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - Phylogenetic tree report - 1.5 - true - - - - - - - - - - DNA substitution model - - Substitution model - Phylogenetic tree report (DNA substitution model) - Sequence alignment report (DNA substitution model) - beta12orEarlier - A model of DNA substitution that explains a DNA sequence alignment, derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic tree report (tree shape) - - beta12orEarlier - true - 1.4 - Data about the shape of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree report (tree evaluation) - - beta12orEarlier - true - 1.4 - Data on the confidence of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree distances - - beta12orEarlier - Phylogenetic tree report (tree distances) - Distances, such as Branch Score distance, between two or more phylogenetic trees. - - - - - - - - - - Phylogenetic tree report (tree stratigraphic) - - beta12orEarlier - 1.4 - true - Molecular clock and stratigraphic (age) data derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic character contrasts - - Phylogenetic report (character contrasts) - Independent contrasts for characters used in a phylogenetic tree, or covariances, regressions and correlations between characters for those contrasts. - beta12orEarlier - - - - - - - - - - Comparison matrix (integers) - - beta12orEarlier - Substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for sequence comparison. - true - - - - - - - - - - Comparison matrix (floats) - - beta12orEarlier - beta12orEarlier - true - Matrix of floating point numbers for sequence comparison. - Substitution matrix (floats) - - - - - - - - - - Comparison matrix (nucleotide) - - Matrix of integer or floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix - - - - - - - - - - Comparison matrix (amino acid) - - - Amino acid comparison matrix - beta12orEarlier - Matrix of integer or floating point numbers for amino acid comparison. - Amino acid substitution matrix - - - - - - - - - - Nucleotide comparison matrix (integers) - - Nucleotide substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for nucleotide comparison. - true - beta12orEarlier - - - - - - - - - - Nucleotide comparison matrix (floats) - - beta12orEarlier - true - Matrix of floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix (floats) - - - - - - - - - - Amino acid comparison matrix (integers) - - beta12orEarlier - Matrix of integer numbers for amino acid comparison. - Amino acid substitution matrix (integers) - true - beta12orEarlier - - - - - - - - - - Amino acid comparison matrix (floats) - - beta12orEarlier - Amino acid substitution matrix (floats) - beta12orEarlier - true - Matrix of floating point numbers for amino acid comparison. - - - - - - - - - - Protein features report (membrane regions) - - true - beta12orEarlier - 1.8 - trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Nucleic acid structure - - - - - - - - 3D coordinate and associated data for a nucleic acid tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein structure - - - - - - - - Protein structures - 3D coordinate and associated data for a protein tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein-ligand complex - - The structure of a protein in complex with a ligand, typically a small molecule such as an enzyme substrate or cofactor, but possibly another macromolecule. - beta12orEarlier - This includes interactions of proteins with atoms, ions and small molecules or macromolecules such as nucleic acids or other polypeptides. For stable inter-polypeptide interactions use 'Protein complex' instead. - - - - - - - - - - Carbohydrate structure - - - - - - - - - - - - - - beta12orEarlier - 3D coordinate and associated data for a carbohydrate (3D) structure. - - - - - - - - - - Small molecule structure - - - - - - - - 3D coordinate and associated data for the (3D) structure of a small molecule, such as any common chemical compound. - CHEBI:23367 - beta12orEarlier - - - - - - - - - - DNA structure - - beta12orEarlier - 3D coordinate and associated data for a DNA tertiary (3D) structure. - - - - - - - - - - RNA structure - - - - - - - - beta12orEarlier - 3D coordinate and associated data for an RNA tertiary (3D) structure. - - - - - - - - - - tRNA structure - - 3D coordinate and associated data for a tRNA tertiary (3D) structure, including tmRNA, snoRNAs etc. - beta12orEarlier - - - - - - - - - - Protein chain - - beta12orEarlier - 3D coordinate and associated data for the tertiary (3D) structure of a polypeptide chain. - - - - - - - - - - Protein domain - - - - - - - - 3D coordinate and associated data for the tertiary (3D) structure of a protein domain. - beta12orEarlier - - - - - - - - - - Protein structure (all atoms) - - beta12orEarlier - 1.5 - true - 3D coordinate and associated data for a protein tertiary (3D) structure (all atoms). - - - - - - - - - - C-alpha trace - - 3D coordinate and associated data for a protein tertiary (3D) structure (typically C-alpha atoms only). - C-beta atoms from amino acid side-chains may be included. - Protein structure (C-alpha atoms) - beta12orEarlier - - - - - - - - - - Protein chain (all atoms) - - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (all atoms). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Protein chain (C-alpha atoms) - - true - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Protein domain (all atoms) - - 3D coordinate and associated data for a protein domain tertiary (3D) structure (all atoms). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Protein domain (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - 3D coordinate and associated data for a protein domain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Structure alignment (pair) - - Alignment (superimposition) of exactly two molecular tertiary (3D) structures. - beta12orEarlier - Pair structure alignment - - - - - - - - - - Structure alignment (multiple) - - beta12orEarlier - beta12orEarlier - true - Alignment (superimposition) of more than two molecular tertiary (3D) structures. - - - - - - - - - - Structure alignment (protein) - - - Protein structure alignment - beta12orEarlier - Alignment (superimposition) of protein tertiary (3D) structures. - - - - - - - - - - Structure alignment (nucleic acid) - - beta12orEarlier - Alignment (superimposition) of nucleic acid tertiary (3D) structures. - Nucleic acid structure alignment - - - - - - - - - - Structure alignment (protein pair) - - 1.12 - Protein pair structural alignment - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures. - - - - - - - - - - - Multiple protein tertiary structure alignment - - Alignment (superimposition) of more than two protein tertiary (3D) structures. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structure alignment (protein all atoms) - - 1.5 - Alignment (superimposition) of protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - true - - - - - - - - - - Structure alignment (protein C-alpha atoms) - - Alignment (superimposition) of protein tertiary (3D) structures (typically C-alpha atoms only considered). - C-beta atoms from amino acid side-chains may be considered. - 1.5 - C-alpha trace - true - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (all atoms) - - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (all atoms) - - beta12orEarlier - true - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (C-alpha atoms) - - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - true - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Structure alignment (nucleic acid pair) - - beta12orEarlier - 1.12 - true - Nucleic acid pair structure alignment - Alignment (superimposition) of exactly two nucleic acid tertiary (3D) structures. - - - - - - - - - - - Multiple nucleic acid tertiary structure alignment - - beta12orEarlier - Alignment (superimposition) of more than two nucleic acid tertiary (3D) structures. - true - beta12orEarlier - - - - - - - - - - Structure alignment (RNA) - - RNA structure alignment - Alignment (superimposition) of RNA tertiary (3D) structures. - beta12orEarlier - - - - - - - - - Structural transformation matrix - - Matrix to transform (rotate/translate) 3D coordinates, typically the transformation necessary to superimpose two molecular structures. - beta12orEarlier - - - - - - - - - - DaliLite hit table - - DaliLite hit table of protein chain tertiary structure alignment data. - The significant and top-scoring hits for regions of the compared structures is shown. Data such as Z-Scores, number of aligned residues, root-mean-square deviation (RMSD) of atoms and sequence identity are given. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Molecular similarity score - - beta12orEarlier - A score reflecting structural similarities of two molecules. - true - beta12orEarlier - - - - - - - - - - Root-mean-square deviation - - RMSD - beta12orEarlier - Root-mean-square deviation (RMSD) is calculated to measure the average distance between superimposed macromolecular coordinates. - - - - - - - - - - Tanimoto similarity score - - beta12orEarlier - A measure of the similarity between two ligand fingerprints. - A ligand fingerprint is derived from ligand structural data from a Protein DataBank file. It reflects the elements or groups present or absent, covalent bonds and bond orders and the bonded environment in terms of SATIS codes and BLEEP atom types. - - - - - - - - - - 3D-1D scoring matrix - - A matrix of 3D-1D scores reflecting the probability of amino acids to occur in different tertiary structural environments. - beta12orEarlier - - - - - - - - - - Amino acid index - - - beta12orEarlier - A table of 20 numerical values which quantify a property (e.g. physicochemical or biochemical) of the common amino acids. - - - - - - - - - - Amino acid index (chemical classes) - - Chemical classes (amino acids) - Chemical classification (small, aliphatic, aromatic, polar, charged etc) of amino acids. - beta12orEarlier - - - - - - - - - - Amino acid pair-wise contact potentials - - Contact potentials (amino acid pair-wise) - Statistical protein contact potentials. - beta12orEarlier - - - - - - - - - - Amino acid index (molecular weight) - - Molecular weights of amino acids. - Molecular weight (amino acids) - beta12orEarlier - - - - - - - - - - Amino acid index (hydropathy) - - Hydrophobic, hydrophilic or charge properties of amino acids. - beta12orEarlier - Hydropathy (amino acids) - - - - - - - - - - Amino acid index (White-Wimley data) - - beta12orEarlier - White-Wimley data (amino acids) - Experimental free energy values for the water-interface and water-octanol transitions for the amino acids. - - - - - - - - - - Amino acid index (van der Waals radii) - - van der Waals radii (amino acids) - Van der Waals radii of atoms for different amino acid residues. - beta12orEarlier - - - - - - - - - - Enzyme report - - true - 1.5 - Protein report (enzyme) - beta12orEarlier - An informative report on a specific enzyme. - - - - - - - - - - Restriction enzyme report - - An informative report on a specific restriction enzyme such as enzyme reference data. - This might include name of enzyme, organism, isoschizomers, methylation, source, suppliers, literature references, or data on restriction enzyme patterns such as name of enzyme, recognition site, length of pattern, number of cuts made by enzyme, details of blunt or sticky end cut etc. - Restriction enzyme pattern data - Protein report (restriction enzyme) - beta12orEarlier - true - 1.5 - - - - - - - - - - Peptide molecular weights - - beta12orEarlier - List of molecular weight(s) of one or more proteins or peptides, for example cut by proteolytic enzymes or reagents. - The report might include associated data such as frequency of peptide fragment molecular weights. - - - - - - - - - - Peptide hydrophobic moment - - beta12orEarlier - Report on the hydrophobic moment of a polypeptide sequence. - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - - - - - - - - - - Protein aliphatic index - - The aliphatic index of a protein. - beta12orEarlier - The aliphatic index is the relative protein volume occupied by aliphatic side chains. - - - - - - - - - - Protein sequence hydropathy plot - - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - A protein sequence with annotation on hydrophobic or hydrophilic / charged regions, hydrophobicity plot etc. - beta12orEarlier - - - - - - - - - - Protein charge plot - - beta12orEarlier - A plot of the mean charge of the amino acids within a window of specified length as the window is moved along a protein sequence. - - - - - - - - - - Protein solubility - - beta12orEarlier - The solubility or atomic solvation energy of a protein sequence or structure. - Protein solubility data - - - - - - - - - - Protein crystallizability - - beta12orEarlier - Protein crystallizability data - Data on the crystallizability of a protein sequence. - - - - - - - - - - Protein globularity - - Protein globularity data - beta12orEarlier - Data on the stability, intrinsic disorder or globularity of a protein sequence. - - - - - - - - - - Protein titration curve - - - The titration curve of a protein. - beta12orEarlier - - - - - - - - - - Protein isoelectric point - - beta12orEarlier - The isoelectric point of one proteins. - - - - - - - - - - Protein pKa value - - The pKa value of a protein. - beta12orEarlier - - - - - - - - - - Protein hydrogen exchange rate - - beta12orEarlier - The hydrogen exchange rate of a protein. - - - - - - - - - - Protein extinction coefficient - - The extinction coefficient of a protein. - beta12orEarlier - - - - - - - - - - Protein optical density - - The optical density of a protein. - beta12orEarlier - - - - - - - - - - Protein subcellular localization - - Protein report (subcellular localization) - An informative report on protein subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or destination (exported / extracellular proteins). - beta12orEarlier - true - beta13 - - - - - - - - - - Peptide immunogenicity data - - An report on allergenicity / immunogenicity of peptides and proteins. - Peptide immunogenicity report - beta12orEarlier - Peptide immunogenicity - This includes data on peptide ligands that elicit an immune response (immunogens), allergic cross-reactivity, predicted antigenicity (Hopp and Woods plot) etc. These data are useful in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - - - - - - - - - - MHC peptide immunogenicity report - - A report on the immunogenicity of MHC class I or class II binding peptides. - beta13 - true - beta12orEarlier - - - - - - - - - - Protein structure report - - - Protein structural property - Protein structure-derived report - This includes for example reports on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure, protein flexibility or motion, and protein architecture (spatial arrangement of secondary structure). - Protein property (structural) - Annotation about, or structural information derived from, one or more specific protein 3D structure(s) or structural domains. - beta12orEarlier - Protein report (structure) - Protein structure report (domain) - - - - - - - - - - Protein structural quality report - - Report on the quality of a protein three-dimensional model. - Protein structure report (quality evaluation) - Protein structure validation report - Protein property (structural quality) - Model validation might involve checks for atomic packing, steric clashes, agreement with electron density maps etc. - Protein report (structural quality) - beta12orEarlier - - - - - - - - - - Protein non-covalent interactions report - - Data on inter-atomic or inter-residue contacts, distances and interactions in protein structure(s) or on the interactions of protein atoms or residues with non-protein groups. - beta12orEarlier - true - 1.12 - - - - - - - - - - Protein flexibility or motion report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein property (flexibility or motion) - Informative report on flexibility or motion of a protein structure. - Protein flexibility or motion - beta12orEarlier - true - 1.4 - Protein structure report (flexibility or motion) - - - - - - - - - - Protein solvent accessibility report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. This concept covers definitions of the protein surface, interior and interfaces, accessible and buried residues, surface accessible pockets, interior inaccessible cavities etc. - beta12orEarlier - Data on the solvent accessible or buried surface area of a protein structure. - - - - - - - - - - Protein surface report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein structure report (surface) - 1.4 - Data on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure. - beta12orEarlier - true - - - - - - - - - - Ramachandran plot - - beta12orEarlier - Phi/psi angle data or a Ramachandran plot of a protein structure. - - - - - - - - - - Protein dipole moment - - Data on the net charge distribution (dipole moment) of a protein structure. - beta12orEarlier - - - - - - - - - - Protein distance matrix - - - beta12orEarlier - A matrix of distances between amino acid residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map - - An amino acid residue contact map for a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue 3D cluster - - beta12orEarlier - Report on clusters of contacting residues in protein structures such as a key structural residue network. - - - - - - - - - - Protein hydrogen bonds - - Patterns of hydrogen bonding in protein structures. - beta12orEarlier - - - - - - - - - - Protein non-canonical interactions - - Protein non-canonical interactions report - true - Non-canonical atomic interactions in protein structures. - 1.4 - beta12orEarlier - - - - - - - - - - CATH node - - Information on a node from the CATH database. - The report (for example http://www.cathdb.info/cathnode/1.10.10.10) includes CATH code (of the node and upper levels in the hierarchy), classification text (of appropriate levels in hierarchy), list of child nodes, representative domain and other relevant data and links. - 1.5 - beta12orEarlier - true - CATH classification node report - - - - - - - - - - SCOP node - - true - SCOP classification node - Information on a node from the SCOP database. - 1.5 - beta12orEarlier - - - - - - - - - - EMBASSY domain classification - - beta12orEarlier - beta12orEarlier - true - An EMBASSY domain classification file (DCF) of classification and other data for domains from SCOP or CATH, in EMBL-like format. - - - - - - - - - - CATH class - - beta12orEarlier - 1.5 - Information on a protein 'class' node from the CATH database. - true - - - - - - - - - - CATH architecture - - beta12orEarlier - 1.5 - Information on a protein 'architecture' node from the CATH database. - true - - - - - - - - - - CATH topology - - true - 1.5 - Information on a protein 'topology' node from the CATH database. - beta12orEarlier - - - - - - - - - - CATH homologous superfamily - - 1.5 - true - beta12orEarlier - Information on a protein 'homologous superfamily' node from the CATH database. - - - - - - - - - - CATH structurally similar group - - 1.5 - true - beta12orEarlier - Information on a protein 'structurally similar group' node from the CATH database. - - - - - - - - - - CATH functional category - - Information on a protein 'functional category' node from the CATH database. - true - 1.5 - beta12orEarlier - - - - - - - - - - Protein fold recognition report - - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - beta12orEarlier - A report on known protein structural domains or folds that are recognized (identified) in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Protein-protein interaction report - - protein-protein interaction(s), including interactions between protein domains. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein-ligand interaction report - - Protein-drug interaction report - beta12orEarlier - An informative report on protein-ligand (small molecule) interaction(s). - - - - - - - - - - Protein-nucleic acid interactions report - - true - protein-DNA/RNA interaction(s). - beta12orEarlier - 1.8 - - - - - - - - - - Nucleic acid melting profile - - Nucleic acid stability profile - A melting (stability) profile calculated the free energy required to unwind and separate the nucleic acid strands, plotted for sliding windows over a sequence. - Data on the dissociation characteristics of a double-stranded nucleic acid molecule (DNA or a DNA/RNA hybrid) during heating. - beta12orEarlier - - - - - - - - - - Nucleic acid enthalpy - - beta12orEarlier - Enthalpy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - - - - - - - - - - Nucleic acid entropy - - Entropy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - - - - - - - - - - Nucleic acid melting temperature - - Melting temperature of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Nucleic acid stitch profile - - beta12orEarlier - Stitch profile of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - A stitch profile diagram shows partly melted DNA conformations (with probabilities) at a range of temperatures. For example, a stitch profile might show possible loop openings with their location, size, probability and fluctuations at a given temperature. - - - - - - - - - - DNA base pair stacking energies data - - DNA base pair stacking energies data. - beta12orEarlier - - - - - - - - - - DNA base pair twist angle data - - beta12orEarlier - DNA base pair twist angle data. - - - - - - - - - - DNA base trimer roll angles data - - beta12orEarlier - DNA base trimer roll angles data. - - - - - - - - - - Vienna RNA parameters - - RNA parameters used by the Vienna package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA structure constraints - - true - Structure constraints used by the Vienna package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA concentration data - - RNA concentration data used by the Vienna package. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Vienna RNA calculated energy - - beta12orEarlier - beta12orEarlier - true - RNA calculated energy data generated by the Vienna package. - - - - - - - - - - Base pairing probability matrix dotplot - - - beta12orEarlier - Such as generated by the Vienna package. - Dotplot of RNA base pairing probability matrix. - - - - - - - - - - Nucleic acid folding report - - Nucleic acid report (folding) - beta12orEarlier - Nucleic acid report (folding model) - RNA secondary structure folding probablities - A report on an analysis of RNA/DNA folding, minimum folding energies for DNA or RNA sequences, energy landscape of RNA mutants etc. - RNA secondary structure folding classification - - - - - - - - - - Codon usage table - - - - - - - - Table of codon usage data calculated from one or more nucleic acid sequences. - A codon usage table might include the codon usage table name, optional comments and a table with columns for codons and corresponding codon usage data. A genetic code can be extracted from or represented by a codon usage table. - beta12orEarlier - - - - - - - - - - Genetic code - - beta12orEarlier - A genetic code for an organism. - A genetic code need not include detailed codon usage information. - - - - - - - - - - Codon adaptation index - - true - A simple measure of synonymous codon usage bias often used to predict gene expression levels. - CAI - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon usage bias plot - - Synonymous codon usage statistic plot - beta12orEarlier - A plot of the synonymous codon usage calculated for windows over a nucleotide sequence. - - - - - - - - - - Nc statistic - - true - beta12orEarlier - The effective number of codons used in a gene sequence. This reflects how far codon usage of a gene departs from equal usage of synonymous codons. - beta12orEarlier - - - - - - - - - - Codon usage fraction difference - - The differences in codon usage fractions between two codon usage tables. - beta12orEarlier - - - - - - - - - - Pharmacogenomic test report - - beta12orEarlier - The report might correlate gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - Data on the influence of genotype on drug response. - - - - - - - - - - Disease report - - - - - - - - An informative report on a specific disease. - For example, an informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Linkage disequilibrium (report) - - true - A report on linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - 1.8 - beta12orEarlier - - - - - - - - - - Heat map - - - A graphical 2D tabular representation of gene expression data, typically derived from a DNA microarray experiment. - beta12orEarlier - A heat map is a table where rows and columns correspond to different genes and contexts (for example, cells or samples) and the cell color represents the level of expression of a gene that context. - - - - - - - - - - Affymetrix probe sets library file - - true - Affymetrix library file of information about which probes belong to which probe set. - CDF file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Affymetrix probe sets information library file - - true - Affymetrix library file of information about the probe sets such as the gene name with which the probe set is associated. - GIN file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Molecular weights standard fingerprint - - beta12orEarlier - true - 1.12 - Standard protonated molecular masses from trypsin (modified porcine trypsin, Promega) and keratin peptides, used in EMBOSS. - - - - - - - - - - Metabolic pathway report - - This includes carbohydrate, energy, lipid, nucleotide, amino acid, glycan, PK/NRP, cofactor/vitamin, secondary metabolite, xenobiotics etc. - beta12orEarlier - A report typically including a map (diagram) of a metabolic pathway. - 1.8 - true - - - - - - - - - - Genetic information processing pathway report - - beta12orEarlier - 1.8 - true - genetic information processing pathways. - - - - - - - - - - Environmental information processing pathway report - - true - environmental information processing pathways. - beta12orEarlier - 1.8 - - - - - - - - - - Signal transduction pathway report - - A report typically including a map (diagram) of a signal transduction pathway. - 1.8 - true - beta12orEarlier - - - - - - - - - - Cellular process pathways report - - 1.8 - Topic concernning cellular process pathways. - true - beta12orEarlier - - - - - - - - - - Disease pathway or network report - - true - beta12orEarlier - disease pathways, typically of human disease. - 1.8 - - - - - - - - - - Drug structure relationship map - - A report typically including a map (diagram) of drug structure relationships. - beta12orEarlier - - - - - - - - - - Protein interaction networks - - 1.8 - networks of protein interactions. - true - beta12orEarlier - - - - - - - - - - MIRIAM datatype - - A MIRIAM entry describes a MIRIAM data type including the official name, synonyms, root URI, identifier pattern (regular expression applied to a unique identifier of the data type) and documentation. Each data type can be associated with several resources. Each resource is a physical location of a service (typically a database) providing information on the elements of a data type. Several resources may exist for each data type, provided the same (mirrors) or different information. MIRIAM provides a stable and persistent reference to its data types. - An entry (data type) from the Minimal Information Requested in the Annotation of Biochemical Models (MIRIAM) database of data resources. - beta12orEarlier - true - 1.5 - - - - - - - - - - E-value - - An expectation value (E-Value) is the expected number of observations which are at least as extreme as observations expected to occur by random chance. The E-value describes the number of hits with a given score or better that are expected to occur at random when searching a database of a particular size. It decreases exponentially with the score (S) of a hit. A low E value indicates a more significant score. - beta12orEarlier - A simple floating point number defining the lower or upper limit of an expectation value (E-value). - Expectation value - - - - - - - - - - Z-value - - beta12orEarlier - The z-value is the number of standard deviations a data value is above or below a mean value. - A z-value might be specified as a threshold for reporting hits from database searches. - - - - - - - - - - P-value - - beta12orEarlier - A z-value might be specified as a threshold for reporting hits from database searches. - The P-value is the probability of obtaining by random chance a result that is at least as extreme as an observed result, assuming a NULL hypothesis is true. - - - - - - - - - - Database version information - - true - Ontology version information - 1.5 - Information on a database (or ontology) version, for example name, version number and release date. - beta12orEarlier - - - - - - - - - - Tool version information - - beta12orEarlier - Information on an application version, for example name, version number and release date. - true - 1.5 - - - - - - - - - - CATH version information - - beta12orEarlier - beta12orEarlier - true - Information on a version of the CATH database. - - - - - - - - - - Swiss-Prot to PDB mapping - - Cross-mapping of Swiss-Prot codes to PDB identifiers. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database cross-references - - Cross-references from a sequence record to other databases. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Job status - - Metadata on the status of a submitted job. - beta12orEarlier - 1.5 - true - Values for EBI services are 'DONE' (job has finished and the results can then be retrieved), 'ERROR' (the job failed or no results where found), 'NOT_FOUND' (the job id is no longer available; job results might be deleted, 'PENDING' (the job is in a queue waiting processing), 'RUNNING' (the job is currently being processed). - - - - - - - - - - Job ID - - 1.0 - The (typically numeric) unique identifier of a submitted job. - beta12orEarlier - true - - - - - - - - - - Job type - - 1.5 - true - beta12orEarlier - A label (text token) describing the type of job, for example interactive or non-interactive. - - - - - - - - - - Tool log - - 1.5 - A report of tool-specific metadata on some analysis or process performed, for example a log of diagnostic or error messages. - true - beta12orEarlier - - - - - - - - - - DaliLite log file - - true - beta12orEarlier - DaliLite log file describing all the steps taken by a DaliLite alignment of two protein structures. - beta12orEarlier - - - - - - - - - - STRIDE log file - - STRIDE log file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - NACCESS log file - - beta12orEarlier - beta12orEarlier - true - NACCESS log file. - - - - - - - - - - EMBOSS wordfinder log file - - EMBOSS wordfinder log file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS domainatrix log file - - beta12orEarlier - EMBOSS (EMBASSY) domainatrix application log file. - beta12orEarlier - true - - - - - - - - - - EMBOSS sites log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS (EMBASSY) sites application log file. - - - - - - - - - - EMBOSS supermatcher error file - - EMBOSS (EMBASSY) supermatcher error file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS megamerger log file - - beta12orEarlier - beta12orEarlier - EMBOSS megamerger log file. - true - - - - - - - - - - EMBOSS whichdb log file - - beta12orEarlier - true - EMBOSS megamerger log file. - beta12orEarlier - - - - - - - - - - EMBOSS vectorstrip log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS vectorstrip log file. - - - - - - - - - - Username - - A username on a computer system. - beta12orEarlier - - - - - - - - - - - Password - - beta12orEarlier - A password on a computer system. - - - - - - - - - - - Email address - - beta12orEarlier - Moby:Email - A valid email address of an end-user. - Moby:EmailAddress - - - - - - - - - - - Person name - - beta12orEarlier - The name of a person. - - - - - - - - - - - Number of iterations - - 1.5 - Number of iterations of an algorithm. - true - beta12orEarlier - - - - - - - - - - Number of output entities - - Number of entities (for example database hits, sequences, alignments etc) to write to an output file. - 1.5 - beta12orEarlier - true - - - - - - - - - - Hit sort order - - Controls the order of hits (reported matches) in an output file from a database search. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Drug report - - - - - - - - An informative report on a specific drug. - beta12orEarlier - Drug annotation - - - - - - - - - - - Phylogenetic tree image - - beta12orEarlier - An image (for viewing or printing) of a phylogenetic tree including (typically) a plot of rooted or unrooted phylogenies, cladograms, circular trees or phenograms and associated information. - See also 'Phylogenetic tree' - - - - - - - - - - RNA secondary structure image - - beta12orEarlier - Image of RNA secondary structure, knots, pseudoknots etc. - - - - - - - - - - Protein secondary structure image - - Image of protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure image - - beta12orEarlier - Image of one or more molecular tertiary (3D) structures. - - - - - - - - - - Sequence alignment image - - beta12orEarlier - Image of two or more aligned molecular sequences possibly annotated with alignment features. - - - - - - - - - - Chemical structure image - - An image of the structure of a small chemical compound. - The molecular identifier and formula are typically included. - Small molecule structure image - beta12orEarlier - - - - - - - - - - Fate map - - - - - - - - - beta12orEarlier - A fate map is a plan of early stage of an embryo such as a blastula, showing areas that are significance to development. - - - - - - - - - - Microarray spots image - - - beta12orEarlier - An image of spots from a microarray experiment. - - - - - - - - - - BioPax term - - beta12orEarlier - A term from the BioPax ontology. - beta12orEarlier - true - - - - - - - - - - GO - - beta12orEarlier - Gene Ontology term - Moby:Annotated_GO_Term - Moby:Annotated_GO_Term_With_Probability - true - A term definition from The Gene Ontology (GO). - beta12orEarlier - Moby:GO_Term - Moby:GOTerm - - - - - - - - - - MeSH - - true - A term from the MeSH vocabulary. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HGNC - - beta12orEarlier - true - A term from the HGNC controlled vocabulary. - beta12orEarlier - - - - - - - - - - NCBI taxonomy vocabulary - - beta12orEarlier - beta12orEarlier - true - A term from the NCBI taxonomy vocabulary. - - - - - - - - - - Plant ontology term - - beta12orEarlier - true - beta12orEarlier - A term from the Plant Ontology (PO). - - - - - - - - - - UMLS - - beta12orEarlier - beta12orEarlier - A term from the UMLS vocabulary. - true - - - - - - - - - - FMA - - beta12orEarlier - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - A term from Foundational Model of Anatomy. - true - - - - - - - - - - EMAP - - A term from the EMAP mouse ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - ChEBI - - beta12orEarlier - A term from the ChEBI ontology. - true - beta12orEarlier - - - - - - - - - - MGED - - beta12orEarlier - true - A term from the MGED ontology. - beta12orEarlier - - - - - - - - - - myGrid - - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - beta12orEarlier - true - A term from the myGrid ontology. - beta12orEarlier - - - - - - - - - - GO (biological process) - - beta12orEarlier - true - beta12orEarlier - Data Type is an enumerated string. - A term definition for a biological process from the Gene Ontology (GO). - - - - - - - - - - GO (molecular function) - - A term definition for a molecular function from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - true - beta12orEarlier - - - - - - - - - - GO (cellular component) - - beta12orEarlier - true - A term definition for a cellular component from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - - - - - - - - - - Ontology relation type - - 1.5 - beta12orEarlier - true - A relation type defined in an ontology. - - - - - - - - - - Ontology concept definition - - beta12orEarlier - Ontology class definition - The definition of a concept from an ontology. - - - - - - - - - - Ontology concept comment - - beta12orEarlier - 1.4 - true - A comment on a concept from an ontology. - - - - - - - - - - Ontology concept reference - - beta12orEarlier - true - Reference for a concept from an ontology. - beta12orEarlier - - - - - - - - - - doc2loc document information - - beta12orEarlier - true - The doc2loc output includes the url, format, type and availability code of a document for every service provider. - beta12orEarlier - Information on a published article provided by the doc2loc program. - - - - - - - - - - PDB residue number - - WHATIF: pdb_number - PDBML:PDB_residue_no - beta12orEarlier - A residue identifier (a string) from a PDB file. - - - - - - - - - - Atomic coordinate - - Cartesian coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian coordinate - - - - - - - - - - Atomic x coordinate - - WHATIF: PDBx_Cartn_x - Cartesian x coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_x in PDBML - Cartesian x coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic y coordinate - - WHATIF: PDBx_Cartn_y - Cartesian y coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_y in PDBML - Cartesian y coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic z coordinate - - PDBML:_atom_site.Cartn_z - WHATIF: PDBx_Cartn_z - Cartesian z coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian z coordinate - - - - - - - - - - PDB atom name - - WHATIF: PDBx_type_symbol - beta12orEarlier - WHATIF: PDBx_auth_atom_id - WHATIF: alternate_atom - PDBML:pdbx_PDB_atom_name - WHATIF: atom_type - Identifier (a string) of a specific atom from a PDB file for a molecular structure. - - - - - - - - - - - Protein atom - - Atom data - CHEBI:33250 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on a single atom from a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue - - beta12orEarlier - Data on a single amino acid residue position in a protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Residue - - - - - - - - - - Atom name - - - Name of an atom. - beta12orEarlier - - - - - - - - - - - PDB residue name - - Three-letter amino acid residue names as used in PDB files. - WHATIF: type - beta12orEarlier - - - - - - - - - - - PDB model number - - Identifier of a model structure from a PDB file. - beta12orEarlier - PDBML:pdbx_PDB_model_num - Model number - WHATIF: model_number - - - - - - - - - - - CATH domain report - - beta12orEarlier - true - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - Summary of domain classification information for a CATH domain. - - - - - - - - - - CATH representative domain sequences (ATOM) - - beta12orEarlier - beta12orEarlier - FASTA sequence database (based on ATOM records in PDB) for CATH domains (clustered at different levels of sequence identity). - true - - - - - - - - - - CATH representative domain sequences (COMBS) - - true - FASTA sequence database (based on COMBS sequence data) for CATH domains (clustered at different levels of sequence identity). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (ATOM) - - true - FASTA sequence database for all CATH domains (based on PDB ATOM records). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (COMBS) - - FASTA sequence database for all CATH domains (based on COMBS sequence data). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence version - - beta12orEarlier - Information on an molecular sequence version. - Sequence version information - - - - - - - - - - Score - - A numerical value, that is some type of scored value arising for example from a prediction method. - beta12orEarlier - - - - - - - - - - Protein report (function) - - true - For properties that can be mapped to a sequence, use 'Sequence report' instead. - beta13 - Report on general functional properties of specific protein(s). - beta12orEarlier - - - - - - - - - - Gene name (ASPGD) - - 1.3 - beta12orEarlier - true - Name of a gene from Aspergillus Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs:ASPGD_LOCUS - - - - - - - - - - Gene name (CGD) - - Name of a gene from Candida Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:CGD_LOCUS - beta12orEarlier - 1.3 - - - - - - - - - - Gene name (dictyBase) - - http://www.geneontology.org/doc/GO.xrf_abbs:dictyBase - beta12orEarlier - 1.3 - true - Name of a gene from dictyBase database. - - - - - - - - - - Gene name (EcoGene primary) - - http://www.geneontology.org/doc/GO.xrf_abbs:ECOGENE_G - Primary name of a gene from EcoGene Database. - EcoGene primary gene name - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (MaizeGDB) - - http://www.geneontology.org/doc/GO.xrf_abbs:MaizeGDB_Locus - 1.3 - Name of a gene from MaizeGDB (maize genes) database. - true - beta12orEarlier - - - - - - - - - - Gene name (SGD) - - true - 1.3 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs:SGD_LOCUS - Name of a gene from Saccharomyces Genome Database. - - - - - - - - - - Gene name (TGD) - - beta12orEarlier - 1.3 - Name of a gene from Tetrahymena Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:TGD_LOCUS - - - - - - - - - - Gene name (CGSC) - - beta12orEarlier - 1.3 - true - http://www.geneontology.org/doc/GO.xrf_abbs: CGSC - Symbol of a gene from E.coli Genetic Stock Center. - - - - - - - - - - Gene name (HGNC) - - beta12orEarlier - HUGO symbol - 1.3 - true - HGNC symbol - Official gene name - HUGO gene name - http://www.geneontology.org/doc/GO.xrf_abbs: HGNC_gene - HGNC gene name - HUGO gene symbol - HGNC:[0-9]{1,5} - Gene name (HUGO) - HGNC gene symbol - Symbol of a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - Gene name (MGD) - - MGI:[0-9]+ - Symbol of a gene from the Mouse Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs: MGD - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (Bacillus subtilis) - - http://www.geneontology.org/doc/GO.xrf_abbs: SUBTILISTG - Symbol of a gene from Bacillus subtilis Genome Sequence Project. - beta12orEarlier - 1.3 - true - - - - - - - - - - Gene ID (PlasmoDB) - - Identifier of a gene from PlasmoDB Plasmodium Genome Resource. - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ApiDB_PlasmoDB - - - - - - - - - - - Gene ID (EcoGene) - - Identifier of a gene from EcoGene Database. - EcoGene Accession - EcoGene ID - beta12orEarlier - - - - - - - - - - - Gene ID (FlyBase) - - beta12orEarlier - Gene identifier from FlyBase database. - http://www.geneontology.org/doc/GO.xrf_abbs: FB - http://www.geneontology.org/doc/GO.xrf_abbs: FlyBase - - - - - - - - - - - Gene ID (GeneDB Glossina morsitans) - - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Gmorsitans - beta13 - Gene identifier from Glossina morsitans GeneDB database. - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Leishmania major) - - Gene identifier from Leishmania major GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Lmajor - beta12orEarlier - beta13 - - - - - - - - - - Gene ID (GeneDB Plasmodium falciparum) - - Gene identifier from Plasmodium falciparum GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Pfalciparum - beta13 - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Schizosaccharomyces pombe) - - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Spombe - beta12orEarlier - true - beta13 - Gene identifier from Schizosaccharomyces pombe GeneDB database. - - - - - - - - - - Gene ID (GeneDB Trypanosoma brucei) - - Gene identifier from Trypanosoma brucei GeneDB database. - true - beta13 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Tbrucei - - - - - - - - - - Gene ID (Gramene) - - http://www.geneontology.org/doc/GO.xrf_abbs: GR_gene - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GR_GENE - Gene identifier from Gramene database. - - - - - - - - - - - Gene ID (Virginia microbial) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: PAMGO_VMD - Gene identifier from Virginia Bioinformatics Institute microbial database. - http://www.geneontology.org/doc/GO.xrf_abbs: VMD - - - - - - - - - - - Gene ID (SGN) - - http://www.geneontology.org/doc/GO.xrf_abbs: SGN - Gene identifier from Sol Genomics Network. - beta12orEarlier - - - - - - - - - - - Gene ID (WormBase) - - - Gene identifier used by WormBase database. - WBGene[0-9]{8} - http://www.geneontology.org/doc/GO.xrf_abbs: WB - http://www.geneontology.org/doc/GO.xrf_abbs: WormBase - beta12orEarlier - - - - - - - - - - - Gene synonym - - Gene name synonym - true - Any name (other than the recommended one) for a gene. - beta12orEarlier - beta12orEarlier - - - - - - - - - - ORF name - - - beta12orEarlier - The name of an open reading frame attributed by a sequencing project. - - - - - - - - - - - Sequence assembly component - - A component of a larger sequence assembly. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Chromosome annotation (aberration) - - beta12orEarlier - beta12orEarlier - true - A report on a chromosome aberration such as abnormalities in chromosome structure. - - - - - - - - - - Clone ID - - beta12orEarlier - An identifier of a clone (cloned molecular sequence) from a database. - - - - - - - - - - - PDB insertion code - - beta12orEarlier - WHATIF: insertion_code - PDBML:pdbx_PDB_ins_code - An insertion code (part of the residue number) for an amino acid residue from a PDB file. - - - - - - - - - - Atomic occupancy - - WHATIF: PDBx_occupancy - The fraction of an atom type present at a site in a molecular structure. - beta12orEarlier - The sum of the occupancies of all the atom types at a site should not normally significantly exceed 1.0. - - - - - - - - - - Isotropic B factor - - Isotropic B factor (atomic displacement parameter) for an atom from a PDB file. - WHATIF: PDBx_B_iso_or_equiv - beta12orEarlier - - - - - - - - - - Deletion map - - A cytogenetic map is built from a set of mutant cell lines with sub-chromosomal deletions and a reference wild-type line ('genome deletion panel'). The panel is used to map markers onto the genome by comparing mutant to wild-type banding patterns. Markers are linked (occur in the same deleted region) if they share the same banding pattern (presence or absence) as the deletion panel. - beta12orEarlier - A cytogenetic map showing chromosome banding patterns in mutant cell lines relative to the wild type. - Deletion-based cytogenetic map - - - - - - - - - - QTL map - - A genetic map which shows the approximate location of quantitative trait loci (QTL) between two or more markers. - beta12orEarlier - Quantitative trait locus map - - - - - - - - - - Haplotype map - - beta12orEarlier - Moby:Haplotyping_Study_obj - A map of haplotypes in a genome or other sequence, describing common patterns of genetic variation. - - - - - - - - - - Map set data - - beta12orEarlier - Data describing a set of multiple genetic or physical maps, typically sharing a common set of features which are mapped. - Moby:GCP_CorrelatedLinkageMapSet - Moby:GCP_CorrelatedMapSet - - - - - - - - - - Map feature - - beta12orEarlier - true - A feature which may mapped (positioned) on a genetic or other type of map. - Moby:MapFeature - beta12orEarlier - Mappable features may be based on Gramene's notion of map features; see http://www.gramene.org/db/cmap/feature_type_info. - - - - - - - - - - - - Map type - - A designation of the type of map (genetic map, physical map, sequence map etc) or map set. - Map types may be based on Gramene's notion of a map type; see http://www.gramene.org/db/cmap/map_type_info. - 1.5 - true - beta12orEarlier - - - - - - - - - - Protein fold name - - The name of a protein fold. - beta12orEarlier - - - - - - - - - - - Taxon - - Moby:PotentialTaxon - Taxonomy rank - beta12orEarlier - Taxonomic rank - For a complete list of taxonomic ranks see https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - The name of a group of organisms belonging to the same taxonomic rank. - Moby:BriefTaxonConcept - - - - - - - - - - - Organism identifier - - - - - - - - beta12orEarlier - A unique identifier of a (group of) organisms. - - - - - - - - - - - Genus name - - beta12orEarlier - The name of a genus of organism. - - - - - - - - - - - Taxonomic classification - - Moby:TaxonName - Moby:GCP_Taxon - beta12orEarlier - The full name for a group of organisms, reflecting their biological classification and (usually) conforming to a standard nomenclature. - Moby:iANT_organism-xml - Taxonomic name - Name components correspond to levels in a taxonomic hierarchy (e.g. 'Genus', 'Species', etc.) Meta information such as a reference where the name was defined and a date might be included. - Taxonomic information - Moby:TaxonScientificName - Moby:TaxonTCS - - - - - - - - - - - iHOP organism ID - - beta12orEarlier - Moby_namespace:iHOPorganism - A unique identifier for an organism used in the iHOP database. - - - - - - - - - - - Genbank common name - - Common name for an organism as used in the GenBank database. - beta12orEarlier - - - - - - - - - - - NCBI taxon - - The name of a taxon from the NCBI taxonomy database. - beta12orEarlier - - - - - - - - - - - Synonym - - beta12orEarlier - Alternative name - beta12orEarlier - true - An alternative for a word. - - - - - - - - - - Misspelling - - A common misspelling of a word. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Acronym - - true - An abbreviation of a phrase or word. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Misnomer - - A term which is likely to be misleading of its meaning. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Author ID - - Information on the authors of a published work. - Moby:Author - beta12orEarlier - - - - - - - - - - - DragonDB author identifier - - An identifier representing an author in the DragonDB database. - beta12orEarlier - - - - - - - - - - - Annotated URI - - beta12orEarlier - A URI along with annotation describing the data found at the address. - Moby:DescribedLink - - - - - - - - - - UniProt keywords - - true - beta12orEarlier - beta12orEarlier - A controlled vocabulary for words and phrases that can appear in the keywords field (KW line) of entries from the UniProt database. - - - - - - - - - - Gene ID (GeneFarm) - - Moby_namespace:GENEFARM_GeneID - Identifier of a gene from the GeneFarm database. - beta12orEarlier - - - - - - - - - - - Blattner number - - beta12orEarlier - Moby_namespace:Blattner_number - The blattner identifier for a gene. - - - - - - - - - - - Gene ID (MIPS Maize) - - MIPS genetic element identifier (Maize) - Identifier for genetic elements in MIPS Maize database. - beta12orEarlier - Moby_namespace:MIPS_GE_Maize - beta13 - true - - - - - - - - - - Gene ID (MIPS Medicago) - - MIPS genetic element identifier (Medicago) - beta12orEarlier - beta13 - true - Moby_namespace:MIPS_GE_Medicago - Identifier for genetic elements in MIPS Medicago database. - - - - - - - - - - Gene name (DragonDB) - - true - The name of an Antirrhinum Gene from the DragonDB database. - beta12orEarlier - Moby_namespace:DragonDB_Gene - 1.3 - - - - - - - - - - Gene name (Arabidopsis) - - Moby_namespace:ArabidopsisGeneSymbol - true - A unique identifier for an Arabidopsis gene, which is an acronym or abbreviation of the gene name. - beta12orEarlier - 1.3 - - - - - - - - - - iHOP symbol - - - - A unique identifier of a protein or gene used in the iHOP database. - Moby_namespace:iHOPsymbol - beta12orEarlier - - - - - - - - - - - Gene name (GeneFarm) - - 1.3 - true - Name of a gene from the GeneFarm database. - Moby_namespace:GENEFARM_GeneName - GeneFarm gene ID - beta12orEarlier - - - - - - - - - - Locus ID - - - - - - - - - A unique name or other identifier of a genetic locus, typically conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - Locus name - beta12orEarlier - Locus identifier - - - - - - - - - - - Locus ID (AGI) - - AT[1-5]G[0-9]{5} - AGI ID - Locus identifier for Arabidopsis Genome Initiative (TAIR, TIGR and MIPS databases) - http://www.geneontology.org/doc/GO.xrf_abbs:AGI_LocusCode - Arabidopsis gene loci number - AGI locus code - beta12orEarlier - AGI identifier - - - - - - - - - - - Locus ID (ASPGD) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGD - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGDID - Identifier for loci from ASPGD (Aspergillus Genome Database). - - - - - - - - - - - Locus ID (MGG) - - Identifier for loci from Magnaporthe grisea Database at the Broad Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: Broad_MGG - beta12orEarlier - - - - - - - - - - - Locus ID (CGD) - - Identifier for loci from CGD (Candida Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: CGDID - beta12orEarlier - CGDID - CGD locus identifier - http://www.geneontology.org/doc/GO.xrf_abbs: CGD - - - - - - - - - - - Locus ID (CMR) - - http://www.geneontology.org/doc/GO.xrf_abbs: TIGR_CMR - Locus identifier for Comprehensive Microbial Resource at the J. Craig Venter Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: JCVI_CMR - beta12orEarlier - - - - - - - - - - - NCBI locus tag - - beta12orEarlier - Moby_namespace:LocusID - Locus ID (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs: NCBI_locus_tag - Identifier for loci from NCBI database. - - - - - - - - - - - Locus ID (SGD) - - - Identifier for loci from SGD (Saccharomyces Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: SGDID - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: SGD - SGDID - - - - - - - - - - - Locus ID (MMP) - - Identifier of loci from Maize Mapping Project. - Moby_namespace:MMP_Locus - beta12orEarlier - - - - - - - - - - - Locus ID (DictyBase) - - Moby_namespace:DDB_gene - Identifier of locus from DictyBase (Dictyostelium discoideum). - beta12orEarlier - - - - - - - - - - - Locus ID (EntrezGene) - - Identifier of a locus from EntrezGene database. - beta12orEarlier - Moby_namespace:EntrezGene_ID - Moby_namespace:EntrezGene_EntrezGeneID - - - - - - - - - - - Locus ID (MaizeGDB) - - Identifier of locus from MaizeGDB (Maize genome database). - Moby_namespace:MaizeGDB_Locus - beta12orEarlier - - - - - - - - - - - Quantitative trait locus - - QTL - A QTL sometimes but does not necessarily correspond to a gene. - true - beta12orEarlier - beta12orEarlier - A stretch of DNA that is closely linked to the genes underlying a quantitative trait (a phenotype that varies in degree and depends upon the interactions between multiple genes and their environment). - Moby:SO_QTL - - - - - - - - - - Gene ID (KOME) - - Identifier of a gene from the KOME database. - beta12orEarlier - Moby_namespace:GeneId - - - - - - - - - - - Locus ID (Tropgene) - - Identifier of a locus from the Tropgene database. - Moby:Tropgene_locus - beta12orEarlier - - - - - - - - - - - Alignment - - An alignment of molecular sequences, structures or profiles derived from them. - beta12orEarlier - - - - - - - - - - Atomic property - - General atomic property - Data for an atom (in a molecular structure). - beta12orEarlier - - - - - - - - - - UniProt keyword - - beta12orEarlier - A word or phrase that can appear in the keywords field (KW line) of entries from the UniProt database. - Moby_namespace:SP_KW - http://www.geneontology.org/doc/GO.xrf_abbs: SP_KW - - - - - - - - - - Ordered locus name - - beta12orEarlier - true - A name for a genetic locus conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - beta12orEarlier - - - - - - - - - - Sequence coordinates - - - - Map position - Moby:Position - Locus - Sequence co-ordinates - A position in a map (for example a genetic map), either a single position (point) or a region / interval. - Moby:GenePosition - This includes positions in genomes based on a reference sequence. A position may be specified for any mappable object, i.e. anything that may have positional information such as a physical position in a chromosome. Data might include sequence region name, strand, coordinate system name, assembly name, start position and end position. - Moby:HitPosition - beta12orEarlier - Moby:MapPosition - Moby:Locus - Moby:GCP_MapInterval - Moby:GCP_MapPosition - Moby:GCP_MapPoint - PDBML:_atom_site.id - - - - - - - - - - Amino acid property - - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all amino acids. - Amino acid data - beta12orEarlier - - - - - - - - - - Annotation - - beta12orEarlier - true - beta13 - This is a broad data type and is used a placeholder for other, more specific types. - A human-readable collection of information which (typically) is generated or collated by hand and which describes a biological entity, phenomena or associated primary (e.g. sequence or structural) data, as distinct from the primary data itself and computer-generated reports derived from it. - - - - - - - - - - Map data - - - - - - - - Map attribute - A molecular map (genetic or physical), an attribute of such a map, or data extracted from or derived from the analysis of such a map. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - - - - - - - - - - Vienna RNA structural data - - true - Data used by the Vienna RNA analysis package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence mask parameter - - beta12orEarlier - 1.5 - true - Data used to replace (mask) characters in a molecular sequence. - - - - - - - - - - Enzyme kinetics data - - - Data concerning chemical reaction(s) catalysed by enzyme(s). - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Michaelis Menten plot - - A plot giving an approximation of the kinetics of an enzyme-catalysed reaction, assuming simple kinetics (i.e. no intermediate or product inhibition, allostericity or cooperativity). It plots initial reaction rate to the substrate concentration (S) from which the maximum rate (vmax) is apparent. - beta12orEarlier - - - - - - - - - - Hanes Woolf plot - - beta12orEarlier - A plot based on the Michaelis Menten equation of enzyme kinetics plotting the ratio of the initial substrate concentration (S) against the reaction velocity (v). - - - - - - - - - - Experimental data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - Raw data from or annotation on laboratory experiments. - beta12orEarlier - Experimental measurement data - beta13 - - - - - - - - - - - Genome version information - - beta12orEarlier - true - Information on a genome version. - 1.5 - - - - - - - - - - Evidence - - Typically a statement about some data or results, including evidence or the source of a statement, which may include computational prediction, laboratory experiment, literature reference etc. - beta12orEarlier - - - - - - - - - - Sequence record lite - - beta12orEarlier - A molecular sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - 1.8 - - - - - - - - - - Sequence - - - - - - - - http://purl.bioontology.org/ontology/MSH/D008969 - Sequences - http://purl.org/biotop/biotop.owl#BioMolecularSequenceInformation - This concept is a placeholder of concepts for primary sequence data including raw sequences and sequence records. It should not normally be used for derivatives such as sequence alignments, motifs or profiles. - beta12orEarlier - One or more molecular sequences, possibly with associated annotation. - - - - - - - - - - Nucleic acid sequence record (lite) - - beta12orEarlier - 1.8 - true - A nucleic acid sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - - - - - - - - - - Protein sequence record (lite) - - 1.8 - Sequence record lite (protein) - beta12orEarlier - A protein sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - - - - - - - - - - Report - - You can use this term by default for any textual report, in case you can't find another, more specific term. Reports may be generated automatically or collated by hand and can include metadata on the origin, source, history, ownership or location of some thing. - http://semanticscience.org/resource/SIO_000148 - Document - A human-readable collection of information including annotation on a biological entity or phenomena, computer-generated reports of analysis of primary data (e.g. sequence or structural), and metadata (data about primary data) or any other free (essentially unformatted) text, as distinct from the primary data itself. - beta12orEarlier - - - - - - - - - - Molecular property (general) - - General molecular property - General data for a molecule. - beta12orEarlier - - - - - - - - - - Structural data - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - true - Data concerning molecular structural data. - beta13 - - - - - - - - - - - Sequence motif (nucleic acid) - - Nucleic acid sequence motif - DNA sequence motif - A nucleotide sequence motif. - beta12orEarlier - RNA sequence motif - - - - - - - - - - Sequence motif (protein) - - beta12orEarlier - An amino acid sequence motif. - Protein sequence motif - - - - - - - - - - Search parameter - - beta12orEarlier - 1.5 - true - Some simple value controlling a search operation, typically a search of a database. - - - - - - - - - - Database search results - - beta12orEarlier - A report of hits from searching a database of some type. - Search results - Database hits - - - - - - - - - - Secondary structure - - 1.5 - true - beta12orEarlier - The secondary structure assignment (predicted or real) of a nucleic acid or protein. - - - - - - - - - - Matrix - - beta12orEarlier - Array - This is a broad data type and is used a placeholder for other, more specific types. - An array of numerical values. - - - - - - - - - - Alignment data - - beta12orEarlier - 1.8 - true - Data concerning, extracted from, or derived from the analysis of molecular alignment of some type. - This is a broad data type and is used a placeholder for other, more specific types. - Alignment report - - - - - - - - - - Nucleic acid report - - An informative human-readable report about one or more specific nucleic acid molecules, derived from analysis of primary (sequence or structural) data. - beta12orEarlier - - - - - - - - - - Structure report - - An informative report on general information, properties or features of one or more molecular tertiary (3D) structures. - beta12orEarlier - Structure-derived report - - - - - - - - - - Nucleic acid structure data - - Nucleic acid property (structural) - This includes reports on the stiffness, curvature, twist/roll data or other conformational parameters or properties. - Nucleic acid structural property - beta12orEarlier - A report on nucleic acid structure-derived data, describing structural properties of a DNA molecule, or any other annotation or information about specific nucleic acid 3D structure(s). - - - - - - - - - - Molecular property - - beta12orEarlier - SO:0000400 - A report on the physical (e.g. structural) or chemical properties of molecules, or parts of a molecule. - Physicochemical property - - - - - - - - - - DNA base structural data - - Structural data for DNA base pairs or runs of bases, such as energy or angle data. - beta12orEarlier - - - - - - - - - - Database entry version information - - true - beta12orEarlier - 1.5 - Information on a database (or ontology) entry version, such as name (or other identifier) or parent database, unique identifier of entry, data, author and so on. - - - - - - - - - - Accession - - beta12orEarlier - http://semanticscience.org/resource/SIO_000731 - A persistent (stable) and unique identifier, typically identifying an object (entry) from a database. - http://semanticscience.org/resource/SIO_000675 - - - - - - - - - - - SNP - - single nucleotide polymorphism (SNP) in a DNA sequence. - true - beta12orEarlier - 1.8 - - - - - - - - - - Data reference - - A list of database accessions or identifiers are usually included. - Reference to a dataset (or a cross-reference between two datasets), typically one or more entries in a biological database or ontology. - beta12orEarlier - - - - - - - - - - Job identifier - - http://wsio.org/data_009 - An identifier of a submitted job. - beta12orEarlier - - - - - - - - - - - Name - - http://semanticscience.org/resource/SIO_000116 - http://usefulinc.com/ns/doap#name - "http://www.w3.org/2000/01/rdf-schema#label - beta12orEarlier - A name of a thing, which need not necessarily uniquely identify it. - Symbolic name - - - - - - - Closely related, but focusing on labeling and human readability but not on identification. - - - - - - - - - - - Type - - A label (text token) describing the type of a thing, typically an enumerated string (a string with one of a limited set of values). - http://purl.org/dc/elements/1.1/type - 1.5 - beta12orEarlier - true - - - - - - - - - - User ID - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - KEGG organism code - - - A three-letter code used in the KEGG databases to uniquely identify organisms. - beta12orEarlier - - - - - - - - - - - Gene name (KEGG GENES) - - beta12orEarlier - KEGG GENES entry name - [a-zA-Z_0-9]+:[a-zA-Z_0-9\.-]* - Name of an entry (gene) from the KEGG GENES database. - Moby_namespace:GeneId - true - 1.3 - - - - - - - - - - BioCyc ID - - - Identifier of an object from one of the BioCyc databases. - beta12orEarlier - - - - - - - - - - - Compound ID (BioCyc) - - - BioCyc compound identifier - Identifier of a compound from the BioCyc chemical compounds database. - BioCyc compound ID - beta12orEarlier - - - - - - - - - - - Reaction ID (BioCyc) - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from the BioCyc reactions database. - - - - - - - - - - - Enzyme ID (BioCyc) - - - BioCyc enzyme ID - beta12orEarlier - Identifier of an enzyme from the BioCyc enzymes database. - - - - - - - - - - - Reaction ID - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from a database. - - - - - - - - - - - Identifier (hybrid) - - An identifier that is re-used for data objects of fundamentally different types (typically served from a single database). - beta12orEarlier - This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - - - - - - - - - - - Molecular property identifier - - - - - - - - beta12orEarlier - Identifier of a molecular property. - - - - - - - - - - - Codon usage table ID - - - - - - - - - - - - - - Identifier of a codon usage table, for example a genetic code. - Codon usage table identifier - beta12orEarlier - - - - - - - - - - - FlyBase primary identifier - - beta12orEarlier - Primary identifier of an object from the FlyBase database. - - - - - - - - - - - WormBase identifier - - beta12orEarlier - Identifier of an object from the WormBase database. - - - - - - - - - - - WormBase wormpep ID - - - Protein identifier used by WormBase database. - CE[0-9]{5} - beta12orEarlier - - - - - - - - - - - Nucleic acid features (codon) - - beta12orEarlier - true - An informative report on a trinucleotide sequence that encodes an amino acid including the triplet sequence, the encoded amino acid or whether it is a start or stop codon. - beta12orEarlier - - - - - - - - - - Map identifier - - - - - - - - An identifier of a map of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Person identifier - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - Nucleic acid identifier - - - - - - - - Name or other identifier of a nucleic acid molecule. - beta12orEarlier - - - - - - - - - - - Translation frame specification - - beta12orEarlier - Frame for translation of DNA (3 forward and 3 reverse frames relative to a chromosome). - - - - - - - - - - Genetic code identifier - - - - - - - - An identifier of a genetic code. - beta12orEarlier - - - - - - - - - - - Genetic code name - - - Informal name for a genetic code, typically an organism name. - beta12orEarlier - - - - - - - - - - - File format name - - - Name of a file format such as HTML, PNG, PDF, EMBL, GenBank and so on. - beta12orEarlier - - - - - - - - - - - Sequence profile type - - true - 1.5 - A label (text token) describing a type of sequence profile such as frequency matrix, Gribskov profile, hidden Markov model etc. - beta12orEarlier - - - - - - - - - - Operating system name - - beta12orEarlier - Name of a computer operating system such as Linux, PC or Mac. - - - - - - - - - - - Mutation type - - beta12orEarlier - true - beta12orEarlier - A type of point or block mutation, including insertion, deletion, change, duplication and moves. - - - - - - - - - - Logical operator - - beta12orEarlier - A logical operator such as OR, AND, XOR, and NOT. - - - - - - - - - - - Results sort order - - Possible options including sorting by score, rank, by increasing P-value (probability, i.e. most statistically significant hits given first) and so on. - beta12orEarlier - true - 1.5 - A control of the order of data that is output, for example the order of sequences in an alignment. - - - - - - - - - - Toggle - - beta12orEarlier - A simple parameter that is a toggle (boolean value), typically a control for a modal tool. - true - beta12orEarlier - - - - - - - - - - Sequence width - - true - beta12orEarlier - beta12orEarlier - The width of an output sequence or alignment. - - - - - - - - - - Gap penalty - - beta12orEarlier - A penalty for introducing or extending a gap in an alignment. - - - - - - - - - - Nucleic acid melting temperature - - beta12orEarlier - A temperature concerning nucleic acid denaturation, typically the temperature at which the two strands of a hybridized or double stranded nucleic acid (DNA or RNA/DNA) molecule separate. - Melting temperature - - - - - - - - - - Concentration - - beta12orEarlier - The concentration of a chemical compound. - - - - - - - - - - Window step size - - 1.5 - beta12orEarlier - true - Size of the incremental 'step' a sequence window is moved over a sequence. - - - - - - - - - - EMBOSS graph - - beta12orEarlier - true - beta12orEarlier - An image of a graph generated by the EMBOSS suite. - - - - - - - - - - EMBOSS report - - An application report generated by the EMBOSS suite. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence offset - - true - beta12orEarlier - 1.5 - An offset for a single-point sequence position. - - - - - - - - - - Threshold - - 1.5 - beta12orEarlier - true - A value that serves as a threshold for a tool (usually to control scoring or output). - - - - - - - - - - Protein report (transcription factor) - - beta13 - true - This might include conformational or physicochemical properties, as well as sequence information for transcription factor(s) binding sites. - An informative report on a transcription factor protein. - Transcription factor binding site data - beta12orEarlier - - - - - - - - - - Database category name - - true - The name of a category of biological or bioinformatics database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence profile name - - beta12orEarlier - Name of a sequence profile. - true - beta12orEarlier - - - - - - - - - - Color - - Specification of one or more colors. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Rendering parameter - - true - beta12orEarlier - 1.5 - A parameter that is used to control rendering (drawing) to a device or image. - Graphics parameter - Graphical parameter - - - - - - - - - - Sequence name - - - Any arbitrary name of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Date - - 1.5 - A temporal date. - beta12orEarlier - true - - - - - - - - - - Word composition - - beta12orEarlier - Word composition data for a molecular sequence. - true - beta12orEarlier - - - - - - - - - - - Fickett testcode plot - - A plot of Fickett testcode statistic (identifying protein coding regions) in a nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity plot - - - Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - beta12orEarlier - Sequence conservation report - A plot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Helical wheel - - beta12orEarlier - An image of peptide sequence sequence looking down the axis of the helix for highlighting amphipathicity and other properties. - - - - - - - - - - Helical net - - beta12orEarlier - Useful for highlighting amphipathicity and other properties. - An image of peptide sequence sequence in a simple 3,4,3,4 repeating pattern that emulates at a simple level the arrangement of residues around an alpha helix. - - - - - - - - - - Protein sequence properties plot - - true - beta12orEarlier - beta12orEarlier - A plot of general physicochemical properties of a protein sequence. - - - - - - - - - - Protein ionization curve - - - beta12orEarlier - A plot of pK versus pH for a protein. - - - - - - - - - - Sequence composition plot - - - beta12orEarlier - A plot of character or word composition / frequency of a molecular sequence. - - - - - - - - - - Nucleic acid density plot - - - beta12orEarlier - Density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence trace image - - Image of a sequence trace (nucleotide sequence versus probabilities of each of the 4 bases). - beta12orEarlier - - - - - - - - - - Nucleic acid features (siRNA) - - true - 1.5 - beta12orEarlier - A report on siRNA duplexes in mRNA. - - - - - - - - - - Sequence set (stream) - - beta12orEarlier - true - This concept may be used for sequence sets that are expected to be read and processed a single sequence at a time. - A collection of multiple molecular sequences and (typically) associated metadata that is intended for sequential processing. - beta12orEarlier - - - - - - - - - - FlyBase secondary identifier - - Secondary identifier of an object from the FlyBase database. - Secondary identifier are used to handle entries that were merged with or split from other entries in the database. - beta12orEarlier - - - - - - - - - - - Cardinality - - The number of a certain thing. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 1 - - beta12orEarlier - beta12orEarlier - A single thing. - true - - - - - - - - - - 1 or more - - One or more things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 2 - - Exactly two things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 2 or more - - Two or more things. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence checksum - - A fixed-size datum calculated (by using a hash function) for a molecular sequence, typically for purposes of error detection or indexing. - beta12orEarlier - Hash code - Hash sum - Hash - Hash value - - - - - - - - - - Protein features report (chemical modifications) - - 1.8 - beta12orEarlier - chemical modification of a protein. - true - - - - - - - - - - Error - - beta12orEarlier - Data on an error generated by computer system or tool. - 1.5 - true - - - - - - - - - - Database entry metadata - - beta12orEarlier - Basic information on any arbitrary database entry. - - - - - - - - - - Gene cluster - - beta13 - true - beta12orEarlier - A cluster of similar genes. - - - - - - - - - - Sequence record full - - true - beta12orEarlier - A molecular sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - - - - - - - - - - Plasmid identifier - - An identifier of a plasmid in a database. - beta12orEarlier - - - - - - - - - - - Mutation ID - - - beta12orEarlier - A unique identifier of a specific mutation catalogued in a database. - - - - - - - - - - - Mutation annotation (basic) - - Information describing the mutation itself, the organ site, tissue and type of lesion where the mutation has been identified, description of the patient origin and life-style. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Mutation annotation (prevalence) - - beta12orEarlier - true - An informative report on the prevalence of mutation(s), including data on samples and mutation prevalence (e.g. by tumour type).. - beta12orEarlier - - - - - - - - - - Mutation annotation (prognostic) - - beta12orEarlier - An informative report on mutation prognostic data, such as information on patient cohort, the study settings and the results of the study. - beta12orEarlier - true - - - - - - - - - - Mutation annotation (functional) - - An informative report on the functional properties of mutant proteins including transcriptional activities, promotion of cell growth and tumorigenicity, dominant negative effects, capacity to induce apoptosis, cell-cycle arrest or checkpoints in human cells and so on. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon number - - beta12orEarlier - The number of a codon, for instance, at which a mutation is located. - - - - - - - - - - Tumor annotation - - true - 1.4 - An informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Server metadata - - Basic information about a server on the web, such as an SRS server. - beta12orEarlier - 1.5 - true - - - - - - - - - - Database field name - - The name of a field in a database. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (SYSTERS) - - SYSTERS cluster ID - Unique identifier of a sequence cluster from the SYSTERS database. - beta12orEarlier - - - - - - - - - - - Ontology metadata - - - - - - - - beta12orEarlier - Data concerning a biological ontology. - - - - - - - - - - Raw SCOP domain classification - - true - beta12orEarlier - Raw SCOP domain classification data files. - beta13 - These are the parsable data files provided by SCOP. - - - - - - - - - - Raw CATH domain classification - - Raw CATH domain classification data files. - These are the parsable data files provided by CATH. - true - beta13 - beta12orEarlier - - - - - - - - - - Heterogen annotation - - 1.4 - true - beta12orEarlier - An informative report on the types of small molecules or 'heterogens' (non-protein groups) that are represented in PDB files. - - - - - - - - - - Phylogenetic property values - - beta12orEarlier - Phylogenetic property values data. - true - beta12orEarlier - - - - - - - - - - Sequence set (bootstrapped) - - 1.5 - beta12orEarlier - Bootstrapping is often performed in phylogenetic analysis. - true - A collection of sequences output from a bootstrapping (resampling) procedure. - - - - - - - - - - Phylogenetic consensus tree - - true - A consensus phylogenetic tree derived from comparison of multiple trees. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Schema - - beta12orEarlier - true - A data schema for organising or transforming data of some type. - 1.5 - - - - - - - - - - DTD - - A DTD (document type definition). - true - beta12orEarlier - 1.5 - - - - - - - - - - XML Schema - - beta12orEarlier - XSD - An XML Schema. - true - 1.5 - - - - - - - - - - Relax-NG schema - - beta12orEarlier - 1.5 - A relax-NG schema. - true - - - - - - - - - - XSLT stylesheet - - 1.5 - beta12orEarlier - An XSLT stylesheet. - true - - - - - - - - - - Data resource definition name - - - beta12orEarlier - The name of a data type. - - - - - - - - - - - OBO file format name - - Name of an OBO file format such as OBO-XML, plain and so on. - beta12orEarlier - - - - - - - - - - - Gene ID (MIPS) - - Identifier for genetic elements in MIPS database. - beta12orEarlier - MIPS genetic element identifier - - - - - - - - - - - Sequence identifier (protein) - - An identifier of protein sequence(s) or protein sequence database entries. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence identifier (nucleic acid) - - An identifier of nucleotide sequence(s) or nucleotide sequence database entries. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - EMBL accession - - EMBL ID - beta12orEarlier - EMBL accession number - EMBL identifier - An accession number of an entry from the EMBL sequence database. - - - - - - - - - - - UniProt ID - - - - - - - - UniProtKB identifier - An identifier of a polypeptide in the UniProt database. - UniProtKB entry name - beta12orEarlier - UniProt identifier - UniProt entry name - - - - - - - - - - - GenBank accession - - GenBank ID - GenBank identifier - Accession number of an entry from the GenBank sequence database. - beta12orEarlier - GenBank accession number - - - - - - - - - - - Gramene secondary identifier - - beta12orEarlier - Gramene internal identifier - Gramene internal ID - Secondary (internal) identifier of a Gramene database entry. - Gramene secondary ID - - - - - - - - - - - Sequence variation ID - - - An identifier of an entry from a database of molecular sequence variation. - beta12orEarlier - - - - - - - - - - - Gene ID - - - Gene accession - beta12orEarlier - A unique (and typically persistent) identifier of a gene in a database, that is (typically) different to the gene name/symbol. - Gene code - - - - - - - - - - - Gene name (AceView) - - AceView gene name - 1.3 - true - Name of an entry (gene) from the AceView genes database. - beta12orEarlier - - - - - - - - - - Gene ID (ECK) - - ECK accession - beta12orEarlier - E. coli K-12 gene identifier - Identifier of an E. coli K-12 gene from EcoGene Database. - http://www.geneontology.org/doc/GO.xrf_abbs: ECK - - - - - - - - - - - Gene ID (HGNC) - - HGNC ID - beta12orEarlier - Identifier for a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - - Gene name - - - The name of a gene, (typically) assigned by a person and/or according to a naming scheme. It may contain white space characters and is typically more intuitive and readable than a gene symbol. It (typically) may be used to identify similar genes in different species and to derive a gene symbol. - Allele name - beta12orEarlier - - - - - - - - - - - Gene name (NCBI) - - beta12orEarlier - 1.3 - NCBI gene name - Name of an entry (gene) from the NCBI genes database. - true - - - - - - - - - - SMILES string - - A specification of a chemical structure in SMILES format. - beta12orEarlier - - - - - - - - - - STRING ID - - Unique identifier of an entry from the STRING database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - Virus annotation - - An informative report on a specific virus. - true - 1.4 - beta12orEarlier - - - - - - - - - - Virus annotation (taxonomy) - - An informative report on the taxonomy of a specific virus. - beta12orEarlier - true - 1.4 - - - - - - - - - - Reaction ID (SABIO-RK) - - Identifier of a biological reaction from the SABIO-RK reactions database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - Carbohydrate report - - Annotation on or information derived from one or more specific carbohydrate 3D structure(s). - beta12orEarlier - - - - - - - - - - GI number - - beta12orEarlier - NCBI GI number - gi number - A series of digits that are assigned consecutively to each sequence record processed by NCBI. The GI number bears no resemblance to the Accession number of the sequence record. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - - - - - - - - - - - NCBI version - - beta12orEarlier - NCBI accession.version - Nucleotide sequence version contains two letters followed by six digits, a dot, and a version number (or for older nucleotide sequence records, the format is one letter followed by five digits, a dot, and a version number). Protein sequence version contains three letters followed by five digits, a dot, and a version number. - An identifier assigned to sequence records processed by NCBI, made of the accession number of the database record followed by a dot and a version number. - accession.version - - - - - - - - - - - Cell line name - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (exact) - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (truncated) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (no punctuation) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (assonant) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Enzyme ID - - - beta12orEarlier - A unique, persistent identifier of an enzyme. - Enzyme accession - - - - - - - - - - - REBASE enzyme number - - Identifier of an enzyme from the REBASE enzymes database. - beta12orEarlier - - - - - - - - - - - DrugBank ID - - beta12orEarlier - DB[0-9]{5} - Unique identifier of a drug from the DrugBank database. - - - - - - - - - - - GI number (protein) - - beta12orEarlier - protein gi number - A unique identifier assigned to NCBI protein sequence records. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - protein gi - - - - - - - - - - - Bit score - - A score derived from the alignment of two sequences, which is then normalized with respect to the scoring system. - Bit scores are normalized with respect to the scoring system and therefore can be used to compare alignment scores from different searches. - beta12orEarlier - - - - - - - - - - Translation phase specification - - beta12orEarlier - Phase for translation of DNA (0, 1 or 2) relative to a fragment of the coding sequence. - Phase - - - - - - - - - - Resource metadata - - Data concerning or describing some core computational resource, as distinct from primary data. This includes metadata on the origin, source, history, ownership or location of some thing. - This is a broad data type and is used a placeholder for other, more specific types. - Provenance metadata - beta12orEarlier - - - - - - - - - - Ontology identifier - - - - - - - - beta12orEarlier - Any arbitrary identifier of an ontology. - - - - - - - - - - - Ontology concept name - - - The name of a concept in an ontology. - beta12orEarlier - - - - - - - - - - - Genome build identifier - - beta12orEarlier - An identifier of a build of a particular genome. - - - - - - - - - - - Pathway or network name - - The name of a biological pathway or network. - beta12orEarlier - - - - - - - - - - - Pathway ID (KEGG) - - - Identifier of a pathway from the KEGG pathway database. - beta12orEarlier - [a-zA-Z_0-9]{2,3}[0-9]{5} - KEGG pathway ID - - - - - - - - - - - Pathway ID (NCI-Nature) - - beta12orEarlier - [a-zA-Z_0-9]+ - Identifier of a pathway from the NCI-Nature pathway database. - - - - - - - - - - - Pathway ID (ConsensusPathDB) - - - beta12orEarlier - Identifier of a pathway from the ConsensusPathDB pathway database. - - - - - - - - - - - Sequence cluster ID (UniRef) - - Unique identifier of an entry from the UniRef database. - UniRef cluster id - UniRef entry accession - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniRef100) - - UniRef100 cluster id - beta12orEarlier - UniRef100 entry accession - Unique identifier of an entry from the UniRef100 database. - - - - - - - - - - - Sequence cluster ID (UniRef90) - - UniRef90 entry accession - beta12orEarlier - UniRef90 cluster id - Unique identifier of an entry from the UniRef90 database. - - - - - - - - - - - Sequence cluster ID (UniRef50) - - beta12orEarlier - UniRef50 cluster id - UniRef50 entry accession - Unique identifier of an entry from the UniRef50 database. - - - - - - - - - - - Ontology data - - - - - - - - Data concerning or derived from an ontology. - Ontological data - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - RNA family report - - beta12orEarlier - An informative report on a specific RNA family or other group of classified RNA sequences. - RNA family annotation - - - - - - - - - - RNA family identifier - - - - - - - - beta12orEarlier - Identifier of an RNA family, typically an entry from a RNA sequence classification database. - - - - - - - - - - - RFAM accession - - - Stable accession number of an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Protein signature type - - beta12orEarlier - true - A label (text token) describing a type of protein family signature (sequence classifier) from the InterPro database. - 1.5 - - - - - - - - - - Domain-nucleic acid interaction report - - 1.5 - true - An informative report on protein domain-DNA/RNA interaction(s). - beta12orEarlier - - - - - - - - - - Domain-domain interactions - - 1.8 - An informative report on protein domain-protein domain interaction(s). - beta12orEarlier - true - - - - - - - - - - Domain-domain interaction (indirect) - - true - beta12orEarlier - beta12orEarlier - Data on indirect protein domain-protein domain interaction(s). - - - - - - - - - - Sequence accession (hybrid) - - - - - - - - Accession number of a nucleotide or protein sequence database entry. - beta12orEarlier - - - - - - - - - - - 2D PAGE data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - beta13 - beta12orEarlier - true - Data concerning two-dimensional polygel electrophoresis. - - - - - - - - - - 2D PAGE report - - beta12orEarlier - two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - true - - - - - - - - - - Pathway or network accession - - - A persistent, unique identifier of a biological pathway or network (typically a database entry). - beta12orEarlier - - - - - - - - - - - Secondary structure alignment - - Alignment of the (1D representations of) secondary structure of two or more molecules. - beta12orEarlier - - - - - - - - - - ASTD ID - - - beta12orEarlier - Identifier of an object from the ASTD database. - - - - - - - - - - - ASTD ID (exon) - - beta12orEarlier - Identifier of an exon from the ASTD database. - - - - - - - - - - - ASTD ID (intron) - - beta12orEarlier - Identifier of an intron from the ASTD database. - - - - - - - - - - - ASTD ID (polya) - - Identifier of a polyA signal from the ASTD database. - beta12orEarlier - - - - - - - - - - - ASTD ID (tss) - - Identifier of a transcription start site from the ASTD database. - beta12orEarlier - - - - - - - - - - - 2D PAGE spot report - - 2D PAGE spot annotation - beta12orEarlier - An informative report on individual spot(s) from a two-dimensional (2D PAGE) gel. - 1.8 - true - - - - - - - - - - Spot ID - - - beta12orEarlier - Unique identifier of a spot from a two-dimensional (protein) gel. - - - - - - - - - - - Spot serial number - - Unique identifier of a spot from a two-dimensional (protein) gel in the SWISS-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Spot ID (HSC-2DPAGE) - - Unique identifier of a spot from a two-dimensional (protein) gel from a HSC-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Protein-motif interaction - - beta13 - true - Data on the interaction of a protein (or protein domain) with specific structural (3D) and/or sequence motifs. - beta12orEarlier - - - - - - - - - - Strain identifier - - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - beta12orEarlier - - - - - - - - - - - CABRI accession - - - A unique identifier of an item from the CABRI database. - beta12orEarlier - - - - - - - - - - - Experiment report (genotyping) - - true - Report of genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - 1.8 - beta12orEarlier - - - - - - - - - - Genotype experiment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of genotype experiment metadata. - - - - - - - - - - - EGA accession - - beta12orEarlier - Identifier of an entry from the EGA database. - - - - - - - - - - - IPI protein ID - - Identifier of a protein entry catalogued in the International Protein Index (IPI) database. - IPI[0-9]{8} - beta12orEarlier - - - - - - - - - - - RefSeq accession (protein) - - RefSeq protein ID - Accession number of a protein from the RefSeq database. - beta12orEarlier - - - - - - - - - - - EPD ID - - beta12orEarlier - Identifier of an entry (promoter) from the EPD database. - EPD identifier - - - - - - - - - - - TAIR accession - - - beta12orEarlier - Identifier of an entry from the TAIR database. - - - - - - - - - - - TAIR accession (At gene) - - beta12orEarlier - Identifier of an Arabidopsis thaliana gene from the TAIR database. - - - - - - - - - - - UniSTS accession - - beta12orEarlier - Identifier of an entry from the UniSTS database. - - - - - - - - - - - UNITE accession - - beta12orEarlier - Identifier of an entry from the UNITE database. - - - - - - - - - - - UTR accession - - beta12orEarlier - Identifier of an entry from the UTR database. - - - - - - - - - - - UniParc accession - - beta12orEarlier - UPI[A-F0-9]{10} - Accession number of a UniParc (protein sequence) database entry. - UniParc ID - UPI - - - - - - - - - - - mFLJ/mKIAA number - - beta12orEarlier - Identifier of an entry from the Rouge or HUGE databases. - - - - - - - - - - - Fungi annotation - - true - beta12orEarlier - 1.4 - An informative report on a specific fungus. - - - - - - - - - - Fungi annotation (anamorph) - - beta12orEarlier - An informative report on a specific fungus anamorph. - 1.4 - true - - - - - - - - - - Gene features report (exon) - - true - exons in a nucleotide sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Ensembl protein ID - - - Ensembl ID (protein) - beta12orEarlier - Protein ID (Ensembl) - Unique identifier for a protein from the Ensembl database. - - - - - - - - - - - Gene transcriptional features report - - 1.8 - beta12orEarlier - transcription of DNA into RNA including the regulation of transcription. - true - - - - - - - - - - Toxin annotation - - beta12orEarlier - An informative report on a specific toxin. - 1.4 - true - - - - - - - - - - Protein report (membrane protein) - - beta12orEarlier - true - An informative report on a membrane protein. - beta12orEarlier - - - - - - - - - - Protein-drug interaction report - - true - An informative report on tentative or known protein-drug interaction(s). - 1.12 - beta12orEarlier - - - - - - - - - - Map data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - true - beta13 - Data concerning a map of molecular sequence(s). - - - - - - - - - - - Phylogenetic data - - Data concerning phylogeny, typically of molecular sequences, including reports of information concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - - - - - - - - - - Protein data - - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning one or more protein molecules. - true - beta12orEarlier - - - - - - - - - - Nucleic acid data - - true - Data concerning one or more nucleic acid molecules. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - Article report - Data concerning, extracted from, or derived from the analysis of a scientific text (or texts) such as a full text article from a scientific journal. - - - - - - - - - - - Parameter - - http://semanticscience.org/resource/SIO_000144 - Tool-specific parameter - beta12orEarlier - http://www.e-lico.eu/ontologies/dmo/DMOP/DMOP.owl#Parameter - Typically a simple numerical or string value that controls the operation of a tool. - Parameters - Tool parameter - - - - - - - - - - Molecular data - - Molecule-specific data - true - Data concerning a specific type of molecule. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Molecule report - - An informative report on a specific molecule. - beta12orEarlier - Molecular report - 1.5 - true - - - - - - - - - - - Organism report - - An informative report on a specific organism. - beta12orEarlier - Organism annotation - - - - - - - - - - Experiment report - - Experiment metadata - beta12orEarlier - Experiment annotation - Annotation on a wet lab experiment, such as experimental conditions. - - - - - - - - - - Nucleic acid features report (mutation) - - DNA mutation. - 1.8 - true - beta12orEarlier - - - - - - - - - - Sequence attribute - - An attribute of a molecular sequence, possibly in reference to some other sequence. - Sequence parameter - beta12orEarlier - - - - - - - - - - Sequence tag profile - - SAGE, MPSS and SBS experiments are usually performed to study gene expression. The sequence tags are typically subsequently annotated (after a database search) with the mRNA (and therefore gene) the tag was extracted from. - beta12orEarlier - Sequencing-based expression profile - Output from a serial analysis of gene expression (SAGE), massively parallel signature sequencing (MPSS) or sequencing by synthesis (SBS) experiment. In all cases this is a list of short sequence tags and the number of times it is observed. - - - - - - - - - - Mass spectrometry data - - beta12orEarlier - Data concerning a mass spectrometry measurement. - - - - - - - - - - Protein structure raw data - - beta12orEarlier - Raw data from experimental methods for determining protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - - - - - - - - - - Mutation identifier - - An identifier of a mutation. - beta12orEarlier - - - - - - - - - - - Alignment data - - This is a broad data type and is used a placeholder for other, more specific types. This includes entities derived from sequences and structures such as motifs and profiles. - true - beta13 - Data concerning an alignment of two or more molecular sequences, structures or derived data. - beta12orEarlier - - - - - - - - - - - Data index data - - true - Data concerning an index of data. - beta12orEarlier - beta13 - Database index - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Amino acid name (single letter) - - beta12orEarlier - Single letter amino acid identifier, e.g. G. - - - - - - - - - - - Amino acid name (three letter) - - beta12orEarlier - Three letter amino acid identifier, e.g. GLY. - - - - - - - - - - - Amino acid name (full name) - - beta12orEarlier - Full name of an amino acid, e.g. Glycine. - - - - - - - - - - - Toxin identifier - - - - - - - - beta12orEarlier - Identifier of a toxin. - - - - - - - - - - - ArachnoServer ID - - Unique identifier of a toxin from the ArachnoServer database. - beta12orEarlier - - - - - - - - - - - Expressed gene list - - beta12orEarlier - true - 1.5 - Gene annotation (expressed gene list) - A simple summary of expressed genes. - - - - - - - - - - BindingDB Monomer ID - - Unique identifier of a monomer from the BindingDB database. - beta12orEarlier - - - - - - - - - - - GO concept name - - true - beta12orEarlier - beta12orEarlier - The name of a concept from the GO ontology. - - - - - - - - - - GO concept ID (biological process) - - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a 'biological process' concept from the the Gene Ontology. - - - - - - - - - - - GO concept ID (molecular function) - - beta12orEarlier - [0-9]{7}|GO:[0-9]{7} - An identifier of a 'molecular function' concept from the the Gene Ontology. - - - - - - - - - - - GO concept name (cellular component) - - The name of a concept for a cellular component from the GO ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Northern blot image - - beta12orEarlier - An image arising from a Northern Blot experiment. - - - - - - - - - - Blot ID - - - Unique identifier of a blot from a Northern Blot. - beta12orEarlier - - - - - - - - - - - BlotBase blot ID - - beta12orEarlier - Unique identifier of a blot from a Northern Blot from the BlotBase database. - - - - - - - - - - - Hierarchy - - beta12orEarlier - Raw data on a biological hierarchy, describing the hierarchy proper, hierarchy components and possibly associated annotation. - Hierarchy annotation - - - - - - - - - - Hierarchy identifier - - Identifier of an entry from a database of biological hierarchies. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Brite hierarchy ID - - beta12orEarlier - Identifier of an entry from the Brite database of biological hierarchies. - - - - - - - - - - - Cancer type - - true - A type (represented as a string) of cancer. - beta12orEarlier - beta12orEarlier - - - - - - - - - - BRENDA organism ID - - A unique identifier for an organism used in the BRENDA database. - beta12orEarlier - - - - - - - - - - - UniGene taxon - - The name of a taxon using the controlled vocabulary of the UniGene database. - UniGene organism abbreviation - beta12orEarlier - - - - - - - - - - - UTRdb taxon - - beta12orEarlier - The name of a taxon using the controlled vocabulary of the UTRdb database. - - - - - - - - - - - Catalogue ID - - beta12orEarlier - An identifier of a catalogue of biological resources. - Catalogue identifier - - - - - - - - - - - CABRI catalogue name - - - The name of a catalogue of biological resources from the CABRI database. - beta12orEarlier - - - - - - - - - - - Secondary structure alignment metadata - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Molecule interaction report - - An informative report on the physical, chemical or other information concerning the interaction of two or more molecules (or parts of molecules). - beta12orEarlier - Molecular interaction report - Molecular interaction data - - - - - - - - - Pathway or network - - - - - - - - Network - beta12orEarlier - Pathway - Primary data about a specific biological pathway or network (the nodes and connections within the pathway or network). - - - - - - - - - - Small molecule data - - true - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - beta13 - Data concerning one or more small molecules. - - - - - - - - - - Genotype and phenotype data - - beta12orEarlier - true - beta13 - Data concerning a particular genotype, phenotype or a genotype / phenotype relation. - - - - - - - - - - Gene expression data - - - - - - - - beta12orEarlier - Image or hybridisation data for a microarray, typically a study of gene expression. - Microarray data - This is a broad data type and is used a placeholder for other, more specific types. See also http://edamontology.org/data_0931 - - - - - - - - - - Compound ID (KEGG) - - - C[0-9]+ - Unique identifier of a chemical compound from the KEGG database. - beta12orEarlier - KEGG compound ID - KEGG compound identifier - - - - - - - - - - - RFAM name - - - Name (not necessarily stable) an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Reaction ID (KEGG) - - - Identifier of a biological reaction from the KEGG reactions database. - R[0-9]+ - beta12orEarlier - - - - - - - - - - - Drug ID (KEGG) - - - beta12orEarlier - Unique identifier of a drug from the KEGG Drug database. - D[0-9]+ - - - - - - - - - - - Ensembl ID - - - beta12orEarlier - ENS[A-Z]*[FPTG][0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl database. - Ensembl IDs - - - - - - - - - - - ICD identifier - - - - - - - - An identifier of a disease from the International Classification of Diseases (ICD) database. - beta12orEarlier - [A-Z][0-9]+(\.[-[0-9]+])? - - - - - - - - - - - Sequence cluster ID (CluSTr) - - Unique identifier of a sequence cluster from the CluSTr database. - [0-9A-Za-z]+:[0-9]+:[0-9]{1,5}(\.[0-9])? - CluSTr ID - beta12orEarlier - CluSTr cluster ID - - - - - - - - - - - KEGG Glycan ID - - - G[0-9]+ - Unique identifier of a glycan ligand from the KEGG GLYCAN database (a subset of KEGG LIGAND). - beta12orEarlier - - - - - - - - - - - TCDB ID - - beta12orEarlier - OBO file for regular expression. - TC number - [0-9]+\.[A-Z]\.[0-9]+\.[0-9]+\.[0-9]+ - A unique identifier of a family from the transport classification database (TCDB) of membrane transport proteins. - - - - - - - - - - - MINT ID - - MINT\-[0-9]{1,5} - Unique identifier of an entry from the MINT database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - DIP ID - - Unique identifier of an entry from the DIP database of protein-protein interactions. - beta12orEarlier - DIP[\:\-][0-9]{3}[EN] - - - - - - - - - - - Signaling Gateway protein ID - - beta12orEarlier - Unique identifier of a protein listed in the UCSD-Nature Signaling Gateway Molecule Pages database. - A[0-9]{6} - - - - - - - - - - - Protein modification ID - - - beta12orEarlier - Identifier of a protein modification catalogued in a database. - - - - - - - - - - - RESID ID - - Identifier of a protein modification catalogued in the RESID database. - AA[0-9]{4} - beta12orEarlier - - - - - - - - - - - RGD ID - - - [0-9]{4,7} - beta12orEarlier - Identifier of an entry from the RGD database. - - - - - - - - - - - TAIR accession (protein) - - - - - - - - - AASequence:[0-9]{10} - Identifier of a protein sequence from the TAIR database. - beta12orEarlier - - - - - - - - - - - Compound ID (HMDB) - - HMDB[0-9]{5} - beta12orEarlier - HMDB ID - Identifier of a small molecule metabolite from the Human Metabolome Database (HMDB). - - - - - - - - - - - LIPID MAPS ID - - beta12orEarlier - LM ID - Identifier of an entry from the LIPID MAPS database. - LM(FA|GL|GP|SP|ST|PR|SL|PK)[0-9]{4}([0-9a-zA-Z]{4})? - - - - - - - - - - - PeptideAtlas ID - - Identifier of a peptide from the PeptideAtlas peptide databases. - PDBML:pdbx_PDB_strand_id - beta12orEarlier - PAp[0-9]{8} - - - - - - - - - - - Molecular interaction ID - - Identifier of a report of molecular interactions from a database (typically). - true - beta12orEarlier - 1.7 - - - - - - - - - - BioGRID interaction ID - - [0-9]+ - beta12orEarlier - A unique identifier of an interaction from the BioGRID database. - - - - - - - - - - - Enzyme ID (MEROPS) - - MEROPS ID - Unique identifier of a peptidase enzyme from the MEROPS database. - beta12orEarlier - S[0-9]{2}\.[0-9]{3} - - - - - - - - - - - Mobile genetic element ID - - - An identifier of a mobile genetic element. - beta12orEarlier - - - - - - - - - - - ACLAME ID - - beta12orEarlier - mge:[0-9]+ - An identifier of a mobile genetic element from the Aclame database. - - - - - - - - - - - SGD ID - - - PWY[a-zA-Z_0-9]{2}\-[0-9]{3} - beta12orEarlier - Identifier of an entry from the Saccharomyces genome database (SGD). - - - - - - - - - - - Book ID - - - beta12orEarlier - Unique identifier of a book. - - - - - - - - - - - ISBN - - beta12orEarlier - (ISBN)?(-13|-10)?[:]?[ ]?([0-9]{2,3}[ -]?)?[0-9]{1,5}[ -]?[0-9]{1,7}[ -]?[0-9]{1,6}[ -]?([0-9]|X) - The International Standard Book Number (ISBN) is for identifying printed books. - - - - - - - - - - - Compound ID (3DMET) - - B[0-9]{5} - 3DMET ID - beta12orEarlier - Identifier of a metabolite from the 3DMET database. - - - - - - - - - - - MatrixDB interaction ID - - ([A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9])_.*|([OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]_.*)|(GAG_.*)|(MULT_.*)|(PFRAG_.*)|(LIP_.*)|(CAT_.*) - A unique identifier of an interaction from the MatrixDB database. - beta12orEarlier - - - - - - - - - - - cPath ID - - - [0-9]+ - These identifiers are unique within the cPath database, however, they are not stable between releases. - beta12orEarlier - A unique identifier for pathways, reactions, complexes and small molecules from the cPath (Pathway Commons) database. - - - - - - - - - - - PubChem bioassay ID - - - Identifier of an assay from the PubChem database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PubChem ID - - - PubChem identifier - beta12orEarlier - Identifier of an entry from the PubChem database. - - - - - - - - - - - Reaction ID (MACie) - - beta12orEarlier - M[0-9]{4} - MACie entry number - Identifier of an enzyme reaction mechanism from the MACie database. - - - - - - - - - - - Gene ID (miRBase) - - beta12orEarlier - miRNA name - miRNA ID - Identifier for a gene from the miRBase database. - MI[0-9]{7} - miRNA identifier - - - - - - - - - - - Gene ID (ZFIN) - - Identifier for a gene from the Zebrafish information network genome (ZFIN) database. - beta12orEarlier - ZDB\-GENE\-[0-9]+\-[0-9]+ - - - - - - - - - - - Reaction ID (Rhea) - - [0-9]{5} - Identifier of an enzyme-catalysed reaction from the Rhea database. - beta12orEarlier - - - - - - - - - - - Pathway ID (Unipathway) - - UPA[0-9]{5} - upaid - beta12orEarlier - Identifier of a biological pathway from the Unipathway database. - - - - - - - - - - - Compound ID (ChEMBL) - - Identifier of a small molecular from the ChEMBL database. - ChEMBL ID - beta12orEarlier - [0-9]+ - - - - - - - - - - - LGICdb identifier - - Unique identifier of an entry from the Ligand-gated ion channel (LGICdb) database. - beta12orEarlier - [a-zA-Z_0-9]+ - - - - - - - - - - - Reaction kinetics ID (SABIO-RK) - - Identifier of a biological reaction (kinetics entry) from the SABIO-RK reactions database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PharmGKB ID - - - beta12orEarlier - Identifier of an entry from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Pathway ID (PharmGKB) - - - PA[0-9]+ - Identifier of a pathway from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - - - - - - - - - - - Disease ID (PharmGKB) - - - Identifier of a disease from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - PA[0-9]+ - - - - - - - - - - - Drug ID (PharmGKB) - - - beta12orEarlier - Identifier of a drug from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Drug ID (TTD) - - DAP[0-9]+ - Identifier of a drug from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Target ID (TTD) - - TTDS[0-9]+ - Identifier of a target protein from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Cell type identifier - - beta12orEarlier - A unique identifier of a type or group of cells. - - - - - - - - - - - NeuronDB ID - - [0-9]+ - beta12orEarlier - A unique identifier of a neuron from the NeuronDB database. - - - - - - - - - - - NeuroMorpho ID - - beta12orEarlier - A unique identifier of a neuron from the NeuroMorpho database. - [a-zA-Z_0-9]+ - - - - - - - - - - - Compound ID (ChemIDplus) - - Identifier of a chemical from the ChemIDplus database. - ChemIDplus ID - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (SMPDB) - - beta12orEarlier - Identifier of a pathway from the Small Molecule Pathway Database (SMPDB). - SMP[0-9]{5} - - - - - - - - - - - BioNumbers ID - - Identifier of an entry from the BioNumbers database of key numbers and associated data in molecular biology. - [0-9]+ - beta12orEarlier - - - - - - - - - - - T3DB ID - - beta12orEarlier - T3D[0-9]+ - Unique identifier of a toxin from the Toxin and Toxin Target Database (T3DB) database. - - - - - - - - - - - Carbohydrate identifier - - - - - - - - - - - - - - beta12orEarlier - Identifier of a carbohydrate. - - - - - - - - - - - GlycomeDB ID - - Identifier of an entry from the GlycomeDB database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - LipidBank ID - - beta12orEarlier - [a-zA-Z_0-9]+[0-9]+ - Identifier of an entry from the LipidBank database. - - - - - - - - - - - CDD ID - - beta12orEarlier - cd[0-9]{5} - Identifier of a conserved domain from the Conserved Domain Database. - - - - - - - - - - - MMDB ID - - [0-9]{1,5} - beta12orEarlier - An identifier of an entry from the MMDB database. - MMDB accession - - - - - - - - - - - iRefIndex ID - - Unique identifier of an entry from the iRefIndex database of protein-protein interactions. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ModelDB ID - - Unique identifier of an entry from the ModelDB database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (DQCS) - - [0-9]+ - Identifier of a signaling pathway from the Database of Quantitative Cellular Signaling (DQCS). - beta12orEarlier - - - - - - - - - - - Ensembl ID (Homo sapiens) - - beta12orEarlier - true - beta12orEarlier - ENS([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database (Homo sapiens division). - - - - - - - - - - Ensembl ID ('Bos taurus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Bos taurus' division). - true - beta12orEarlier - ENSBTA([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Canis familiaris') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Canis familiaris' division). - true - ENSCAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Cavia porcellus') - - ENSCPO([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Cavia porcellus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Ciona intestinalis') - - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona intestinalis' division). - beta12orEarlier - beta12orEarlier - ENSCIN([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Ciona savignyi') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona savignyi' division). - ENSCSAV([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Ensembl ID ('Danio rerio') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Danio rerio' division). - true - beta12orEarlier - beta12orEarlier - ENSDAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Dasypus novemcinctus') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Dasypus novemcinctus' division). - beta12orEarlier - beta12orEarlier - ENSDNO([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Echinops telfairi') - - ENSETE([EGTP])[0-9]{11} - true - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Echinops telfairi' division). - - - - - - - - - - Ensembl ID ('Erinaceus europaeus') - - true - ENSEEU([EGTP])[0-9]{11} - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Erinaceus europaeus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Felis catus') - - beta12orEarlier - true - ENSFCA([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Felis catus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gallus gallus') - - ENSGAL([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gallus gallus' division). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gasterosteus aculeatus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gasterosteus aculeatus' division). - true - ENSGAC([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Homo sapiens') - - ENSHUM([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Homo sapiens' division). - true - - - - - - - - - - Ensembl ID ('Loxodonta africana') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Loxodonta africana' division). - ENSLAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Macaca mulatta') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Macaca mulatta' division). - beta12orEarlier - ENSMMU([EGTP])[0-9]{11} - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Monodelphis domestica') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Monodelphis domestica' division). - true - ENSMOD([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Mus musculus') - - ENSMUS([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Mus musculus' division). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Ensembl ID ('Myotis lucifugus') - - beta12orEarlier - ENSMLU([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Myotis lucifugus' division). - - - - - - - - - - Ensembl ID ("Ornithorhynchus anatinus") - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ornithorhynchus anatinus' division). - ENSOAN([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryctolagus cuniculus') - - beta12orEarlier - ENSOCU([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryctolagus cuniculus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryzias latipes') - - ENSORL([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryzias latipes' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Otolemur garnettii') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Otolemur garnettii' division). - true - beta12orEarlier - ENSSAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Pan troglodytes') - - beta12orEarlier - beta12orEarlier - ENSPTR([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Pan troglodytes' division). - true - - - - - - - - - - Ensembl ID ('Rattus norvegicus') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Rattus norvegicus' division). - ENSRNO([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Spermophilus tridecemlineatus') - - true - beta12orEarlier - ENSSTO([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Spermophilus tridecemlineatus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Takifugu rubripes') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Takifugu rubripes' division). - ENSFRU([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Tupaia belangeri') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Tupaia belangeri' division). - true - ENSTBE([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Xenopus tropicalis') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Xenopus tropicalis' division). - beta12orEarlier - beta12orEarlier - true - ENSXET([EGTP])[0-9]{11} - - - - - - - - - - CATH identifier - - beta12orEarlier - Identifier of a protein domain (or other node) from the CATH database. - - - - - - - - - - - CATH node ID (family) - - beta12orEarlier - A code number identifying a family from the CATH database. - 2.10.10.10 - - - - - - - - - - - Enzyme ID (CAZy) - - Identifier of an enzyme from the CAZy enzymes database. - beta12orEarlier - CAZy ID - - - - - - - - - - - Clone ID (IMAGE) - - I.M.A.G.E. cloneID - IMAGE cloneID - A unique identifier assigned by the I.M.A.G.E. consortium to a clone (cloned molecular sequence). - beta12orEarlier - - - - - - - - - - - GO concept ID (cellular compartment) - - An identifier of a 'cellular compartment' concept from the Gene Ontology. - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - GO concept identifier (cellular compartment) - - - - - - - - - - - Chromosome name (BioCyc) - - Name of a chromosome as used in the BioCyc database. - beta12orEarlier - - - - - - - - - - - CleanEx entry name - - beta12orEarlier - An identifier of a gene expression profile from the CleanEx database. - - - - - - - - - - - CleanEx dataset code - - beta12orEarlier - An identifier of (typically a list of) gene expression experiments catalogued in the CleanEx database. - - - - - - - - - - - Genome report - - An informative report of general information concerning a genome as a whole. - beta12orEarlier - - - - - - - - - - Protein ID (CORUM) - - beta12orEarlier - CORUM complex ID - Unique identifier for a protein complex from the CORUM database. - - - - - - - - - - - CDD PSSM-ID - - beta12orEarlier - Unique identifier of a position-specific scoring matrix from the CDD database. - - - - - - - - - - - Protein ID (CuticleDB) - - CuticleDB ID - beta12orEarlier - Unique identifier for a protein from the CuticleDB database. - - - - - - - - - - - DBD ID - - Identifier of a predicted transcription factor from the DBD database. - beta12orEarlier - - - - - - - - - - - Oligonucleotide probe annotation - - - - - - - - beta12orEarlier - General annotation on an oligonucleotide probe. - - - - - - - - - - Oligonucleotide ID - - - Identifier of an oligonucleotide from a database. - beta12orEarlier - - - - - - - - - - - dbProbe ID - - Identifier of an oligonucleotide probe from the dbProbe database. - beta12orEarlier - - - - - - - - - - - Dinucleotide property - - beta12orEarlier - Physicochemical property data for one or more dinucleotides. - - - - - - - - - - DiProDB ID - - beta12orEarlier - Identifier of an dinucleotide property from the DiProDB database. - - - - - - - - - - - Protein features report (disordered structure) - - 1.8 - true - beta12orEarlier - disordered structure in a protein. - - - - - - - - - - Protein ID (DisProt) - - DisProt ID - beta12orEarlier - Unique identifier for a protein from the DisProt database. - - - - - - - - - - - Embryo report - - Annotation on an embryo or concerning embryological development. - true - Embryo annotation - beta12orEarlier - 1.5 - - - - - - - - - - Ensembl transcript ID - - - beta12orEarlier - Transcript ID (Ensembl) - Unique identifier for a gene transcript from the Ensembl database. - - - - - - - - - - - Inhibitor annotation - - 1.4 - beta12orEarlier - An informative report on one or more small molecules that are enzyme inhibitors. - true - - - - - - - - - - Promoter ID - - - beta12orEarlier - An identifier of a promoter of a gene that is catalogued in a database. - Moby:GeneAccessionList - - - - - - - - - - - EST accession - - Identifier of an EST sequence. - beta12orEarlier - - - - - - - - - - - COGEME EST ID - - beta12orEarlier - Identifier of an EST sequence from the COGEME database. - - - - - - - - - - - COGEME unisequence ID - - Identifier of a unisequence from the COGEME database. - A unisequence is a single sequence assembled from ESTs. - beta12orEarlier - - - - - - - - - - - Protein family ID (GeneFarm) - - GeneFarm family ID - beta12orEarlier - Accession number of an entry (family) from the TIGRFam database. - - - - - - - - - - - Family name - - beta12orEarlier - The name of a family of organism. - - - - - - - - - - - Genus name (virus) - - true - The name of a genus of viruses. - beta13 - beta12orEarlier - - - - - - - - - - Family name (virus) - - beta13 - The name of a family of viruses. - true - beta12orEarlier - - - - - - - - - - Database name (SwissRegulon) - - true - beta13 - The name of a SwissRegulon database. - beta12orEarlier - - - - - - - - - - Sequence feature ID (SwissRegulon) - - beta12orEarlier - A feature identifier as used in the SwissRegulon database. - This can be name of a gene, the ID of a TFBS, or genomic coordinates in form "chr:start..end". - - - - - - - - - - - FIG ID - - A FIG ID consists of four parts: a prefix, genome id, locus type and id number. - A unique identifier of gene in the NMPDR database. - beta12orEarlier - - - - - - - - - - - Gene ID (Xenbase) - - A unique identifier of gene in the Xenbase database. - beta12orEarlier - - - - - - - - - - - Gene ID (Genolist) - - beta12orEarlier - A unique identifier of gene in the Genolist database. - - - - - - - - - - - Gene name (Genolist) - - beta12orEarlier - true - Genolist gene name - 1.3 - Name of an entry (gene) from the Genolist genes database. - - - - - - - - - - ABS ID - - ABS identifier - beta12orEarlier - Identifier of an entry (promoter) from the ABS database. - - - - - - - - - - - AraC-XylS ID - - Identifier of a transcription factor from the AraC-XylS database. - beta12orEarlier - - - - - - - - - - - Gene name (HUGO) - - beta12orEarlier - beta12orEarlier - true - Name of an entry (gene) from the HUGO database. - - - - - - - - - - Locus ID (PseudoCAP) - - beta12orEarlier - Identifier of a locus from the PseudoCAP database. - - - - - - - - - - - Locus ID (UTR) - - beta12orEarlier - Identifier of a locus from the UTR database. - - - - - - - - - - - MonosaccharideDB ID - - Unique identifier of a monosaccharide from the MonosaccharideDB database. - beta12orEarlier - - - - - - - - - - - Database name (CMD) - - beta12orEarlier - true - The name of a subdivision of the Collagen Mutation Database (CMD) database. - beta13 - - - - - - - - - - Database name (Osteogenesis) - - beta12orEarlier - true - beta13 - The name of a subdivision of the Osteogenesis database. - - - - - - - - - - Genome identifier - - An identifier of a particular genome. - beta12orEarlier - - - - - - - - - - - GenomeReviews ID - - beta12orEarlier - An identifier of a particular genome. - - - - - - - - - - - GlycoMap ID - - [0-9]+ - beta12orEarlier - Identifier of an entry from the GlycosciencesDB database. - - - - - - - - - - - Carbohydrate conformational map - - beta12orEarlier - A conformational energy map of the glycosidic linkages in a carbohydrate molecule. - - - - - - - - - - Gene features report (intron) - - introns in a nucleotide sequences. - true - beta12orEarlier - 1.8 - - - - - - - - - - Transcription factor name - - - The name of a transcription factor. - beta12orEarlier - - - - - - - - - - - TCID - - Identifier of a membrane transport proteins from the transport classification database (TCDB). - beta12orEarlier - - - - - - - - - - - Pfam domain name - - beta12orEarlier - Name of a domain from the Pfam database. - PF[0-9]{5} - - - - - - - - - - - Pfam clan ID - - beta12orEarlier - CL[0-9]{4} - Accession number of a Pfam clan. - - - - - - - - - - - Gene ID (VectorBase) - - VectorBase ID - beta12orEarlier - Identifier for a gene from the VectorBase database. - - - - - - - - - - - UTRSite ID - - Identifier of an entry from the UTRSite database of regulatory motifs in eukaryotic UTRs. - beta12orEarlier - - - - - - - - - - - Sequence signature report - - - - - - - - Sequence motif report - Sequence profile report - An informative report about a specific or conserved pattern in a molecular sequence, such as its context in genes or proteins, its role, origin or method of construction, etc. - beta12orEarlier - - - - - - - - - - Locus annotation - - Locus report - true - beta12orEarlier - An informative report on a particular locus. - beta12orEarlier - - - - - - - - - - Protein name (UniProt) - - Official name of a protein as used in the UniProt database. - beta12orEarlier - - - - - - - - - - - Term ID list - - One or more terms from one or more controlled vocabularies which are annotations on an entity. - beta12orEarlier - true - The concepts are typically provided as a persistent identifier or some other link the source ontologies. Evidence of the validity of the annotation might be included. - 1.5 - - - - - - - - - - HAMAP ID - - Name of a protein family from the HAMAP database. - beta12orEarlier - - - - - - - - - - - Identifier with metadata - - Basic information concerning an identifier of data (typically including the identifier itself). For example, a gene symbol with information concerning its provenance. - beta12orEarlier - true - 1.12 - - - - - - - - - - Gene symbol annotation - - true - beta12orEarlier - Annotation about a gene symbol. - beta12orEarlier - - - - - - - - - - Transcript ID - - - - - - - - - Identifier of a RNA transcript. - beta12orEarlier - - - - - - - - - - - HIT ID - - Identifier of an RNA transcript from the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HIX ID - - A unique identifier of gene cluster in the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HPA antibody id - - beta12orEarlier - Identifier of a antibody from the HPA database. - - - - - - - - - - - IMGT/HLA ID - - Identifier of a human major histocompatibility complex (HLA) or other protein from the IMGT/HLA database. - beta12orEarlier - - - - - - - - - - - Gene ID (JCVI) - - A unique identifier of gene assigned by the J. Craig Venter Institute (JCVI). - beta12orEarlier - - - - - - - - - - - Kinase name - - beta12orEarlier - The name of a kinase protein. - - - - - - - - - - - ConsensusPathDB entity ID - - - Identifier of a physical entity from the ConsensusPathDB database. - beta12orEarlier - - - - - - - - - - - ConsensusPathDB entity name - - - beta12orEarlier - Name of a physical entity from the ConsensusPathDB database. - - - - - - - - - - - CCAP strain number - - The number of a strain of algae and protozoa from the CCAP database. - beta12orEarlier - - - - - - - - - - - Stock number - - - beta12orEarlier - An identifier of stock from a catalogue of biological resources. - - - - - - - - - - - Stock number (TAIR) - - beta12orEarlier - A stock number from The Arabidopsis information resource (TAIR). - - - - - - - - - - - REDIdb ID - - beta12orEarlier - Identifier of an entry from the RNA editing database (REDIdb). - - - - - - - - - - - SMART domain name - - Name of a domain from the SMART database. - beta12orEarlier - - - - - - - - - - - Protein family ID (PANTHER) - - beta12orEarlier - Panther family ID - Accession number of an entry (family) from the PANTHER database. - - - - - - - - - - - RNAVirusDB ID - - beta12orEarlier - Could list (or reference) other taxa here from https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - A unique identifier for a virus from the RNAVirusDB database. - - - - - - - - - - - Virus ID - - - beta12orEarlier - An accession of annotation on a (group of) viruses (catalogued in a database). - - - - - - - - - - - NCBI Genome Project ID - - An identifier of a genome project assigned by NCBI. - beta12orEarlier - - - - - - - - - - - NCBI genome accession - - A unique identifier of a whole genome assigned by the NCBI. - beta12orEarlier - - - - - - - - - - - Sequence profile data - - 1.8 - Data concerning, extracted from, or derived from the analysis of a sequence profile, such as its name, length, technical details about the profile or it's construction, the biological role or annotation, and so on. - true - beta12orEarlier - - - - - - - - - - Protein ID (TopDB) - - beta12orEarlier - TopDB ID - Unique identifier for a membrane protein from the TopDB database. - - - - - - - - - - - Gel ID - - Gel identifier - Identifier of a two-dimensional (protein) gel. - beta12orEarlier - - - - - - - - - - - Reference map name (SWISS-2DPAGE) - - - beta12orEarlier - Name of a reference map gel from the SWISS-2DPAGE database. - - - - - - - - - - - Protein ID (PeroxiBase) - - PeroxiBase ID - beta12orEarlier - Unique identifier for a peroxidase protein from the PeroxiBase database. - - - - - - - - - - - SISYPHUS ID - - beta12orEarlier - Identifier of an entry from the SISYPHUS database of tertiary structure alignments. - - - - - - - - - - - ORF ID - - - beta12orEarlier - Accession of an open reading frame (catalogued in a database). - - - - - - - - - - - ORF identifier - - An identifier of an open reading frame. - beta12orEarlier - - - - - - - - - - - Linucs ID - - Identifier of an entry from the GlycosciencesDB database. - beta12orEarlier - - - - - - - - - - - Protein ID (LGICdb) - - beta12orEarlier - LGICdb ID - Unique identifier for a ligand-gated ion channel protein from the LGICdb database. - - - - - - - - - - - MaizeDB ID - - beta12orEarlier - Identifier of an EST sequence from the MaizeDB database. - - - - - - - - - - - Gene ID (MfunGD) - - beta12orEarlier - A unique identifier of gene in the MfunGD database. - - - - - - - - - - - Orpha number - - - - - - - - beta12orEarlier - An identifier of a disease from the Orpha database. - - - - - - - - - - - Protein ID (EcID) - - beta12orEarlier - Unique identifier for a protein from the EcID database. - - - - - - - - - - - Clone ID (RefSeq) - - - A unique identifier of a cDNA molecule catalogued in the RefSeq database. - beta12orEarlier - - - - - - - - - - - Protein ID (ConoServer) - - beta12orEarlier - Unique identifier for a cone snail toxin protein from the ConoServer database. - - - - - - - - - - - GeneSNP ID - - Identifier of a GeneSNP database entry. - beta12orEarlier - - - - - - - - - - - Lipid identifier - - - - - - - - - - - - - - Identifier of a lipid. - beta12orEarlier - - - - - - - - - - - Databank - - true - beta12orEarlier - A flat-file (textual) data archive. - beta12orEarlier - - - - - - - - - - Web portal - - A web site providing data (web pages) on a common theme to a HTTP client. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gene ID (VBASE2) - - Identifier for a gene from the VBASE2 database. - beta12orEarlier - VBASE2 ID - - - - - - - - - - - DPVweb ID - - DPVweb virus ID - beta12orEarlier - A unique identifier for a virus from the DPVweb database. - - - - - - - - - - - Pathway ID (BioSystems) - - beta12orEarlier - Identifier of a pathway from the BioSystems pathway database. - [0-9]+ - - - - - - - - - - - Experimental data (proteomics) - - true - Data concerning a proteomics experiment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Abstract - - beta12orEarlier - An abstract of a scientific article. - - - - - - - - - - Lipid structure - - beta12orEarlier - 3D coordinate and associated data for a lipid structure. - - - - - - - - - - Drug structure - - beta12orEarlier - 3D coordinate and associated data for the (3D) structure of a drug. - - - - - - - - - - Toxin structure - - 3D coordinate and associated data for the (3D) structure of a toxin. - beta12orEarlier - - - - - - - - - - Position-specific scoring matrix - - - beta12orEarlier - PSSM - A simple matrix of numbers, where each value (or column of values) is derived derived from analysis of the corresponding position in a sequence alignment. - - - - - - - - - - Distance matrix - - A matrix of distances between molecular entities, where a value (distance) is (typically) derived from comparison of two entities and reflects their similarity. - beta12orEarlier - - - - - - - - - - Structural distance matrix - - Distances (values representing similarity) between a group of molecular structures. - beta12orEarlier - - - - - - - - - - Article metadata - - true - beta12orEarlier - Bibliographic data concerning scientific article(s). - 1.5 - - - - - - - - - - Ontology concept - - beta12orEarlier - This includes any fields from the concept definition such as concept name, definition, comments and so on. - A concept from a biological ontology. - - - - - - - - - - Codon usage bias - - A numerical measure of differences in the frequency of occurrence of synonymous codons in DNA sequences. - beta12orEarlier - - - - - - - - - - Northern blot report - - true - beta12orEarlier - 1.8 - Northern Blot experiments. - - - - - - - - - - Nucleic acid features report (VNTR) - - 1.8 - beta12orEarlier - true - variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - - - - - - - - - - Nucleic acid features report (microsatellite) - - true - microsatellite polymorphism in a DNA sequence. - 1.8 - beta12orEarlier - - - - - - - - - - - Nucleic acid features report (RFLP) - - beta12orEarlier - true - 1.8 - restriction fragment length polymorphisms (RFLP) in a DNA sequence. - - - - - - - - - - Radiation hybrid map - - The radiation method can break very closely linked markers providing a more detailed map. Most genetic markers and subsequences may be located to a defined map position and with a more precise estimates of distance than a linkage map. - A map showing distance between genetic markers estimated by radiation-induced breaks in a chromosome. - beta12orEarlier - RH map - - - - - - - - - - ID list - - A simple list of data identifiers (such as database accessions), possibly with additional basic information on the addressed data. - beta12orEarlier - - - - - - - - - - Phylogenetic gene frequencies data - - beta12orEarlier - Gene frequencies data that may be read during phylogenetic tree calculation. - - - - - - - - - - Sequence set (polymorphic) - - beta13 - beta12orEarlier - true - A set of sub-sequences displaying some type of polymorphism, typically indicating the sequence in which they occur, their position and other metadata. - - - - - - - - - - DRCAT resource - - 1.5 - An entry (resource) from the DRCAT bioinformatics resource catalogue. - beta12orEarlier - true - - - - - - - - - - Protein complex - - beta12orEarlier - 3D coordinate and associated data for a multi-protein complex; two or more polypeptides chains in a stable, functional association with one another. - - - - - - - - - - Protein structural motif - - beta12orEarlier - 3D coordinate and associated data for a protein (3D) structural motif; any group of contiguous or non-contiguous amino acid residues but typically those forming a feature with a structural or functional role. - - - - - - - - - - Lipid report - - beta12orEarlier - Annotation on or information derived from one or more specific lipid 3D structure(s). - - - - - - - - - - Secondary structure image - - 1.4 - beta12orEarlier - Image of one or more molecular secondary structures. - true - - - - - - - - - - Secondary structure report - - Secondary structure-derived report - beta12orEarlier - true - An informative report on general information, properties or features of one or more molecular secondary structures. - 1.5 - - - - - - - - - - DNA features - - beta12orEarlier - DNA sequence-specific feature annotation (not in a feature table). - true - beta12orEarlier - - - - - - - - - - RNA features report - - true - beta12orEarlier - 1.5 - Features concerning RNA or regions of DNA that encode an RNA molecule. - RNA features - Nucleic acid features (RNA features) - - - - - - - - - - Plot - - beta12orEarlier - Biological data that has been plotted as a graph of some type. - - - - - - - - - - Nucleic acid features report (polymorphism) - - true - DNA polymorphism. - beta12orEarlier - - - - - - - - - - Protein sequence record - - - A protein sequence and associated metadata. - beta12orEarlier - Sequence record (protein) - - - - - - - - - - Nucleic acid sequence record - - - RNA sequence record - Nucleotide sequence record - A nucleic acid sequence and associated metadata. - beta12orEarlier - DNA sequence record - Sequence record (nucleic acid) - - - - - - - - - - Protein sequence record (full) - - A protein sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - beta12orEarlier - true - - - - - - - - - - Nucleic acid sequence record (full) - - true - A nucleic acid sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - beta12orEarlier - 1.8 - - - - - - - - - - Biological model accession - - - beta12orEarlier - Accession of a mathematical model, typically an entry from a database. - - - - - - - - - - - Cell type name - - - The name of a type or group of cells. - beta12orEarlier - - - - - - - - - - - Cell type accession - - - Cell type ID - beta12orEarlier - Accession of a type or group of cells (catalogued in a database). - - - - - - - - - - - Compound accession - - - Small molecule accession - Accession of an entry from a database of chemicals. - beta12orEarlier - Chemical compound accession - - - - - - - - - - - Drug accession - - - Accession of a drug. - beta12orEarlier - - - - - - - - - - - Toxin name - - - Name of a toxin. - beta12orEarlier - - - - - - - - - - - Toxin accession - - - beta12orEarlier - Accession of a toxin (catalogued in a database). - - - - - - - - - - - Monosaccharide accession - - - Accession of a monosaccharide (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Drug name - - - beta12orEarlier - Common name of a drug. - - - - - - - - - - - Carbohydrate accession - - - Accession of an entry from a database of carbohydrates. - beta12orEarlier - - - - - - - - - - - Molecule accession - - - Accession of a specific molecule (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Data resource definition accession - - - beta12orEarlier - Accession of a data definition (catalogued in a database). - - - - - - - - - - - Genome accession - - - An accession of a particular genome (in a database). - beta12orEarlier - - - - - - - - - - - Map accession - - - An accession of a map of a molecular sequence (deposited in a database). - beta12orEarlier - - - - - - - - - - - Lipid accession - - - beta12orEarlier - Accession of an entry from a database of lipids. - - - - - - - - - - - Peptide ID - - - beta12orEarlier - Accession of a peptide deposited in a database. - - - - - - - - - - - Protein accession - - - Protein accessions - beta12orEarlier - Accession of a protein deposited in a database. - - - - - - - - - - - Organism accession - - - An accession of annotation on a (group of) organisms (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Organism name - - - Moby:Organism_Name - Moby:OrganismsShortName - Moby:OccurrenceRecord - Moby:BriefOccurrenceRecord - Moby:FirstEpithet - Moby:InfraspecificEpithet - beta12orEarlier - Moby:OrganismsLongName - The name of an organism (or group of organisms). - - - - - - - - - - - Protein family accession - - - beta12orEarlier - Accession of a protein family (that is deposited in a database). - - - - - - - - - - - Transcription factor accession - - - - beta12orEarlier - Accession of an entry from a database of transcription factors or binding sites. - - - - - - - - - - - Strain accession - - - - - - - - - beta12orEarlier - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - Virus identifier - - An accession of annotation on a (group of) viruses (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Sequence features metadata - - beta12orEarlier - Metadata on sequence features. - - - - - - - - - - Gramene identifier - - beta12orEarlier - Identifier of a Gramene database entry. - - - - - - - - - - - DDBJ accession - - beta12orEarlier - DDBJ accession number - DDBJ identifier - DDBJ ID - An identifier of an entry from the DDBJ sequence database. - - - - - - - - - - - ConsensusPathDB identifier - - beta12orEarlier - An identifier of an entity from the ConsensusPathDB database. - - - - - - - - - - - Sequence data - - This is a broad data type and is used a placeholder for other, more specific types. - 1.8 - beta12orEarlier - true - Data concerning, extracted from, or derived from the analysis of molecular sequence(s). - - - - - - - - - - Codon usage - - beta12orEarlier - true - beta13 - Data concerning codon usage. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article report - - beta12orEarlier - 1.5 - Data derived from the analysis of a scientific text such as a full text article from a scientific journal. - true - - - - - - - - - - Sequence report - - An informative report of information about molecular sequence(s), including basic information (metadata), and reports generated from molecular sequence analysis, including positional features and non-positional properties. - beta12orEarlier - Sequence-derived report - - - - - - - - - - Protein secondary structure report - - An informative report about the properties or features of one or more protein secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plot - - - A Hopp and Woods plot of predicted antigenicity of a peptide or protein. - beta12orEarlier - - - - - - - - - - Nucleic acid melting curve - - - Shows the proportion of nucleic acid which are double-stranded versus temperature. - A melting curve of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile - - A probability profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - Shows the probability of a base pair not being melted (i.e. remaining as double-stranded DNA) at a specified temperature - - - - - - - - - - Nucleic acid temperature profile - - A temperature profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - Plots melting temperature versus base position. - beta12orEarlier - Melting map - - - - - - - - - - Gene regulatory network report - - 1.8 - A report typically including a map (diagram) of a gene regulatory network. - true - beta12orEarlier - - - - - - - - - - 2D PAGE gel report - - An informative report on a two-dimensional (2D PAGE) gel. - 2D PAGE image report - 1.8 - true - 2D PAGE gel annotation - beta12orEarlier - 2D PAGE image annotation - - - - - - - - - - Oligonucleotide probe sets annotation - - beta12orEarlier - General annotation on a set of oligonucleotide probes, such as the gene name with which the probe set is associated and which probes belong to the set. - - - - - - - - - - Microarray image - - 1.5 - beta12orEarlier - Gene expression image - An image from a microarray experiment which (typically) allows a visualisation of probe hybridisation and gene-expression data. - true - - - - - - - - - - Image - - http://semanticscience.org/resource/SIO_000081 - Biological or biomedical data has been rendered into an image, typically for display on screen. - http://semanticscience.org/resource/SIO_000079 - Image data - beta12orEarlier - - - - - - - - - - Sequence image - - - Image of a molecular sequence, possibly with sequence features or properties shown. - beta12orEarlier - - - - - - - - - - Protein hydropathy data - - Protein hydropathy report - A report on protein properties concerning hydropathy. - beta12orEarlier - - - - - - - - - - Workflow data - - beta12orEarlier - beta13 - Data concerning a computational workflow. - true - - - - - - - - - - Workflow - - true - beta12orEarlier - 1.5 - A computational workflow. - - - - - - - - - - Secondary structure data - - beta13 - true - beta12orEarlier - Data concerning molecular secondary structure data. - - - - - - - - - - Protein sequence (raw) - - - Raw protein sequence - beta12orEarlier - Raw sequence (protein) - A raw protein sequence (string of characters). - - - - - - - - - - Nucleic acid sequence (raw) - - - Nucleic acid raw sequence - beta12orEarlier - Nucleotide sequence (raw) - Raw sequence (nucleic acid) - A raw nucleic acid sequence. - - - - - - - - - - Protein sequence - - One or more protein sequences, possibly with associated annotation. - Protein sequences - beta12orEarlier - http://purl.org/biotop/biotop.owl#AminoAcidSequenceInformation - - - - - - - - - - Nucleic acid sequence - - One or more nucleic acid sequences, possibly with associated annotation. - beta12orEarlier - DNA sequence - Nucleotide sequence - Nucleotide sequences - Nucleic acid sequences - http://purl.org/biotop/biotop.owl#NucleotideSequenceInformation - - - - - - - - - - Reaction data - - Enzyme kinetics annotation - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Reaction annotation - Data concerning a biochemical reaction, typically data and more general annotation on the kinetics of enzyme-catalysed reaction. - - - - - - - - - - Peptide property - - beta12orEarlier - Peptide data - Data concerning small peptides. - - - - - - - - - - Protein classification - - This is a broad data type and is used a placeholder for other, more specific types. - Protein classification data - An informative report concerning the classification of protein sequences or structures. - beta12orEarlier - - - - - - - - - Sequence motif data - - true - 1.8 - Data concerning specific or conserved pattern in molecular sequences. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Sequence profile data - - beta12orEarlier - true - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning models representing a (typically multiple) sequence alignment. - - - - - - - - - - Pathway or network data - - Data concerning a specific biological pathway or network. - beta13 - true - beta12orEarlier - - - - - - - - - - - Pathway or network report - - - - - - - - beta12orEarlier - An informative report concerning or derived from the analysis of a biological pathway or network, such as a map (diagram) or annotation. - - - - - - - - - - Nucleic acid thermodynamic data - - Nucleic acid property (thermodynamic or kinetic) - A thermodynamic or kinetic property of a nucleic acid molecule. - Nucleic acid thermodynamic property - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification of nucleic acid sequences or structures. - Nucleic acid classification data - - - - - - - - - Classification report - - This can include an entire classification, components such as classifiers, assignments of entities to a classification and so on. - beta12orEarlier - true - Classification data - A report on a classification of molecular sequences, structures or other entities. - 1.5 - - - - - - - - - - Protein features report (key folding sites) - - beta12orEarlier - key residues involved in protein folding. - 1.8 - true - - - - - - - - - - Protein geometry report - - Torsion angle data - beta12orEarlier - Geometry data for a protein structure, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - - - - - - - - - - Protein structure image - - - An image of protein structure. - beta12orEarlier - Structure image (protein) - - - - - - - - - - Phylogenetic character weights - - Weights for sequence positions or characters in phylogenetic analysis where zero is defined as unweighted. - beta12orEarlier - - - - - - - - - - Annotation track - - beta12orEarlier - Genomic track - Annotation of one particular positional feature on a biomolecular (typically genome) sequence, suitable for import and display in a genome browser. - Genome annotation track - Genome-browser track - Genome track - Sequence annotation track - - - - - - - - - - UniProt accession - - - - - - - - UniProtKB accession number - beta12orEarlier - P43353|Q7M1G0|Q9C199|A5A6J6 - UniProt entry accession - [OPQ][0-9][A-Z0-9]{3}[0-9]|[A-NR-Z][0-9]([A-Z][A-Z0-9]{2}[0-9]){1,2} - Swiss-Prot entry accession - TrEMBL entry accession - Accession number of a UniProt (protein sequence) database entry. - UniProtKB accession - UniProt accession number - - - - - - - - - - - NCBI genetic code ID - - - Identifier of a genetic code in the NCBI list of genetic codes. - [1-9][0-9]? - 16 - beta12orEarlier - - - - - - - - - - - Ontology concept identifier - - - - - - - - Identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - GO concept name (biological process) - - true - The name of a concept for a biological process from the GO ontology. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GO concept name (molecular function) - - true - beta12orEarlier - The name of a concept for a molecular function from the GO ontology. - beta12orEarlier - - - - - - - - - - Taxonomy - - - - - - - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification, identification and naming of organisms. - Taxonomic data - - - - - - - - - - Protein ID (EMBL/GenBank/DDBJ) - - beta13 - EMBL/GENBANK/DDBJ coding feature protein identifier, issued by International collaborators. - This qualifier consists of a stable ID portion (3+5 format with 3 position letters and 5 numbers) plus a version number after the decimal point. When the protein sequence encoded by the CDS changes, only the version number of the /protein_id value is incremented; the stable part of the /protein_id remains unchanged and as a result will permanently be associated with a given protein; this qualifier is valid only on CDS features which translate into a valid protein. - - - - - - - - - - - Core data - - Core data entities typically have a format and may be identified by an accession number. - A type of data that (typically) corresponds to entries from the primary biological databases and which is (typically) the primary input or output of a tool, i.e. the data the tool processes or generates, as distinct from metadata and identifiers which describe and identify such core data, parameters that control the behaviour of tools, reports of derivative data generated by tools and annotation. - 1.5 - true - beta13 - - - - - - - - - - Sequence feature identifier - - - - - - - - beta13 - Name or other identifier of molecular sequence feature(s). - - - - - - - - - - - Structure identifier - - - - - - - - beta13 - An identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Matrix identifier - - - - - - - - An identifier of an array of numerical values, such as a comparison matrix. - beta13 - - - - - - - - - - - Protein sequence composition - - beta13 - 1.8 - true - A report (typically a table) on character or word composition / frequency of protein sequence(s). - - - - - - - - - - Nucleic acid sequence composition (report) - - 1.8 - A report (typically a table) on character or word composition / frequency of nucleic acid sequence(s). - true - beta13 - - - - - - - - - - Protein domain classification node - - beta13 - A node from a classification of protein structural domain(s). - true - 1.5 - - - - - - - - - - CAS number - - beta13 - CAS registry number - Unique numerical identifier of chemicals in the scientific literature, as assigned by the Chemical Abstracts Service. - - - - - - - - - - - ATC code - - Unique identifier of a drug conforming to the Anatomical Therapeutic Chemical (ATC) Classification System, a drug classification system controlled by the WHO Collaborating Centre for Drug Statistics Methodology (WHOCC). - beta13 - - - - - - - - - - - UNII - - beta13 - A unique, unambiguous, alphanumeric identifier of a chemical substance as catalogued by the Substance Registration System of the Food and Drug Administration (FDA). - Unique Ingredient Identifier - - - - - - - - - - - Geotemporal metadata - - 1.5 - beta13 - true - Basic information concerning geographical location or time. - - - - - - - - - - System metadata - - Metadata concerning the software, hardware or other aspects of a computer system. - beta13 - - - - - - - - - - Sequence feature name - - - A name of a sequence feature, e.g. the name of a feature to be displayed to an end-user. - beta13 - - - - - - - - - - - Experimental measurement - - beta13 - Raw data such as measurements or other results from laboratory experiments, as generated from laboratory hardware. - Experimental measurement data - Measurement - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Measured data - Experimentally measured data - Measurement metadata - Measurement data - Raw experimental data - - - - - - - - - - Raw microarray data - - - beta13 - Raw data (typically MIAME-compliant) for hybridisations from a microarray experiment. - Such data as found in Affymetrix CEL or GPR files. - - - - - - - - - - Processed microarray data - - - - - - - - Data generated from processing and analysis of probe set data from a microarray experiment. - Gene annotation (expression) - Microarray probe set data - beta13 - Gene expression report - Such data as found in Affymetrix .CHP files or data from other software such as RMA or dChip. - - - - - - - - - - Gene expression matrix - - - This combines data from all hybridisations. - beta13 - Normalised microarray data - The final processed (normalised) data for a set of hybridisations in a microarray experiment. - Gene expression data matrix - - - - - - - - - - Sample annotation - - Annotation on a biological sample, for example experimental factors and their values. - This might include compound and dose in a dose response experiment. - beta13 - - - - - - - - - - Microarray metadata - - This might include gene identifiers, genomic coordinates, probe oligonucleotide sequences etc. - Annotation on the array itself used in a microarray experiment. - beta13 - - - - - - - - - - Microarray protocol annotation - - true - This might describe e.g. the normalisation methods used to process the raw data. - beta13 - 1.8 - Annotation on laboratory and/or data processing protocols used in an microarray experiment. - - - - - - - - - - Microarray hybridisation data - - Data concerning the hybridisations measured during a microarray experiment. - beta13 - - - - - - - - - - Protein features report (topological domains) - - 1.8 - beta13 - topological domains such as cytoplasmic regions in a protein. - true - - - - - - - - - - Sequence features (compositionally-biased regions) - - 1.5 - beta13 - true - A report of regions in a molecular sequence that are biased to certain characters. - - - - - - - - - - Nucleic acid features (difference and change) - - beta13 - A report on features in a nucleic acid sequence that indicate changes to or differences between sequences. - 1.5 - true - - - - - - - - - - Nucleic acid features report (expression signal) - - true - beta13 - regions within a nucleic acid sequence containing a signal that alters a biological function. - 1.8 - - - - - - - - - - Nucleic acid features report (binding) - - nucleic acids binding to some other molecule. - 1.8 - true - beta13 - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes). - - - - - - - - - - Nucleic acid repeats (report) - - true - repetitive elements within a nucleic acid sequence. - 1.8 - beta13 - - - - - - - - - - Nucleic acid features report (replication and recombination) - - beta13 - true - 1.8 - DNA replication or recombination. - - - - - - - - - - Nucleic acid structure report - - - A report on regions within a nucleic acid sequence which form secondary or tertiary (3D) structures. - Stem loop (report) - d-loop (report) - Nucleic acid features (structure) - Quadruplexes (report) - beta13 - - - - - - - - - - Protein features report (repeats) - - 1.8 - short repetitive subsequences (repeat sequences) in a protein sequence. - beta13 - true - - - - - - - - - - Sequence motif matches (protein) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more protein sequences. - 1.8 - beta13 - true - - - - - - - - - - Sequence motif matches (nucleic acid) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more nucleic acid sequences. - beta13 - true - 1.8 - - - - - - - - - - Nucleic acid features (d-loop) - - beta13 - true - 1.5 - A report on displacement loops in a mitochondrial DNA sequence. - A displacement loop is a region of mitochondrial DNA in which one of the strands is displaced by an RNA molecule. - - - - - - - - - - Nucleic acid features (stem loop) - - beta13 - true - A report on stem loops in a DNA sequence. - 1.5 - A stem loop is a hairpin structure; a double-helical structure formed when two complementary regions of a single strand of RNA or DNA molecule form base-pairs. - - - - - - - - - - Gene transcript report - - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Nucleic acid features (mRNA features) - beta13 - Transcript (report) - mRNA features - Gene transcript annotation - Clone or EST (report) - mRNA (report) - An informative report on features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. This includes reports on a specific gene transcript, clone or EST. - - - - - - - - - - - Nucleic acid features report (signal or transit peptide) - - true - coding sequences for a signal or transit peptide. - 1.8 - beta13 - - - - - - - - - - Non-coding RNA - - beta13 - true - features of non-coding or functional RNA molecules, including tRNA and rRNA. - 1.8 - - - - - - - - - - Transcriptional features (report) - - 1.5 - true - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - Features concerning transcription of DNA into RNA including the regulation of transcription. - beta13 - - - - - - - - - - Nucleic acid features report (STS) - - sequence tagged sites (STS) in nucleic acid sequences. - 1.8 - true - beta13 - - - - - - - - - - Nucleic acid features (immunoglobulin gene structure) - - true - beta13 - 1.5 - A report on predicted or actual immunoglobulin gene structure including constant, switch and variable regions and diversity, joining and variable segments. - - - - - - - - - - SCOP class - - 1.5 - beta13 - true - Information on a 'class' node from the SCOP database. - - - - - - - - - - SCOP fold - - beta13 - Information on a 'fold' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP superfamily - - beta13 - Information on a 'superfamily' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP family - - 1.5 - true - Information on a 'family' node from the SCOP database. - beta13 - - - - - - - - - - SCOP protein - - Information on a 'protein' node from the SCOP database. - true - beta13 - 1.5 - - - - - - - - - - SCOP species - - 1.5 - true - beta13 - Information on a 'species' node from the SCOP database. - - - - - - - - - - Mass spectrometry experiment - - 1.8 - true - mass spectrometry experiments. - beta13 - - - - - - - - - - Gene family report - - An informative report on a particular family of genes, typically a set of genes with similar sequence that originate from duplication of a common ancestor gene, or any other classification of nucleic acid sequences or structures that reflects gene structure. - This includes reports on on gene homologues between species. - beta13 - Gene annotation (homology information) - Homology information - Gene annotation (homology) - Nucleic acid classification - Gene family annotation - Gene homology (report) - - - - - - - - - - Protein image - - beta13 - An image of a protein. - - - - - - - - - - Protein alignment - - An alignment of protein sequences and/or structures. - beta13 - - - - - - - - - - NGS experiment - - 1.8 - 1.0 - sequencing experiment, including samples, sampling, preparation, sequencing, and analysis. - true - - - - - - - - - - Sequence assembly report - - An informative report about a DNA sequence assembly. - 1.1 - This might include an overall quality assement of the assembly and summary statistics including counts, average length and number of bases for reads, matches and non-matches, contigs, reads in pairs etc. - Assembly report - - - - - - - - - - Genome index - - 1.1 - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. - An index of a genome sequence. - - - - - - - - - - GWAS report - - 1.8 - 1.1 - Report concerning genome-wide association study experiments. - true - Genome-wide association study - - - - - - - - - - Cytoband position - - 1.2 - The position of a cytogenetic band in a genome. - Information might include start and end position in a chromosome sequence, chromosome identifier, name of band and so on. - - - - - - - - - - Cell type ontology ID - - - CL ID - Cell type ontology concept ID. - CL_[0-9]{7} - 1.2 - beta12orEarlier - - - - - - - - - - - Kinetic model - - 1.2 - Mathematical model of a network, that contains biochemical kinetics. - - - - - - - - - - COSMIC ID - - COSMIC identifier - cosmic ID - Identifier of a COSMIC database entry. - cosmic identifier - cosmic id - 1.3 - - - - - - - - - - - HGMD ID - - Identifier of a HGMD database entry. - hgmd ID - hgmd identifier - beta12orEarlier - hgmd id - HGMD identifier - - - - - - - - - - - Sequence assembly ID - - Sequence assembly version - Unique identifier of sequence assembly. - 1.3 - - - - - - - - - - - Sequence feature type - - true - A label (text token) describing a type of sequence feature such as gene, transcript, cds, exon, repeat, simple, misc, variation, somatic variation, structural variation, somatic structural variation, constrained or regulatory. - 1.3 - 1.5 - - - - - - - - - - Gene homology (report) - - beta12orEarlier - true - An informative report on gene homologues between species. - 1.5 - - - - - - - - - - Ensembl gene tree ID - - - ENSGT00390000003602 - Ensembl ID (gene tree) - Unique identifier for a gene tree from the Ensembl database. - 1.3 - - - - - - - - - - - Gene tree - - 1.3 - A phylogenetic tree that is an estimate of the character's phylogeny. - - - - - - - - - - Species tree - - A phylogenetic tree that reflects phylogeny of the taxa from which the characters (used in calculating the tree) were sampled. - 1.3 - - - - - - - - - - Sample ID - - - - - - - - - 1.3 - Sample accession - Name or other identifier of an entry from a biosample database. - - - - - - - - - - - MGI accession - - - Identifier of an object from the MGI database. - 1.3 - - - - - - - - - - - Phenotype name - - - 1.3 - Name of a phenotype. - Phenotypes - Phenotype - - - - - - - - - - - Transition matrix - - A HMM transition matrix contains the probabilities of switching from one HMM state to another. - Consider for example an HMM with two states (AT-rich and GC-rich). The transition matrix will hold the probabilities of switching from the AT-rich to the GC-rich state, and vica versa. - HMM transition matrix - 1.4 - - - - - - - - - Emission matrix - - A HMM emission matrix holds the probabilities of choosing the four nucleotides (A, C, G and T) in each of the states of a HMM. - 1.4 - Consider for example an HMM with two states (AT-rich and GC-rich). The emission matrix holds the probabilities of choosing each of the four nucleotides (A, C, G and T) in the AT-rich state and in the GC-rich state. - HMM emission matrix - - - - - - - - - Hidden Markov model - - A statistical Markov model of a system which is assumed to be a Markov process with unobserved (hidden) states. - 1.4 - - - - - - - - - Format identifier - - An identifier of a data format. - 1.4 - - - - - - - - - Raw image - - 1.5 - Amino acid data - http://semanticscience.org/resource/SIO_000081 - beta12orEarlier - Image data - Raw biological or biomedical image generated by some experimental technique. - - - - - - - - - - Carbohydrate property - - Carbohydrate data - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all carbohydrates. - 1.5 - - - - - - - - - - Proteomics experiment report - - true - 1.8 - Report concerning proteomics experiments. - 1.5 - - - - - - - - - - RNAi report - - 1.5 - RNAi experiments. - true - 1.8 - - - - - - - - - - Simulation experiment report - - 1.5 - biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - true - 1.8 - - - - - - - - - - MRI image - - - - - - - - MRT image - 1.7 - Magnetic resonance tomography image - Nuclear magnetic resonance imaging image - - Magnetic resonance imaging image - - NMRI image - An imaging technique that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - - - - - - - - - - Cell migration track image - - - - - - - - 1.7 - An image from a cell migration track assay. - - - - - - - - - - Rate of association - - kon - 1.7 - Rate of association of a protein with another protein or some other molecule. - - - - - - - - - - Gene order - - Such data are often used for genome rearrangement tools and phylogenetic tree labeling. - Multiple gene identifiers in a specific order. - 1.7 - - - - - - - - - - Spectrum - - 1.7 - The spectrum of frequencies of electromagnetic radiation emitted from a molecule as a result of some spectroscopy experiment. - Spectra - - - - - - - - - - NMR spectrum - - - - - - - - Spectral information for a molecule from a nuclear magnetic resonance experiment. - 1.7 - NMR spectra - - - - - - - - - - Chemical structure sketch - - Chemical structure sketches are used for presentational purposes but also as inputs to various analysis software. - 1.8 - Small molecule sketch - A sketch of a small molecule made with some specialised drawing package. - - - - - - - - - - Nucleic acid signature - - 1.8 - An informative report about a specific or conserved nucleic acid sequence pattern. - - - - - - - - - - DNA sequence - - DNA sequences - 1.8 - A DNA sequence. - - - - - - - - - - RNA sequence - - A DNA sequence. - DNA sequences - RNA sequences - 1.8 - - - - - - - - - - RNA sequence (raw) - - - Raw sequence (RNA) - 1.8 - A raw RNA sequence. - RNA raw sequence - - - - - - - - - - DNA sequence (raw) - - - Raw sequence (DNA) - A raw DNA sequence. - 1.8 - DNA raw sequence - - - - - - - - - - Sequence variations - - - - - - - - 1.8 - Data on gene sequence variations resulting large-scale genotyping and DNA sequencing projects. - Gene sequence variations - Variations are stored along with a reference genome. - - - - - - - - - - Bibliography - - 1.8 - A list of publications such as scientic papers or books. - - - - - - - - - - Ontology mapping - - A mapping of supplied textual terms or phrases to ontology concepts (URIs). - beta12orEarlier - - - - - - - - - - Image metadata - - Image-associated data - This can include basic provenance and technical information about the image, scientific annotation and so on. - Any data concerning a specific biological or biomedical image. - 1.9 - Image data - Image-related data - - - - - - - - - - Clinical trial report - - Clinical trial information - A report concerning a clinical trial. - 1.9 - - - - - - - - - - Reference sample report - - 1.10 - A report about a biosample. - Biosample report - - - - - - - - - - Gene Expression Atlas Experiment ID - - Accession number of an entry from the Gene Expression Atlas. - 1.10 - - - - - - - - - - - Disease identifier - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of disease. - - - - - - - - - - - Disease name - - - The name of some disease. - 1.12 - - - - - - - - - - - Training material - - Open educational resource - Some material that is used for educational (training) purposes. - OER - 1.12 - - - - - - - - - - Online course - - MOOC - A training course available for use on the Web. - On-line course - 1.12 - Massive open online course - - - - - - - - - - Text - - - Any free or plain text, as often specified as some search query. - Plain text - Free text - 1.12 - - - - - - - - - - SMILES - - - Chemical structure specified in Simplified Molecular Input Line Entry System (SMILES) line notation. - beta12orEarlier - - - - - - - - - - - - - - InChI - - - Chemical structure specified in IUPAC International Chemical Identifier (InChI) line notation. - beta12orEarlier - - - - - - - - - - mf - - - Chemical structure specified by Molecular Formula (MF), including a count of each element in a compound. - beta12orEarlier - The general MF query format consists of a series of valid atomic symbols, with an optional number or range. - - - - - - - - - - InChIKey - - - An InChIKey identifier is not human- nor machine-readable but is more suitable for web searches than an InChI chemical structure specification. - The InChIKey (hashed InChI) is a fixed length (25 character) condensed digital representation of an InChI chemical structure specification. It uniquely identifies a chemical compound. - beta12orEarlier - - - - - - - - - - smarts - - SMILES ARbitrary Target Specification (SMARTS) format for chemical structure specification, which is a subset of the SMILES line notation. - beta12orEarlier - - - - - - - - - - unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - nucleotide - - - Non-sequence characters may be used for example for gaps. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Nucleotide_sequence - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - protein - - - Alphabet for a protein sequence with possible ambiguity, unknown positions and non-sequence characters. - beta12orEarlier - Non-sequence characters may be used for gaps and translation stop. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Amino_acid_sequence - - - - - - - - - - consensus - - - beta12orEarlier - Alphabet for the consensus of two or more molecular sequences. - - - - - - - - - - pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - unambiguous pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence (characters ACGTU only) with possible unknown positions but without ambiguity or non-sequence characters . - - - - - - - - - - dna - - beta12orEarlier - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#DNA_sequence - Alphabet for a DNA sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - rna - - Alphabet for an RNA sequence with possible ambiguity, unknown positions and non-sequence characters. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#RNA_sequence - beta12orEarlier - - - - - - - - - - unambiguous pure dna - - - Alphabet for a DNA sequence (characters ACGT only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure dna - - - Alphabet for a DNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure rna - - - Alphabet for an RNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - UniGene entry format - - beta12orEarlier - Format of an entry from UniGene. - A UniGene entry includes a set of transcript sequences assigned to the same transcription locus (gene or expressed pseudogene), with information on protein similarities, gene expression, cDNA clone reagents, and genomic location. - beta12orEarlier - true - - - - - - - - - - COG sequence cluster format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the COG database of clusters of (related) protein sequences. - - - - - - - - - - EMBL feature location - - - beta12orEarlier - Feature location - Format for sequence positions (feature location) as used in DDBJ/EMBL/GenBank database. - - - - - - - - - - quicktandem - - - Report format for tandem repeats in a nucleotide sequence (format generated by the Sanger Centre quicktandem program). - beta12orEarlier - - - - - - - - - - Sanger inverted repeats - - - beta12orEarlier - Report format for inverted repeats in a nucleotide sequence (format generated by the Sanger Centre inverted program). - - - - - - - - - - EMBOSS repeat - - - Report format for tandem repeats in a sequence (an EMBOSS report format). - beta12orEarlier - - - - - - - - - - est2genome format - - - beta12orEarlier - Format of a report on exon-intron structure generated by EMBOSS est2genome. - - - - - - - - - - restrict format - - - Report format for restriction enzyme recognition sites used by EMBOSS restrict program. - beta12orEarlier - - - - - - - - - - restover format - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by EMBOSS restover program. - - - - - - - - - - REBASE restriction sites - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by REBASE database. - - - - - - - - - - FASTA search results format - - - Format of results of a sequence database search using FASTA. - beta12orEarlier - This includes (typically) score data, alignment data and a histogram (of observed and expected distribution of E values.) - - - - - - - - - - BLAST results - - - Format of results of a sequence database search using some variant of BLAST. - beta12orEarlier - This includes score data, alignment data and summary table. - - - - - - - - - - mspcrunch - - - beta12orEarlier - Format of results of a sequence database search using some variant of MSPCrunch. - - - - - - - - - - Smith-Waterman format - - - beta12orEarlier - Format of results of a sequence database search using some variant of Smith Waterman. - - - - - - - - - - dhf - - - The hits are relatives to a SCOP or CATH family and are found from a search of a sequence database. - beta12orEarlier - Format of EMBASSY domain hits file (DHF) of hits (sequences) with domain classification information. - - - - - - - - - - lhf - - - beta12orEarlier - Format of EMBASSY ligand hits file (LHF) of database hits (sequences) with ligand classification information. - The hits are putative ligand-binding sequences and are found from a search of a sequence database. - - - - - - - - - - InterPro hits format - - - Results format for searches of the InterPro database. - beta12orEarlier - - - - - - - - - - InterPro protein view report format - - Format of results of a search of the InterPro database showing matches of query protein sequence(s) to InterPro entries. - The report includes a classification of regions in a query protein sequence which are assigned to a known InterPro protein family or group. - beta12orEarlier - - - - - - - - - - InterPro match table format - - Format of results of a search of the InterPro database showing matches between protein sequence(s) and signatures for an InterPro entry. - beta12orEarlier - The table presents matches between query proteins (rows) and signature methods (columns) for this entry. Alternatively the sequence(s) might be from from the InterPro entry itself. The match position in the protein sequence and match status (true positive, false positive etc) are indicated. - - - - - - - - - - HMMER Dirichlet prior - - - beta12orEarlier - Dirichlet distribution HMMER format. - - - - - - - - - - MEME Dirichlet prior - - - beta12orEarlier - Dirichlet distribution MEME format. - - - - - - - - - - HMMER emission and transition - - - Format of a report from the HMMER package on the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - prosite-pattern - - - Format of a regular expression pattern from the Prosite database. - beta12orEarlier - - - - - - - - - - EMBOSS sequence pattern - - - Format of an EMBOSS sequence pattern. - beta12orEarlier - - - - - - - - - - meme-motif - - - A motif in the format generated by the MEME program. - beta12orEarlier - - - - - - - - - - prosite-profile - - - Sequence profile (sequence classifier) format used in the PROSITE database. - beta12orEarlier - - - - - - - - - - JASPAR format - - - beta12orEarlier - A profile (sequence classifier) in the format used in the JASPAR database. - - - - - - - - - - MEME background Markov model - - - Format of the model of random sequences used by MEME. - beta12orEarlier - - - - - - - - - - HMMER format - - - Format of a hidden Markov model representation used by the HMMER package. - beta12orEarlier - - - - - - - - - - HMMER-aln - - - - beta12orEarlier - FASTA-style format for multiple sequences aligned by HMMER package to an HMM. - - - - - - - - - - DIALIGN format - - - Format of multiple sequences aligned by DIALIGN package. - beta12orEarlier - - - - - - - - - - daf - - - The format is clustal-like and includes annotation of domain family classification information. - EMBASSY 'domain alignment file' (DAF) format, containing a sequence alignment of protein domains belonging to the same SCOP or CATH family. - beta12orEarlier - - - - - - - - - - Sequence-MEME profile alignment - - - beta12orEarlier - Format for alignment of molecular sequences to MEME profiles (position-dependent scoring matrices) as generated by the MAST tool from the MEME package. - - - - - - - - - - HMMER profile alignment (sequences versus HMMs) - - - Format used by the HMMER package for an alignment of a sequence against a hidden Markov model database. - beta12orEarlier - - - - - - - - - - HMMER profile alignment (HMM versus sequences) - - - Format used by the HMMER package for of an alignment of a hidden Markov model against a sequence database. - beta12orEarlier - - - - - - - - - - Phylip distance matrix - - - Data Type must include the distance matrix, probably as pairs of sequence identifiers with a distance (integer or float). - beta12orEarlier - Format of PHYLIP phylogenetic distance matrix data. - - - - - - - - - - ClustalW dendrogram - - - beta12orEarlier - Dendrogram (tree file) format generated by ClustalW. - - - - - - - - - - Phylip tree raw - - - Raw data file format used by Phylip from which a phylogenetic tree is directly generated or plotted. - beta12orEarlier - - - - - - - - - - Phylip continuous quantitative characters - - - beta12orEarlier - PHYLIP file format for continuous quantitative character data. - - - - - - - - - - Phylogenetic property values format - - Format of phylogenetic property data. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Phylip character frequencies format - - - beta12orEarlier - PHYLIP file format for phylogenetics character frequency data. - - - - - - - - - - Phylip discrete states format - - - Format of PHYLIP discrete states data. - beta12orEarlier - - - - - - - - - - Phylip cliques format - - - beta12orEarlier - Format of PHYLIP cliques data. - - - - - - - - - - Phylip tree format - - - Phylogenetic tree data format used by the PHYLIP program. - beta12orEarlier - - - - - - - - - - TreeBASE format - - - beta12orEarlier - The format of an entry from the TreeBASE database of phylogenetic data. - - - - - - - - - - TreeFam format - - - beta12orEarlier - The format of an entry from the TreeFam database of phylogenetic data. - - - - - - - - - - Phylip tree distance format - - - Format for distances, such as Branch Score distance, between two or more phylogenetic trees as used by the Phylip package. - beta12orEarlier - - - - - - - - - - dssp - - - beta12orEarlier - The DSSP database is built using the DSSP application which defines secondary structure, geometrical features and solvent exposure of proteins, given atomic coordinates in PDB format. - Format of an entry from the DSSP database (Dictionary of Secondary Structure in Proteins). - - - - - - - - - - hssp - - - Entry format of the HSSP database (Homology-derived Secondary Structure in Proteins). - beta12orEarlier - - - - - - - - - - Dot-bracket format - - - beta12orEarlier - Format of RNA secondary structure in dot-bracket notation, originally generated by the Vienna RNA package/server. - Vienna RNA secondary structure format - Vienna RNA format - - - - - - - - - - Vienna local RNA secondary structure format - - - Format of local RNA secondary structure components with free energy values, generated by the Vienna RNA package/server. - beta12orEarlier - - - - - - - - - - PDB database entry format - - - - - - - - beta12orEarlier - PDB entry format - Format of an entry (or part of an entry) from the PDB database. - - - - - - - - - - PDB - - - PDB format - beta12orEarlier - Entry format of PDB database in PDB format. - - - - - - - - - - mmCIF - - - Chemical MIME (http://www.ch.ic.ac.uk/chemime): chemical/x-mmcif - Entry format of PDB database in mmCIF format. - beta12orEarlier - mmcif - - - - - - - - - - PDBML - - - Entry format of PDB database in PDBML (XML) format. - beta12orEarlier - - - - - - - - - - Domainatrix 3D-1D scoring matrix format - - beta12orEarlier - true - beta12orEarlier - Format of a matrix of 3D-1D scores used by the EMBOSS Domainatrix applications. - - - - - - - - - - aaindex - - - Amino acid index format used by the AAindex database. - beta12orEarlier - - - - - - - - - - IntEnz enzyme report format - - beta12orEarlier - beta12orEarlier - Format of an entry from IntEnz (The Integrated Relational Enzyme Database). - IntEnz is the master copy of the Enzyme Nomenclature, the recommendations of the NC-IUBMB on the Nomenclature and Classification of Enzyme-Catalysed Reactions. - true - - - - - - - - - - BRENDA enzyme report format - - true - Format of an entry from the BRENDA enzyme database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - KEGG REACTION enzyme report format - - true - beta12orEarlier - Format of an entry from the KEGG REACTION database of biochemical reactions. - beta12orEarlier - - - - - - - - - - KEGG ENZYME enzyme report format - - beta12orEarlier - true - Format of an entry from the KEGG ENZYME database. - beta12orEarlier - - - - - - - - - - REBASE proto enzyme report format - - Format of an entry from the proto section of the REBASE enzyme database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - REBASE withrefm enzyme report format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the withrefm section of the REBASE enzyme database. - - - - - - - - - - Pcons report format - - - Format of output of the Pcons Model Quality Assessment Program (MQAP). - beta12orEarlier - Pcons ranks protein models by assessing their quality based on the occurrence of recurring common three-dimensional structural patterns. Pcons returns a score reflecting the overall global quality and a score for each individual residue in the protein reflecting the local residue quality. - - - - - - - - - - ProQ report format - - - beta12orEarlier - ProQ is a neural network-based predictor that predicts the quality of a protein model based on the number of structural features. - Format of output of the ProQ protein model quality predictor. - - - - - - - - - - SMART domain assignment report format - - beta12orEarlier - true - Format of SMART domain assignment data. - The SMART output file includes data on genetically mobile domains / analysis of domain architectures, including phyletic distributions, functional class, tertiary structures and functionally important residues. - beta12orEarlier - - - - - - - - - - BIND entry format - - Entry format for the BIND database of protein interaction. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - IntAct entry format - - beta12orEarlier - beta12orEarlier - Entry format for the IntAct database of protein interaction. - true - - - - - - - - - - InterPro entry format - - Entry format for the InterPro database of protein signatures (sequence classifiers) and classified sequences. - true - beta12orEarlier - This includes signature metadata, sequence references and a reference to the signature itself. There is normally a header (entry accession numbers and name), abstract, taxonomy information, example proteins etc. Each entry also includes a match list which give a number of different views of the signature matches for the sequences in each InterPro entry. - beta12orEarlier - - - - - - - - - - InterPro entry abstract format - - true - beta12orEarlier - References are included and a functional inference is made where possible. - beta12orEarlier - Entry format for the textual abstract of signatures in an InterPro entry and its protein matches. - - - - - - - - - - Gene3D entry format - - Entry format for the Gene3D protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PIRSF entry format - - beta12orEarlier - Entry format for the PIRSF protein secondary database. - true - beta12orEarlier - - - - - - - - - - PRINTS entry format - - beta12orEarlier - beta12orEarlier - true - Entry format for the PRINTS protein secondary database. - - - - - - - - - - Panther Families and HMMs entry format - - beta12orEarlier - beta12orEarlier - Entry format for the Panther library of protein families and subfamilies. - true - - - - - - - - - - Pfam entry format - - Entry format for the Pfam protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - SMART entry format - - true - beta12orEarlier - Entry format for the SMART protein secondary database. - beta12orEarlier - - - - - - - - - - Superfamily entry format - - Entry format for the Superfamily protein secondary database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - TIGRFam entry format - - beta12orEarlier - true - Entry format for the TIGRFam protein secondary database. - beta12orEarlier - - - - - - - - - - ProDom entry format - - Entry format for the ProDom protein domain classification database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FSSP entry format - - Entry format for the FSSP database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - findkm - - - beta12orEarlier - A report format for the kinetics of enzyme-catalysed reaction(s) in a format generated by EMBOSS findkm. This includes Michaelis Menten plot, Hanes Woolf plot, Michaelis Menten constant (Km) and maximum velocity (Vmax). - - - - - - - - - - Ensembl gene report format - - beta12orEarlier - Entry format of Ensembl genome database. - beta12orEarlier - true - - - - - - - - - - DictyBase gene report format - - true - beta12orEarlier - Entry format of DictyBase genome database. - beta12orEarlier - - - - - - - - - - CGD gene report format - - beta12orEarlier - true - beta12orEarlier - Entry format of Candida Genome database. - - - - - - - - - - DragonDB gene report format - - beta12orEarlier - Entry format of DragonDB genome database. - beta12orEarlier - true - - - - - - - - - - EcoCyc gene report format - - Entry format of EcoCyc genome database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FlyBase gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of FlyBase genome database. - - - - - - - - - - Gramene gene report format - - beta12orEarlier - beta12orEarlier - Entry format of Gramene genome database. - true - - - - - - - - - - KEGG GENES gene report format - - true - beta12orEarlier - Entry format of KEGG GENES genome database. - beta12orEarlier - - - - - - - - - - MaizeGDB gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Maize genetics and genomics database (MaizeGDB). - - - - - - - - - - MGD gene report format - - Entry format of the Mouse Genome Database (MGD). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RGD gene report format - - true - beta12orEarlier - Entry format of the Rat Genome Database (RGD). - beta12orEarlier - - - - - - - - - - SGD gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of the Saccharomyces Genome Database (SGD). - - - - - - - - - - GeneDB gene report format - - Entry format of the Sanger GeneDB genome database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - TAIR gene report format - - beta12orEarlier - beta12orEarlier - Entry format of The Arabidopsis Information Resource (TAIR) genome database. - true - - - - - - - - - - WormBase gene report format - - Entry format of the WormBase genomes database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - ZFIN gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Zebrafish Information Network (ZFIN) genome database. - - - - - - - - - - TIGR gene report format - - true - Entry format of the TIGR genome database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - dbSNP polymorphism report format - - beta12orEarlier - Entry format for the dbSNP database. - true - beta12orEarlier - - - - - - - - - - OMIM entry format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the OMIM database of genotypes and phenotypes. - - - - - - - - - - HGVbase entry format - - true - Format of a record from the HGVbase database of genotypes and phenotypes. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HIVDB entry format - - beta12orEarlier - beta12orEarlier - true - Format of a record from the HIVDB database of genotypes and phenotypes. - - - - - - - - - - KEGG DISEASE entry format - - beta12orEarlier - Format of an entry from the KEGG DISEASE database. - true - beta12orEarlier - - - - - - - - - - Primer3 primer - - - Report format on PCR primers and hybridization oligos as generated by Whitehead primer3 program. - beta12orEarlier - - - - - - - - - - ABI - - - A format of raw sequence read data from an Applied Biosystems sequencing machine. - beta12orEarlier - - - - - - - - - - mira - - - Format of MIRA sequence trace information file. - beta12orEarlier - - - - - - - - - - CAF - - - Common Assembly Format (CAF). A sequence assembly format including contigs, base-call qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - exp - - - Sequence assembly project file EXP format. - beta12orEarlier - - - - - - - - - - SCF - - - Staden Chromatogram Files format (SCF) of base-called sequence reads, qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - PHD - - - beta12orEarlier - PHD sequence trace format to store serialised chromatogram data (reads). - - - - - - - - - - - - dat - - - - - - - - - beta12orEarlier - Format of Affymetrix data file of raw image data. - Affymetrix image data file format - - - - - - - - - - cel - - - - - - - - - beta12orEarlier - Affymetrix probe raw data format - Format of Affymetrix data file of information about (raw) expression levels of the individual probes. - - - - - - - - - - affymetrix - - - Format of affymetrix gene cluster files (hc-genes.txt, hc-chips.txt) from hierarchical clustering. - beta12orEarlier - - - - - - - - - - ArrayExpress entry format - - beta12orEarlier - true - Entry format for the ArrayExpress microarrays database. - beta12orEarlier - - - - - - - - - - affymetrix-exp - - - Affymetrix data file format for information about experimental conditions and protocols. - Affymetrix experimental conditions data file format - beta12orEarlier - - - - - - - - - - CHP - - - - - - - - - Affymetrix probe normalised data format - beta12orEarlier - Format of Affymetrix data file of information about (normalised) expression levels of the individual probes. - - - - - - - - - - EMDB entry format - - beta12orEarlier - Format of an entry from the Electron Microscopy DataBase (EMDB). - true - beta12orEarlier - - - - - - - - - - KEGG PATHWAY entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PATHWAY database of pathway maps for molecular interactions and reaction networks. - true - - - - - - - - - - MetaCyc entry format - - true - beta12orEarlier - The format of an entry from the MetaCyc metabolic pathways database. - beta12orEarlier - - - - - - - - - - HumanCyc entry format - - The format of a report from the HumanCyc metabolic pathways database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - INOH entry format - - beta12orEarlier - true - The format of an entry from the INOH signal transduction pathways database. - beta12orEarlier - - - - - - - - - - PATIKA entry format - - beta12orEarlier - The format of an entry from the PATIKA biological pathways database. - beta12orEarlier - true - - - - - - - - - - Reactome entry format - - beta12orEarlier - The format of an entry from the reactome biological pathways database. - true - beta12orEarlier - - - - - - - - - - aMAZE entry format - - beta12orEarlier - true - The format of an entry from the aMAZE biological pathways and molecular interactions database. - beta12orEarlier - - - - - - - - - - CPDB entry format - - The format of an entry from the CPDB database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Panther Pathways entry format - - beta12orEarlier - true - beta12orEarlier - The format of an entry from the Panther Pathways database. - - - - - - - - - - Taverna workflow format - - - Format of Taverna workflows. - beta12orEarlier - - - - - - - - - - BioModel mathematical model format - - beta12orEarlier - beta12orEarlier - Format of mathematical models from the BioModel database. - true - Models are annotated and linked to relevant data resources, such as publications, databases of compounds and pathways, controlled vocabularies, etc. - - - - - - - - - - KEGG LIGAND entry format - - The format of an entry from the KEGG LIGAND chemical database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - KEGG COMPOUND entry format - - beta12orEarlier - The format of an entry from the KEGG COMPOUND database. - true - beta12orEarlier - - - - - - - - - - KEGG PLANT entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PLANT database. - true - - - - - - - - - - KEGG GLYCAN entry format - - true - beta12orEarlier - The format of an entry from the KEGG GLYCAN database. - beta12orEarlier - - - - - - - - - - PubChem entry format - - beta12orEarlier - The format of an entry from PubChem. - true - beta12orEarlier - - - - - - - - - - ChemSpider entry format - - beta12orEarlier - The format of an entry from a database of chemical structures and property predictions. - beta12orEarlier - true - - - - - - - - - - ChEBI entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from Chemical Entities of Biological Interest (ChEBI). - true - ChEBI includes an ontological classification defining relations between entities or classes of entities. - - - - - - - - - - MSDchem ligand dictionary entry format - - The format of an entry from the MSDchem ligand dictionary. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - HET group dictionary entry format - - - The format of an entry from the HET group dictionary (HET groups from PDB files). - beta12orEarlier - - - - - - - - - - KEGG DRUG entry format - - The format of an entry from the KEGG DRUG database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PubMed citation - - - beta12orEarlier - Format of bibliographic reference as used by the PubMed database. - - - - - - - - - - Medline Display Format - - - beta12orEarlier - Format for abstracts of scientific articles from the Medline database. - Bibliographic reference information including citation information is included - - - - - - - - - - CiteXplore-core - - - beta12orEarlier - CiteXplore 'core' citation format including title, journal, authors and abstract. - - - - - - - - - - CiteXplore-all - - - CiteXplore 'all' citation format includes all known details such as Mesh terms and cross-references. - beta12orEarlier - - - - - - - - - - pmc - - - beta12orEarlier - Article format of the PubMed Central database. - - - - - - - - - - iHOP text mining abstract format - - - beta12orEarlier - iHOP abstract format. - - - - - - - - - - Oscar3 - - - Oscar 3 performs chemistry-specific parsing of chemical documents. It attempts to identify chemical names, ontology concepts and chemical data from a document. - Text mining abstract format from the Oscar 3 application. - beta12orEarlier - - - - - - - - - - PDB atom record format - - true - beta13 - beta12orEarlier - Format of an ATOM record (describing data for an individual atom) from a PDB file. - - - - - - - - - - CATH chain report format - - The report (for example http://www.cathdb.info/chain/1cukA) includes chain identifiers, domain identifiers and CATH codes for domains in a given protein chain. - beta12orEarlier - Format of CATH domain classification information for a polypeptide chain. - beta12orEarlier - true - - - - - - - - - - CATH PDB report format - - beta12orEarlier - beta12orEarlier - true - Format of CATH domain classification information for a protein PDB file. - The report (for example http://www.cathdb.info/pdb/1cuk) includes chain identifiers, domain identifiers and CATH codes for domains in a given PDB file. - - - - - - - - - - NCBI gene report format - - true - Entry (gene) format of the NCBI database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GeneIlluminator gene report format - - Report format for biological functions associated with a gene name and its alternative names (synonyms, homonyms), as generated by the GeneIlluminator service. - This includes a gene name and abbreviation of the name which may be in a name space indicating the gene status and relevant organisation. - beta12orEarlier - beta12orEarlier - Moby:GI_Gene - true - - - - - - - - - - BacMap gene card format - - Format of a report on the DNA and protein sequences for a given gene label from a bacterial chromosome maps from the BacMap database. - true - beta12orEarlier - beta12orEarlier - Moby:BacMapGeneCard - - - - - - - - - - ColiCard report format - - Format of a report on Escherichia coli genes, proteins and molecules from the CyberCell Database (CCDB). - true - beta12orEarlier - Moby:ColiCard - beta12orEarlier - - - - - - - - - - PlasMapper TextMap - - - beta12orEarlier - Map of a plasmid (circular DNA) in PlasMapper TextMap format. - - - - - - - - - - newick - - - nh - beta12orEarlier - Phylogenetic tree Newick (text) format. - - - - - - - - - - TreeCon format - - - beta12orEarlier - Phylogenetic tree TreeCon (text) format. - - - - - - - - - - Nexus format - - - Phylogenetic tree Nexus (text) format. - beta12orEarlier - - - - - - - - - - Format - - - - http://en.wikipedia.org/wiki/File_format - http://purl.org/biotop/biotop.owl#MachineLanguage - File format - Data model - http://www.onto-med.de/ontologies/gfo.owl#Symbol_structure - Exchange format - "http://purl.obolibrary.org/obo/IAO_0000098" - http://semanticscience.org/resource/SIO_000612 - http://semanticscience.org/resource/SIO_000618 - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - "http://purl.org/dc/elements/1.1/format" - http://wsio.org/compression_004 - A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere. - http://en.wikipedia.org/wiki/List_of_file_formats - http://www.ifomis.org/bfo/1.1/snap#Quality - Data format - http://purl.org/biotop/biotop.owl#Quality - The main focus in EDAM lies on formats as means of structuring data exchanged between different tools or resources. The serialisation, compression, or encoding of concrete data formats/models is not in scope of EDAM. Format 'is format of' Data. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - - - - - File format - File format denotes only formats of a computer file, but the same formats apply also to data blobs or exchanged messages. - - - - - Data model - A defined data format has its implicit or explicit data model, and EDAM does not distinguish the two. Some data models however do not have any standard way of serialisation into an exchange format, and those are thus not considered formats in EDAM. (Remark: even broader - or closely related - term to 'Data model' would be an 'Information model'.) - - - - - - - - - - Atomic data format - - beta12orEarlier - beta13 - Data format for an individual atom. - true - - - - - - - - - - Sequence record format - - - - - - - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence feature annotation format - - - - - - - - beta12orEarlier - Data format for molecular sequence feature information. - - - - - - - - - - Alignment format - - - - - - - - Data format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - acedb - - beta12orEarlier - ACEDB sequence format. - - - - - - - - - - clustal sequence format - - true - beta12orEarlier - Clustalw output format. - beta12orEarlier - - - - - - - - - - codata - - - Codata entry format. - beta12orEarlier - - - - - - - - - - dbid - - beta12orEarlier - Fasta format variant with database name before ID. - - - - - - - - - - EMBL format - - - EMBL entry format. - EMBL sequence format - EMBL - beta12orEarlier - - - - - - - - - - Staden experiment format - - - Staden experiment file format. - beta12orEarlier - - - - - - - - - - FASTA - - - beta12orEarlier - FASTA format - FASTA sequence format - FASTA format including NCBI-style IDs. - - - - - - - - - - FASTQ - - FASTQ short read format ignoring quality scores. - beta12orEarlier - FASTAQ - fq - - - - - - - - - - FASTQ-illumina - - FASTQ Illumina 1.3 short read format. - beta12orEarlier - - - - - - - - - - FASTQ-sanger - - FASTQ short read format with phred quality. - beta12orEarlier - - - - - - - - - - FASTQ-solexa - - FASTQ Solexa/Illumina 1.0 short read format. - beta12orEarlier - - - - - - - - - - fitch program - - - Fitch program format. - beta12orEarlier - - - - - - - - - - GCG - - - GCG SSF - beta12orEarlier - GCG SSF (single sequence file) file format. - GCG sequence file format. - - - - - - - - - - GenBank format - - - beta12orEarlier - Genbank entry format. - GenBank - - - - - - - - - - genpept - - beta12orEarlier - Genpept protein entry format. - Currently identical to refseqp format - - - - - - - - - - GFF2-seq - - - GFF feature file format with sequence in the header. - beta12orEarlier - - - - - - - - - - GFF3-seq - - - GFF3 feature file format with sequence. - beta12orEarlier - - - - - - - - - - giFASTA format - - FASTA sequence format including NCBI-style GIs. - beta12orEarlier - - - - - - - - - - hennig86 - - - beta12orEarlier - Hennig86 output sequence format. - - - - - - - - - - ig - - - Intelligenetics sequence format. - beta12orEarlier - - - - - - - - - - igstrict - - - beta12orEarlier - Intelligenetics sequence format (strict version). - - - - - - - - - - jackknifer - - - Jackknifer interleaved and non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - mase format - - - beta12orEarlier - Mase program sequence format. - - - - - - - - - - mega-seq - - - beta12orEarlier - Mega interleaved and non-interleaved sequence format. - - - - - - - - - - MSF - - GCG MSF - beta12orEarlier - GCG MSF (multiple sequence file) file format. - - - - - - - - - - nbrf/pir - - NBRF/PIR entry sequence format. - nbrf - beta12orEarlier - pir - - - - - - - - - - nexus-seq - - - - beta12orEarlier - Nexus/paup interleaved sequence format. - - - - - - - - - - pdbatom - - - - pdb format in EMBOSS. - beta12orEarlier - PDB sequence format (ATOM lines). - - - - - - - - - - pdbatomnuc - - - - beta12orEarlier - pdbnuc format in EMBOSS. - PDB nucleotide sequence format (ATOM lines). - - - - - - - - - - pdbseqresnuc - - - - pdbnucseq format in EMBOSS. - PDB nucleotide sequence format (SEQRES lines). - beta12orEarlier - - - - - - - - - - pdbseqres - - - - PDB sequence format (SEQRES lines). - beta12orEarlier - pdbseq format in EMBOSS. - - - - - - - - - - Pearson format - - beta12orEarlier - Plain old FASTA sequence format (unspecified format for IDs). - - - - - - - - - - phylip sequence format - - beta12orEarlier - Phylip interleaved sequence format. - true - beta12orEarlier - - - - - - - - - - phylipnon sequence format - - true - Phylip non-interleaved sequence format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - raw - - - beta12orEarlier - Raw sequence format with no non-sequence characters. - - - - - - - - - - refseqp - - - beta12orEarlier - Refseq protein entry sequence format. - Currently identical to genpept format - - - - - - - - - - selex sequence format - - beta12orEarlier - true - beta12orEarlier - Selex sequence format. - - - - - - - - - - Staden format - - - beta12orEarlier - Staden suite sequence format. - - - - - - - - - - - - - - Stockholm format - - - Stockholm multiple sequence alignment format (used by Pfam and Rfam). - beta12orEarlier - - - - - - - - - - - - strider format - - - DNA strider output sequence format. - beta12orEarlier - - - - - - - - - - UniProtKB format - - UniProt format - SwissProt format - beta12orEarlier - UniProtKB entry sequence format. - - - - - - - - - - plain text format (unformatted) - - beta12orEarlier - Plain text sequence format (essentially unformatted). - - - - - - - - - - treecon sequence format - - true - beta12orEarlier - beta12orEarlier - Treecon output sequence format. - - - - - - - - - - ASN.1 sequence format - - - NCBI ASN.1-based sequence format. - beta12orEarlier - - - - - - - - - - DAS format - - - das sequence format - DAS sequence (XML) format (any type). - beta12orEarlier - - - - - - - - - - dasdna - - - beta12orEarlier - DAS sequence (XML) format (nucleotide-only). - The use of this format is deprecated. - - - - - - - - - - debug-seq - - - EMBOSS debugging trace sequence format of full internal data content. - beta12orEarlier - - - - - - - - - - jackknifernon - - - beta12orEarlier - Jackknifer output sequence non-interleaved format. - - - - - - - - - - meganon sequence format - - beta12orEarlier - beta12orEarlier - Mega non-interleaved output sequence format. - true - - - - - - - - - - NCBI format - - NCBI FASTA sequence format with NCBI-style IDs. - beta12orEarlier - There are several variants of this. - - - - - - - - - - nexusnon - - - - Nexus/paup non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - GFF2 - - beta12orEarlier - General Feature Format (GFF) of sequence features. - - - - - - - - - - - - GFF3 - - beta12orEarlier - Generic Feature Format version 3 (GFF3) of sequence features. - - - - - - - - - - - - pir - - true - 1.7 - PIR feature format. - beta12orEarlier - - - - - - - - - - swiss feature - - true - Swiss-Prot feature format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - DASGFF - - - DAS GFF (XML) feature format. - das feature - DASGFF feature - beta12orEarlier - - - - - - - - - - debug-feat - - - EMBOSS debugging trace feature format of full internal data content. - beta12orEarlier - - - - - - - - - - EMBL feature - - beta12orEarlier - EMBL feature format. - true - beta12orEarlier - - - - - - - - - - GenBank feature - - beta12orEarlier - Genbank feature format. - beta12orEarlier - true - - - - - - - - - - ClustalW format - - - clustal - beta12orEarlier - ClustalW format for (aligned) sequences. - - - - - - - - - - debug - - - EMBOSS alignment format for debugging trace of full internal data content. - beta12orEarlier - - - - - - - - - - FASTA-aln - - - beta12orEarlier - Fasta format for (aligned) sequences. - - - - - - - - - - markx0 - - beta12orEarlier - Pearson MARKX0 alignment format. - - - - - - - - - - markx1 - - Pearson MARKX1 alignment format. - beta12orEarlier - - - - - - - - - - markx10 - - beta12orEarlier - Pearson MARKX10 alignment format. - - - - - - - - - - markx2 - - beta12orEarlier - Pearson MARKX2 alignment format. - - - - - - - - - - markx3 - - beta12orEarlier - Pearson MARKX3 alignment format. - - - - - - - - - - match - - - Alignment format for start and end of matches between sequence pairs. - beta12orEarlier - - - - - - - - - - mega - - Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - meganon - - Mega non-interleaved format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - msf alignment format - - true - beta12orEarlier - beta12orEarlier - MSF format for (aligned) sequences. - - - - - - - - - - nexus alignment format - - Nexus/paup format for (aligned) sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - nexusnon alignment format - - beta12orEarlier - true - Nexus/paup non-interleaved format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - pair - - EMBOSS simple sequence pair alignment format. - beta12orEarlier - - - - - - - - - - PHYLIP format - - phy - beta12orEarlier - ph - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - PHYLIP interleaved format - Phylip format for (aligned) sequences. - - - - - - - - - - phylipnon - - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - beta12orEarlier - PHYLIP sequential format - Phylip non-interleaved format for (aligned) sequences. - - - - - - - - - - scores format - - - Alignment format for score values for pairs of sequences. - beta12orEarlier - - - - - - - - - - selex - - - - beta12orEarlier - SELEX format for (aligned) sequences. - - - - - - - - - - EMBOSS simple format - - - EMBOSS simple multiple alignment format. - beta12orEarlier - - - - - - - - - - srs format - - - beta12orEarlier - Simple multiple sequence (alignment) format for SRS. - - - - - - - - - - srspair - - - beta12orEarlier - Simple sequence pair (alignment) format for SRS. - - - - - - - - - - T-Coffee format - - - T-Coffee program alignment format. - beta12orEarlier - - - - - - - - - - TreeCon-seq - - - - Treecon format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format - - - - - - - - Data format for a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Biological pathway or network format - - - - - - - - beta12orEarlier - Data format for a biological pathway or network. - - - - - - - - - - Sequence-profile alignment format - - - - - - - - beta12orEarlier - Data format for a sequence-profile alignment. - - - - - - - - - - Sequence-profile alignment (HMM) format - - beta12orEarlier - beta12orEarlier - true - Data format for a sequence-HMM profile alignment. - - - - - - - - - - Amino acid index format - - - - - - - - Data format for an amino acid index. - beta12orEarlier - - - - - - - - - - Article format - - - - - - - - beta12orEarlier - Literature format - Data format for a full-text scientific article. - - - - - - - - - - Text mining report format - - - - - - - - beta12orEarlier - Data format for an abstract (report) from text mining. - - - - - - - - - - Enzyme kinetics report format - - - - - - - - Data format for reports on enzyme kinetics. - beta12orEarlier - - - - - - - - - - Small molecule report format - - - - - - - - beta12orEarlier - Chemical compound annotation format - Format of a report on a chemical compound. - - - - - - - - - - Gene annotation format - - - - - - - - Format of a report on a particular locus, gene, gene system or groups of genes. - beta12orEarlier - Gene features format - - - - - - - - - - Workflow format - - beta12orEarlier - Format of a workflow. - - - - - - - - - - Tertiary structure format - - beta12orEarlier - Data format for a molecular tertiary structure. - - - - - - - - - - Biological model format - - Data format for a biological model. - beta12orEarlier - 1.2 - true - - - - - - - - - - Chemical formula format - - - - - - - - beta12orEarlier - Text format of a chemical formula. - - - - - - - - - - Phylogenetic character data format - - - - - - - - beta12orEarlier - Format of raw (unplotted) phylogenetic data. - - - - - - - - - - Phylogenetic continuous quantitative character format - - - - - - - - Format of phylogenetic continuous quantitative character data. - beta12orEarlier - - - - - - - - - - Phylogenetic discrete states format - - - - - - - - Format of phylogenetic discrete states data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (cliques) format - - - - - - - - Format of phylogenetic cliques data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (invariants) format - - - - - - - - beta12orEarlier - Format of phylogenetic invariants data. - - - - - - - - - - Electron microscopy model format - - beta12orEarlier - true - beta12orEarlier - Annotation format for electron microscopy models. - - - - - - - - - - Phylogenetic tree report (tree distances) format - - - - - - - - Format for phylogenetic tree distance data. - beta12orEarlier - - - - - - - - - - Polymorphism report format - - beta12orEarlier - true - 1.0 - Format for sequence polymorphism data. - - - - - - - - - - Protein family report format - - - - - - - - beta12orEarlier - Format for reports on a protein family. - - - - - - - - - - Protein interaction format - - - - - - - - beta12orEarlier - Format for molecular interaction data. - Molecular interaction format - - - - - - - - - - Sequence assembly format - - - - - - - - beta12orEarlier - Format for sequence assembly data. - - - - - - - - - - Microarray experiment data format - - Format for information about a microarray experimental per se (not the data generated from that experiment). - beta12orEarlier - - - - - - - - - - Sequence trace format - - - - - - - - Format for sequence trace data (i.e. including base call information). - beta12orEarlier - - - - - - - - - - Gene expression report format - - - - - - - - Gene expression data format - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - beta12orEarlier - - - - - - - - - - Genotype and phenotype annotation format - - beta12orEarlier - true - Format of a report on genotype / phenotype information. - beta12orEarlier - - - - - - - - - - Map format - - - - - - - - Format of a map of (typically one) molecular sequence annotated with features. - beta12orEarlier - - - - - - - - - - Nucleic acid features (primers) format - - beta12orEarlier - Format of a report on PCR primers or hybridization oligos in a nucleic acid sequence. - - - - - - - - - - Protein report format - - - - - - - - Format of a report of general information about a specific protein. - beta12orEarlier - - - - - - - - - - Protein report (enzyme) format - - Format of a report of general information about a specific enzyme. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix format - - - - - - - - beta12orEarlier - Format of a matrix of 3D-1D scores (amino acid environment probabilities). - - - - - - - - - - Protein structure report (quality evaluation) format - - - - - - - - Format of a report on the quality of a protein three-dimensional model. - beta12orEarlier - - - - - - - - - - Database hits (sequence) format - - - - - - - - Format of a report on sequence hits and associated data from searching a sequence database. - beta12orEarlier - - - - - - - - - - Sequence distance matrix format - - - - - - - - beta12orEarlier - Format of a matrix of genetic distances between molecular sequences. - - - - - - - - - - Sequence motif format - - - - - - - - Format of a sequence motif. - beta12orEarlier - - - - - - - - - - Sequence profile format - - - - - - - - Format of a sequence profile. - beta12orEarlier - - - - - - - - - - Hidden Markov model format - - - - - - - - beta12orEarlier - Format of a hidden Markov model. - - - - - - - - - - Dirichlet distribution format - - - - - - - - Data format of a dirichlet distribution. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts format - - - - - - - - - - - - - - Data format for the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - RNA secondary structure format - - - - - - - - beta12orEarlier - Format for secondary structure (predicted or real) of an RNA molecule. - - - - - - - - - - Protein secondary structure format - - Format for secondary structure (predicted or real) of a protein molecule. - beta12orEarlier - - - - - - - - - - Sequence range format - - - - - - - - beta12orEarlier - Format used to specify range(s) of sequence positions. - - - - - - - - - - pure - - - Alphabet for molecular sequence with possible unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unpure - - - Alphabet for a molecular sequence with possible unknown positions but possibly with non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous sequence - - - Alphabet for a molecular sequence with possible unknown positions but without ambiguity characters. - beta12orEarlier - - - - - - - - - - ambiguous - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions and possible ambiguity characters. - - - - - - - - - - Sequence features (repeats) format - - beta12orEarlier - Format used for map of repeats in molecular (typically nucleotide) sequences. - - - - - - - - - - Nucleic acid features (restriction sites) format - - beta12orEarlier - Format used for report on restriction enzyme recognition sites in nucleotide sequences. - - - - - - - - - - Gene features (coding region) format - - beta12orEarlier - Format used for report on coding regions in nucleotide sequences. - true - 1.10 - - - - - - - - - - Sequence cluster format - - - - - - - - beta12orEarlier - Format used for clusters of molecular sequences. - - - - - - - - - - Sequence cluster format (protein) - - Format used for clusters of protein sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster format (nucleic acid) - - Format used for clusters of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Gene cluster format - - true - beta13 - beta12orEarlier - Format used for clusters of genes. - - - - - - - - - - EMBL-like (text) - - - This concept may be used for the many non-standard EMBL-like text formats. - beta12orEarlier - A text format resembling EMBL entry format. - - - - - - - - - - FASTQ-like format (text) - - - A text format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - EMBLXML - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - cdsxml - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - insdxml - - beta12orEarlier - XML format for EMBL entries. - - - - - - - - - - geneseq - - Geneseq sequence format. - beta12orEarlier - - - - - - - - - - UniProt-like (text) - - - A text sequence format resembling uniprotkb entry format. - beta12orEarlier - - - - - - - - - - UniProt format - - beta12orEarlier - true - UniProt entry sequence format. - 1.8 - - - - - - - - - - ipi - - 1.8 - beta12orEarlier - ipi sequence format. - true - - - - - - - - - - medline - - - Abstract format used by MedLine database. - beta12orEarlier - - - - - - - - - - Ontology format - - - - - - - - Format used for ontologies. - beta12orEarlier - - - - - - - - - - OBO format - - beta12orEarlier - A serialisation format conforming to the Open Biomedical Ontologies (OBO) model. - - - - - - - - - - OWL format - - A serialisation format conforming to the Web Ontology Language (OWL) model. - beta12orEarlier - - - - - - - - - - FASTA-like (text) - - - This concept may also be used for the many non-standard FASTA-like formats. - http://filext.com/file-extension/FASTA - beta12orEarlier - A text format resembling FASTA format. - - - - - - - - - - Sequence record full format - - 1.8 - beta12orEarlier - Data format for a molecular sequence record, typically corresponding to a full entry from a molecular sequence database. - true - - - - - - - - - - Sequence record lite format - - true - 1.8 - beta12orEarlier - Data format for a molecular sequence record 'lite', typically molecular sequence and minimal metadata, such as an identifier of the sequence and/or a comment. - - - - - - - - - - EMBL format (XML) - - beta12orEarlier - An XML format for EMBL entries. - This is a placeholder for other more specific concepts. It should not normally be used for annotation. - - - - - - - - - - GenBank-like format (text) - - - A text format resembling GenBank entry (plain text) format. - This concept may be used for the non-standard GenBank-like text formats. - beta12orEarlier - - - - - - - - - - Sequence feature table format (text) - - Text format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Strain data format - - Format of a report on organism strain data / cell line. - beta12orEarlier - true - 1.0 - - - - - - - - - - CIP strain data format - - Format for a report of strain data as used for CIP database entries. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - phylip property values - - true - PHYLIP file format for phylogenetic property data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - STRING entry format (HTML) - - beta12orEarlier - true - beta12orEarlier - Entry format (HTML) for the STRING database of protein interaction. - - - - - - - - - - STRING entry format (XML) - - - Entry format (XML) for the STRING database of protein interaction. - beta12orEarlier - - - - - - - - - - GFF - - - GFF feature format (of indeterminate version). - beta12orEarlier - - - - - - - - - - GTF - - Gene Transfer Format (GTF), a restricted version of GFF. - beta12orEarlier - - - - - - - - - - - - - FASTA-HTML - - - FASTA format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - EMBL-HTML - - - EMBL entry format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - BioCyc enzyme report format - - true - beta12orEarlier - beta12orEarlier - Format of an entry from the BioCyc enzyme database. - - - - - - - - - - ENZYME enzyme report format - - Format of an entry from the Enzyme nomenclature database (ENZYME). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - PseudoCAP gene report format - - true - beta12orEarlier - beta12orEarlier - Format of a report on a gene from the PseudoCAP database. - - - - - - - - - - GeneCards gene report format - - Format of a report on a gene from the GeneCards database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Textual format - - http://filext.com/file-extension/TSV - http://www.iana.org/assignments/media-types/text/plain - Textual format. - Data in text format can be compressed into binary format, or can be a value of an XML element or attribute. Markup formats are not considered textual (or more precisely, not plain-textual). - txt - http://filext.com/file-extension/TXT - Plain text - http://www.iana.org/assignments/media-types/media-types.xhtml#text - beta12orEarlier - - - - - - - - - - HTML - - - - - - - - HTML format. - beta12orEarlier - http://filext.com/file-extension/HTML - Hypertext Markup Language - - - - - - - - - - XML - - Data in XML format can be serialised into text, or binary format. - beta12orEarlier - eXtensible Markup Language (XML) format. - xml - - Extensible Markup Language - - - - - - - - - - - - - Binary format - - Only specific native binary formats are listed under 'Binary format' in EDAM. Generic binary formats - such as any data being zipped, or any XML data being serialised into the Efficient XML Interchange (EXI) format - are not modelled in EDAM. Refer to http://wsio.org/compression_004. - beta12orEarlier - Binary format. - - - - - - - - - - URI format - - beta13 - true - Typical textual representation of a URI. - beta12orEarlier - - - - - - - - - - NCI-Nature pathway entry format - - beta12orEarlier - true - The format of an entry from the NCI-Nature pathways database. - beta12orEarlier - - - - - - - - - - Format (typed) - - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. The concept branch under 'Format (typed)' provides an alternative organisation of the concepts nested under the other top-level branches ('Binary', 'HTML', 'RDF', 'Text' and 'XML'. All concepts under here are already included under those branches. - beta12orEarlier - A broad class of format distinguished by the scientific nature of the data that is identified. - - - - - - - - - - BioXSD - - - - - - - - - - - - - - - - - - - - - - - - BioXSD XML format - beta12orEarlier - BioXSD XML format of basic bioinformatics types of data (sequence records, alignments, feature records, references to resources, and more). - - - - - - - - - - - - RDF format - - - beta12orEarlier - A serialisation format conforming to the Resource Description Framework (RDF) model. - - - - - - - - - - GenBank-HTML - - - beta12orEarlier - Genbank entry format wrapped in HTML elements. - - - - - - - - - - Protein features (domains) format - - beta12orEarlier - true - beta12orEarlier - Format of a report on protein features (domain composition). - - - - - - - - - - EMBL-like format - - beta12orEarlier - A format resembling EMBL entry (plain text) format. - This concept may be used for the many non-standard EMBL-like formats. - - - - - - - - - - FASTQ-like format - - A format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - FASTA-like - - This concept may be used for the many non-standard FASTA-like formats. - beta12orEarlier - A format resembling FASTA format. - - - - - - - - - - uniprotkb-like format - - - beta12orEarlier - A sequence format resembling uniprotkb entry format. - - - - - - - - - - Sequence feature table format - - - - - - - - Format for a sequence feature table. - beta12orEarlier - - - - - - - - - - OBO - - - beta12orEarlier - OBO ontology text format. - - - - - - - - - - OBO-XML - - - beta12orEarlier - OBO ontology XML format. - - - - - - - - - - Sequence record format (text) - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence record format (XML) - - beta12orEarlier - Data format for a molecular sequence record. - - - - - - - - - - Sequence feature table format (XML) - - XML format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Alignment format (text) - - Text format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Alignment format (XML) - - XML format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format (text) - - beta12orEarlier - Text format for a phylogenetic tree. - - - - - - - - - - Phylogenetic tree format (XML) - - beta12orEarlier - XML format for a phylogenetic tree. - - - - - - - - - - EMBL-like (XML) - - - An XML format resembling EMBL entry format. - This concept may be used for the any non-standard EMBL-like XML formats. - beta12orEarlier - - - - - - - - - - GenBank-like format - - A format resembling GenBank entry (plain text) format. - beta12orEarlier - This concept may be used for the non-standard GenBank-like formats. - - - - - - - - - - STRING entry format - - beta12orEarlier - Entry format for the STRING database of protein interaction. - beta12orEarlier - true - - - - - - - - - - Sequence assembly format (text) - - beta12orEarlier - Text format for sequence assembly data. - - - - - - - - - - Amino acid identifier format - - beta13 - Text format (representation) of amino acid residues. - true - beta12orEarlier - - - - - - - - - - completely unambiguous - - - beta12orEarlier - Alphabet for a molecular sequence without any unknown positions or ambiguity characters. - - - - - - - - - - completely unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure nucleotide - - - Alphabet for a nucleotide sequence (characters ACGTU only) without unknown positions, ambiguity or non-sequence characters . - beta12orEarlier - - - - - - - - - - completely unambiguous pure dna - - - beta12orEarlier - Alphabet for a DNA sequence (characters ACGT only) without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) without unknown positions, ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - Raw sequence format - - - - - - - - http://www.onto-med.de/ontologies/gfo.owl#Symbol_sequence - beta12orEarlier - Format of a raw molecular sequence (i.e. the alphabet used). - - - - - - - - - - BAM - - - - beta12orEarlier - BAM format, the binary, BGZF-formatted compressed version of SAM format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SAM - - - - The format supports short and long reads (up to 128Mbp) produced by different sequencing platforms and is used to hold mapped data within the GATK and across the Broad Institute, the Sanger Centre, and throughout the 1000 Genomes project. - beta12orEarlier - Sequence Alignment/Map (SAM) format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SBML - - - Systems Biology Markup Language (SBML), the standard XML format for models of biological processes such as for example metabolism, cell signaling, and gene regulation. - beta12orEarlier - - - - - - - - - - - - completely unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - Bibliographic reference format - - - - - - - - - - - - - - Format of a bibliographic reference. - beta12orEarlier - - - - - - - - - - Sequence annotation track format - - - - - - - - Format of a sequence annotation track. - beta12orEarlier - - - - - - - - - - Alignment format (pair only) - - - - - - - - beta12orEarlier - Data format for molecular sequence alignment information that can hold sequence alignment(s) of only 2 sequences. - - - - - - - - - - Sequence variation annotation format - - - - - - - - Format of sequence variation annotation. - beta12orEarlier - - - - - - - - - - markx0 variant - - - Some variant of Pearson MARKX alignment format. - beta12orEarlier - - - - - - - - - - mega variant - - - - Some variant of Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylip format variant - - - - beta12orEarlier - Some variant of Phylip format for (aligned) sequences. - - - - - - - - - - AB1 - - - beta12orEarlier - AB1 binary format of raw DNA sequence reads (output of Applied Biosystems' sequencing analysis software). Contains an electropherogram and the DNA base sequence. - AB1 uses the generic binary Applied Biosystems, Inc. Format (ABIF). - - - - - - - - - - ACE - - - ACE sequence assembly format including contigs, base-call qualities, and other metadata (version Aug 1998 and onwards). - beta12orEarlier - - - - - - - - - - - - BED - - - beta12orEarlier - BED detail format includes 2 additional columns (http://genome.ucsc.edu/FAQ/FAQformat#format1.7) and BED 15 includes 3 additional columns for experiment scores (http://genomewiki.ucsc.edu/index.php/Microarray_track). - Browser Extensible Data (BED) format of sequence annotation track, typically to be displayed in a genome browser. - - - - - - - - - - - - bigBed - - - beta12orEarlier - bigBed format for large sequence annotation tracks, similar to textual BED format. - - - - - - - - - - - - WIG - - - Wiggle format (WIG) of a sequence annotation track that consists of a value for each sequence position. Typically to be displayed in a genome browser. - beta12orEarlier - - - - - - - - - - - - bigWig - - - beta12orEarlier - bigWig format for large sequence annotation tracks that consist of a value for each sequence position. Similar to textual WIG format. - - - - - - - - - - - - PSL - - - - PSL format of alignments, typically generated by BLAT or psLayout. Can be displayed in a genome browser like a sequence annotation track. - beta12orEarlier - - - - - - - - - - - - MAF - - - - Multiple Alignment Format (MAF) supporting alignments of whole genomes with rearrangements, directions, multiple pieces to the alignment, and so forth. - Typically generated by Multiz and TBA aligners; can be displayed in a genome browser like a sequence annotation track. This should not be confused with MIRA Assembly Format or Mutation Annotation Format. - beta12orEarlier - - - - - - - - - - - - 2bit - - - beta12orEarlier - 2bit binary format of nucleotide sequences using 2 bits per nucleotide. In addition encodes unknown nucleotides and lower-case 'masking'. - - - - - - - - - - - - - .nib - - - beta12orEarlier - .nib (nibble) binary format of a nucleotide sequence using 4 bits per nucleotide (including unknown) and its lower-case 'masking'. - - - - - - - - - - - - genePred - - - genePred table format for gene prediction tracks. - genePred format has 3 main variations (http://genome.ucsc.edu/FAQ/FAQformat#format9 http://www.broadinstitute.org/software/igv/genePred). They reflect UCSC Browser DB tables. - beta12orEarlier - - - - - - - - - - - - pgSnp - - - Personal Genome SNP (pgSnp) format for sequence variation tracks (indels and polymorphisms), supported by the UCSC Genome Browser. - beta12orEarlier - - - - - - - - - - - - axt - - - beta12orEarlier - axt format of alignments, typically produced from BLASTZ. - - - - - - - - - - - - LAV - - - beta12orEarlier - LAV format of alignments generated by BLASTZ and LASTZ. - - - - - - - - - - - - Pileup - - - beta12orEarlier - Pileup format of alignment of sequences (e.g. sequencing reads) to (a) reference sequence(s). Contains aligned bases per base of the reference sequence(s). - - - - - - - - - - - - VCF - - - beta12orEarlier - Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - - SRF - - - Sequence Read Format (SRF) of sequence trace data. Supports submission to the NCBI Short Read Archive. - beta12orEarlier - - - - - - - - - - - - ZTR - - - ZTR format for storing chromatogram data from DNA sequencing instruments. - beta12orEarlier - - - - - - - - - - - - GVF - - - Genome Variation Format (GVF). A GFF3-compatible format with defined header and attribute tags for sequence variation. - beta12orEarlier - - - - - - - - - - - - BCF - - - beta12orEarlier - BCF, the binary version of Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - Matrix format - - - - - - - - Format of a matrix (array) of numerical values. - beta13 - - - - - - - - - - Protein domain classification format - - - - - - - - Format of data concerning the classification of the sequences and/or structures of protein structural domain(s). - beta13 - - - - - - - - - - Raw SCOP domain classification format - - Format of raw SCOP domain classification data files. - These are the parsable data files provided by SCOP. - beta13 - - - - - - - - - - Raw CATH domain classification format - - These are the parsable data files provided by CATH. - beta13 - Format of raw CATH domain classification data files. - - - - - - - - - - CATH domain report format - - Format of summary of domain classification information for a CATH domain. - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - - - - - - - - - - SBRML - - - 1.0 - Systems Biology Result Markup Language (SBRML), the standard XML format for simulated or calculated results (e.g. trajectories) of systems biology models. - - - - - - - - - - - - BioPAX - - BioPAX is an exchange format for pathway data, with its data model defined in OWL. - 1.0 - - - - - - - - - - - - EBI Application Result XML - - - - EBI Application Result XML is a format returned by sequence similarity search Web services at EBI. - 1.0 - - - - - - - - - - - - PSI MI XML (MIF) - - - 1.0 - XML Molecular Interaction Format (MIF), standardised by HUPO PSI MI. - MIF - - - - - - - - - - - - phyloXML - - - phyloXML is a standardised XML format for phylogenetic trees, networks, and associated data. - 1.0 - - - - - - - - - - - - NeXML - - - 1.0 - NeXML is a standardised XML format for rich phyloinformatic data. - - - - - - - - - - - - MAGE-ML - - - - - - - - - 1.0 - MAGE-ML XML format for microarray expression data, standardised by MGED (now FGED). - - - - - - - - - - - - MAGE-TAB - - - - - - - - - MAGE-TAB textual format for microarray expression data, standardised by MGED (now FGED). - 1.0 - - - - - - - - - - - - GCDML - - - GCDML XML format for genome and metagenome metadata according to MIGS/MIMS/MIMARKS information standards, standardised by the Genomic Standards Consortium (GSC). - 1.0 - - - - - - - - - - - - GTrack - - - 1.0 - GTrack is an optimised tabular format for genome/sequence feature tracks unifying the power of other tabular formats (e.g. GFF3, BED, WIG). - - - - - - - - - - - - Biological pathway or network report format - - - - - - - - Data format for a report of information derived from a biological pathway or network. - beta12orEarlier - - - - - - - - - - Experiment annotation format - - - - - - - - beta12orEarlier - Data format for annotation on a laboratory experiment. - - - - - - - - - - Cytoband format - - - - - - - - - 1.2 - Cytoband format for chromosome cytobands. - Reflects a UCSC Browser DB table. - - - - - - - - - - - - CopasiML - - - - 1.2 - CopasiML, the native format of COPASI. - - - - - - - - - - - - CellML - - - CellML, the format for mathematical models of biological and other networks. - 1.2 - - - - - - - - - - - - - - PSI MI TAB (MITAB) - - - 1.2 - Tabular Molecular Interaction format (MITAB), standardised by HUPO PSI MI. - - - - - - - - - - - - PSI-PAR - - Protein affinity format (PSI-PAR), standardised by HUPO PSI MI. It is compatible with PSI MI XML (MIF) and uses the same XML Schema. - 1.2 - - - - - - - - - - - - mzML - - - mzML is the successor and unifier of the mzData format developed by PSI and mzXML developed at the Seattle Proteome Center. - 1.2 - mzML format for raw spectrometer output data, standardised by HUPO PSI MSS. - - - - - - - - - - - - Mass spectrometry data format - - - - - - - - Format for mass pectra and derived data, include peptide sequences etc. - 1.2 - - - - - - - - - - TraML - - - TraML (Transition Markup Language) is the format for mass spectrometry transitions, standardised by HUPO PSI MSS. - 1.2 - - - - - - - - - - - - mzIdentML - - - mzIdentML is the exchange format for peptides and proteins identified from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of proteomics search engines. - 1.2 - - - - - - - - - - - - mzQuantML - - - mzQuantML is the format for quantitation values associated with peptides, proteins and small molecules from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of quantitation software for proteomics. - 1.2 - - - - - - - - - - - - GelML - - - 1.2 - GelML is the format for describing the process of gel electrophoresis, standardised by HUPO PSI PS. - - - - - - - - - - - - spML - - - 1.2 - spML is the format for describing proteomics sample processing, other than using gels, prior to mass spectrometric protein identification, standardised by HUPO PSI PS. It may also be applicable for metabolomics. - - - - - - - - - - - - OWL Functional Syntax - - - A human-readable encoding for the Web Ontology Language (OWL). - 1.2 - - - - - - - - - - Manchester OWL Syntax - - - A syntax for writing OWL class expressions. - 1.2 - This format was influenced by the OWL Abstract Syntax and the DL style syntax. - - - - - - - - - - KRSS2 Syntax - - - This format is used in Protege 4. - A superset of the "Description-Logic Knowledge Representation System Specification from the KRSS Group of the ARPA Knowledge Sharing Effort". - 1.2 - - - - - - - - - - Turtle - - - The SPARQL Query Language incorporates a very similar syntax. - 1.2 - The Terse RDF Triple Language (Turtle) is a human-friendly serialization format for RDF (Resource Description Framework) graphs. - - - - - - - - - - N-Triples - - - N-Triples should not be confused with Notation 3 which is a superset of Turtle. - 1.2 - A plain text serialisation format for RDF (Resource Description Framework) graphs, and a subset of the Turtle (Terse RDF Triple Language) format. - - - - - - - - - - Notation3 - - - N3 - A shorthand non-XML serialization of Resource Description Framework model, designed with human-readability in mind. - - - - - - - - - - RDF/XML - - - - RDF - Resource Description Framework (RDF) XML format. - 1.2 - http://www.ebi.ac.uk/SWO/data/SWO_3000006 - RDF/XML is a serialization syntax for OWL DL, but not for OWL Full. - - - - - - - - - - OWL/XML - - - OWL ontology XML serialisation format. - 1.2 - OWL - - - - - - - - - - A2M - - - The A2M format is used as the primary format for multiple alignments of protein or nucleic-acid sequences in the SAM suite of tools. It is a small modification of FASTA format for sequences and is compatible with most tools that read FASTA. - 1.3 - - - - - - - - - - - - SFF - - - Standard flowgram format - Standard flowgram format (SFF) is a binary file format used to encode results of pyrosequencing from the 454 Life Sciences platform for high-throughput sequencing. - 1.3 - - - - - - - - - - - - MAP - - The MAP file describes SNPs and is used by the Plink package. - 1.3 - Plink MAP - - - - - - - - - - - PED - - Plink PED - 1.3 - The PED file describes individuals and genetic data and is used by the Plink package. - - - - - - - - - - - Individual genetic data format - - Data format for a metadata on an individual and their genetic data. - 1.3 - - - - - - - - - - PED/MAP - - - The PED/MAP file describes data used by the Plink package. - Plink PED/MAP - 1.3 - - - - - - - - - - - CT - - - File format of a CT (Connectivity Table) file from the RNAstructure package. - beta12orEarlier - Connect format - Connectivity Table file format - - - - - - - - - - - - SS - - - beta12orEarlier - XRNA old input style format. - - - - - - - - - - - RNAML - - - - RNA Markup Language. - beta12orEarlier - - - - - - - - - - - GDE - - - Format for the Genetic Data Environment (GDE). - beta12orEarlier - - - - - - - - - - - BLC - - 1.3 - Block file format - A multiple alignment in vertical format, as used in the AMPS (Alignment of Multiple Protein Sequences) pacakge. - - - - - - - - - - - Data index format - - - - - - - - - 1.3 - - - - - - - - - - BAI - - - - - - - - 1.3 - BAM indexing format - - - - - - - - - - - HMMER2 - - HMMER profile HMM file for HMMER versions 2.x - 1.3 - - - - - - - - - - - HMMER3 - - 1.3 - HMMER profile HMM file for HMMER versions 3.x - - - - - - - - - - - PO - - EMBOSS simple sequence pair alignment format. - 1.3 - - - - - - - - - - - BLAST XML results format - - - XML format as produced by the NCBI Blast package - 1.3 - - - - - - - - - - CRAM - - - Reference-based compression of alignment format - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - 1.7 - - - - - - - - - - JSON - - 1.7 - Javascript Object Notation format; a lightweight, text-based format to represent structured data using key-value pairs. - - - - - - - - - - EPS - - Encapsulated PostScript format - 1.7 - - - - - - - - - - GIF - - 1.7 - Graphics Interchange Format. - - - - - - - - - - xls - - - Microsoft Excel spreadsheet format. - Microsoft Excel format - 1.7 - - - - - - - - - - TSV - - Tabular format - http://filext.com/file-extension/CSV - http://www.iana.org/assignments/media-types/text/csv - Tabular data represented as tab-separated values in a text file. - 1.7 - http://filext.com/file-extension/TSV - CSV - - - - - - - - - - Gene expression data format - - true - 1.10 - 1.7 - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - - - - - - - - - - Cytoscape input file format - - - Format of the cytoscape input file of gene expression ratios or values are specified over one or more experiments. - 1.7 - - - - - - - - - - ebwt - - - - - - - - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie index format - 1.7 - Bowtie format for indexed reference genome for "small" genomes. - - - - - - - - - - RSF - - http://www.molbiol.ox.ac.uk/tutorials/Seqlab_GCG.pdf - RSF-format files contain one or more sequences that may or may not be related. In addition to the sequence data, each sequence can be annotated with descriptive sequence information (from the GCG manual). - Rich sequence format. - 1.7 - GCG RSF - - - - - - - - - - GCG format variant - - - - 1.7 - Some format based on the GCG format. - - - - - - - - - - BSML - - - http://rothlab.ucdavis.edu/genhelp/chapter_2_using_sequences.html#_Creating_and_Editing_Single_Sequenc - Bioinformatics Sequence Markup Language format. - 1.7 - - - - - - - - - - ebwtl - - - - - - - - 1.7 - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie long index format - Bowtie format for indexed reference genome for "large" genomes. - - - - - - - - - - Ensembl variation file format - - - Ensembl standard format for variation data. - 1.8 - - - - - - - - - - - docx - - - 1.8 - Microsoft Word format - doc - Microsoft Word format. - - - - - - - - - - Document format - - Format of documents including word processor, spreadsheet and presentation. - 1.8 - - - - - - - - - - PDF - - - 1.8 - Portable Document Format - - - - - - - - - - Image format - - - - - - - - Format used for images and image metadata. - 1.9 - - - - - - - - - - DICOM format - - - 1.9 - Medical image format corresponding to the Digital Imaging and Communications in Medicine (DICOM) standard. - - - - - - - - - - - - - nii - - - Medical image and metadata format of the Neuroimaging Informatics Technology Initiative. - - - NIfTI-1 format - 1.9 - - - - - - - - - - - mhd - - - Metalmage format - 1.9 - Text-based tagged file format for medical images generated using the MetaImage software package. - - - - - - - - - - - nrrd - - - 1.9 - Nearly Raw Rasta Data format designed to support scientific visualization and image processing involving N-dimensional raster data. - - - - - - - - - - - R file format - - File format used for scripts written in the R programming language for execution within the R software environment, typically for statistical computation and graphics. - - 1.9 - - - - - - - - - - SPSS - - 1.9 - File format used for scripts for the Statistical Package for the Social Sciences. - - - - - - - - - - - MHT - MIME HTML format for Web pages, which can include external resources, including images, Flash animations and so on. - - EMBL entry format wrapped in HTML elements. - 1.9 - MHTML - - - - - - - - - - IDAT - - - - - - - - - Proprietary file format for (raw) BeadArray data used by genomewide profiling platforms from Illumina Inc. This format is output directly from the scanner and stores summary intensities for each probe-type on an array. - 1.10 - - - - - - - - - - JPG - - - 1.10 - Joint Picture Group file format for lossy graphics file. - - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rcc - - - 1.10 - Reporter Code Count-A data file (.csv) output by the Nanostring nCounter Digital Analyzer, which contains gene sample information, probe information and probe counts. - - - - - - - - - - arff - - ARFF (Attribute-Relation File Format) is an ASCII text file format that describes a list of instances sharing a set of attributes. - 1.11 - This file format is for machine learning. - - - - - - - - - - - - afg - - - 1.11 - AFG is a single text-based file assembly format that holds read and consensus information together - - - - - - - - - - - - bedgraph - - - Holds a tab-delimited chromosome /start /end / datavalue dataset. - 1.11 - The bedGraph format allows display of continuous-valued data in track format. This display type is useful for probability scores and transcriptome data - - - - - - - - - - - - bedstrict - - Browser Extensible Data (BED) format of sequence annotation track that strictly does not contain non-standard fields beyond the first 3 columns. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - 1.11 - - - - - - - - - - - - bed6 - - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 6 - BED file format where each feature is described by chromosome, start, end, name, score, and strand. - 1.11 - - - - - - - - - - - - bed12 - - 1.11 - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 12 - A BED file where each feature is described by all twelve columns. - - - - - - - - - - - - chrominfo - - - 1.11 - Tabular format of chromosome names and sizes used by Galaxy. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - - - - - - - - - - - - customtrack - - - 1.11 - Custom Sequence annotation track format used by Galaxy. - Used for tracks/track views within galaxy. - - - - - - - - - - - - csfasta - - - Color space FASTA format sequence variant. - 1.3 - FASTA format extended for color space information. - - - - - - - - - - - - hdf5 - - An HDF5 file appears to the user as a directed graph. The nodes of this graph are the higher-level HDF5 objects that are exposed by the HDF5 APIs: Groups, Datasets, Named datatypes. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. - 1.11 - h5 - Binary format used by Galaxy for hierarchical data. - - - - - - - - - - - - tiff - - - The TIFF format is perhaps the most versatile and diverse bitmap format in existence. Its extensible nature and support for numerous data compression schemes allow developers to customize the TIFF format to fit any peculiar data storage needs. - - A versatile bitmap format. - 1.11 - - - - - - - - - - - bmp - - - Standard bitmap storage format in the Microsoft Windows environment. - 1.11 - Although it is based on Windows internal bitmap data structures, it is supported by many non-Windows and non-PC applications. - - - - - - - - - - - im - - - IM is a format used by LabEye and other applications based on the IFUNC image processing library. - IFUNC library reads and writes most uncompressed interchange versions of this format. - - 1.11 - - - - - - - - - - - pcd - - - PCD was developed by Kodak. A PCD file contains five different resolution (ranging from low to high) of a slide or film negative. Due to it PCD is often used by many photographers and graphics professionals for high-end printed applications. - 1.11 - Photo CD format, which is the highest resolution format for images on a CD. - - - - - - - - - - - pcx - - - 1.11 - PCX is an image file format that uses a simple form of run-length encoding. It is lossless. - - - - - - - - - - - - ppm - - - The PPM format is a lowest common denominator color image file format. - - 1.11 - - - - - - - - - - - psd - - - 1.11 - PSD (Photoshop Document) is a proprietary file that allows the user to work with the images’ individual layers even after the file has been saved. - - - - - - - - - - - xbm - - - The XBM format was replaced by XPM for X11 in 1989. - 1.11 - X BitMap is a plain text binary image format used by the X Window System used for storing cursor and icon bitmaps used in the X GUI. - - - - - - - - - - - xpm - - - X PixMap (XPM) is an image file format used by the X Window System, it is intended primarily for creating icon pixmaps, and supports transparent pixels. - - 1.11 - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rgb - - - RGB file format is the native raster graphics file format for Silicon Graphics workstations. - - 1.11 - - - - - - - - - - - pbm - - - The PBM format is a lowest common denominator monochrome file format. It serves as the common language of a large family of bitmap image conversion filters. - - 1.11 - - - - - - - - - - - pgm - - - It is designed to be extremely easy to learn and write programs for. - The PGM format is a lowest common denominator grayscale file format. - - 1.11 - - - - - - - - - - - PNG - - - 1.11 - png - PNG is a file format for image compression. - - It iis expected to replace the Graphics Interchange Format (GIF). - - - - - - - - - - - SVG - - - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - Scalable Vector Graphics (SVG) is an XML-based vector image format for two-dimensional graphics with support for interactivity and animation. - svg - Scalable Vector Graphics - 1.11 - - - - - - - - - - - rast - - - Sun Raster is a raster graphics file format used on SunOS by Sun Microsystems - 1.11 - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - - - - - - - - - - - Sequence quality report format (text) - - - - - - - - - Textual report format for sequence quality for reports from sequencing machines. - 1.11 - - - - - - - - - - qual - - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - Phred quality scores are defined as a property which is logarithmically related to the base-calling error probabilities. - FASTQ format subset for Phred sequencing quality score data only (no sequences). - - - - - - - - - - qualsolexa - - - Solexa/Illumina 1.0 format can encode a Solexa/Illumina quality score from -5 to 62 using ASCII 59 to 126 (although in raw read data Solexa scores from -5 to 40 only are expected) - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) for Solexa/Illumina 1.0 format. - - - - - - - - - - qualillumina - - - Starting in Illumina 1.5 and before Illumina 1.8, the Phred scores 0 to 2 have a slightly different meaning. The values 0 and 1 are no longer used and the value 2, encoded by ASCII 66 "B", is used also at the end of reads as a Read Segment Quality Control Indicator. - FASTQ format subset for Phred sequencing quality score data only (no sequences) from Illumina 1.5 and before Illumina 1.8. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qualsolid - - For SOLiD data, the sequence is in color space, except the first position. The quality values are those of the Sanger format. - FASTQ format subset for Phred sequencing quality score data only (no sequences) for SOLiD data. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qual454 - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) from 454 sequencers. - - - - - - - - - - ENCODE peak format - - 1.11 - Human ENCODE peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE narrow peak format - - 1.11 - Human ENCODE narrow peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE broad peak format - - 1.11 - Human ENCODE broad peak format. - - - - - - - - - - - - bgzip - - - BAM files are compressed using a variant of GZIP (GNU ZIP), into a format called BGZF (Blocked GNU Zip Format). - Blocked GNU Zip format. - 1.11 - - - - - - - - - - - tabix - - - TAB-delimited genome position file index format. - 1.11 - - - - - - - - - - - - Graph format - - Data format for graph data. - 1.11 - - - - - - - - - - xgmml - - XML-based format used to store graph descriptions within Galaxy. - 1.11 - - - - - - - - - - - sif - - 1.11 - SIF (simple interaction file) Format - a network/pathway format used for instance in cytoscape. - - - - - - - - - - - xlsx - - - 1.11 - MS Excel spreadsheet format consisting of a set of XML documents stored in a ZIP-compressed file. - - - - - - - - - - SQLite - - https://www.sqlite.org/fileformat2.html - Data format used by the SQLite database. - 1.11 - - - - - - - - - - GeminiSQLite - - https://gemini.readthedocs.org/en/latest/content/quick_start.html - 1.11 - Data format used by the SQLite database conformant to the Gemini schema. - - - - - - - - - - Index format - - - - - - - - - Format of a data index of some type. - 1.11 - - - - - - - - - - snpeffdb - - An index of a genome database, indexed for use by the snpeff tool. - 1.11 - - - - - - - - - - MAT - - - - - - - - MATLAB file format - Binary format used by MATLAB files to store workspace variables. - 1.12 - MAT file format - .mat file format - - - - - - - - - - - netCDF - - 1.12 - ANDI-MS - Format used by netCDF software library for writing and reading chromatography-MS data files. - - - - - - - - - - - MGF - - Files includes *m*/*z*, intensity pairs separated by headers; headers can contain a bit more information, including search engine instructions. - Mascot Generic Format. Encodes multiple MS/MS spectra in a single file. - 1.12 - - - - - - - - - - dta - - Each file contains one header line for the known or assumed charge and the mass of the precursor peptide ion, calculated from the measured *m*/*z* and the charge. This one line was then followed by all the *m*/*z*, intensity pairs that represent the spectrum. - 1.12 - Spectral data format file where each spectrum is written to a separate file. - - - - - - - - - - pkl - - Spectral data file similar to dta. - Differ from .dta only in subtleties of the header line format and content and support the added feature of being able to. - 1.12 - - - - - - - - - - mzXML - - 1.12 - https://dx.doi.org/10.1038%2Fnbt1031 - Common file format for proteomics mass spectrometric data developed at the Seattle Proteome Center/Institute for Systems Biology. - - - - - - - - - - pepXML - - http://sashimi.sourceforge.net/schema_revision/pepXML/pepXML_v118.xsd - Open data format for the storage, exchange, and processing of peptide sequence assignments of MS/MS scans, intended to provide a common data output format for many different MS/MS search engines and subsequent peptide-level analyses. - 1.12 - - - - - - - - - - GPML - - - 1.12 - Graphical Pathway Markup Language (GPML) is an XML format used - for exchanging biological pathways. - - - - - - - - - - - K-mer countgraph - - - 1.12 - oxlicg - http://www.iana.org/assignments/media-types/application/vnd.oxli.countgraph - A list of k-mers and their occurences in a dataset. Can also be used as an implicit De Bruijn graph. - - - - - - - - - - - mzTab - - - 1.13 - mzTab is a tab-delimited format for mass spectrometry-based proteomics and metabolomics results. - - - - - - - - - - - - - imzML - - - - imzML is a data format for mass spectrometry imaging data. NB.: See comment. - 1.13 - imzML|ibd - Data is recorded in 2 files: '.imzXML' is a metadata XML file based on mzML by HUPO-PSI, and '.ibd' is a binary file containing the mass spectra. - - - - - - - - - - - - - qcML - - - - The focus of qcML is towards mass spectrometry based proteomics, but the format is suitable for metabolomics and sequencing as well. - qcML is an XML format for quality-related data of mass spectrometry and other high-throughput measurements. - 1.13 - - - - - - - - - - - - PRIDE XML - - - - 1.13 - PRIDE XML is an XML format for mass spectra, peptide and protein identifications, and metadata about a corresponding measurement, sample, experiment. - - - - - - - - - - - - SED-ML - - - Simulation Experiment Description Markup Language (SED-ML) is an XML format for encoding simulation setups, according to the MIASE (Minimum Information About a Simulation Experiment) requirements. - 1.13 - - - - - - - - - - - - - - COMBINE OMEX - - - - 1.13 - An OMEX file is a ZIP container that includes a manifest file, listing the content of the archive, an optional metadata file adding information about the archive and its content, and the files describing the model. OMEX is one of the standardised formats within COMBINE (Computational Modeling in Biology Network). - Open Modeling EXchange format (OMEX) is a ZIPped format for encapsulating all information necessary for a modeling and simulation project in systems biology. - - - - - - - - - - - - - ISA-TAB - - - - ISA-TAB is based on MAGE-TAB. Other than tabular, the ISA model can also be represented in RDF, and in JSON (compliable with a set of defined JSON Schemata). - The Investigation / Study / Assay (ISA) tab-delimited (TAB) format incorporates metadata from -experiments employing a combination of technologies. - 1.13 - ISA-Tab - - - - - - - - - - - - SBtab - - - 1.13 - SBtab is a tabular format for biochemical network models. - - - - - - - - - - - - - BCML - - - 1.13 - Biological Connection Markup Language (BCML) is an XML format for biological pathways. - - - - - - - - - - - - BDML - - Biological Dynamics Markup Language (BDML) is an XML format for quantitative data describing biological dynamics. - 1.13 - - - - - - - - - - - - - BEL - - 1.13 - Biological Expression Language (BEL) is a textual format for representing scientific findings in life sciences in a computable form. - - - - - - - - - - - - SBGN-ML - - - SBGN-ML is an XML format for Systems Biology Graphical Notation (SBGN) diagrams of biological pathways or networks. - 1.13 - - - - - - - - - - - - AGP - - - 1.13 - AGP is a tabular format for a sequence assembly (a contig, a scaffold/supercontig, or a chromosome). - - - - - - - - - - - - PS - - PostScript - PostScript format - 1.13 - - - - - - - - - - SRA format - - SRA archive format (SRA) is the archive format used for input to the NCBI Sequence Read Archive. - SRA archive format - 1.13 - SRA - - - - - - - - - - - VDB - - VDB ('vertical database') is the format (SRA) is the native format used for export from the NCBI Sequence Read Archive. - SRA native format - 1.13 - SRA - - - - - - - - - - - Tabix index file format - - - - - - - - 1.3 - Index file format used by the samtools package to index TAB-delimited genome position files. - - - - - - - - - - - sequin - - A five-column, tab-delimited table of feature locations and qualifiers for importing annotation into an existing Sequin submission (an NCBI tool for submitting and updating GenBank entries). - 1.13 - - - - - - - - - - Operation - - - A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs). - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - Computational tool - Function - http://purl.org/biotop/biotop.owl#Function - http://www.ifomis.org/bfo/1.1/snap#Function - http://en.wikipedia.org/wiki/Function_(mathematics) - Computational method - http://semanticscience.org/resource/SIO_000017 - http://www.ebi.ac.uk/swo/SWO_0000003 - Mathematical operation - sumo:Function - beta12orEarlier - Process - Computational operation - Computational subroutine - http://semanticscience.org/resource/SIO_000649 - Special cases are: a) An operation that consumes no input (has no input arguments). Such operation is either a constant function, or an operation depending only on the underlying state. b) An operation that may modify the underlying state but has no output. c) The singular-case operation with no input or output, that still may modify the underlying state. - http://www.ifomis.org/bfo/1.1/span#Process - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - Computational procedure - Mathematical function - Lambda abstraction - Function (programming) - http://www.onto-med.de/ontologies/gfo.owl#Process - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - http://wsio.org/operation_001 - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#process - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Function - http://en.wikipedia.org/wiki/Function_(computer_science) - http://en.wikipedia.org/wiki/Subroutine - - - - - Process - Process can have a function (as its quality/attribute), and can also perform an operation with inputs and outputs. - - - - - Computational tool - Computational tool provides one or more operations. - - - - - Operation is a function that is computational. It typically has input(s) and output(s), which are always data. - Function - - - - - - - - - - Query and retrieval - - - - - - - - - - - - - - beta12orEarlier - Query - Search or query a data resource and retrieve entries and / or annotation. - Database retrieval - - - - - - - - - - Data retrieval (database cross-reference) - - beta12orEarlier - Search database to retrieve all relevant references to a particular entity or entry. - true - beta13 - - - - - - - - - - Annotation - - - - - - - - - - - - - - Annotate an entity (typically a biological or biomedical database entity) with terms from a controlled vocabulary. - beta12orEarlier - This is a broad concept and is used a placeholder for other, more specific concepts. - - - - - - - - - - Indexing - - - - - - - - Data indexing - beta12orEarlier - Generate an index of (typically a file of) biological data. - Database indexing - - - - - - - - - - Data index analysis - - Database index analysis - Analyse an index of biological data. - beta12orEarlier - true - 1.6 - - - - - - - - - - Annotation retrieval (sequence) - - true - beta12orEarlier - Retrieve basic information about a molecular sequence. - beta12orEarlier - - - - - - - - - - Sequence generation - - - beta12orEarlier - Generate a molecular sequence by some means. - - - - - - - - - - Sequence editing - - - Edit or change a molecular sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence merging - - beta12orEarlier - Merge two or more (typically overlapping) molecular sequences. - Sequence splicing - - - - - - - - - - Sequence conversion - - - Convert a molecular sequence from one type to another. - beta12orEarlier - - - - - - - - - - Sequence complexity calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate sequence complexity, for example to find low-complexity regions in sequences. - - - - - - - - - - Sequence ambiguity calculation - - - - - - - - - - - - - - Calculate sequence ambiguity, for example identity regions in protein or nucleotide sequences with many ambiguity codes. - beta12orEarlier - - - - - - - - - - Sequence composition calculation - - - - - - - - - - - - - - - beta12orEarlier - Calculate character or word composition or frequency of a molecular sequence. - - - - - - - - - - Repeat sequence analysis - - - - - - - - Find and/or analyse repeat sequences in (typically nucleotide) sequences. - beta12orEarlier - Repeat sequences include tandem repeats, inverted or palindromic repeats, DNA microsatellites (Simple Sequence Repeats or SSRs), interspersed repeats, maximal duplications and reverse, complemented and reverse complemented repeats etc. Repeat units can be exact or imperfect, in tandem or dispersed, of specified or unspecified length. - - - - - - - - - - Sequence motif discovery - - - - - - - - - - - - - - - Motifs and patterns might be conserved or over-represented (occur with improbable frequency). - beta12orEarlier - Discover new motifs or conserved patterns in sequences or sequence alignments (de-novo discovery). - Motif discovery - - - - - - - - - - Sequence motif recognition - - - - - - - - - - - - - - - beta12orEarlier - Sequence signature recognition - Motif scanning - Motif search - Sequence motif search - Protein secondary database search - Motif detection - Sequence signature detection - Sequence profile search - Find (scan for) known motifs, patterns and regular expressions in molecular sequence(s). - Sequence motif detection - Motif recognition - - - - - - - - - - Sequence motif comparison - - - - - - - - - - - - - - - beta12orEarlier - Find motifs shared by molecular sequences. - - - - - - - - - - Transcription regulatory sequence analysis - - beta12orEarlier - beta13 - Analyse the sequence, conformational or physicochemical properties of transcription regulatory elements in DNA sequences. - For example transcription factor binding sites (TFBS) analysis to predict accessibility of DNA to binding factors. - true - - - - - - - - - - Conserved transcription regulatory sequence identification - - - For example cross-species comparison of transcription factor binding sites (TFBS). Methods might analyse co-regulated or co-expressed genes, or sets of oppositely expressed genes. - beta12orEarlier - Identify common, conserved (homologous) or synonymous transcriptional regulatory motifs (transcription factor binding sites). - - - - - - - - - - Protein property calculation (from structure) - - - - - - - - - - - - - - - This might be a residue-level search for properties such as solvent accessibility, hydropathy, secondary structure, ligand-binding etc. - Extract, calculate or predict non-positional (physical or chemical) properties of a protein from processing a protein (3D) structure. - beta12orEarlier - Protein structural property calculation - - - - - - - - - - Protein flexibility and motion analysis - - - beta12orEarlier - Analyse flexibility and motion in protein structure. - Use this concept for analysis of flexible and rigid residues, local chain deformability, regions undergoing conformational change, molecular vibrations or fluctuational dynamics, domain motions or other large-scale structural transitions in a protein structure. - - - - - - - - - - Protein structural motif recognition - - - - - - - - - Identify or screen for 3D structural motifs in protein structure(s). - This includes conserved substructures and conserved geometry, such as spatial arrangement of secondary structure or protein backbone. Methods might use structure alignment, structural templates, searches for similar electrostatic potential and molecular surface shape, surface-mapping of phylogenetic information etc. - beta12orEarlier - Protein structural feature identification - - - - - - - - - - Protein domain recognition - - - - - - - - - beta12orEarlier - Identify structural domains in a protein structure from first principles (for example calculations on structural compactness). - - - - - - - - - - Protein architecture analysis - - beta12orEarlier - Analyse the architecture (spatial arrangement of secondary structure) of protein structure(s). - - - - - - - - - - Residue interaction calculation - - - - - - - - - WHATIF: SymShellTenXML - WHATIF:ListContactsRelaxed - WHATIF: SymShellTwoXML - WHATIF:ListSideChainContactsRelaxed - beta12orEarlier - WHATIF:ListSideChainContactsNormal - WHATIF:ListContactsNormal - Calculate or extract inter-atomic, inter-residue or residue-atom contacts, distances and interactions in protein structure(s). - WHATIF: SymShellFiveXML - WHATIF: SymShellOneXML - - - - - - - - - - Protein geometry calculation - - - - - - - - WHATIF:ResidueTorsions - beta12orEarlier - Backbone torsion angle calculation - WHATIF:CysteineTorsions - Calculate, visualise or analyse phi/psi angles of a protein structure. - WHATIF:ResidueTorsionsBB - WHATIF:ShowTauAngle - Torsion angle calculation - Tau angle calculation - Cysteine torsion angle calculation - - - - - - - - - - Protein property calculation - - - - This includes methods to render and visualise the properties of a protein sequence. - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - beta12orEarlier - Protein property rendering - - - - - - - - - - Peptide immunogenicity prediction - - - - - - - - - - - - - - - Immunogenicity prediction - beta12orEarlier - This is usually done in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - This includes methods that generate a graphical rendering of antigenicity of a protein, such as a Hopp and Woods plot. - Hopp and Woods plotting - Predict antigenicity, allergenicity / immunogenicity, allergic cross-reactivity etc of peptides and proteins. - MHC peptide immunogenicity prediction - - - - - - - - - - Sequence feature detection - - - - - - - - - - - - - - - Sequence feature prediction - Predict, recognise and identify positional features in molecular sequences such as key functional sites or regions. - Sequence feature recognition - beta12orEarlier - Motif database search - SO:0000110 - - - - - - - - - - Data retrieval (feature table) - - beta13 - Extract a sequence feature table from a sequence database entry. - true - beta12orEarlier - - - - - - - - - - Feature table query - - 1.6 - beta12orEarlier - true - Query the features (in a feature table) of molecular sequence(s). - - - - - - - - - - Sequence feature comparison - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Compare the feature tables of two or more molecular sequences. - Feature comparison - Feature table comparison - - - - - - - - - - Data retrieval (sequence alignment) - - beta12orEarlier - true - beta13 - Display basic information about a sequence alignment. - - - - - - - - - - Sequence alignment analysis - - - - - - - - Analyse a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence alignment comparison - - - Compare (typically by aligning) two molecular sequence alignments. - beta12orEarlier - See also 'Sequence profile alignment'. - - - - - - - - - - Sequence alignment conversion - - - beta12orEarlier - Convert a molecular sequence alignment from one type to another (for example amino acid to coding nucleotide sequence). - - - - - - - - - - Nucleic acid property processing - - beta12orEarlier - true - Process (read and / or write) physicochemical property data of nucleic acids. - beta13 - - - - - - - - - - Nucleic acid property calculation - - - - - - - - - beta12orEarlier - Calculate or predict physical or chemical properties of nucleic acid molecules, including any non-positional properties of the molecular sequence. - - - - - - - - - - Splice transcript prediction - - - - - - - - beta12orEarlier - Predict splicing alternatives or transcript isoforms from analysis of sequence data. - - - - - - - - - - Frameshift detection - - - - - - - - - Detect frameshifts in DNA sequences, including frameshift sites and signals, and frameshift errors from sequencing projects. - Frameshift error detection - beta12orEarlier - Methods include sequence alignment (if related sequences are available) and word-based sequence comparison. - - - - - - - - - - Vector sequence detection - - - beta12orEarlier - Detect vector sequences in nucleotide sequence, typically by comparison to a set of known vector sequences. - - - - - - - - - - Protein secondary structure prediction - - - - Methods might use amino acid composition, local sequence information, multiple sequence alignments, physicochemical features, estimated energy content, statistical algorithms, hidden Markov models, support vector machines, kernel machines, neural networks etc. - Predict secondary structure of protein sequences. - Secondary structure prediction (protein) - beta12orEarlier - - - - - - - - - - Protein super-secondary structure prediction - - - - - - - - beta12orEarlier - Predict super-secondary structure of protein sequence(s). - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - - - - - - - - - - Transmembrane protein prediction - - - Predict and/or classify transmembrane proteins or transmembrane (helical) domains or regions in protein sequences. - beta12orEarlier - - - - - - - - - - Transmembrane protein analysis - - - - - - - - beta12orEarlier - Analyse transmembrane protein(s), typically by processing sequence and / or structural data, and write an informative report for example about the protein and its transmembrane domains / regions. - Use this (or child) concept for analysis of transmembrane domains (buried and exposed faces), transmembrane helices, helix topology, orientation, inter-helical contacts, membrane dipping (re-entrant) loops and other secondary structure etc. Methods might use pattern discovery, hidden Markov models, sequence alignment, structural profiles, amino acid property analysis, comparison to known domains or some combination (hybrid methods). - - - - - - - - - - Structure prediction - - - - - - - - - - - - - - - Predict tertiary structure of a molecular (biopolymer) sequence. - beta12orEarlier - - - - - - - - - - Residue interaction prediction - - - - - - - - - Methods usually involve multiple sequence alignment analysis. - Predict contacts, non-covalent interactions and distance (constraints) between amino acids in protein sequences. - beta12orEarlier - - - - - - - - - - Protein interaction raw data analysis - - - - - - - - - - - - - - Analyse experimental protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein-protein interaction prediction (from protein sequence) - - beta12orEarlier - 1.12 - true - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein sequences. - - - - - - - - - - Protein-protein interaction prediction (from protein structure) - - true - 1.12 - beta12orEarlier - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein structures. - - - - - - - - - - Protein interaction network analysis - - - - - - - - - - - - - - - beta12orEarlier - Analyse a network of protein interactions. - - - - - - - - - - Protein interaction network comparison - - - beta12orEarlier - Compare two or more networks of protein interactions. - - - - - - - - - - RNA secondary structure prediction - - - - - - - - - - Predict RNA secondary structure (for example knots, pseudoknots, alternative structures etc). - beta12orEarlier - Methods might use RNA motifs, predicted intermolecular contacts, or RNA sequence-structure compatibility (inverse RNA folding). - - - - - - - - - - Nucleic acid folding analysis - - - - - - - - - - beta12orEarlier - Analyse some aspect of RNA/DNA folding, typically by processing sequence and/or structural data. - Nucleic acid folding modelling - Nucleic acid folding prediction - Nucleic acid folding - - - - - - - - - - Data retrieval (restriction enzyme annotation) - - beta13 - Restriction enzyme information retrieval - true - Retrieve information on restriction enzymes or restriction enzyme sites. - beta12orEarlier - - - - - - - - - - Genetic marker identification - - true - beta12orEarlier - beta13 - Identify genetic markers in DNA sequences. - A genetic marker is any DNA sequence of known chromosomal location that is associated with and specific to a particular gene or trait. This includes short sequences surrounding a SNP, Sequence-Tagged Sites (STS) which are well suited for PCR amplification, a longer minisatellites sequence etc. - - - - - - - - - - Genetic mapping - - - - - - - - - beta12orEarlier - QTL mapping - This includes mapping of the genetic architecture of dynamic complex traits (functional mapping), e.g. by characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Linkage mapping - Genetic map generation - Mapping involves ordering genetic loci along a chromosome and estimating the physical distance between loci. A genetic map shows the relative (not physical) position of known genes and genetic markers. - Generate a genetic (linkage) map of a DNA sequence (typically a chromosome) showing the relative positions of genetic markers based on estimation of non-physical distances. - Genetic map construction - Functional mapping - - - - - - - - - - Linkage analysis - - - - - - - - - - - - - - beta12orEarlier - For example, estimate how close two genes are on a chromosome by calculating how often they are transmitted together to an offspring, ascertain whether two genes are linked and parental linkage, calculate linkage map distance etc. - Analyse genetic linkage. - - - - - - - - - - Codon usage table generation - - - - - - - - - Calculate codon usage statistics and create a codon usage table. - beta12orEarlier - Codon usage table construction - - - - - - - - - - Codon usage table comparison - - - beta12orEarlier - Compare two or more codon usage tables. - - - - - - - - - - Codon usage analysis - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - synon: Codon usage data analysis - Process (read and / or write) codon usage data, e.g. analyse codon usage tables or codon usage in molecular sequences. - synon: Codon usage table analysis - - - - - - - - - - Base position variability plotting - - - - - - - - - - - - - - - Identify and plot third base position variability in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Sequence word comparison - - Find exact character or word matches between molecular sequences without full sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence distance matrix generation - - - - - - - - - - - - - - - Sequence distance matrix construction - Phylogenetic distance matrix generation - beta12orEarlier - Calculate a sequence distance matrix or otherwise estimate genetic distances between molecular sequences. - - - - - - - - - - Sequence redundancy removal - - - - - - - - beta12orEarlier - Compare two or more molecular sequences, identify and remove redundant sequences based on some criteria. - - - - - - - - - - Sequence clustering - - - - - - - - - - The clusters may be output or used internally for some other purpose. - Sequence cluster construction - beta12orEarlier - Build clusters of similar sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - Sequence cluster generation - - - - - - - - - - Sequence alignment - - - - - - - - - - Sequence alignment construction - beta12orEarlier - Align (identify equivalent sites within) molecular sequences. - Sequence alignment generation - Sequence alignment computation - - - - - - - - - - Hybrid sequence alignment construction - - Hybrid sequence alignment - true - beta13 - beta12orEarlier - Align two or more molecular sequences of different types (for example genomic DNA to EST, cDNA or mRNA). - Hybrid sequence alignment generation - - - - - - - - - - Structure-based sequence alignment - - Sequence alignment generation (structure-based) - Structure-based sequence alignment construction - beta12orEarlier - Sequence alignment (structure-based) - Structure-based sequence alignment generation - Align molecular sequences using sequence and structural information. - - - - - - - - - - Structure alignment - - - - - - - - - - Align (superimpose) molecular tertiary structures. - Structure alignment generation - Structure alignment construction - beta12orEarlier - Multiple structure alignment construction - Multiple structure alignment generation - - - - - - - - - - Sequence profile generation - - - - - - - - - - - - - - - - - - - - - Sequence profile construction - beta12orEarlier - Generate some type of sequence profile (for example a hidden Markov model) from a sequence alignment. - - - - - - - - - - 3D profile generation - - - - - - - - - - - - - - - - - - - - - Structural profile generation - Generate some type of structural (3D) profile or template from a structure or structure alignment. - Structural profile construction - beta12orEarlier - - - - - - - - - - Profile-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence profile alignment - beta12orEarlier - See also 'Sequence alignment comparison'. - Sequence profile alignment construction - Align sequence profiles (representing sequence alignments). - Sequence profile alignment generation - - - - - - - - - - 3D profile-to-3D profile alignment - - - - - - - - - - - - - - beta12orEarlier - 3D profile alignment (multiple) - 3D profile alignment - Multiple 3D profile alignment construction - Structural profile alignment construction (multiple) - Structural profile alignment - Structural profile alignment generation - Structural profile alignment construction - Align structural (3D) profiles or templates (representing structures or structure alignments). - - - - - - - - - - Sequence-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence-profile alignment construction - Sequence-profile alignment generation - beta12orEarlier - Align molecular sequence(s) to sequence profile(s). - Sequence-profile alignment - A sequence profile typically represents a sequence alignment. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - - - - - - - - - - Sequence-to-3D-profile alignment - - - - - - - - - - - - - - - beta12orEarlier - Sequence-3D profile alignment construction - Align molecular sequence(s) to structural (3D) profile(s) or template(s) (representing a structure or structure alignment). - Sequence-3D profile alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-3D profile alignment - - - - - - - - - - Protein threading - - - - - - - - - - - - - - - beta12orEarlier - Align molecular sequence to structure in 3D space (threading). - Use this concept for methods that evaluate sequence-structure compatibility by assessing residue interactions in 3D. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-structure alignment - - - - - - - - - - Protein fold recognition - - - - - beta12orEarlier - Protein domain prediction - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - Recognize (predict and identify) known protein structural domains or folds in protein sequence(s). - Protein fold prediction - - - - - - - - - - Metadata retrieval - - - - - - - - Data retrieval (documentation) - Search for and retrieve data concerning or describing some core data, as distinct from the primary data that is being described. - Data retrieval (metadata) - beta12orEarlier - This includes documentation, general information and other metadata on entities such as databases, database entries and tools. - - - - - - - - - - Literature search - - - - - - - - - - - - - - beta12orEarlier - Query the biomedical and informatics literature. - - - - - - - - - - Text mining - - - - - - - - - - - - - - - - - - - - Text data mining - beta12orEarlier - Process and analyse text (typically the biomedical and informatics literature) to extract information from it. - - - - - - - - - - Virtual PCR - - - - - - - - beta12orEarlier - Perform in-silico (virtual) PCR. - - - - - - - - - - PCR primer design - - - - - - - - - - - - - - - - - - - - This includes predicting primers based on gene structure, promoters, exon-exon junctions, predicting primers that are conserved across multiple genomes or species, primers for for gene transcription profiling, for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs), for large scale sequencing, or for methylation PCRs. - PCR primer design (based on gene structure) - PCR primer design (for methylation PCRs) - beta12orEarlier - PCR primer design (for large scale sequencing) - PCR primer prediction - Primer design involves predicting or selecting primers that are specific to a provided PCR template. Primers can be designed with certain properties such as size of product desired, primer size etc. The output might be a minimal or overlapping primer set. - PCR primer design (for conserved primers) - Design or predict oligonucleotide primers for PCR and DNA amplification etc. - PCR primer design (for gene transcription profiling) - PCR primer design (for genotyping polymorphisms) - - - - - - - - - - Microarray probe design - - - - - - - - - - - - - - - - - - - - - - - - - - - Predict and/or optimize oligonucleotide probes for DNA microarrays, for example for transcription profiling of genes, or for genomes and gene families. - beta12orEarlier - Microarray probe prediction - - - - - - - - - - Sequence assembly - - - - - - - - - - - - - - - beta12orEarlier - For example, assemble overlapping reads from paired-end sequencers into contigs (a contiguous sequence corresponding to read overlaps). Or assemble contigs, for example ESTs and genomic DNA fragments, depending on the detected fragment overlaps. - Combine (align and merge) overlapping fragments of a DNA sequence to reconstruct the original sequence. - - - - - - - - - - Microarray data standardization and normalization - - - - - - - - - - - - - - - beta12orEarlier - Standardize or normalize microarray data. - This includes statistical analysis, for example of variability amongst microarrays experiments, comparison of heterogeneous microarray platforms etc. - - - - - - - - - - Sequencing-based expression profile data processing - - Process (read and / or write) SAGE, MPSS or SBS experimental data. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gene expression profile clustering - - - - - - - - - beta12orEarlier - Perform cluster analysis of gene expression (microarray) data, for example clustering of similar gene expression profiles. - - - - - - - - - - Gene expression profiling - - - - - - - - - Expression profiling - Gene expression profile construction - Functional profiling - Generate a gene expression profile or pattern, for example from microarray data. - beta12orEarlier - Gene expression profile generation - - - - - - - - - - Gene expression profile comparison - - - - - - - - - beta12orEarlier - Compare gene expression profiles or patterns. - - - - - - - - - - Functional profiling - - true - beta12orEarlier - Interpret (in functional terms) and annotate gene expression data. - beta12orEarlier - - - - - - - - - - EST and cDNA sequence analysis - - Analyse EST or cDNA sequences. - For example, identify full-length cDNAs from EST sequences or detect potential EST antisense transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Structural genomics target selection - - beta12orEarlier - Identify and select targets for protein structural determination. - beta12orEarlier - Methods will typically navigate a graph of protein families of known structure. - true - - - - - - - - - - Protein secondary structure assignment - - - - - - - - - - - - - - beta12orEarlier - Assign secondary structure from protein coordinate or experimental data. - - - - - - - - - - Protein structure assignment - - - - - - - - - - - - - - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw experimental data. - - - - - - - - - - Protein model validation - - - - - - - - - - - - - - - WHATIF: UseResidueDB - Evaluate the quality or correctness a protein three-dimensional model. - This includes methods that calculate poor quality residues. The scoring function to identify poor quality residues may consider residues with bad atoms or atoms with high B-factor, residues in the N- or C-terminal position, adjacent to an unstructured residue, non-canonical residues, glycine and proline (or adjacent to these such residues). - Model validation might involve checks for atomic packing, steric clashes (bumps), volume irregularities, agreement with electron density maps, number of amino acid residues, percentage of residues with missing or bad atoms, irregular Ramachandran Z-scores, irregular Chi-1 / Chi-2 normality scores, RMS-Z score on bonds and angles etc. - Residue validation - WHATIF: CorrectedPDBasXML - Protein structure validation - WHATIF: UseFileDB - The PDB file format has had difficulties, inconsistencies and errors. Corrections can include identifying a meaningful sequence, removal of alternate atoms, correction of nomenclature problems, removal of incomplete residues and spurious waters, addition or removal of water, modelling of missing side chains, optimisation of cysteine bonds, regularisation of bond lengths, bond angles and planarities etc. - beta12orEarlier - - - - - - - - - - Molecular model refinement - - - Protein model refinement - WHATIF: CorrectedPDBasXML - beta12orEarlier - Refine (after evaluation) a model of a molecular structure (typically a protein structure) to reduce steric clashes, volume irregularities etc. - - - - - - - - - - Phylogenetic tree generation - - - - - - - - - - - - - - - Phylogenetic trees are usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. - Phylogenetic tree construction - Construct a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis - - - - - - - - beta12orEarlier - Analyse an existing phylogenetic tree or trees, typically to detect features or make predictions. - - - - - - - - - - Phylogenetic tree comparison - - - beta12orEarlier - Compare two or more phylogenetic trees. - For example, to produce a consensus tree, subtrees, supertrees, calculate distances between trees or test topological similarity between trees (e.g. a congruence index) etc. - - - - - - - - - - Phylogenetic tree editing - - - - - - - - - - - - - - - Edit a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic footprinting / shadowing - - - - - - - - A phylogenetic 'shadow' represents the additive differences between individual sequences. By masking or 'shadowing' variable positions a conserved sequence is produced with few or none of the variations, which is then compared to the sequences of interest to identify significant regions of conservation. - beta12orEarlier - Infer a phylogenetic tree by comparing orthologous sequences in different species, particularly many closely related species (phylogenetic shadowing). - - - - - - - - - - Protein folding simulation - - beta12orEarlier - Simulate the folding of a protein. - - - - - - - - - - Protein folding pathway prediction - - - Predict the folding pathway(s) or non-native structural intermediates of a protein. - beta12orEarlier - - - - - - - - - - Protein SNP mapping - - true - beta12orEarlier - Map and model the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). - 1.12 - - - - - - - - - - Protein modelling (mutation) - - - - - - - - - - - - - - - Protein SNP mapping - Protein mutation modelling - Predict the effect of point mutation on a protein structure, in terms of strucural effects and protein folding, stability and function. - Rotamer likelihood prediction - beta12orEarlier - This includes 1) rotamer likelihood prediction: the prediction of rotamer likelihoods for all 20 amino acid types at each position in a protein structure, where output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. 2) Protein SNP mapping, which maps and modesl the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). Methods might predict silent or pathological mutations. - - - - - - - - - - Immunogen design - - true - Design molecules that elicit an immune response (immunogens). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Zinc finger prediction - - - - - - - - - - - - - - Predict and optimise zinc finger protein domains for DNA/RNA binding (for example for transcription factors and nucleases). - beta12orEarlier - - - - - - - - - - Enzyme kinetics calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate Km, Vmax and derived data for an enzyme reaction. - - - - - - - - - - Formatting - - beta12orEarlier - Reformat a file of data (or equivalent entity in memory). - Format conversion - File formatting - Reformatting - File reformatting - File format conversion - - - - - - - - - - Format validation - - Test and validate the format and content of a data file. - File format validation - beta12orEarlier - - - - - - - - - - Visualisation - - - - - - - - - - - - - - - - - - - - Visualization - beta12orEarlier - Visualise, plot or render (graphically) biomolecular data such as molecular sequences or structures. - Rendering - - - - - - - - - - Sequence database search - - - - - - - - - Search a sequence database by sequence comparison and retrieve similar sequences. - -sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - This excludes direct retrieval methods (e.g. the dbfetch program). - - - - - - - - - - Structure database search - - - - - - - - beta12orEarlier - Search a tertiary structure database, typically by sequence and/or structure comparison, or some other means, and retrieve structures and associated data. - - - - - - - - - - Protein secondary database search - - 1.8 - beta12orEarlier - true - Search a secondary protein database (of classification information) to assign a protein sequence(s) to a known protein family or group. - - - - - - - - - - Motif database search - - beta12orEarlier - Screen a sequence against a motif or pattern database. - true - 1.8 - - - - - - - - - - Sequence profile database search - - true - beta12orEarlier - Search a database of sequence profiles with a query sequence. - 1.4 - - - - - - - - - - Transmembrane protein database search - - true - beta12orEarlier - Search a database of transmembrane proteins, for example for sequence or structural similarities. - beta12orEarlier - - - - - - - - - - Sequence retrieval (by code) - - Query a database and retrieve sequences with a given entry code or accession number. - true - 1.6 - beta12orEarlier - - - - - - - - - - Sequence retrieval (by keyword) - - true - Query a database and retrieve sequences containing a given keyword. - beta12orEarlier - 1.6 - - - - - - - - - - Sequence similarity search - - - Structure database search (by sequence) - Sequence database search (by sequence) - beta12orEarlier - Search a sequence database and retrieve sequences that are similar to a query sequence. - - - - - - - - - - Sequence database search (by motif or pattern) - - 1.8 - Search a sequence database and retrieve sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - true - - - - - - - - - - Sequence database search (by amino acid composition) - - true - Search a sequence database and retrieve sequences of a given amino acid composition. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence database search (by property) - - Search a sequence database and retrieve sequences with a specified property, typically a physicochemical or compositional property. - beta12orEarlier - - - - - - - - - - Sequence database search (by sequence using word-based methods) - - beta12orEarlier - Word-based methods (for example BLAST, gapped BLAST, MEGABLAST, WU-BLAST etc.) are usually quicker than alignment-based methods. They may or may not handle gaps. - 1.6 - true - Sequence similarity search (word-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a word-based method. - - - - - - - - - - Sequence database search (by sequence using profile-based methods) - - true - Sequence similarity search (profile-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a sequence profile-based method, or with a supplied profile as query. - beta12orEarlier - This includes tools based on PSI-BLAST. - 1.6 - - - - - - - - - - Sequence database search (by sequence using local alignment-based methods) - - Search a sequence database for sequences that are similar to a query sequence using a local alignment-based method. - 1.6 - beta12orEarlier - true - Sequence similarity search (local alignment-based methods) - This includes tools based on the Smith-Waterman algorithm or FASTA. - - - - - - - - - - Sequence database search (by sequence using global alignment-based methods) - - This includes tools based on the Needleman and Wunsch algorithm. - Search sequence(s) or a sequence database for sequences that are similar to a query sequence using a global alignment-based method. - 1.6 - Sequence similarity search (global alignment-based methods) - beta12orEarlier - true - - - - - - - - - - Sequence database search (by sequence for primer sequences) - - true - beta12orEarlier - Search a DNA database (for example a database of conserved sequence tags) for matches to Sequence-Tagged Site (STS) primer sequences. - 1.6 - STSs are genetic markers that are easily detected by the polymerase chain reaction (PCR) using specific primers. - Sequence similarity search (primer sequences) - - - - - - - - - - Sequence database search (by molecular weight) - - Search sequence(s) or a sequence database for sequences which match a set of peptide masses, for example a peptide mass fingerprint from mass spectrometry. - 1.6 - Protein fingerprinting - true - beta12orEarlier - Peptide mass fingerprinting - - - - - - - - - - Sequence database search (by isoelectric point) - - 1.6 - beta12orEarlier - Search sequence(s) or a sequence database for sequences of a given isoelectric point. - true - - - - - - - - - - Structure retrieval (by code) - - Query a tertiary structure database and retrieve entries with a given entry code or accession number. - 1.6 - beta12orEarlier - true - - - - - - - - - - Structure retrieval (by keyword) - - true - 1.6 - Query a tertiary structure database and retrieve entries containing a given keyword. - beta12orEarlier - - - - - - - - - - Structure database search (by sequence) - - beta12orEarlier - true - Search a tertiary structure database and retrieve structures with a sequence similar to a query sequence. - 1.8 - - - - - - - - - - Structural similarity search - - - beta12orEarlier - Search a database of molecular structure and retrieve structures that are similar to a query structure. - Structure database search (by structure) - Structure retrieval by structure - - - - - - - - - - Sequence annotation - - - - - - - - - - - - - - beta12orEarlier - Annotate a molecular sequence record with terms from a controlled vocabulary. - - - - - - - - - - Genome annotation - - beta12orEarlier - Metagenome annotation - Annotate a genome sequence with terms from a controlled vocabulary. - - - - - - - - - - Nucleic acid sequence reverse and complement - - beta12orEarlier - Generate the reverse and / or complement of a nucleotide sequence. - - - - - - - - - - Random sequence generation - - Generate a random sequence, for example, with a specific character composition. - beta12orEarlier - - - - - - - - - - Nucleic acid restriction digest - - - - - - - - - beta12orEarlier - Generate digest fragments for a nucleotide sequence containing restriction sites. - - - - - - - - - - Protein sequence cleavage - - - - - - - - - - - - - - - beta12orEarlier - Cleave a protein sequence into peptide fragments (by enzymatic or chemical cleavage) and calculate the fragment masses. - - - - - - - - - - Sequence mutation and randomization - - beta12orEarlier - Mutate a molecular sequence a specified amount or shuffle it to produce a randomized sequence with the same overall composition. - - - - - - - - - - Sequence masking - - Mask characters in a molecular sequence (replacing those characters with a mask character). - For example, SNPs or repeats in a DNA sequence might be masked. - beta12orEarlier - - - - - - - - - - Sequence cutting - - Cut (remove) characters or a region from a molecular sequence. - beta12orEarlier - - - - - - - - - - Restriction site creation - - Create (or remove) restriction sites in sequences, for example using silent mutations. - beta12orEarlier - - - - - - - - - - DNA translation - - - - - - - - beta12orEarlier - Translate a DNA sequence into protein. - - - - - - - - - - DNA transcription - - - - - - - - beta12orEarlier - Transcribe a nucleotide sequence into mRNA sequence(s). - - - - - - - - - - Sequence composition calculation (nucleic acid) - - true - Calculate base frequency or word composition of a nucleotide sequence. - 1.8 - beta12orEarlier - - - - - - - - - - Sequence composition calculation (protein) - - 1.8 - Calculate amino acid frequency or word composition of a protein sequence. - beta12orEarlier - true - - - - - - - - - - Repeat sequence detection - - - beta12orEarlier - Find (and possibly render) short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Repeat sequence organisation analysis - - - beta12orEarlier - Analyse repeat sequence organization such as periodicity. - - - - - - - - - - Protein hydropathy calculation (from structure) - - true - Analyse the hydrophobic, hydrophilic or charge properties of a protein structure. - 1.12 - beta12orEarlier - - - - - - - - - - Accessible surface calculation - - - - - - - - beta12orEarlier - WHATIF:AtomAccessibilitySolventPlus - Protein solvent accessibility calculation - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate solvent accessible or buried surface areas in protein or other molecular structures. - WHATIF:AtomAccessibilitySolvent - - - - - - - - - - Protein hydropathy cluster calculation - - true - 1.12 - beta12orEarlier - Identify clusters of hydrophobic or charged residues in a protein structure. - - - - - - - - - - Protein dipole moment calculation - - - - - - - - beta12orEarlier - Calculate whether a protein structure has an unusually large net charge (dipole moment). - - - - - - - - - - Molecular surface calculation - - WHATIF:ResidueAccessibilityMolecular - Protein surface calculation - Protein surface and interior calculation - WHATIF:AtomAccessibilityMolecularPlus - WHATIF:TotAccessibilityMolecular - Protein atom surface calculation - Calculate the molecular surface area in proteins and other macromolecules. - Protein residue surface calculation - WHATIF:ResidueAccessibilityVacuum - beta12orEarlier - WHATIF:TotAccessibilitySolvent - WHATIF:ResidueAccessibilitySolvent - WHATIF:ResidueAccessibilityVacuumMolecular - WHATIF:AtomAccessibilityMolecular - - - - - - - - - - Protein binding site prediction (from structure) - - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein structures. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein-nucleic acid binding site analysis - - - - - - - - Analyse RNA or DNA-binding sites in protein structure. - beta12orEarlier - - - - - - - - - - Protein peeling - - beta12orEarlier - Decompose a structure into compact or globular fragments (protein peeling). - - - - - - - - - - Protein distance matrix calculation - - - - - - - - beta12orEarlier - Calculate a matrix of distance between residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map calculation - - - - - - - - beta12orEarlier - Calculate a residue contact map (typically all-versus-all inter-residue contacts) for a protein structure. - - - - - - - - - - Residue cluster calculation - - - - - - - - Calculate clusters of contacting residues in protein structures. - This includes for example clusters of hydrophobic or charged residues, or clusters of contacting residues which have a key structural or functional role. - beta12orEarlier - - - - - - - - - - Hydrogen bond calculation - - - - - - - - WHATIF:ShowHydrogenBonds - WHATIF:HasHydrogenBonds - The output might include the atoms involved in the bond, bond geometric parameters and bond enthalpy. - beta12orEarlier - WHATIF:ShowHydrogenBondsM - Identify potential hydrogen bonds between amino acids and other groups. - - - - - - - - - - Residue non-canonical interaction detection - - beta12orEarlier - 1.12 - Calculate non-canonical atomic interactions in protein structures. - true - - - - - - - - - - Ramachandran plot calculation - - - - - - - - Calculate a Ramachandran plot of a protein structure. - beta12orEarlier - - - - - - - - - - Ramachandran plot validation - - - - - - - - - - - - - - beta12orEarlier - Validate a Ramachandran plot of a protein structure. - - - - - - - - - - Protein molecular weight calculation - - - - - - - - - - - - - - Calculate the molecular weight of a protein sequence or fragments. - beta12orEarlier - - - - - - - - - - Protein extinction coefficient calculation - - - - - - - - beta12orEarlier - Predict extinction coefficients or optical density of a protein sequence. - - - - - - - - - - Protein pH-dependent property calculation - - - - - - - - - - - - - - Calculate pH-dependent properties from pKa calculations of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydropathy calculation (from sequence) - - 1.12 - Hydropathy calculation on a protein sequence. - beta12orEarlier - true - - - - - - - - - - Protein titration curve plotting - - - - - - - - - beta12orEarlier - Plot a protein titration curve. - - - - - - - - - - Protein isoelectric point calculation - - - - - - - - beta12orEarlier - Calculate isoelectric point of a protein sequence. - - - - - - - - - - Protein hydrogen exchange rate calculation - - - - - - - - Estimate hydrogen exchange rate of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydrophobic region calculation - - Calculate hydrophobic or hydrophilic / charged regions of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein aliphatic index calculation - - - - - - - - beta12orEarlier - Calculate aliphatic index (relative volume occupied by aliphatic side chains) of a protein. - - - - - - - - - - Protein hydrophobic moment plotting - - - - - - - - - beta12orEarlier - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - Calculate the hydrophobic moment of a peptide sequence and recognize amphiphilicity. - - - - - - - - - - Protein globularity prediction - - - - - - - - Predict the stability or globularity of a protein sequence, whether it is intrinsically unfolded etc. - beta12orEarlier - - - - - - - - - - Protein solubility prediction - - - - - - - - Predict the solubility or atomic solvation energy of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein crystallizability prediction - - - - - - - - beta12orEarlier - Predict crystallizability of a protein sequence. - - - - - - - - - - Protein signal peptide detection (eukaryotes) - - beta12orEarlier - Detect or predict signal peptides (and typically predict subcellular localization) of eukaryotic proteins. - - - - - - - - - - Protein signal peptide detection (bacteria) - - Detect or predict signal peptides (and typically predict subcellular localization) of bacterial proteins. - beta12orEarlier - - - - - - - - - - MHC peptide immunogenicity prediction - - true - - Predict MHC class I or class II binding peptides, promiscuous binding peptides, immunogenicity etc. - beta12orEarlier - 1.12 - - - - - - - - - - Protein feature prediction (from sequence) - - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - true - Sequence feature detection (protein) - 1.6 - Predict, recognise and identify positional features in protein sequences such as functional sites or regions and secondary structure. - - - - - - - - - - Nucleic acid feature detection - - - - - - - - - - - - - - - Sequence feature detection (nucleic acid) - Predict, recognise and identify features in nucleotide sequences such as functional sites or regions, typically by scanning for known motifs, patterns and regular expressions. - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - Nucleic acid feature recognition - Nucleic acid feature prediction - - - - - - - - - - Epitope mapping - - - - - - - - - beta12orEarlier - Predict antigenic determinant sites (epitopes) in protein sequences. - Epitope mapping is commonly done during vaccine design. - - - - - - - - - - Protein post-translation modification site prediction - - - - - - - - Predict post-translation modification sites in protein sequences. - beta12orEarlier - Methods might predict sites of methylation, N-terminal myristoylation, N-terminal acetylation, sumoylation, palmitoylation, phosphorylation, sulfation, glycosylation, glycosylphosphatidylinositol (GPI) modification sites (GPI lipid anchor signals) etc. - - - - - - - - - - Protein signal peptide detection - - - - - - - - - beta12orEarlier - Methods might use sequence motifs and features, amino acid composition, profiles, machine-learned classifiers, etc. - Detect or predict signal peptides and signal peptide cleavage sites in protein sequences. - - - - - - - - - - Protein binding site prediction (from sequence) - - 1.12 - Predict catalytic residues, active sites or other ligand-binding sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - Protein-nucleic acid binding prediction - - beta12orEarlier - Predict RNA and DNA-binding binding sites in protein sequences. - - - - - - - - - - Protein folding site prediction - - - Predict protein sites that are key to protein folding, such as possible sites of nucleation or stabilization. - beta12orEarlier - - - - - - - - - - Protein cleavage site prediction - - - - - - - - beta12orEarlier - Detect or predict cleavage sites (enzymatic or chemical) in protein sequences. - - - - - - - - - - Epitope mapping (MHC Class I) - - 1.8 - true - beta12orEarlier - Predict epitopes that bind to MHC class I molecules. - - - - - - - - - - Epitope mapping (MHC Class II) - - Predict epitopes that bind to MHC class II molecules. - 1.8 - true - beta12orEarlier - - - - - - - - - - - Whole gene prediction - - beta12orEarlier - 1.12 - true - Detect, predict and identify whole gene structure in DNA sequences. This includes protein coding regions, exon-intron structure, regulatory regions etc. - - - - - - - - - - Gene component prediction - - true - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - beta12orEarlier - Detect, predict and identify genetic elements such as promoters, coding regions, splice sites, etc in DNA sequences. - 1.12 - - - - - - - - - - Transposon prediction - - beta12orEarlier - Detect or predict transposons, retrotransposons / retrotransposition signatures etc. - - - - - - - - - - PolyA signal detection - - Detect polyA signals in nucleotide sequences. - beta12orEarlier - - - - - - - - - - Quadruplex formation site detection - - - - - - - - beta12orEarlier - Quadruplex structure prediction - Detect quadruplex-forming motifs in nucleotide sequences. - Quadruplex (4-stranded) structures are formed by guanine-rich regions and are implicated in various important biological processes and as therapeutic targets. - - - - - - - - - - CpG island and isochore detection - - - - - - - - An isochore is long region (> 3 KB) of DNA with very uniform GC content, in contrast to the rest of the genome. Isochores tend tends to have more genes, higher local melting or denaturation temperatures, and different flexibility. Methods might calculate fractional GC content or variation of GC content, predict methylation status of CpG islands etc. This includes methods that visualise CpG rich regions in a nucleotide sequence, for example plot isochores in a genome sequence. - beta12orEarlier - Find CpG rich regions in a nucleotide sequence or isochores in genome sequences. - CpG island and isochores rendering - CpG island and isochores detection - - - - - - - - - - Restriction site recognition - - - - - - - - beta12orEarlier - Find and identify restriction enzyme cleavage sites (restriction sites) in (typically) DNA sequences, for example to generate a restriction map. - - - - - - - - - - Nucleosome formation or exclusion sequence prediction - - beta12orEarlier - Identify or predict nucleosome exclusion sequences (nucleosome free regions) in DNA. - - - - - - - - - - Splice site prediction - - - - - - - - beta12orEarlier - Identify, predict or analyse splice sites in nucleotide sequences. - Methods might require a pre-mRNA or genomic DNA sequence. - - - - - - - - - - Integrated gene prediction - - Predict whole gene structure using a combination of multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Operon prediction - - Find operons (operators, promoters and genes) in bacteria genes. - beta12orEarlier - - - - - - - - - - Coding region prediction - - Predict protein-coding regions (CDS or exon) or open reading frames in nucleotide sequences. - ORF prediction - ORF finding - beta12orEarlier - - - - - - - - - - Selenocysteine insertion sequence (SECIS) prediction - - - - - - - - Predict selenocysteine insertion sequence (SECIS) in a DNA sequence. - SECIS elements are around 60 nucleotides in length with a stem-loop structure directs the cell to translate UGA codons as selenocysteines. - beta12orEarlier - - - - - - - - - - Regulatory element prediction - - - - - - - - Identify or predict transcription regulatory motifs, patterns, elements or regions in DNA sequences. - Translational regulatory element prediction - Transcription regulatory element prediction - This includes promoters, enhancers, silencers and boundary elements / insulators, regulatory protein or transcription factor binding sites etc. Methods might be specific to a particular genome and use motifs, word-based / grammatical methods, position-specific frequency matrices, discriminative pattern analysis etc. - beta12orEarlier - - - - - - - - - - Translation initiation site prediction - - - - - - - - Predict translation initiation sites, possibly by searching a database of sites. - beta12orEarlier - - - - - - - - - - Promoter prediction - - Identify or predict whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in DNA sequences. - Methods might recognize CG content, CpG islands, splice sites, polyA signals etc. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (DNA-cis) - - beta12orEarlier - Cis-regulatory elements (cis-elements) regulate the expression of genes located on the same strand. Cis-elements are found in the 5' promoter region of the gene, in an intron, or in the 3' untranslated region. Cis-elements are often binding sites of one or more trans-acting factors. - Identify, predict or analyse cis-regulatory elements (TATA box, Pribnow box, SOS box, CAAT box, CCAAT box, operator etc.) in DNA sequences. - - - - - - - - - - Transcription regulatory element prediction (RNA-cis) - - Cis-regulatory elements (cis-elements) regulate genes located on the same strand from which the element was transcribed. A riboswitch is a region of an mRNA molecule that bind a small target molecule that regulates the gene's activity. - Identify, predict or analyse cis-regulatory elements (for example riboswitches) in RNA sequences. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (trans) - - - - - - - - beta12orEarlier - Trans-regulatory elements regulate genes distant from the gene from which they were transcribed. - Identify or predict functional RNA sequences with a gene regulatory role (trans-regulatory elements) or targets. - Functional RNA identification - - - - - - - - - - Matrix/scaffold attachment site prediction - - MAR/SAR sites often flank a gene or gene cluster and are found nearby cis-regulatory sequences. They might contribute to transcription regulation. - Identify matrix/scaffold attachment regions (MARs/SARs) in DNA sequences. - beta12orEarlier - - - - - - - - - - Transcription factor binding site prediction - - beta12orEarlier - Identify or predict transcription factor binding sites in DNA sequences. - - - - - - - - - - Exonic splicing enhancer prediction - - - - - - - - An exonic splicing enhancer (ESE) is 6-base DNA sequence motif in an exon that enhances or directs splicing of pre-mRNA or hetero-nuclear RNA (hnRNA) into mRNA. - Identify or predict exonic splicing enhancers (ESE) in exons. - beta12orEarlier - - - - - - - - - - Sequence alignment validation - - - Evaluation might be purely sequence-based or use structural information. - Sequence alignment quality evaluation - Evaluate molecular sequence alignment accuracy. - beta12orEarlier - - - - - - - - - - Sequence alignment analysis (conservation) - - beta12orEarlier - Analyse character conservation in a molecular sequence alignment, for example to derive a consensus sequence. - Residue conservation analysis - Use this concept for methods that calculate substitution rates, estimate relative site variability, identify sites with biased properties, derive a consensus sequence, or identify highly conserved or very poorly conserved sites, regions, blocks etc. - - - - - - - - - - Sequence alignment analysis (site correlation) - - - Analyse correlations between sites in a molecular sequence alignment. - This is typically done to identify possible covarying positions and predict contacts or structural constraints in protein structures. - beta12orEarlier - - - - - - - - - - Chimeric sequence detection - - beta12orEarlier - A chimera includes regions from two or more phylogenetically distinct sequences. They are usually artifacts of PCR and are thought to occur when a prematurely terminated amplicon reanneals to another DNA strand and is subsequently copied to completion in later PCR cycles. - Detects chimeric sequences (chimeras) from a sequence alignment. - Sequence alignment analysis (chimeric sequence detection) - - - - - - - - - - Recombination detection - - Sequence alignment analysis (recombination detection) - beta12orEarlier - Detect recombination (hotspots and coldspots) and identify recombination breakpoints in a sequence alignment. - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - - - - - - - - - - Indel detection - - - beta12orEarlier - Sequence alignment analysis (indel detection) - Indel discovery - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - Identify insertion, deletion and duplication events from a sequence alignment. - - - - - - - - - - Nucleosome formation potential prediction - - true - beta12orEarlier - Predict nucleosome formation potential of DNA sequences. - beta12orEarlier - - - - - - - - - - Nucleic acid thermodynamic property calculation - - - - - - - - Calculate a thermodynamic property of DNA or DNA/RNA, such as melting temperature, enthalpy and entropy. - beta12orEarlier - - - - - - - - - - Nucleic acid melting profile plotting - - - - - - - - - Calculate and plot a DNA or DNA/RNA melting profile. - A melting profile is used to visualise and analyse partly melted DNA conformations. - beta12orEarlier - - - - - - - - - - Nucleic acid stitch profile plotting - - - - - - - - A stitch profile represents the alternative conformations that partly melted DNA can adopt in a temperature range. - beta12orEarlier - Calculate and plot a DNA or DNA/RNA stitch profile. - - - - - - - - - - Nucleic acid melting curve plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA melting curve. - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile plotting - - - - - - - - beta12orEarlier - Calculate and plot a DNA or DNA/RNA probability profile. - - - - - - - - - - Nucleic acid temperature profile plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA temperature profile. - beta12orEarlier - - - - - - - - - - Nucleic acid curvature calculation - - - - - - - - Calculate curvature and flexibility / stiffness of a nucleotide sequence. - beta12orEarlier - This includes properties such as. - - - - - - - - - - microRNA detection - - Identify or predict microRNA sequences (miRNA) and precursors or microRNA targets / binding sites in a DNA sequence. - beta12orEarlier - - - - - - - - - - tRNA gene prediction - - - - - - - - Identify or predict tRNA genes in genomic sequences (tRNA). - beta12orEarlier - - - - - - - - - - siRNA binding specificity prediction - - - - - - - - beta12orEarlier - Assess binding specificity of putative siRNA sequence(s), for example for a functional assay, typically with respect to designing specific siRNA sequences. - - - - - - - - - - Protein secondary structure prediction (integrated) - - Predict secondary structure of protein sequence(s) using multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (helices) - - beta12orEarlier - Predict helical secondary structure of protein sequences. - - - - - - - - - - Protein secondary structure prediction (turns) - - Predict turn structure (for example beta hairpin turns) of protein sequences. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (coils) - - beta12orEarlier - Predict open coils, non-regular secondary structure and intrinsically disordered / unstructured regions of protein sequences. - - - - - - - - - - Protein secondary structure prediction (disulfide bonds) - - beta12orEarlier - Predict cysteine bonding state and disulfide bond partners in protein sequences. - - - - - - - - - - GPCR prediction - - - beta12orEarlier - G protein-coupled receptor (GPCR) prediction - Predict G protein-coupled receptors (GPCR). - - - - - - - - - - GPCR analysis - - - - - - - - Analyse G-protein coupled receptor proteins (GPCRs). - beta12orEarlier - G protein-coupled receptor (GPCR) analysis - - - - - - - - - - Protein structure prediction - - - - - - - - - - - beta12orEarlier - Predict tertiary structure (backbone and side-chain conformation) of protein sequences. - - - - - - - - - - Nucleic acid structure prediction - - - - - - - - - - beta12orEarlier - Methods might identify thermodynamically stable or evolutionarily conserved structures. - Predict tertiary structure of DNA or RNA. - - - - - - - - - - Ab initio structure prediction - - Predict tertiary structure of protein sequence(s) without homologs of known structure. - de novo structure prediction - beta12orEarlier - - - - - - - - - - Protein modelling - - - - - - - - - - Comparative modelling - beta12orEarlier - Build a three-dimensional protein model based on known (for example homologs) structures. - The model might be of a whole, part or aspect of protein structure. Molecular modelling methods might use sequence-structure alignment, structural templates, molecular dynamics, energy minimization etc. - Homology modelling - Homology structure modelling - Protein structure comparative modelling - - - - - - - - - - Molecular docking - - - - - - - - - - - - - - - Model the structure of a protein in complex with a small molecule or another macromolecule. - beta12orEarlier - This includes protein-protein interactions, protein-nucleic acid, protein-ligand binding etc. Methods might predict whether the molecules are likely to bind in vivo, their conformation when bound, the strength of the interaction, possible mutations to achieve bonding and so on. - Docking simulation - Protein docking - - - - - - - - - - Protein modelling (backbone) - - Model protein backbone conformation. - Methods might require a preliminary C(alpha) trace. - beta12orEarlier - - - - - - - - - - Protein modelling (side chains) - - beta12orEarlier - Methods might use a residue rotamer library. - Model, analyse or edit amino acid side chain conformation in protein structure, optimize side-chain packing, hydrogen bonding etc. - - - - - - - - - - Protein modelling (loops) - - beta12orEarlier - Model loop conformation in protein structures. - - - - - - - - - - Protein-ligand docking - - - - - - - - - - - - - - beta12orEarlier - Methods aim to predict the position and orientation of a ligand bound to a protein receptor or enzyme. - Ligand-binding simulation - Model protein-ligand (for example protein-peptide) binding using comparative modelling or other techniques. - Virtual ligand screening - - - - - - - - - - Structured RNA prediction and optimisation - - - - - - - - Nucleic acid folding family identification - RNA inverse folding - beta12orEarlier - Predict or optimise RNA sequences (sequence pools) with likely secondary and tertiary structure for in vitro selection. - - - - - - - - - - SNP detection - - - - Find single nucleotide polymorphisms (SNPs) between sequences. - Single nucleotide polymorphism detection - beta12orEarlier - This includes functional SNPs for large-scale genotyping purposes, disease-associated non-synonymous SNPs etc. - SNP discovery - - - - - - - - - - Radiation Hybrid Mapping - - - - - - - - Generate a physical (radiation hybrid) map of genetic markers in a DNA sequence using provided radiation hybrid (RH) scores for one or more markers. - beta12orEarlier - - - - - - - - - - Functional mapping - - beta12orEarlier - true - This can involve characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Map the genetic architecture of dynamic complex traits. - beta12orEarlier - - - - - - - - - - Haplotype mapping - - - - - - - - - Haplotype map generation - Haplotype inference - Infer haplotypes, either alleles at multiple loci that are transmitted together on the same chromosome, or a set of single nucleotide polymorphisms (SNPs) on a single chromatid that are statistically associated. - beta12orEarlier - Haplotype inference can help in population genetic studies and the identification of complex disease genes, , and is typically based on aligned single nucleotide polymorphism (SNP) fragments. Haplotype comparison is a useful way to characterize the genetic variation between individuals. An individual's haplotype describes which nucleotide base occurs at each position for a set of common SNPs. Tools might use combinatorial functions (for example parsimony) or a likelihood function or model with optimization such as minimum error correction (MEC) model, expectation-maximization algorithm (EM), genetic algorithm or Markov chain Monte Carlo (MCMC). - Haplotype reconstruction - - - - - - - - - - Linkage disequilibrium calculation - - - - - - - - beta12orEarlier - Linkage disequilibrium is identified where a combination of alleles (or genetic markers) occurs more or less frequently in a population than expected by chance formation of haplotypes. - Calculate linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Genetic code prediction - - - - - - - - - beta12orEarlier - Predict genetic code from analysis of codon usage data. - - - - - - - - - - Dotplot plotting - - - - - - - - - - beta12orEarlier - Draw a dotplot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Pairwise sequence alignment - - - - - - - - Pairwise sequence alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Align exactly two molecular sequences. - Pairwise sequence alignment construction - beta12orEarlier - - - - - - - - - - Multiple sequence alignment - - Multiple sequence alignment construction - Align two or more molecular sequences. - This includes methods that use an existing alignment, for example to incorporate sequences into an alignment, or combine several multiple alignments into a single, improved alignment. - beta12orEarlier - Multiple sequence alignment generation - - - - - - - - - - Pairwise sequence alignment generation (local) - - beta12orEarlier - Local pairwise sequence alignment construction - Locally align exactly two molecular sequences. - Pairwise sequence alignment (local) - true - Local alignment methods identify regions of local similarity. - 1.6 - Pairwise sequence alignment construction (local) - - - - - - - - - - - Pairwise sequence alignment generation (global) - - Pairwise sequence alignment construction (global) - Global pairwise sequence alignment construction - 1.6 - true - Globally align exactly two molecular sequences. - beta12orEarlier - Global alignment methods identify similarity across the entire length of the sequences. - Pairwise sequence alignment (global) - - - - - - - - - - - Local sequence alignment - - Multiple sequence alignment (local) - Local multiple sequence alignment construction - beta12orEarlier - Local alignment methods identify regions of local similarity. - Multiple sequence alignment construction (local) - Sequence alignment generation (local) - Sequence alignment (local) - Locally align two or more molecular sequences. - Smith-Waterman - - - - - - - - - - Global sequence alignment - - Global multiple sequence alignment construction - Multiple sequence alignment (global) - beta12orEarlier - Sequence alignment (global) - Multiple sequence alignment construction (global) - Globally align two or more molecular sequences. - Sequence alignment generation (global) - Global alignment methods identify similarity across the entire length of the sequences. - - - - - - - - - - Constrained sequence alignment - - beta12orEarlier - Align two or more molecular sequences with user-defined constraints. - Multiple sequence alignment construction (constrained) - Sequence alignment generation (constrained) - Multiple sequence alignment (constrained) - Sequence alignment (constrained) - Constrained multiple sequence alignment construction - - - - - - - - - - Consensus-based sequence alignment - - Consensus multiple sequence alignment construction - Sequence alignment (consensus) - beta12orEarlier - Align two or more molecular sequences using multiple methods to achieve higher quality. - Sequence alignment generation (consensus) - Multiple sequence alignment construction (consensus) - Multiple sequence alignment (consensus) - - - - - - - - - - Tree-based sequence alignment - - - - - - - - Sequence alignment generation (phylogenetic tree-based) - This is supposed to give a more biologically meaningful alignment than standard alignments. - beta12orEarlier - Phylogenetic tree-based multiple sequence alignment construction - Align multiple sequences using relative gap costs calculated from neighbors in a supplied phylogenetic tree. - Sequence alignment (phylogenetic tree-based) - Multiple sequence alignment construction (phylogenetic tree-based) - Multiple sequence alignment (phylogenetic tree-based) - - - - - - - - - - Secondary structure alignment generation - - beta12orEarlier - 1.6 - Secondary structure alignment construction - Secondary structure alignment - true - Align molecular secondary structure (represented as a 1D string). - - - - - - - - - - Protein secondary structure alignment generation - - - - - - - - - Protein secondary structure alignment construction - Align protein secondary structures. - beta12orEarlier - Secondary structure alignment (protein) - Protein secondary structure alignment - - - - - - - - - - RNA secondary structure alignment - - - - - - - - - - - - - - - RNA secondary structure alignment generation - Align RNA secondary structures. - RNA secondary structure alignment construction - Secondary structure alignment (RNA) - beta12orEarlier - - - - - - - - - - Pairwise structure alignment - - beta12orEarlier - Pairwise structure alignment generation - Pairwise structure alignment construction - Align (superimpose) exactly two molecular tertiary structures. - - - - - - - - - - Multiple structure alignment construction - - Align (superimpose) two or more molecular tertiary structures. - This includes methods that use an existing alignment. - 1.6 - true - Multiple structure alignment - beta12orEarlier - - - - - - - - - - Structure alignment (protein) - - beta13 - true - beta12orEarlier - Align protein tertiary structures. - - - - - - - - - - Structure alignment (RNA) - - beta13 - true - Align RNA tertiary structures. - beta12orEarlier - - - - - - - - - - Pairwise structure alignment generation (local) - - Locally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (local) - Local alignment methods identify regions of local similarity, common substructures etc. - Pairwise structure alignment construction (local) - 1.6 - true - Local pairwise structure alignment construction - beta12orEarlier - - - - - - - - - - - Pairwise structure alignment generation (global) - - Global pairwise structure alignment construction - Global alignment methods identify similarity across the entire structures. - true - beta12orEarlier - 1.6 - Pairwise structure alignment construction (global) - Globally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (global) - - - - - - - - - - - Local structure alignment - - Local multiple structure alignment construction - Local alignment methods identify regions of local similarity, common substructures etc. - Structure alignment construction (local) - beta12orEarlier - Locally align (superimpose) two or more molecular tertiary structures. - Multiple structure alignment construction (local) - Multiple structure alignment (local) - Structure alignment generation (local) - - - - - - - - - - Global structure alignment - - Structure alignment construction (global) - Multiple structure alignment (global) - Structure alignment generation (global) - Multiple structure alignment construction (global) - beta12orEarlier - Global alignment methods identify similarity across the entire structures. - Global multiple structure alignment construction - Globally align (superimpose) two or more molecular tertiary structures. - - - - - - - - - - Profile-to-profile alignment (pairwise) - - Sequence alignment generation (pairwise profile) - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise sequence profile alignment construction - Sequence profile alignment construction (pairwise) - Sequence profile alignment (pairwise) - beta12orEarlier - Align exactly two molecular profiles. - Sequence profile alignment generation (pairwise) - - - - - - - - - - Sequence alignment generation (multiple profile) - - Align two or more molecular profiles. - 1.6 - true - Sequence profile alignment generation (multiple) - beta12orEarlier - Sequence profile alignment (multiple) - Sequence profile alignment construction (multiple) - Multiple sequence profile alignment construction - - - - - - - - - - 3D profile-to-3D profile alignment (pairwise) - - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise structural (3D) profile alignment construction - Structural (3D) profile alignment (pairwise) - Structural profile alignment construction (pairwise) - Align exactly two molecular Structural (3D) profiles. - beta12orEarlier - Structural profile alignment generation (pairwise) - - - - - - - - - - Structural profile alignment generation (multiple) - - true - Structural profile alignment construction (multiple) - Align two or more molecular 3D profiles. - Multiple structural (3D) profile alignment construction - beta12orEarlier - Structural (3D) profile alignment (multiple) - 1.6 - - - - - - - - - - Data retrieval (tool metadata) - - Data retrieval (tool annotation) - 1.6 - Search and retrieve names of or documentation on bioinformatics tools, for example by keyword or which perform a particular function. - beta12orEarlier - true - Tool information retrieval - - - - - - - - - - Data retrieval (database metadata) - - beta12orEarlier - true - Data retrieval (database annotation) - Search and retrieve names of or documentation on bioinformatics databases or query terms, for example by keyword. - Database information retrieval - 1.6 - - - - - - - - - - PCR primer design (for large scale sequencing) - - 1.13 - Predict primers for large scale sequencing. - beta12orEarlier - true - - - - - - - - - - PCR primer design (for genotyping polymorphisms) - - true - beta12orEarlier - Predict primers for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs). - 1.13 - - - - - - - - - - PCR primer design (for gene transcription profiling) - - Predict primers for gene transcription profiling. - beta12orEarlier - true - 1.13 - - - - - - - - - - PCR primer design (for conserved primers) - - 1.13 - Predict primers that are conserved across multiple genomes or species. - beta12orEarlier - true - - - - - - - - - - PCR primer design (based on gene structure) - - 1.13 - true - beta12orEarlier - - - - - - - - - - PCR primer design (for methylation PCRs) - - true - beta12orEarlier - Predict primers for methylation PCRs. - 1.13 - - - - - - - - - - Mapping assembly - - Sequence assembly by combining fragments using an existing backbone sequence, typically a reference genome. - beta12orEarlier - Sequence assembly (mapping assembly) - The final sequence will resemble the backbone sequence. Mapping assemblers are usually much faster and less memory intensive than de-novo assemblers. - - - - - - - - - - De-novo assembly - - De Bruijn graph - Sequence assembly by combining fragments without the aid of a reference sequence or genome. - Sequence assembly (de-novo assembly) - De-novo assemblers are much slower and more memory intensive than mapping assemblers. - beta12orEarlier - - - - - - - - - - Genome assembly - - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - beta12orEarlier - Sequence assembly (genome assembly) - - - - - - - - - - EST assembly - - beta12orEarlier - Sequence assembly (EST assembly) - Sequence assembly for EST sequences (transcribed mRNA). - Assemblers must handle (or be complicated by) alternative splicing, trans-splicing, single-nucleotide polymorphism (SNP), recoding, and post-transcriptional modification. - - - - - - - - - - Tag mapping - - - - - - - - - Tag mapping might assign experimentally obtained tags to known transcripts or annotate potential virtual tags in a genome. - Tag to gene assignment - Make gene to tag assignments (tag mapping) of SAGE, MPSS and SBS data, by annotating tags with ontology concepts. - beta12orEarlier - - - - - - - - - - SAGE data processing - - beta12orEarlier - Serial analysis of gene expression data processing - beta12orEarlier - Process (read and / or write) serial analysis of gene expression (SAGE) data. - true - - - - - - - - - - MPSS data processing - - beta12orEarlier - Process (read and / or write) massively parallel signature sequencing (MPSS) data. - true - Massively parallel signature sequencing data processing - beta12orEarlier - - - - - - - - - - SBS data processing - - beta12orEarlier - Sequencing by synthesis data processing - beta12orEarlier - Process (read and / or write) sequencing by synthesis (SBS) data. - true - - - - - - - - - - Heat map generation - - - - - - - - - beta12orEarlier - The heat map usually uses a coloring scheme to represent clusters. They can show how expression of mRNA by a set of genes was influenced by experimental conditions. - Heat map construction - Generate a heat map of gene expression from microarray data. - - - - - - - - - - Gene expression profile analysis - - true - Functional profiling - beta12orEarlier - Analyse one or more gene expression profiles, typically to interpret them in functional terms. - 1.6 - - - - - - - - - - Gene expression profile pathway mapping - - - - - - - - - - beta12orEarlier - Map a gene expression profile to known biological pathways, for example, to identify or reconstruct a pathway. - - - - - - - - - - Protein secondary structure assignment (from coordinate data) - - - beta12orEarlier - Assign secondary structure from protein coordinate data. - - - - - - - - - - Protein secondary structure assignment (from CD data) - - - - - - - - Assign secondary structure from circular dichroism (CD) spectroscopic data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from X-ray crystallographic data) - - true - 1.7 - Assign a protein tertiary structure (3D coordinates) from raw X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from NMR data) - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw NMR spectroscopy data. - true - 1.7 - - - - - - - - - - Phylogenetic tree generation (data centric) - - Phylogenetic tree construction (data centric) - beta12orEarlier - Construct a phylogenetic tree from a specific type of data. - - - - - - - - - - Phylogenetic tree generation (method centric) - - Phylogenetic tree construction (method centric) - Construct a phylogenetic tree using a specific method. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (from molecular sequences) - - - Phylogenetic tree construction from molecular sequences. - beta12orEarlier - Phylogenetic tree construction (from molecular sequences) - Methods typically compare multiple molecular sequence and estimate evolutionary distances and relationships to infer gene families or make functional predictions. - - - - - - - - - - Phylogenetic tree generation (from continuous quantitative characters) - - - - - - - - Phylogenetic tree construction (from continuous quantitative characters) - beta12orEarlier - Phylogenetic tree construction from continuous quantitative character data. - - - - - - - - - - Phylogenetic tree generation (from gene frequencies) - - - - - - - - - - - - - - Phylogenetic tree construction (from gene frequencies) - Phylogenetic tree construction from gene frequency data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree construction (from polymorphism data) - - - - - - - - Phylogenetic tree construction from polymorphism data including microsatellites, RFLP (restriction fragment length polymorphisms), RAPD (random-amplified polymorphic DNA) and AFLP (amplified fragment length polymorphisms) data. - Phylogenetic tree generation (from polymorphism data) - beta12orEarlier - - - - - - - - - - Phylogenetic species tree construction - - Construct a phylogenetic species tree, for example, from a genome-wide sequence comparison. - Phylogenetic species tree generation - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (parsimony methods) - - Phylogenetic tree construction (parsimony methods) - Construct a phylogenetic tree by computing a sequence alignment and searching for the tree with the fewest number of character-state changes from the alignment. - This includes evolutionary parsimony (invariants) methods. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (minimum distance methods) - - This includes neighbor joining (NJ) clustering method. - beta12orEarlier - Phylogenetic tree construction (minimum distance methods) - Construct a phylogenetic tree by computing (or using precomputed) distances between sequences and searching for the tree with minimal discrepancies between pairwise distances. - - - - - - - - - - Phylogenetic tree generation (maximum likelihood and Bayesian methods) - - Phylogenetic tree construction (maximum likelihood and Bayesian methods) - Construct a phylogenetic tree by relating sequence data to a hypothetical tree topology using a model of sequence evolution. - Maximum likelihood methods search for a tree that maximizes a likelihood function, i.e. that is most likely given the data and model. Bayesian analysis estimate the probability of tree for branch lengths and topology, typically using a Monte Carlo algorithm. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (quartet methods) - - beta12orEarlier - Phylogenetic tree construction (quartet methods) - Construct a phylogenetic tree by computing four-taxon trees (4-trees) and searching for the phylogeny that matches most closely. - - - - - - - - - - Phylogenetic tree generation (AI methods) - - Construct a phylogenetic tree by using artificial-intelligence methods, for example genetic algorithms. - Phylogenetic tree construction (AI methods) - beta12orEarlier - - - - - - - - - - DNA substitution modelling - - - - - - - - - - - - - - - Identify a plausible model of DNA substitution that explains a molecular (DNA or protein) sequence alignment. - Sequence alignment analysis (phylogenetic modelling) - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (shape) - - Phylogenetic tree topology analysis - Analyse the shape (topology) of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree bootstrapping - - - Apply bootstrapping or other measures to estimate confidence of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (gene family prediction) - - - - - - - - - - - - - - Predict families of genes and gene function based on their position in a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (natural selection) - - beta12orEarlier - Stabilizing/purifying (directional) selection favors a single phenotype and tends to decrease genetic diversity as a population stabilizes on a particular trait, selecting out trait extremes or deleterious mutations. In contrast, balancing selection maintain genetic polymorphisms (or multiple alleles), whereas disruptive (or diversifying) selection favors individuals at both extremes of a trait. - Analyse a phylogenetic tree to identify allele frequency distribution and change that is subject to evolutionary pressures (natural selection, genetic drift, mutation and gene flow). Identify type of natural selection (such as stabilizing, balancing or disruptive). - - - - - - - - - - Phylogenetic tree generation (consensus) - - - Compare two or more phylogenetic trees to produce a consensus tree. - Methods typically test for topological similarity between trees using for example a congruence index. - beta12orEarlier - Phylogenetic tree construction (consensus) - - - - - - - - - - Phylogenetic sub/super tree detection - - beta12orEarlier - Compare two or more phylogenetic trees to detect subtrees or supertrees. - - - - - - - - - - Phylogenetic tree distances calculation - - - - - - - - beta12orEarlier - Compare two or more phylogenetic trees to calculate distances between trees. - - - - - - - - - - Phylogenetic tree annotation - - beta12orEarlier - http://www.evolutionaryontology.org/cdao.owl#CDAOAnnotation - Annotate a phylogenetic tree with terms from a controlled vocabulary. - - - - - - - - - - Immunogenicity prediction - - true - 1.12 - beta12orEarlier - Peptide immunogen prediction - Predict and optimise peptide ligands that elicit an immunological response. - - - - - - - - - - DNA vaccine design - - - - - - - - beta12orEarlier - Predict or optimise DNA to elicit (via DNA vaccination) an immunological response. - - - - - - - - - - Sequence formatting - - 1.12 - beta12orEarlier - Reformat (a file or other report of) molecular sequence(s). - true - - - - - - - - - - Sequence alignment formatting - - Reformat (a file or other report of) molecular sequence alignment(s). - beta12orEarlier - true - 1.12 - - - - - - - - - - Codon usage table formatting - - Reformat a codon usage table. - true - beta12orEarlier - 1.12 - - - - - - - - - - Sequence visualisation - - - - - - - - - - - - - - - beta12orEarlier - Visualise, format or render a molecular sequence, possibly with sequence features or properties shown. - Sequence rendering - - - - - - - - - - Sequence alignment visualisation - - - - - - - - - - - - - - - Sequence alignment rendering - Visualise, format or print a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence cluster visualisation - - - - - - - - Sequence cluster rendering - beta12orEarlier - Visualise, format or render sequence clusters. - - - - - - - - - - Phylogenetic tree visualisation - - - - - - - - - Render or visualise a phylogenetic tree. - Phylogenetic tree rendering - beta12orEarlier - - - - - - - - - - RNA secondary structure visualisation - - - - - - - - - RNA secondary structure rendering - Visualise RNA secondary structure, knots, pseudoknots etc. - beta12orEarlier - - - - - - - - - - Protein secondary structure rendering - Protein secondary structure visualisation - - - - - - - - Render and visualise protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure visualisation - - - - - - - - - - - - - - - Structure rendering - Visualise or render a molecular tertiary structure, for example a high-quality static picture or animation. - beta12orEarlier - - - - - - - - - - Microarray data rendering - - - - - - - - - - Visualise microarray data. - beta12orEarlier - - - - - - - - - - Protein interaction network rendering - Protein interaction network visualisation - - - - - - - - - beta12orEarlier - Identify and analyse networks of protein interactions. - - - - - - - - - - Map drawing - - - - - - - - beta12orEarlier - DNA map drawing - Map rendering - Draw or visualise a DNA map. - - - - - - - - - - Sequence motif rendering - - Render a sequence with motifs. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Restriction map drawing - - - - - - - - - Draw or visualise restriction maps in DNA sequences. - beta12orEarlier - - - - - - - - - - DNA linear map rendering - - beta12orEarlier - beta12orEarlier - true - Draw a linear maps of DNA. - - - - - - - - - - Plasmid map drawing - - beta12orEarlier - DNA circular map rendering - Draw a circular maps of DNA, for example a plasmid map. - - - - - - - - - - Operon drawing - - - - - - - - Visualise operon structure etc. - beta12orEarlier - Operon rendering - - - - - - - - - - Nucleic acid folding family identification - - true - beta12orEarlier - Identify folding families of related RNAs. - beta12orEarlier - - - - - - - - - - Nucleic acid folding energy calculation - - beta12orEarlier - Compute energies of nucleic acid folding, e.g. minimum folding energies for DNA or RNA sequences or energy landscape of RNA mutants. - - - - - - - - - - Annotation retrieval - - beta12orEarlier - Use this concepts for tools which retrieve pre-existing annotations, not for example prediction methods that might make annotations. - Retrieve existing annotation (or documentation), typically annotation on a database entity. - beta12orEarlier - true - - - - - - - - - - Protein function prediction - - - - - - - - - beta12orEarlier - Predict general functional properties of a protein. - For functional properties that can be mapped to a sequence, use 'Sequence feature detection (protein)' instead. - - - - - - - - - - Protein function comparison - - - - - - - - - Compare the functional properties of two or more proteins. - beta12orEarlier - - - - - - - - - - Sequence submission - - Submit a molecular sequence to a database. - beta12orEarlier - 1.6 - true - - - - - - - - - - Gene regulatory network analysis - - - - - - - - beta12orEarlier - Analyse a known network of gene regulation. - - - - - - - - - - - Loading - - - - - - - - Data loading - WHATIF:UploadPDB - Prepare or load a user-specified data file so that it is available for use. - beta12orEarlier - - - - - - - - - - Sequence retrieval - - This includes direct retrieval methods (e.g. the dbfetch program) but not those that perform calculations on the sequence. - Data retrieval (sequences) - 1.6 - Query a sequence data resource (typically a database) and retrieve sequences and / or annotation. - beta12orEarlier - true - - - - - - - - - - Structure retrieval - - true - WHATIF:EchoPDB - beta12orEarlier - WHATIF:DownloadPDB - This includes direct retrieval methods but not those that perform calculations on the sequence or structure. - Query a tertiary structure data resource (typically a database) and retrieve structures, structure-related data and annotation. - 1.6 - - - - - - - - - - Surface rendering - - - beta12orEarlier - WHATIF:GetSurfaceDots - Calculate the positions of dots that are homogeneously distributed over the surface of a molecule. - A dot has three coordinates (x,y,z) and (typically) a color. - - - - - - - - - - Protein atom surface calculation (accessible) - - beta12orEarlier - 1.12 - true - Calculate the solvent accessibility ('accessible surface') for each atom in a structure. - Waters are not considered. - - - - - - - - - - Protein atom surface calculation (accessible molecular) - - beta12orEarlier - 1.12 - Calculate the solvent accessibility ('accessible molecular surface') for each atom in a structure. - Waters are not considered. - true - - - - - - - - - - Protein residue surface calculation (accessible) - - true - 1.12 - beta12orEarlier - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('accessible surface') for each residue in a structure. - - - - - - - - - - Protein residue surface calculation (vacuum accessible) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('vacuum accessible surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (accessible molecular) - - Calculate the solvent accessibility ('accessible molecular surface') for each residue in a structure. - true - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - 1.12 - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (vacuum molecular) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - true - beta12orEarlier - Calculate the solvent accessibility ('vacuum molecular surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - - - - - - - - - - Protein surface calculation (accessible molecular) - - true - 1.12 - beta12orEarlier - Calculate the solvent accessibility ('accessible molecular surface') for a structure as a whole. - - - - - - - - - - Protein surface calculation (accessible) - - Calculate the solvent accessibility ('accessible surface') for a structure as a whole. - beta12orEarlier - 1.12 - true - - - - - - - - - - Backbone torsion angle calculation - - 1.12 - beta12orEarlier - true - Calculate for each residue in a protein structure all its backbone torsion angles. - - - - - - - - - - Full torsion angle calculation - - 1.12 - beta12orEarlier - Calculate for each residue in a protein structure all its torsion angles. - true - - - - - - - - - - Cysteine torsion angle calculation - - beta12orEarlier - Calculate for each cysteine (bridge) all its torsion angles. - 1.12 - true - - - - - - - - - - Tau angle calculation - - beta12orEarlier - Tau is the backbone angle N-Calpha-C (angle over the C-alpha). - 1.12 - For each amino acid in a protein structure calculate the backbone angle tau. - true - - - - - - - - - - Cysteine bridge detection - - WHATIF:ShowCysteineBridge - Detect cysteine bridges (from coordinate data) in a protein structure. - beta12orEarlier - - - - - - - - - - Free cysteine detection - - beta12orEarlier - A free cysteine is neither involved in a cysteine bridge, nor functions as a ligand to a metal. - Detect free cysteines in a protein structure. - WHATIF:ShowCysteineFree - - - - - - - - - - Metal-bound cysteine detection - - - beta12orEarlier - WHATIF:ShowCysteineMetal - Detect cysteines that are bound to metal in a protein structure. - - - - - - - - - - Residue contact calculation (residue-nucleic acid) - - beta12orEarlier - 1.12 - true - Calculate protein residue contacts with nucleic acids in a structure. - - - - - - - - - - Protein-metal contact calculation - - beta12orEarlier - Calculate protein residue contacts with metal in a structure. - Residue-metal contact calculation - - - - - - - - - - Residue contact calculation (residue-negative ion) - - Calculate ion contacts in a structure (all ions for all side chain atoms). - beta12orEarlier - true - 1.12 - - - - - - - - - - Residue bump detection - - WHATIF:ShowBumps - beta12orEarlier - Detect 'bumps' between residues in a structure, i.e. those with pairs of atoms whose Van der Waals' radii interpenetrate more than a defined distance. - - - - - - - - - - Residue symmetry contact calculation - - Calculate the number of symmetry contacts made by residues in a protein structure. - true - 1.12 - WHATIF:SymmetryContact - A symmetry contact is a contact between two atoms in different asymmetric unit. - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-ligand) - - true - beta12orEarlier - 1.12 - Calculate contacts between residues and ligands in a protein structure. - - - - - - - - - - Salt bridge calculation - - Salt bridges are interactions between oppositely charged atoms in different residues. The output might include the inter-atomic distance. - WHATIF:HasSaltBridgePlus - WHATIF:ShowSaltBridges - beta12orEarlier - WHATIF:HasSaltBridge - WHATIF:ShowSaltBridgesH - Calculate (and possibly score) salt bridges in a protein structure. - - - - - - - - - - Rotamer likelihood prediction - - WHATIF:ShowLikelyRotamers - WHATIF:ShowLikelyRotamers500 - 1.12 - Predict rotamer likelihoods for all 20 amino acid types at each position in a protein structure. - WHATIF:ShowLikelyRotamers600 - WHATIF:ShowLikelyRotamers800 - WHATIF:ShowLikelyRotamers900 - true - Output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. - WHATIF:ShowLikelyRotamers700 - WHATIF:ShowLikelyRotamers400 - WHATIF:ShowLikelyRotamers300 - WHATIF:ShowLikelyRotamers200 - WHATIF:ShowLikelyRotamers100 - beta12orEarlier - - - - - - - - - - Proline mutation value calculation - - true - 1.12 - Calculate for each position in a protein structure the chance that a proline, when introduced at this position, would increase the stability of the whole protein. - WHATIF:ProlineMutationValue - beta12orEarlier - - - - - - - - - - Residue packing validation - - beta12orEarlier - Identify poorly packed residues in protein structures. - WHATIF: PackingQuality - - - - - - - - - - Protein geometry validation - - WHATIF: ImproperQualitySum - beta12orEarlier - Validate protein geometry, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - WHATIF: ImproperQualityMax - - - - - - - - - - PDB file sequence retrieval - - Extract a molecular sequence from a PDB file. - beta12orEarlier - WHATIF: PDB_sequence - true - beta12orEarlier - - - - - - - - - - HET group detection - - true - Identify HET groups in PDB files. - beta12orEarlier - 1.12 - A HET group usually corresponds to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. - - - - - - - - - - DSSP secondary structure assignment - - Determine for residue the DSSP determined secondary structure in three-state (HSC). - beta12orEarlier - WHATIF: ResidueDSSP - beta12orEarlier - true - - - - - - - - - - Structure formatting - - 1.12 - true - Reformat (a file or other report of) tertiary structure data. - beta12orEarlier - WHATIF: PDBasXML - - - - - - - - - - Protein cysteine and disulfide bond assignment - - - - - - - - Assign cysteine bonding state and disulfide bond partners in protein structures. - beta12orEarlier - - - - - - - - - - Residue validation - - 1.12 - Identify poor quality amino acid positions in protein structures. - beta12orEarlier - true - - - - - - - - - - Structure retrieval (water) - - beta12orEarlier - 1.6 - WHATIF:MovedWaterPDB - true - Query a tertiary structure database and retrieve water molecules. - - - - - - - - - - siRNA duplex prediction - - - - - - - - beta12orEarlier - Identify or predict siRNA duplexes in RNA. - - - - - - - - - - Sequence alignment refinement - - - Refine an existing sequence alignment. - beta12orEarlier - - - - - - - - - - Listfile processing - - 1.6 - Process an EMBOSS listfile (list of EMBOSS Uniform Sequence Addresses). - true - beta12orEarlier - - - - - - - - - - Sequence file editing - - - beta12orEarlier - Perform basic (non-analytical) operations on a report or file of sequences (which might include features), such as file concatenation, removal or ordering of sequences, creation of subset or a new file of sequences. - - - - - - - - - - Sequence alignment file processing - - beta12orEarlier - Perform basic (non-analytical) operations on a sequence alignment file, such as copying or removal and ordering of sequences. - 1.6 - true - - - - - - - - - - Small molecule data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) physicochemical property data for small molecules. - - - - - - - - - - Data retrieval (ontology annotation) - - beta13 - Ontology information retrieval - true - Search and retrieve documentation on a bioinformatics ontology. - beta12orEarlier - - - - - - - - - - Data retrieval (ontology concept) - - Query an ontology and retrieve concepts or relations. - true - beta13 - beta12orEarlier - Ontology retrieval - - - - - - - - - - Representative sequence identification - - Identify a representative sequence from a set of sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - beta12orEarlier - - - - - - - - - - Structure file processing - - Perform basic (non-analytical) operations on a file of molecular tertiary structural data. - 1.6 - beta12orEarlier - true - - - - - - - - - - Data retrieval (sequence profile) - - Query a profile data resource and retrieve one or more profile(s) and / or associated annotation. - true - This includes direct retrieval methods that retrieve a profile by, e.g. the profile name. - beta13 - beta12orEarlier - - - - - - - - - - Statistical calculation - - Statistics - Statistical testing - Statistical analysis - Perform a statistical data operation of some type, e.g. calibration or validation. - Gibbs sampling - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix generation - - - - - - - - - - - - - - - - beta12orEarlier - 3D-1D scoring matrix construction - A 3D-1D scoring matrix scores the probability of amino acids occurring in different structural environments. - Calculate a 3D-1D scoring matrix from analysis of protein sequence and structural data. - - - - - - - - - - Transmembrane protein visualisation - - - - - - - - - Visualise transmembrane proteins, typically the transmembrane regions within a sequence. - beta12orEarlier - Transmembrane protein rendering - - - - - - - - - - Demonstration - - beta12orEarlier - true - An operation performing purely illustrative (pedagogical) purposes. - beta13 - - - - - - - - - - Data retrieval (pathway or network) - - beta12orEarlier - true - Query a biological pathways database and retrieve annotation on one or more pathways. - beta13 - - - - - - - - - - Data retrieval (identifier) - - beta12orEarlier - Query a database and retrieve one or more data identifiers. - beta13 - true - - - - - - - - - - Nucleic acid density plotting - - - beta12orEarlier - Calculate a density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence analysis - - - - - - - - Analyse one or more known molecular sequences. - beta12orEarlier - Sequence analysis (general) - - - - - - - - - - Sequence motif analysis - - Analyse molecular sequence motifs. - beta12orEarlier - Sequence motif processing - - - - - - - - - - Protein interaction data processing - - 1.6 - Process (read and / or write) protein interaction data. - true - beta12orEarlier - - - - - - - - - - Protein structure analysis - - - - - - - - - - - - - - - Structure analysis (protein) - beta12orEarlier - Analyse protein tertiary structural data. - - - - - - - - - - Annotation processing - - true - beta12orEarlier - beta12orEarlier - Process (read and / or write) annotation of some type, typically annotation on an entry from a biological or biomedical database entity. - - - - - - - - - - Sequence feature analysis - - beta12orEarlier - true - Analyse features in molecular sequences. - beta12orEarlier - - - - - - - - - - File handling - - - - - - - - Basic (non-analytical) operations of some data, either a file or equivalent entity in memory. - File processing - beta12orEarlier - Report handling - Data handling - Utility operation - - - - - - - - - - Gene expression analysis - - Analyse gene expression and regulation data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structural profile processing - - beta12orEarlier - 1.6 - Process (read and / or write) one or more structural (3D) profile(s) or template(s) of some type. - 3D profile processing - true - - - - - - - - - - Data index processing - - Database index processing - true - Process (read and / or write) an index of (typically a file of) biological data. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence profile processing - - true - beta12orEarlier - Process (read and / or write) some type of sequence profile. - 1.6 - - - - - - - - - - Protein function analysis - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - beta12orEarlier - Analyse protein function, typically by processing protein sequence and/or structural data, and generate an informative report. - - - - - - - - - - Protein folding analysis - - - - - - - - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - Analyse protein folding, typically by processing sequence and / or structural data, and write an informative report. - Protein folding modelling - beta12orEarlier - - - - - - - - - - Protein secondary structure analysis - - - - - - - - - - - - - - Analyse known protein secondary structure data. - beta12orEarlier - Secondary structure analysis (protein) - - - - - - - - - - Physicochemical property data processing - - beta13 - true - Process (read and / or write) data on the physicochemical property of a molecule. - beta12orEarlier - - - - - - - - - - Primer and probe design - - - - - - - - - Primer and probe prediction - beta12orEarlier - Predict oligonucleotide primers or probes. - - - - - - - - - - Operation (typed) - - true - Process (read and / or write) data of a specific type, for example applying analytical methods. - beta12orEarlier - 1.12 - - - - - - - - - - Database search - - - - - - - - beta12orEarlier - Typically the query is compared to each entry and high scoring matches (hits) are returned. For example, a BLAST search of a sequence database. - Search a database (or other data resource) with a supplied query and retrieve entries (or parts of entries) that are similar to the query. - Search - - - - - - - - - - Data retrieval - - - - - - - - Information retrieval - beta12orEarlier - Retrieve an entry (or part of an entry) from a data resource that matches a supplied query. This might include some primary data and annotation. The query is a data identifier or other indexed term. For example, retrieve a sequence record with the specified accession number, or matching supplied keywords. - Retrieval - - - - - - - - - - Prediction and recognition - - beta12orEarlier - Recognition - Prediction - Predict, recognise, detect or identify some properties of a biomolecule. - Detection - - - - - - - - - - Comparison - - beta12orEarlier - Compare two or more things to identify similarities. - - - - - - - - - - Optimisation and refinement - - beta12orEarlier - Refine or optimise some data model. - - - - - - - - - - Modelling and simulation - - - - - - - - beta12orEarlier - Model or simulate some biological entity or system, typically using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - Mathematical modelling - - - - - - - - - - Data handling - - true - beta12orEarlier - Perform basic operations on some data or a database. - beta12orEarlier - - - - - - - - - - Validation - - beta12orEarlier - Validation and standardisation - Quality control - Validate some data. - - - - - - - - - - Mapping - - This is a broad concept and is used a placeholder for other, more specific concepts. - Map properties to positions on an biological entity (typically a molecular sequence or structure), or assemble such an entity from constituent parts. - beta12orEarlier - - - - - - - - - - Design - - beta12orEarlier - Design a biological entity (typically a molecular sequence or structure) with specific properties. - - - - - - - - - - Microarray data processing - - beta12orEarlier - Process (read and / or write) microarray data. - beta12orEarlier - true - - - - - - - - - - Codon usage table processing - - Process (read and / or write) a codon usage table. - beta12orEarlier - - - - - - - - - - Data retrieval (codon usage table) - - Retrieve a codon usage table and / or associated annotation. - beta12orEarlier - true - beta13 - - - - - - - - - - Gene expression profile processing - - 1.6 - Process (read and / or write) a gene expression profile. - true - beta12orEarlier - - - - - - - - - - Functional enrichment - - - - - - - - - Analyse a set of genes (genes corresponding to an expression profile, or any other set) to find functional annotations (such as cellular processes or metaobolic pathways) that the sets are significantly associated with, providing biological insight into the a set of genes. - beta12orEarlier - The Gene Ontology (GO) is invariably used, the input is a set of Gene IDs and the output of the analysis is typically a ranked list of GO terms, each associated with a p-value. - GO term enrichment - - - - - - - - - - Gene regulatory network prediction - - - - - - - - - - - - - - - Predict a network of gene regulation. - beta12orEarlier - - - - - - - - - - Pathway or network processing - - Generate, analyse or handle a biological pathway or network. - beta12orEarlier - true - 1.12 - - - - - - - - - - RNA secondary structure analysis - - - - - - - - beta12orEarlier - Process (read and / or write) RNA secondary structure data. - - - - - - - - - - Structure processing (RNA) - - Process (read and / or write) RNA tertiary structure data. - beta12orEarlier - beta13 - true - - - - - - - - - - RNA structure prediction - - - - - - - - beta12orEarlier - Predict RNA tertiary structure. - - - - - - - - - - DNA structure prediction - - - - - - - - Predict DNA tertiary structure. - beta12orEarlier - - - - - - - - - - Phylogenetic tree processing - - beta12orEarlier - 1.12 - true - Generate, process or analyse phylogenetic tree or trees. - - - - - - - - - - Protein secondary structure processing - - Process (read and / or write) protein secondary structure data. - 1.6 - true - beta12orEarlier - - - - - - - - - - Protein interaction network processing - - true - beta12orEarlier - Process (read and / or write) a network of protein interactions. - 1.6 - - - - - - - - - - Sequence processing - - Sequence processing (general) - Process (read and / or write) one or more molecular sequences and associated annotation. - true - beta12orEarlier - 1.6 - - - - - - - - - - Sequence processing (protein) - - Process (read and / or write) a protein sequence and associated annotation. - beta12orEarlier - true - 1.6 - - - - - - - - - - Sequence processing (nucleic acid) - - 1.6 - true - beta12orEarlier - Process (read and / or write) a nucleotide sequence and associated annotation. - - - - - - - - - - Sequence comparison - - - - - - - - - - - - - - - Compare two or more molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster processing - - Process (read and / or write) a sequence cluster. - true - beta12orEarlier - 1.6 - - - - - - - - - - Feature table processing - - Process (read and / or write) a sequence feature table. - 1.6 - true - beta12orEarlier - - - - - - - - - - Gene prediction - - - - - - - - - - - - - - Gene component prediction - Detect, predict and identify genes or components of genes in DNA sequences, including promoters, coding regions, splice sites, etc. - Whole gene prediction - Gene and gene component prediction - beta12orEarlier - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene finding - - - - - - - - - - GPCR classification - - - - - - - - - beta12orEarlier - G protein-coupled receptor (GPCR) classification - Classify G-protein coupled receptors (GPCRs) into families and subfamilies. - - - - - - - - - - GPCR coupling selectivity prediction - - - - - - - - - - Predict G-protein coupled receptor (GPCR) coupling selectivity. - beta12orEarlier - - - - - - - - - - Structure processing (protein) - - true - 1.6 - beta12orEarlier - Process (read and / or write) a protein tertiary structure. - - - - - - - - - - Protein atom surface calculation - - Waters are not considered. - Calculate the solvent accessibility for each atom in a structure. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein residue surface calculation - - beta12orEarlier - true - Calculate the solvent accessibility for each residue in a structure. - 1.12 - - - - - - - - - - Protein surface calculation - - beta12orEarlier - Calculate the solvent accessibility of a structure as a whole. - 1.12 - true - - - - - - - - - - Sequence alignment processing - - beta12orEarlier - true - Process (read and / or write) a molecular sequence alignment. - 1.6 - - - - - - - - - - Protein-protein interaction prediction - - - - - - - - - - - - - - - Identify or predict protein-protein interactions, interfaces, binding sites etc. - beta12orEarlier - - - - - - - - - - Structure processing - - true - 1.6 - Process (read and / or write) a molecular tertiary structure. - beta12orEarlier - - - - - - - - - - Map annotation - - Annotate a DNA map of some type with terms from a controlled vocabulary. - true - beta12orEarlier - 1.6 - - - - - - - - - - Data retrieval (protein annotation) - - Retrieve information on a protein. - beta13 - true - Protein information retrieval - beta12orEarlier - - - - - - - - - - Data retrieval (phylogenetic tree) - - beta12orEarlier - beta13 - Retrieve a phylogenetic tree from a data resource. - true - - - - - - - - - - Data retrieval (protein interaction annotation) - - Retrieve information on a protein interaction. - true - beta13 - beta12orEarlier - - - - - - - - - - Data retrieval (protein family annotation) - - beta12orEarlier - Protein family information retrieval - beta13 - Retrieve information on a protein family. - true - - - - - - - - - - Data retrieval (RNA family annotation) - - true - Retrieve information on an RNA family. - RNA family information retrieval - beta12orEarlier - beta13 - - - - - - - - - - Data retrieval (gene annotation) - - beta12orEarlier - Gene information retrieval - Retrieve information on a specific gene. - true - beta13 - - - - - - - - - - Data retrieval (genotype and phenotype annotation) - - Retrieve information on a specific genotype or phenotype. - Genotype and phenotype information retrieval - beta12orEarlier - beta13 - true - - - - - - - - - - Protein architecture comparison - - - Compare the architecture of two or more protein structures. - beta12orEarlier - - - - - - - - - - Protein architecture recognition - - - - beta12orEarlier - Includes methods that try to suggest the most likely biological unit for a given protein X-ray crystal structure based on crystal symmetry and scoring of putative protein-protein interfaces. - Identify the architecture of a protein structure. - - - - - - - - - - Molecular dynamics simulation - - - - - - - - - - - - - - - - - - - - - - Simulate molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence analysis - - - - - - - - - - - - - - - Analyse a nucleic acid sequence (using methods that are only applicable to nucleic acid sequences). - beta12orEarlier - Sequence analysis (nucleic acid) - - - - - - - - - - Protein sequence analysis - - - - - - - - - Analyse a protein sequence (using methods that are only applicable to protein sequences). - Sequence analysis (protein) - beta12orEarlier - - - - - - - - - - Structure analysis - - - - - - - - beta12orEarlier - Analyse known molecular tertiary structures. - - - - - - - - - - Nucleic acid structure analysis - - - - - - - - - - - - - - - Analyse nucleic acid tertiary structural data. - beta12orEarlier - - - - - - - - - - Secondary structure processing - - 1.6 - Process (read and / or write) a molecular secondary structure. - true - beta12orEarlier - - - - - - - - - - Structure comparison - - - - - - - - - beta12orEarlier - Compare two or more molecular tertiary structures. - - - - - - - - - - Helical wheel drawing - - - - - - - - Helical wheel rendering - beta12orEarlier - Render a helical wheel representation of protein secondary structure. - - - - - - - - - - Topology diagram drawing - - - - - - - - Topology diagram rendering - beta12orEarlier - Render a topology diagram of protein secondary structure. - - - - - - - - - - Protein structure comparison - - - - - - - - - - beta12orEarlier - Structure comparison (protein) - Methods might identify structural neighbors, find structural similarities or define a structural core. - Compare protein tertiary structures. - - - - - - - - - - Protein secondary structure comparison - - - - Compare protein secondary structures. - beta12orEarlier - Secondary structure comparison (protein) - Protein secondary structure - - - - - - - - - - Protein subcellular localization prediction - - - - - - - - - The prediction might include subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or export (extracellular proteins) of a protein. - Predict the subcellular localization of a protein sequence. - Protein targeting prediction - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-residue) - - true - beta12orEarlier - Calculate contacts between residues in a protein structure. - 1.12 - - - - - - - - - - Hydrogen bond calculation (inter-residue) - - Identify potential hydrogen bonds between amino acid residues. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein interaction prediction - - - - - - - - - - - - - - - Predict the interactions of proteins with other molecules. - beta12orEarlier - - - - - - - - - - Codon usage data processing - - beta12orEarlier - beta13 - Process (read and / or write) codon usage data. - true - - - - - - - - - - Gene expression data analysis - - - - - - - - Gene expression (microarray) data processing - Gene expression profile analysis - beta12orEarlier - Microarray data processing - Gene expression data processing - Gene expression analysis - Process (read and / or write) gene expression (typically microarray) data, including analysis of one or more gene expression profiles, typically to interpret them in functional terms. - - - - - - - - - - Gene regulatory network processing - - 1.6 - beta12orEarlier - Process (read and / or write) a network of gene regulation. - true - - - - - - - - - - Pathway or network analysis - - - - - - - - Pathway analysis - Generate, process or analyse a biological pathway or network. - Network analysis - beta12orEarlier - - - - - - - - - - Sequencing-based expression profile data analysis - - Analyse SAGE, MPSS or SBS experimental data, typically to identify or quantify mRNA transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Splicing model analysis - - - - - - - - - - Analyse, characterize and model alternative splicing events from comparing multiple nucleic acid sequences. - Splicing analysis - beta12orEarlier - - - - - - - - - - Microarray raw data analysis - - beta12orEarlier - beta12orEarlier - true - Analyse raw microarray data. - - - - - - - - - - Nucleic acid analysis - - - - - - - - Process (read and / or write) nucleic acid sequence or structural data. - Nucleic acid data processing - beta12orEarlier - - - - - - - - - - Protein analysis - - - - - - - - beta12orEarlier - Protein data processing - Process (read and / or write) protein sequence or structural data. - - - - - - - - - - Sequence data processing - - beta12orEarlier - Process (read and / or write) molecular sequence data. - beta13 - true - - - - - - - - - - Structural data processing - - Process (read and / or write) molecular structural data. - beta13 - true - beta12orEarlier - - - - - - - - - - Text processing - - true - beta12orEarlier - Process (read and / or write) text. - 1.6 - - - - - - - - - - Protein sequence alignment analysis - - - - - - - - - - Analyse a protein sequence alignment, typically to detect features or make predictions. - beta12orEarlier - Sequence alignment analysis (protein) - - - - - - - - - - Nucleic acid sequence alignment analysis - - - - - - - - - - beta12orEarlier - Sequence alignment analysis (nucleic acid) - Analyse a protein sequence alignment, typically to detect features or make predictions. - - - - - - - - - - Nucleic acid sequence comparison - - - - Sequence comparison (nucleic acid) - Compare two or more nucleic acid sequences. - beta12orEarlier - - - - - - - - - - Protein sequence comparison - - - - beta12orEarlier - Sequence comparison (protein) - Compare two or more protein sequences. - - - - - - - - - - DNA back-translation - - - - - - - - beta12orEarlier - Back-translate a protein sequence into DNA. - - - - - - - - - - Sequence editing (nucleic acid) - - 1.8 - true - Edit or change a nucleic acid sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence editing (protein) - - Edit or change a protein sequence, either randomly or specifically. - beta12orEarlier - true - 1.8 - - - - - - - - - - Sequence generation (nucleic acid) - - Generate a nucleic acid sequence by some means. - beta12orEarlier - - - - - - - - - - Sequence generation (protein) - - - Generate a protein sequence by some means. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence visualisation - - Visualise, format or render a nucleic acid sequence. - true - Various nucleic acid sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - 1.8 - beta12orEarlier - - - - - - - - - - Protein sequence visualisation - - true - beta12orEarlier - Visualise, format or render a protein sequence. - 1.8 - Various protein sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - - - - - - - - - - Nucleic acid structure comparison - - - - Compare nucleic acid tertiary structures. - beta12orEarlier - Structure comparison (nucleic acid) - - - - - - - - - - Structure processing (nucleic acid) - - 1.6 - beta12orEarlier - true - Process (read and / or write) nucleic acid tertiary structure data. - - - - - - - - - - - DNA mapping - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Generate a map of a DNA sequence annotated with positional or non-positional features of some type. - - - - - - - - - - Map data processing - - DNA map data processing - Process (read and / or write) a DNA map of some type. - beta12orEarlier - true - 1.6 - - - - - - - - - - Protein hydropathy calculation - - - - - - - - - - - - - - beta12orEarlier - Analyse the hydrophobic, hydrophilic or charge properties of a protein (from analysis of sequence or structural information). - - - - - - - - - - Protein binding site prediction - - - - - - - - - beta12orEarlier - Active site prediction - Binding site prediction - Protein binding site detection - Ligand-binding site prediction - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein sequences or structures. - - - - - - - - - - Sequence tagged site (STS) mapping - - - - - - - - beta12orEarlier - Sequence mapping - An STS is a short subsequence of known sequence and location that occurs only once in the chromosome or genome that is being mapped. Sources of STSs include 1. expressed sequence tags (ESTs), simple sequence length polymorphisms (SSLPs), and random genomic sequences from cloned genomic DNA or database sequences. - Generate a physical DNA map (sequence map) from analysis of sequence tagged sites (STS). - - - - - - - - - - Alignment - - - - - - - - - Compare two or more entities, typically the sequence or structure (or derivatives) of macromolecules, to identify equivalent subunits. - Alignment generation - beta12orEarlier - Alignment construction - - - - - - - - - - Protein fragment weight comparison - - - beta12orEarlier - Calculate the molecular weight of a protein (or fragments) and compare it another protein or reference data. - - - - - - - - - - Protein property comparison - - - - - - - - Compare the physicochemical properties of two or more proteins (or reference data). - beta12orEarlier - - - - - - - - - - Secondary structure comparison - - - - - - - - Compare two or more molecular secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plotting - - beta12orEarlier - 1.12 - Generate a Hopp and Woods plot of antigenicity of a protein. - true - - - - - - - - - - Microarray cluster textual view generation - - beta12orEarlier - Visualise gene clusters with gene names. - - - - - - - - - - Microarray wave graph plotting - - Microarray wave graph rendering - Microarray cluster temporal graph rendering - beta12orEarlier - This view can be rendered as a pie graph. The distance matrix is sorted by cluster number and typically represented as a diagonal matrix with distance values displayed in different color shades. - Visualise clustered gene expression data as a set of waves, where each wave corresponds to a gene across samples on the X-axis. - - - - - - - - - - Microarray dendrograph plotting - - Microarray dendrograph rendering - Generate a dendrograph of raw, preprocessed or clustered microarray data. - beta12orEarlier - Microarray checks view rendering - Microarray view rendering - - - - - - - - - - Microarray proximity map plotting - - beta12orEarlier - Microarray distance map rendering - Generate a plot of distances (distance matrix) between genes. - Microarray proximity map rendering - - - - - - - - - - Microarray tree or dendrogram rendering - - Microarray 2-way dendrogram rendering - beta12orEarlier - Visualise clustered gene expression data using a gene tree, array tree and color coded band of gene expression. - Microarray matrix tree plot rendering - - - - - - - - - - Microarray principal component plotting - - beta12orEarlier - Microarray principal component rendering - Generate a line graph drawn as sum of principal components (Eigen value) and individual expression values. - - - - - - - - - - Microarray scatter plot plotting - - Generate a scatter plot of microarray data, typically after principal component analysis. - beta12orEarlier - Microarray scatter plot rendering - - - - - - - - - - Whole microarray graph plotting - - Visualise gene expression data where each band (or line graph) corresponds to a sample. - beta12orEarlier - Whole microarray graph rendering - - - - - - - - - - Microarray tree-map rendering - - beta12orEarlier - Visualise gene expression data after hierarchical clustering for representing hierarchical relationships. - - - - - - - - - - Microarray Box-Whisker plot plotting - - beta12orEarlier - Visualise raw and pre-processed gene expression data, via a plot showing over- and under-expression along with mean, upper and lower quartiles. - - - - - - - - - - Physical mapping - - - - - - - - - - - - - - beta12orEarlier - Generate a physical (sequence) map of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. - - - - - - - - - - Analysis - - Process or apply analytical methods to existing data of a specific type. - Processing - beta12orEarlier - - - - - - - - - - Alignment analysis - - Process or analyse an alignment of molecular sequences or structures. - true - beta12orEarlier - 1.8 - - - - - - - - - - Article analysis - - - - - - - - - - - - - - - - - - - - Analyse a body of scientific text (typically a full text article from a scientific journal.) - beta12orEarlier - - - - - - - - - - Molecular interaction analysis - - Analyse the interactions of two or more molecules (or parts of molecules) that are known to interact. - beta12orEarlier - beta13 - true - - - - - - - - - - Protein interaction analysis - - - - - - - - - - - - - - beta12orEarlier - Analyse known protein-protein, protein-DNA/RNA or protein-ligand interactions. - - - - - - - - - - Residue distance calculation - - WHATIF:HasNegativeIonContacts - Residue contact calculation (residue-ligand) - Residue contact calculation (residue-metal) - WHATIF:SymmetryContact - Residue contact calculation (residue-negative ion) - This includes identifying HET groups, which usually correspond to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. It can also include calculation of symmetry contacts, i.e. a contact between two atoms in different asymmetric unit. - WHATIF:HasMetalContactsPlus - Calculate contacts between residues, or between residues and other groups, in a protein structure, on the basis of distance calculations. - Residue contact calculation (residue-nucleic acid) - WHATIF: HETGroupNames - HET group detection - WHATIF:ShowDrugContacts - WHATIF:ShowLigandContacts - WHATIF:HasNucleicContacts - WHATIF:ShowDrugContactsShort - WHATIF:ShowProteiNucleicContacts - beta12orEarlier - WHATIF:HasMetalContacts - WHATIF:HasNegativeIonContactsPlus - - - - - - - - - - Alignment processing - - true - Process (read and / or write) an alignment of two or more molecular sequences, structures or derived data. - 1.6 - beta12orEarlier - - - - - - - - - - - Structure alignment processing - - Process (read and / or write) a molecular tertiary (3D) structure alignment. - 1.6 - beta12orEarlier - true - - - - - - - - - - Codon usage bias calculation - - - - - - - - Calculate codon usage bias. - beta12orEarlier - - - - - - - - - - Codon usage bias plotting - - - - - - - - - beta12orEarlier - Generate a codon usage bias plot. - - - - - - - - - - Codon usage fraction calculation - - - - - - - - Calculate the differences in codon usage fractions between two sequences, sets of sequences, codon usage tables etc. - beta12orEarlier - - - - - - - - - - Classification - - beta12orEarlier - Assign molecular sequences, structures or other biological data to a specific group or category according to qualities it shares with that group or category. - - - - - - - - - - Molecular interaction data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) molecular interaction data. - - - - - - - - - - Sequence classification - - - beta12orEarlier - Assign molecular sequence(s) to a group or category. - - - - - - - - - - Structure classification - - - Assign molecular structure(s) to a group or category. - beta12orEarlier - - - - - - - - - - Protein comparison - - Compare two or more proteins (or some aspect) to identify similarities. - beta12orEarlier - - - - - - - - - - Nucleic acid comparison - - beta12orEarlier - Compare two or more nucleic acids to identify similarities. - - - - - - - - - - Prediction and recognition (protein) - - beta12orEarlier - Predict, recognise, detect or identify some properties of proteins. - - - - - - - - - - Prediction and recognition (nucleic acid) - - beta12orEarlier - Predict, recognise, detect or identify some properties of nucleic acids. - - - - - - - - - - Structure editing - - - - - - - - beta13 - Edit, convert or otherwise change a molecular tertiary structure, either randomly or specifically. - - - - - - - - - - Sequence alignment editing - - Edit, convert or otherwise change a molecular sequence alignment, either randomly or specifically. - beta13 - - - - - - - - - - Pathway or network visualisation - - - - - - - - - Render (visualise) a biological pathway or network. - Pathway or network rendering - beta13 - - - - - - - - - - Protein function prediction (from sequence) - - beta13 - true - Predict general (non-positional) functional properties of a protein from analysing its sequence. - For functional properties that are positional, use 'Protein site detection' instead. - 1.6 - - - - - - - - - - Protein sequence feature detection - - - - Protein site recognition - Predict, recognise and identify functional or other key sites within protein sequences, typically by scanning for known motifs, patterns and regular expressions. - Protein site prediction - Sequence profile database search - Protein site detection - Protein secondary database search - Sequence feature detection (protein) - beta13 - - - - - - - - - - Protein property calculation (from sequence) - - - beta13 - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - - - - - - - - - - Protein feature prediction (from structure) - - beta13 - 1.6 - true - Predict, recognise and identify positional features in proteins from analysing protein structure. - - - - - - - - - - Protein feature detection - - - - - - - - - - - - - - - Features includes functional sites or regions, secondary structure, structural domains and so on. Methods might use fingerprints, motifs, profiles, hidden Markov models, sequence alignment etc to provide a mapping of a query protein sequence to a discriminatory element. This includes methods that search a secondary protein database (Prosite, Blocks, ProDom, Prints, Pfam etc.) to assign a protein sequence(s) to a known protein family or group. - - Predict, recognise and identify positional features in proteins from analysing protein sequences or structures. - beta13 - Protein feature recognition - Protein feature prediction - - - - - - - - - - Database search (by sequence) - - Sequence screening - true - 1.6 - Screen a molecular sequence(s) against a database (of some type) to identify similarities between the sequence and database entries. - beta13 - - - - - - - - - - Protein interaction network prediction - - - - - - - - - - - - - - beta13 - Predict a network of protein interactions. - - - - - - - - - - Nucleic acid design - - - beta13 - Design (or predict) nucleic acid sequences with specific chemical or physical properties. - - - - - - - - - - Editing - - beta13 - Edit a data entity, either randomly or specifically. - - - - - - - - - - Sequence assembly validation - - - - - - - - - - - - - - - - - - - - - Assembly quality evaluation - Assembly QC - Sequence assembly quality evaluation - Sequence assembly QC - Evaluate a DNA sequence assembly, typically for purposes of quality control. - 1.1 - - - - - - - - - - Genome alignment - - Align two or more (tpyically huge) molecular sequences that represent genomes. - Genome alignment construction - 1.1 - - - - - - - - - - Localized reassembly - - Reconstruction of a sequence assembly in a localised area. - 1.1 - - - - - - - - - - Sequence assembly visualisation - - Assembly rendering - Sequence assembly rendering - Render and visualise a DNA sequence assembly. - 1.1 - Assembly visualisation - - - - - - - - - - Base-calling - - - - - - - - Phred base calling - 1.1 - Identify base (nucleobase) sequence from a fluorescence 'trace' data generated by an automated DNA sequencer. - Base calling - Phred base-calling - - - - - - - - - - Bisulfite mapping - - 1.1 - Bisulfite mapping follows high-throughput sequencing of DNA which has undergone bisulfite treatment followed by PCR amplification; unmethylated cytosines are specifically converted to thymine, allowing the methylation status of cytosine in the DNA to be detected. - The mapping of methylation sites in a DNA (genome) sequence. - Bisulfite sequence alignment - Bisulfite sequence mapping - - - - - - - - - - Sequence contamination filtering - - - - - - - - beta12orEarlier - Identify and filter a (typically large) sequence data set to remove sequences from contaminants in the sample that was sequenced. - - - - - - - - - - Trim ends - - 1.1 - Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. - 1.12 - For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - true - - - - - - - - - - Trim vector - - true - Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - 1.12 - 1.1 - - - - - - - - - - Trim to reference - - true - 1.1 - 1.12 - Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - - - - - - - - - - Sequence trimming - - 1.1 - Cut (remove) the end from a molecular sequence. - Barcode sequence removal - Trim vector - Trimming - Trim ends - Trim to reference - This includes - -ennd trimming -Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. -For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - -trimming to a reference sequence, -Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - -vector trimming -Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - - - - - - - - - - - Genome feature comparison - - Genomic elements that might be compared include genes, indels, single nucleotide polymorphisms (SNPs), retrotransposons, tandem repeats and so on. - Compare the features of two genome sequences. - 1.1 - - - - - - - - - - Sequencing error detection - - - - - - - - Short read error correction - Short-read error correction - beta12orEarlier - Detect errors in DNA sequences generated from sequencing projects). - - - - - - - - - - Genotyping - - 1.1 - Methods might consider cytogenetic analyses, copy number polymorphism (and calculate copy number calls for copy-number variation(CNV) regions), single nucleotide polymorphism (SNP), , rare copy number variation (CNV) identification, loss of heterozygosity data and so on. - Analyse DNA sequence data to identify differences between the genetic composition (genotype) of an individual compared to other individual's or a reference sequence. - - - - - - - - - - Genetic variation analysis - - - 1.1 - Sequence variation analysis - Genetic variation annotation provides contextual interpretation of coding SNP consequences in transcripts. It allows comparisons to be made between variation data in different populations or strains for the same transcript. - Genetic variation annotation - Analyse a genetic variation, for example to annotate its location, alleles, classification, and effects on individual transcripts predicted for a gene model. - - - - - - - - - - Read mapping - - - Short oligonucleotide alignment - Oligonucleotide mapping - Oligonucleotide alignment generation - Short read mapping - Oligonucleotide alignment construction - The purpose of read mapping is to identify the location of sequenced fragments within a reference genome and assumes that there is, in fact, at least local similarity between the fragment and reference sequences. - Oligonucleotide alignment - Read alignment - 1.1 - Short read alignment - Align short oligonucleotide sequences (reads) to a larger (genomic) sequence. - Short sequence read mapping - - - - - - - - - - Split read mapping - - A varient of oligonucleotide mapping where a read is mapped to two separate locations because of possible structural variation. - 1.1 - - - - - - - - - - Community profiling - - - Analyse DNA sequences in order to identify a DNA 'barcode'; marker genes or any short fragment(s) of DNA that are useful to diagnose the taxa of biological organisms. - 1.1 - DNA barcoding - Sample barcoding - - - - - - - - - - SNP calling - - Identify single nucleotide change in base positions in sequencing data that differ from a reference genome and which might, especially by reference to population frequency or functional data, indicate a polymorphism. - Operations usually score confidence in the prediction or some other statistical measure of evidence. - 1.1 - - - - - - - - - - Polymorphism detection - - Polymorphism detection - Detect mutations in multiple DNA sequences, for example, from the alignment and comparison of the fluorescent traces produced by DNA sequencing hardware. - 1.1 - Mutation detection - - - - - - - - - - Chromatogram visualisation - - Visualise, format or render an image of a Chromatogram. - Chromatogram viewing - 1.1 - - - - - - - - - - Methylation analysis - - 1.1 - Determine cytosine methylation states in nucleic acid sequences. - - - - - - - - - - Methylation calling - - - 1.1 - Determine cytosine methylation status of specific positions in a nucleic acid sequences. - - - - - - - - - - Methylation level analysis (global) - - 1.1 - Global methylation analysis - Measure the overall level of methyl cytosines in a genome from analysis of experimental data, typically from chromatographic methods and methyl accepting capacity assay. - - - - - - - - - - Methylation level analysis (gene-specific) - - Gene-specific methylation analysis - Many different techniques are available for this. - Measure the level of methyl cytosines in specific genes. - 1.1 - - - - - - - - - - Genome visualisation - - 1.1 - Genome visualization - Visualise, format or render a nucleic acid sequence that is part of (and in context of) a complete genome sequence. - Genome rendering - Genome browser - Genome viewing - Genome browsing - - - - - - - - - - Genome comparison - - Compare the sequence or features of two or more genomes, for example, to find matching regions. - 1.1 - Genomic region matching - - - - - - - - - - Genome indexing - - - - - - - - Genome indexing (Burrows-Wheeler) - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. A suffix array consists of the lexicographically sorted list of suffixes of a genome. - Genome indexing (suffix arrays) - Generate an index of a genome sequence. - Suffix arrays - Burrows-Wheeler - 1.1 - - - - - - - - - - Genome indexing (Burrows-Wheeler) - - The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. - 1.12 - true - Generate an index of a genome sequence using the Burrows-Wheeler algorithm. - 1.1 - - - - - - - - - - Genome indexing (suffix arrays) - - 1.1 - Generate an index of a genome sequence using a suffix arrays algorithm. - A suffix array consists of the lexicographically sorted list of suffixes of a genome. - true - 1.12 - Suffix arrays - - - - - - - - - - Spectral analysis - - - - - - - - 1.1 - Analyse one or more spectra from mass spectrometry (or other) experiments. - Spectrum analysis - Mass spectrum analysis - - - - - - - - - - Peak detection - - - - - - - - 1.1 - Peak finding - Peak assignment - Identify peaks in a spectrum from a mass spectrometry, NMR, or some other spectrum-generating experiment. - - - - - - - - - - Scaffolding - - - - - - - - - Scaffold construction - Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically typically contigs; contiguous sequences corresponding to read overlaps. - 1.1 - Scaffold may be positioned along a chromosome physical map to create a "golden path". - Scaffold generation - - - - - - - - - - Scaffold gap completion - - Fill the gaps in a sequence assembly (scaffold) by merging in additional sequences. - Different techniques are used to generate gap sequences to connect contigs, depending on the size of the gap. For small (5-20kb) gaps, PCR amplification and sequencing is used. For large (>20kb) gaps, fragments are cloned (e.g. in BAC (Bacterial artificial chromosomes) vectors) and then sequenced. - 1.1 - - - - - - - - - - Sequencing quality control - - - Raw sequence data quality control. - Analyse raw sequence data from a sequencing pipeline and identify (and possiby fix) problems. - Sequencing QC - 1.1 - - - - - - - - - - Read pre-processing - - - Sequence read pre-processing - Pre-process sequence reads to ensure (or improve) quality and reliability. - For example process paired end reads to trim low quality ends remove short sequences, identify sequence inserts, detect chimeric reads, or remove low quality sequnces including vector, adaptor, low complexity and contaminant sequences. Sequences might come from genomic DNA library, EST libraries, SSH library and so on. - 1.1 - - - - - - - - - - Species frequency estimation - - - - - - - - Estimate the frequencies of different species from analysis of the molecular sequences, typically of DNA recovered from environmental samples. - 1.1 - - - - - - - - - - Peak calling - - Peak-pair calling - Chip-sequencing combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to generate a set of reads, which are aligned to a genome sequence. The enriched areas contain the binding sites of DNA-associated proteins. For example, a transcription factor binding site. ChIP-on-chip in contrast combines chromatin immunoprecipitation ('ChIP') with microarray ('chip'). "Peak-pair calling" is similar to "Peak calling" in the context of ChIP-exo. - Identify putative protein-binding regions in a genome sequence from analysis of Chip-sequencing data or ChIP-on-chip data. - Protein binding peak detection - 1.1 - - - - - - - - - - Differential expression analysis - - Identify (typically from analysis of microarray or RNA-seq data) genes whose expression levels are significantly different between two sample groups. - Differentially expressed gene identification - Differential expression analysis is used, for example, to identify which genes are up-regulated (increased expression) or down-regulated (decreased expression) between a group treated with a drug and a control groups. - 1.1 - - - - - - - - - - Gene set testing - - 1.1 - Gene sets can be defined beforehand by biological function, chromosome locations and so on. - Analyse gene expression patterns (typically from DNA microarray datasets) to identify sets of genes that are associated with a specific trait, condition, clinical outcome etc. - - - - - - - - - - Variant classification - - - Classify variants based on their potential effect on genes, especially functional effects on the expressed proteins. - 1.1 - Variants are typically classified by their position (intronic, exonic, etc.) in a gene transcript and (for variants in coding exons) by their effect on the protein sequence (synonymous, non-synonymous, frameshifting, etc.) - - - - - - - - - - Variant prioritization - - Variant prioritization can be used for example to produce a list of variants responsible for 'knocking out' genes in specific genomes. Methods amino acid substitution, aggregative approaches, probabilistic approach, inheritance and unified likelihood-frameworks. - Identify biologically interesting variants by prioritizing individual variants, for example, homozygous variants absent in control genomes. - 1.1 - - - - - - - - - - Variant calling - - Allele calling - Somatic variant calling - Germ line variant calling - Somatic variant calling is the detection of variations established in somatic cells and hence not inherited as a germ line variant. - Methods often utilise a database of aligned reads. - Variant mapping - 1.1 - Variant detection - Identify and map genomic alterations, including single nucleotide polymorphisms, short indels and structural variants, in a genome sequence. - - - - - - - - - - Structural variation discovery - - Detect large regions in a genome subject to copy-number variation, or other structural variations in genome(s). - 1.1 - Methods might involve analysis of whole-genome array comparative genome hybridization or single-nucleotide polymorphism arrays, paired-end mapping of sequencing data, or from analysis of short reads from new sequencing technologies. - - - - - - - - - - Exome assembly - Exome analysis - - 1.1 - Exome sequence analysis - Anaylse sequencing data from experiments aiming to selectively sequence the coding regions of the genome. - - - - - - - - - - Read depth analysis - - 1.1 - Analyse mapping density (read depth) of (typically) short reads from sequencing platforms, for example, to detect deletions and duplications. - - - - - - - - - - Gene expression QTL analysis - - - - - - - - expression quantitative trait loci profiling - 1.1 - eQTL profiling - Combine classical quantitative trait loci (QTL) analysis with gene expression profiling, for example, to describe describe cis- and trans-controlling elements for the expression of phenotype associated genes. - expression QTL profiling - - - - - - - - - - Copy number estimation - - Methods typically implement some statistical model for hypothesis testing, and methods estimate total copy number, i.e. do not distinguish the two inherited chromosomes quantities (specific copy number). - Transcript copy number estimation - 1.1 - Estimate the number of copies of loci of particular gene(s) in DNA sequences typically from gene-expression profiling technology based on microarray hybridization-based experiments. For example, estimate copy number (or marker dosage) of a dominant marker in samples from polyploid plant cells or tissues, or chromosomal gains and losses in tumors. - - - - - - - - - - Primer removal - - 1.2 - Remove forward and/or reverse primers from nucleic acid sequences (typically PCR products). - Adapter removal - - - - - - - - - - Transcriptome assembly - - - - - - - - - - - - - - Infer a transcriptome sequence by analysis of short sequence reads. - 1.2 - - - - - - - - - - Transcriptome assembly (de novo) - - de novo transcriptome assembly - true - 1.6 - 1.2 - Infer a transcriptome sequence without the aid of a reference genome, i.e. by comparing short sequences (reads) to each other. - - - - - - - - - - Transcriptome assembly (mapping) - - Infer a transcriptome sequence by mapping short reads to a reference genome. - 1.6 - 1.2 - true - - - - - - - - - - Sequence coordinate conversion - - - - - - - - - - - - - - 1.3 - Convert one set of sequence coordinates to another, e.g. convert coordinates of one assembly to another, cDNA to genomic, CDS to genomic, protein translation to genomic etc. - - - - - - - - - - Document similarity calculation - - Calculate similarity between 2 or more documents. - 1.3 - - - - - - - - - - Document clustering - - - Cluster (group) documents on the basis of their calculated similarity. - 1.3 - - - - - - - - - - Named entity recognition - - - Entity identification - Entity chunking - Entity extraction - Recognise named entities (text tokens) within documents. - 1.3 - - - - - - - - - - ID mapping - - - Identifier mapping - The mapping can be achieved by comparing identifier values or some other means, e.g. exact matches to a provided sequence. - 1.3 - Accession mapping - Map data identifiers to one another for example to establish a link between two biological databases for the purposes of data integration. - - - - - - - - - - Anonymisation - - Process data in such a way that makes it hard to trace to the person which the data concerns. - 1.3 - Data anonymisation - - - - - - - - - - ID retrieval - - - - - - - - id retrieval - Data retrieval (accession) - Data retrieval (ID) - Identifier retrieval - Data retrieval (id) - Accession retrieval - Search for and retrieve a data identifier of some kind, e.g. a database entry accession. - 1.3 - - - - - - - - - - Sequence checksum generation - - - - - - - - - - - - - - Generate a checksum of a molecular sequence. - 1.4 - - - - - - - - - - Bibliography generation - - - - - - - - Bibliography construction - Construct a bibliography from the scientific literature. - 1.4 - - - - - - - - - - Protein quaternary structure prediction - - 1.4 - Predict the structure of a multi-subunit protein and particularly how the subunits fit together. - - - - - - - - - - Molecular surface analysis - - - - - - - - - - - - - - 1.4 - Analyse the surface properties of proteins or other macromolecules, including surface accessible pockets, interior inaccessible cavities etc. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - 1.9 - - - - - - - - - - Format detection - - - - - - - - - - - - - - Recognition of which format the given data is in. - 1.4 - Format identification - Format recognition - 'Format recognition' is not a bioinformatics-specific operation, but of great relevance in bioinformatics. Should be removed from EDAM if/when captured satisfactorily in a suitable domain-generic ontology. - Format inference - - - - - - The has_input "Data" (data_0006) may cause visualisation or other problems although ontologically correct. But on the other hand it may be useful to distinguish from nullary operations without inputs. - - - - - - - - - - - Splitting - - File splitting - Split a file containing multiple data items into many files, each containing one item - 1.4 - - - - - - - - - - Generation - - Construction - beta12orEarlier - For non-analytical operations, see the 'Processing' branch. - Construct some data entity. - - - - - - - - - - Nucleic acid sequence feature detection - - - Nucleic acid site prediction - Predict, recognise and identify functional or other key sites within nucleic acid sequences, typically by scanning for known motifs, patterns and regular expressions. - Nucleic acid site recognition - 1.6 - Nucleic acid site detection - - - - - - - - - - Deposition - - Deposit some data in a database or some other type of repository or software system. - 1.6 - Database submission - Submission - Data submission - Data deposition - Database deposition - For non-analytical operations, see the 'Processing' branch. - - - - - - - - - - Clustering - - 1.6 - Group together some data entities on the basis of similarities such that entities in the same group (cluster) are more similar to each other than to those in other groups (clusters). - - - - - - - - - - Assembly - - 1.6 - Construct some entity (typically a molecule sequence) from component pieces. - - - - - - - - - - Conversion - - Convert a data set from one form to another. - 1.6 - - - - - - - - - - Standardization and normalization - - Normalization - 1.6 - Standardization - Standardize or normalize data. - - - - - - - - - - Aggregation - - Combine multiple files or data items into a single file or object. - 1.6 - - - - - - - - - - Article comparison - - Compare two or more scientific articles. - 1.6 - - - - - - - - - - Calculation - - Mathemetical determination of the value of something, typically a properly of a molecule. - 1.6 - - - - - - - - - - Pathway or network prediction - - - 1.6 - Predict a molecular pathway or network. - - - - - - - - - - Genome assembly - - 1.12 - 1.6 - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - true - - - - - - - - - - - Plotting - - Generate a graph, or other visual representation, of data, showing the relationship between two or more variables. - 1.6 - - - - - - - - - - Image analysis - - - - - - - - 1.7 - The analysis of a image (typically a digital image) of some type in order to extract information from it. - Image processing - - - - - - - - - - - Diffraction data analysis - - 1.7 - Analysis of data from a diffraction experiment. - - - - - - - - - - Cell migration analysis - - - - - - - - 1.7 - Analysis of cell migration images in order to study cell migration, typically in order to study the processes that play a role in the disease progression. - - - - - - - - - - Diffraction data reduction - - 1.7 - Processing of diffraction data into a corrected, ordered, and simplified form. - - - - - - - - - - Neurite measurement - - - - - - - - Measurement of neurites; projections (axons or dendrites) from the cell body of a neuron, from analysis of neuron images. - 1.7 - - - - - - - - - - Diffraction data integration - - 1.7 - Diffraction summation integration - Diffraction profile fitting - The evaluation of diffraction intensities and integration of diffraction maxima from a diffraction experiment. - - - - - - - - - - Phasing - - Phase a macromolecular crystal structure, for example by using molecular replacement or experimental phasing methods. - 1.7 - - - - - - - - - - Molecular replacement - - 1.7 - A technique used to construct an atomic model of an unknown structure from diffraction data, based upon an atomic model of a known structure, either a related protein or the same protein from a different crystal form. - The technique solves the phase problem, i.e. retrieve information concern phases of the structure. - - - - - - - - - - Rigid body refinement - - 1.7 - Rigid body refinement usually follows molecular replacement in the assignment of a structure from diffraction data. - A method used to refine a structure by moving the whole molecule or parts of it as a rigid unit, rather than moving individual atoms. - - - - - - - - - - Single particle analysis - - - - - - - - - An image processing technique that combines and analyze multiple images of a particulate sample, in order to produce an image with clearer features that are more easily interpreted. - 1.7 - Single particle analysis is used to improve the information that can be obtained by relatively low resolution techniques, , e.g. an image of a protein or virus from transmission electron microscopy (TEM). - - - - - - - - - - Single particle alignment and classification - - - Compare (align and classify) multiple particle images from a micrograph in order to produce a representative image of the particle. - 1.7 - A micrograph can include particles in multiple different orientations and/or conformations. Particles are compared and organised into sets based on their similarity. Typically iterations of classification and alignment and are performed to optimise the final image; average images produced by classification are used as a reference image for subsequent alignment of the whole image set. - - - - - - - - - - Functional clustering - - - - - - - - 1.7 - Clustering of molecular sequences on the basis of their function, typically using information from an ontology of gene function, or some other measure of functional phenotype. - Functional sequence clustering - - - - - - - - - - Taxonomic classification - - Taxonomy assignment - Classifiication (typically of molecular sequences) by assignment to some taxonomic hierarchy. - 1.7 - - - - - - - - - - Virulence prediction - - - - - - - - - Pathogenicity prediction - The prediction of the degree of pathogenicity of a microorganism from analysis of molecular sequences. - 1.7 - - - - - - - - - - Gene expression correlation analysis - - - 1.7 - Gene co-expression network analysis - Analyse the correlation patterns among genes across across a variety of experiments, microarray samples etc. - - - - - - - - - - - Correlation - - - - - - - - 1.7 - Identify a correlation, i.e. a statistical relationship between two random variables or two sets of data. - - - - - - - - - - RNA structure covariance model generation - - - - - - - - - Compute the covariance model for (a family of) RNA secondary structures. - 1.7 - - - - - - - - - - RNA secondary structure prediction (shape-based) - - RNA shape prediction - Predict RNA secondary structure by analysis, e.g. probabilistic analysis, of the shape of RNA folds. - 1.7 - - - - - - - - - - Nucleic acid folding prediction (alignment-based) - - 1.7 - Prediction of nucleic-acid folding using sequence alignments as a source of data. - - - - - - - - - - k-mer counting - - Count k-mers (substrings of length k) in DNA sequence data. - 1.7 - k-mer counting is used in genome and transcriptome assembly, metagenomic sequencing, and for error correction of sequence reads. - - - - - - - - - - Phylogenetic tree reconstruction - - - - - - - - Reconstructing the inner node labels of a phylogenetic tree from its leafes. - Note that this is somewhat different from simply analysing an existing tree or constructing a completely new one. - 1.7 - - - - - - - - - - Probabilistic data generation - - Generate some data from a choosen probibalistic model, possibly to evaluate algorithms. - 1.7 - - - - - - - - - - Probabilistic sequence generation - - - 1.7 - Generate sequences from some probabilistic model, e.g. a model that simulates evolution. - - - - - - - - - - Antimicrobial resistance prediction - - - - - - - - - 1.7 - Identify or predict causes for antibiotic resistance from molecular sequence analysis. - - - - - - - - - - Enrichment - - - - - - - - - A relevant ontology will be used. The input is typically a set of identifiers or other data, and the output of the analysis is typically a ranked list of ontology terms, each associated with a p-value. - Term enrichment - 1.8 - Analyse a dataset with respect to concepts from an ontology. - - - - - - - - - - Chemical class enrichment - - - - - - - - - 1.8 - Analyse a dataset with respect to concepts from an ontology of chemical structure. - - - - - - - - - - Incident curve plotting - - 1.8 - Plot an incident curve such as a survival curve, death curve, mortality curve. - - - - - - - - - - Variant pattern analysis - - Methods often utilise a database of aligned reads. - Identify and map patterns of genomic variations. - 1.8 - - - - - - - - - - Mathematical modelling - - 1.12 - Model some biological system using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - true - beta12orEarlier - - - - - - - - - - Microscope image visualisation - - - - - - - - Visualise images resulting from various types of microscopy. - 1.9 - Microscopy image visualisation - - - - - - - - - - Image annotation - - 1.9 - Annotate an image of some sort, typically with terms from a controlled vocabulary. - - - - - - - - - - Imputation - - Data imputation - Replace missing data with substituted values, usually by using some statistical or other mathematical approach. - 1.9 - - - - - - - - - - Ontology visualisation - - 1.9 - Visualise, format or render data from an ontology, typically a tree of terms. - Ontology browsing - - - - - - - - - - Maximum occurence analysis - - A method for making numerical assessments about the maximum percent of time that a conformer of a flexible macromolecule can exist and still be compatible with the experimental data. - beta12orEarlier - - - - - - - - - - Database comparison - - - 1.9 - Data model comparison - Compare the models or schemas used by two or more databases, or any other general comparison of databases rather than a detailed comparison of the entries themselves. - Schema comparison - - - - - - - - - - Network simulation - - - - - - - - Simulate the bevaviour of a biological pathway or network. - Pathway simulation - Network topology simulation - 1.9 - - - - - - - - - - RNA-seq read count analysis - - Analyze read counts from RNA-seq experiments. - 1.9 - - - - - - - - - - Chemical redundancy removal - - 1.9 - Identify and remove redudancy from a set of small molecule structures. - - - - - - - - - - RNA-seq time series data analysis - - 1.9 - Analyze time series data from an RNA-seq experiment. - - - - - - - - - - Simulated gene expression data generation - - 1.9 - Simulate gene expression data, e.g. for purposes of benchmarking. - - - - - - - - - - Relationship inference - - - - - - - - - - - - - - - - - - - - 1.12 - Identify semantic relationships within a text or between two or more texts using text mining techniques. - - - - - - - - - - Mass spectra calibration - - - - - - - - Re-adjust the output of mass spectrometry experiments with shifted ppm values. - 1.12 - - - - - - - - - - Chromatographic alignment - - - - - - - - Align multiple data sets using information from chromatography and/or peptide identification, from mass spectrometry experiments. - 1.12 - - - - - - - - - - Deisotoping - - - - - - - - The removal of isotope peaks in a spectrum, to represent the fragment ion as one data point. - Deconvolution - 1.12 - Deisotoping is commonly done to reduce complexity, and done in conjunction with the charge state deconvolution. - - - - - - - - - - Quantification - - - - - - - - Technique for determining the amount of proteins in a sample. - 1.12 - Quantitation - - - - - - - - - - Peptide identification - - - - - - - - Peptide-spectrum-matching - Determination of peptide sequence from mass spectrum. - 1.12 - - - - - - - - - - Isotopic distributions calculation - - - - - - - - - - - - - - Peptide-spectrum-matching - Predict the isotope distribution of a given chemical species. - 1.12 - - - - - - - - - - Retention times calculation - - Prediction of retention times in a mass spectrometry experiment based on compositional and structural properties of the separated species. - 1.12 - - - - - - - - - - Label-free quantification - - 1.12 - Quantification without the use of chemical tags. - - - - - - - - - - Labeled quantification - - 1.12 - Quantification based on the use of chemical tags. - - - - - - - - - - MRM/SRM - - 1.12 - Quantification by Selected/multiple Reaction Monitoring workflow (XIC quantitation of precursor / fragment mass pair). - - - - - - - - - - Spectral counting - - 1.12 - Calculate number of identified MS2 spectra as approximation of peptide / protein quantity. - - - - - - - - - - SILAC - - Quantification analysis using stable isotope labeling by amino acids in cell culture. - 1.12 - - - - - - - - - - iTRAQ - - 1.12 - Quantification analysis using the AB SCIEX iTRAQ isobaric labelling workflow, wherein 2-8 reporter ions are measured in MS2 spectra near 114 m/z. - - - - - - - - - - 18O labeling - - 1.12 - Quantification analysis using labeling based on 18O-enriched H2O. - - - - - - - - - - TMT-tag - - 1.12 - Quantification analysis using the Thermo Fisher tandem mass tag labelling workflow. - - - - - - - - - - Dimethyl - - 1.12 - Quantification analysis using chemical labeling by stable isotope dimethylation - - - - - - - - - - Tag-based peptide identification - - Peptide sequence tags are used as piece of information about a peptide obtained by tandem mass spectrometry. - 1.12 - - - - - - - - - - de Novo sequencing - - - Analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum (MS/MS) without the assistance of a sequence database. - 1.12 - - - - - - - - - - PTM identification - - Identification of post-translational modifications (PTMs) of peptides/proteins in mass spectrum. - 1.12 - - - - - - - - - - Peptide database search - - - 1.12 - Determination of best matches between MS/MS spectrum and a database of protein or nucleic acid sequences. - - - - - - - - - - Blind peptide database search - - Modification-tolerant peptide database search - Unrestricted peptide database search - 1.12 - Peptide database search for identification of known and unknown PTMs looking for mass difference mismatches. - - - - - - - - - - Validation of peptide-spectrum matches - - - Statistical estimation of false discovery rate from score distribution for peptide-spectrum-matches, following a peptide database search. - 1.12 - - - - - - - - - - Target-Decoy - - Estimation of false discovery rate by comparison to search results with a database containing incorrect information. - 1.12 - - - - - - - - - - Statistical inference - - 1.12 - Empirical Bayes - Analyse data in order to deduce properties of an underlying distribution or population. - - - - - - - - - - Regression analysis - - A statistical calculation to estimate the relationships among variables. - Regression - 1.12 - - - - - - - - - - Metabolic network modelling - - - - - - - - Model a metabolic network, for example, to reconstruct pathways or to simulate metabolism. - Metabolic reconstruction - Metabolic network reconstruction - Metabolic network simulation - 1.12 - - - - - - - - - - SNP annotation - - Predict the effect or function of an individual single nucleotide polymorphism (SNP). - 1.12 - - - - - - - - - - Ab-initio gene prediction - - Prediction of genes or gene components from first principles, i.e. without reference to existing genes. - 1.12 - Gene prediction (ab-initio) - - - - - - - - - - Homology-based gene prediction - - Gene prediction (homology-based) - Prediction of genes or gene components by reference to homologous genes. - 1.12 - - - - - - - - - - Statistical modelling - - 1.12 - Construction of a statistical model, or a set of assumptions around some observed data, usually by describing a set of probability distributions which approximate the distribution of data. - - - - - - - - - - Molecular surface comparison - - - 1.12 - Compare two or more molecular surfaces. - - - - - - - - - - Gene functional annotation - - 1.12 - Annotate one or more sequences with functional information, such as cellular processes or metaobolic pathways, by reference to a controlled vocabulary - invariably the Gene Ontology (GO). - - - - - - - - - - Variant filtering - - - 1.12 - Variant filtering is used to eliminate false positive variants based for example on base calling quality, strand and position information, and mapping info. - - - - - - - - - - Differential binding analysis - - 1.12 - Differential binding analysis identifies binding sites in nucleic acid sequences that are statistically significantly differentially bound between sample groups. - - - - - - - - - - RNA-Seq analysis - - Analyze data from RNA-seq experiments. - 1.13 - - - - - - - - - - Mass spectrum visualisation - - 1.1 - Visualise, format or render a mass spectrum. - - - - - - - - - - Filtering - - Filter a set of files or data items according to some property. - 1.13 - Sequence filtering - - - - - - - - - - Topic - - http://purl.org/biotop/biotop.owl#Quality - http://bioontology.org/ontologies/ResearchArea.owl#Area_of_Research - http://www.onto-med.de/ontologies/gfo.owl#Category - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - sumo:FieldOfStudy - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - - - - - - - - - - Nucleic acid analysis - - The processing and analysis of nucleic acid sequence, structural and other data. - Nucleic acid bioinformatics - Nucleic acids - Nucleic acid informatics - http://purl.bioontology.org/ontology/MSH/D017423 - Nucleic acid properties - Nucleic acid physicochemistry - http://purl.bioontology.org/ontology/MSH/D017422 - true - beta12orEarlier - - - - - - - - - - Protein analysis - - Protein informatics - Proteins - http://purl.bioontology.org/ontology/MSH/D020539 - Protein bioinformatics - Protein databases - true - beta12orEarlier - Archival, processing and analysis of protein data, typically molecular sequence and structural data. - - - - - - - - - - Metabolites - - 1.13 - true - The structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. - beta12orEarlier - - - - - - - - - - Sequence analysis - - true - beta12orEarlier - Sequence databases - Sequences - http://purl.bioontology.org/ontology/MSH/D017421 - The archival, processing and analysis of molecular sequences (monomer composition of polymers) including molecular sequence data resources, sequence sites, alignments, motifs and profiles. - - - - - - - - - - - Structure analysis - - Computational structural biology - true - The curation, processing and analysis of the structure of biological molecules, typically proteins and nucleic acids and other macromolecules. - http://purl.bioontology.org/ontology/MSH/D015394 - Structural bioinformatics - Structure databases - This includes related concepts such as structural properties, alignments and structural motifs. - Structure data resources - beta12orEarlier - - - - - - - - - - - Structure prediction - - Protein fold recognition - The prediction of molecular structure, including the prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features, and the folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - - - Nucleic acid structure prediction - beta12orEarlier - Protein structure prediction - true - DNA structure prediction - Nucleic acid design - Nucleic acid folding - RNA structure prediction - This includes the recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s), for example by threading, or the alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - - - - - - - - - - Alignment - - beta12orEarlier - true - The alignment (equivalence between sites) of molecular sequences, structures or profiles (representing a sequence or structure alignment). - beta12orEarlier - - - - - - - - - - - Phylogeny - - - Phylogeny reconstruction - Phylogenetic stratigraphy - beta12orEarlier - Phylogenetic dating - Phylogenetic clocks - true - http://purl.bioontology.org/ontology/MSH/D010802 - The study of evolutionary relationships amongst organisms. - Phylogenetic simulation - This includes diverse phylogenetic methods, including phylogenetic tree construction, typically from molecular sequence or morphological data, methods that simulate DNA sequence evolution, a phylogenetic tree or the underlying data, or which estimate or use molecular clock and stratigraphic (age) data, methods for studying gene evolution etc. - - - - - - - - - - - Functional genomics - - - beta12orEarlier - true - The study of gene or protein functions and their interactions in totality in a given organism, tissue, cell etc. - - - - - - - - - - - Ontology and terminology - - true - Terminology - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D002965 - Applied ontology - Ontology - The conceptualisation, categorisation and nomenclature (naming) of entities or phenomena within biology or bioinformatics. This includes formal ontologies, controlled vocabularies, structured glossary, symbols and terminology or other related resource. - Ontologies - - - - - - - - - - - Information retrieval - - beta12orEarlier - 1.13 - true - The search and query of data sources (typically databases or ontologies) in order to retrieve entries or other information. - VT 1.3.3 Information retrieval - - - - - - - - - - Bioinformatics - - This includes data processing in general, including basic handling of files and databases, datatypes, workflows and annotation. - VT 1.5.6 Bioinformatics - The archival, curation, processing and analysis of complex biological data. - http://purl.bioontology.org/ontology/MSH/D016247 - beta12orEarlier - true - - - - - - - - - - - Data visualisation - - Data rendering - Rendering (drawing on a computer screen) or visualisation of molecular sequences, structures or other biomolecular data. - true - VT 1.2.5 Computer graphics - beta12orEarlier - Computer graphics - - - - - - - - - - Nucleic acid thermodynamics - - true - The study of the thermodynamic properties of a nucleic acid. - 1.3 - - - - - - - - - - Nucleic acid structure analysis - - - Includes secondary and tertiary nucleic acid structural data, nucleic acid thermodynamic, thermal and conformational properties including DNA or DNA/RNA denaturation (melting) etc. - DNA melting - Nucleic acid denaturation - RNA alignment - The archival, curation, processing and analysis of nucleic acid structural information, such as whole structures, structural features and alignments, and associated annotation. - beta12orEarlier - RNA structure alignment - Nucleic acid structure - Nucleic acid thermodynamics - RNA structure - - - - - - - - - - RNA - - beta12orEarlier - Small RNA - RNA sequences and structures. - - - - - - - - - - Nucleic acid restriction - - 1.3 - beta12orEarlier - Topic for the study of restriction enzymes, their cleavage sites and the restriction of nucleic acids. - true - - - - - - - - - - Mapping - - The mapping of complete (typically nucleotide) sequences. Mapping (in the sense of short read alignment, or more generally, just alignment) has application in RNA-Seq analysis (mapping of transcriptomics reads), variant discovery (e.g. mapping of exome capture), and re-sequencing (mapping of WGS reads). - Genetic linkage - Linkage - Linkage mapping - true - Synteny - This includes resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. It also includes resources for physical (sequence) maps of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. It also covers for example the alignment of sequences of (typically millions) of short reads to a reference genome. - DNA mapping - beta12orEarlier - - - - - - - - - Genetic codes and codon usage - - beta12orEarlier - true - 1.3 - Codon usage analysis - The study of codon usage in nucleotide sequence(s), genetic codes and so on. - - - - - - - - - - Protein expression - - Translation - The translation of mRNA into protein and subsequent protein processing in the cell. - beta12orEarlier - - - - - - - - - - - Gene finding - - 1.3 - This includes the study of promoters, coding regions, splice sites, etc. Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene discovery - Methods that aims to identify, predict, model or analyse genes or gene structure in DNA sequences. - beta12orEarlier - Gene prediction - true - - - - - - - - - - Transcription - - 1.3 - The transcription of DNA into mRNA. - beta12orEarlier - true - - - - - - - - - - Promoters - - true - beta12orEarlier - Promoters in DNA sequences (region of DNA that facilitates the transcription of a particular gene by binding RNA polymerase and transcription factor proteins). - beta13 - - - - - - - - - - Nucleic acid folding - - beta12orEarlier - The folding (in 3D space) of nucleic acid molecules. - true - beta12orEarlier - - - - - - - - - - Gene structure - - This includes the study of promoters, coding regions etc. - beta12orEarlier - Fusion genes - Gene features - true - Gene structure, regions which make an RNA product and features such as promoters, coding regions, gene fusion, splice sites etc. - - This incudes operons (operators, promoters and genes) from a bacterial genome. For example the operon leader and trailer gene, gene composition of the operon and associated information. - - - - - - - - - - Proteomics - - beta12orEarlier - Protein and peptide identification, especially in the study of whole proteomes of organisms. - Protein and peptide identification - Peptide identification - Proteomics includes any methods (especially high-throughput) that separate, characterize and identify expressed proteins such as mass spectrometry, two-dimensional gel electrophoresis and protein microarrays, as well as in-silico methods that perform proteolytic or mass calculations on a protein sequence and other analyses of protein expression data, for example in different cells or tissues. - true - http://purl.bioontology.org/ontology/MSH/D040901 - Protein expression - - - - - - - - - - - Structural genomics - - - true - beta12orEarlier - The elucidation of the three dimensional structure for all (available) proteins in a given organism. - - - - - - - - - - - Protein properties - - The study of the physical and biochemical properties of peptides and proteins, for example the hydrophobic, hydrophilic and charge properties of a protein. - Protein hydropathy - true - Protein physicochemistry - beta12orEarlier - - - - - - - - - - Protein interactions - - - Protein-protein, protein-DNA/RNA and protein-ligand interactions, including analysis of known interactions and prediction of putative interactions. - Protein-nucleic acid interactions - Protein-RNA interaction - Protein interaction networks - This includes experimental (e.g. yeast two-hybrid) and computational analysis techniques. - Protein-protein interactions - Protein-ligand interactions - beta12orEarlier - Protein-DNA interaction - true - - - - - - - - - - Protein folding, stability and design - - beta12orEarlier - Protein residue interactions - Protein design - true - Protein folding - Protein stability - Protein stability, folding (in 3D space) and protein sequence-structure-function relationships. This includes for example study of inter-atomic or inter-residue interactions in protein (3D) structures, the effect of mutation, and the design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - Rational protein design - - - - - - - - - - Two-dimensional gel electrophoresis - - Two-dimensional gel electrophoresis image and related data. - beta13 - beta12orEarlier - true - - - - - - - - - - Mass spectrometry - - beta12orEarlier - true - 1.13 - An analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. - - - - - - - - - - Protein microarrays - - Protein microarray data. - true - beta12orEarlier - beta13 - - - - - - - - - - Protein hydropathy - - beta12orEarlier - true - The study of the hydrophobic, hydrophilic and charge properties of a protein. - 1.3 - - - - - - - - - - Protein targeting and localization - - Protein targeting - Protein sorting - The study of how proteins are transported within and without the cell, including signal peptides, protein subcellular localization and export. - Protein localization - beta12orEarlier - - - - - - - - - - Protein cleavage sites and proteolysis - - true - beta12orEarlier - 1.3 - Enzyme or chemical cleavage sites and proteolytic or mass calculations on a protein sequence. - - - - - - - - - - Protein structure comparison - - The comparison of two or more protein structures. - beta12orEarlier - true - Use this concept for methods that are exclusively for protein structure. - beta12orEarlier - - - - - - - - - - Protein residue interactions - - The processing and analysis of inter-atomic or inter-residue interactions in protein (3D) structures. - true - 1.3 - beta12orEarlier - - - - - - - - - - Protein-protein interactions - - Protein interaction networks - true - Protein-protein interactions, individual interactions and networks, protein complexes, protein functional coupling etc. - beta12orEarlier - 1.3 - - - - - - - - - - Protein-ligand interactions - - beta12orEarlier - true - 1.3 - Protein-ligand (small molecule) interactions. - - - - - - - - - - Protein-nucleic acid interactions - - beta12orEarlier - 1.3 - Protein-DNA/RNA interactions. - true - - - - - - - - - - Protein design - - 1.3 - beta12orEarlier - The design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - true - - - - - - - - - - G protein-coupled receptors (GPCR) - - G-protein coupled receptors (GPCRs). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Carbohydrates - - beta12orEarlier - Carbohydrates, typically including structural information. - true - - - - - - - - - - Lipids - - beta12orEarlier - true - Lipidomics - Lipids and their structures. - - - - - - - - - - Small molecules - - Drugs and target structures - Amino acids - Targets - Drug structures - Metabolite structures - Target structures - Small molecules of biological significance, typically archival, curation, processing and analysis of structural information. - Small molecules include organic molecules, metal-organic compounds, small polypeptides, small polysaccharides and oligonucleotides. Structural data is usually included. - true - This concept excludes macromolecules such as proteins and nucleic acids. - Toxins and targets - CHEBI:23367 - Toxins - Metabolites - Drug targets - Peptides and amino acids - beta12orEarlier - Chemical structures - This includes the structures of drugs, drug target, their interactions and binding affinities. Also the structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. Also the physicochemical, biochemical or structural properties of amino acids or peptides. Also structural and associated data for toxic chemical substances. - Peptides - - - - - - - - - - Sequence editing - - beta12orEarlier - true - beta12orEarlier - Edit, convert or otherwise change a molecular sequence, either randomly or specifically. - - - - - - - - - - - Sequence composition, complexity and repeats - - Repeat sequences - This includes short repetitive subsequences (repeat sequences) in a protein sequence. - true - The archival, processing and analysis of the basic character composition of molecular sequences, for example character or word frequency, ambiguity, complexity, particularly regions of low complexity, and repeats or the repetitive nature of molecular sequences. - beta12orEarlier - Protein sequence repeats - Nucleic acid repeats - This includes repetitive elements within a nucleic acid sequence, e.g. -long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a sequence and other types of repeating unit. - Sequence complexity - Low complexity sequences - Sequence repeats - Sequence composition - Protein repeats - - - - - - - - - - Sequence motifs - - beta12orEarlier - Motifs - true - 1.3 - Conserved patterns (motifs) in molecular sequences, that (typically) describe functional or other key sites. - - - - - - - - - - Sequence comparison - - true - The comparison might be on the basis of sequence, physico-chemical or some other properties of the sequences. - beta12orEarlier - 1.12 - The comparison of two or more molecular sequences, for example sequence alignment and clustering. - - - - - - - - - - Sequence sites, features and motifs - - Sequence features - true - Functional sites - The archival, detection, prediction and analysis of positional features such as functional and other key sites, in molecular sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence motifs - Sequence profiles - Sequence sites - HMMs - beta12orEarlier - - - - - - - - - - Sequence database search - - beta12orEarlier - Search and retrieve molecular sequences that are similar to a sequence-based query (typically a simple sequence). - beta12orEarlier - true - The query is a sequence-based entity such as another sequence, a motif or profile. - - - - - - - - - - Sequence clustering - - This includes systems that generate, process and analyse sequence clusters. - beta12orEarlier - true - 1.7 - The comparison and grouping together of molecular sequences on the basis of their similarities. - Sequence clusters - - - - - - - - - - Protein structural motifs and surfaces - - This includes conformation of conserved substructures, conserved geometry (spatial arrangement) of secondary structure or protein backbone, solvent-exposed surfaces, internal cavities, the analysis of shape, hydropathy, electrostatic patches, role and functions etc. - Protein structural features - Structural motifs - Protein 3D motifs - true - beta12orEarlier - Protein structural motifs - Structural features or common 3D motifs within protein structures, including the surface of a protein structure, such as biological interfaces with other molecules. - Protein surfaces - - - - - - - - - - Structural (3D) profiles - - The processing, analysis or use of some type of structural (3D) profile or template; a computational entity (typically a numerical matrix) that is derived from and represents a structure or structure alignment. - true - beta12orEarlier - 1.3 - Structural profiles - - - - - - - - - - Protein structure prediction - - true - beta12orEarlier - The prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features. - 1.12 - - - - - - - - - - Nucleic acid structure prediction - - The folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - 1.12 - true - beta12orEarlier - - - - - - - - - - Ab initio structure prediction - - 1.7 - The prediction of three-dimensional structure of a (typically protein) sequence from first principles, using a physics-based or empirical scoring function and without using explicit structural templates. - true - beta12orEarlier - - - - - - - - - - Homology modelling - - 1.4 - The modelling of the three-dimensional structure of a protein using known sequence and structural data. - true - beta12orEarlier - - - - - - - - - - Molecular dynamics - - This includes resources concerning flexibility and motion in protein and other molecular structures. - Protein dynamics - true - Molecular flexibility - Molecular motions - beta12orEarlier - The study and simulation of molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - - - - - - - - - - Molecular docking - - beta12orEarlier - true - The modelling the structure of proteins in complex with small molecules or other macromolecules. - true - 1.12 - - - - - - - - - - Protein secondary structure prediction - - beta12orEarlier - 1.3 - The prediction of secondary or supersecondary structure of protein sequences. - true - - - - - - - - - - Protein tertiary structure prediction - - 1.3 - true - The prediction of tertiary structure of protein sequences. - beta12orEarlier - - - - - - - - - - Protein fold recognition - - 1.12 - The recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Sequence alignment - - This includes the generation of alignments (the identification of equivalent sites), the analysis of alignments, editing, visualisation, alignment databases, the alignment (equivalence between sites) of sequence profiles (representing sequence alignments) and so on. - beta12orEarlier - 1.7 - The alignment of molecular sequences or sequence profiles (representing sequence alignments). - true - - - - - - - - - - Structure alignment - - The superimposition of molecular tertiary structures or structural (3D) profiles (representing a structure or structure alignment). - This includes the generation, storage, analysis, rendering etc. of structure alignments. - true - 1.7 - beta12orEarlier - - - - - - - - - - Threading - - Sequence-structure alignment - 1.3 - beta12orEarlier - The alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - true - - - - - - - - - - Sequence profiles and HMMs - - true - Sequence profiles; typically a positional, numerical matrix representing a sequence alignment. - beta12orEarlier - 1.3 - Sequence profiles include position-specific scoring matrix (position weight matrix), hidden Markov models etc. - - - - - - - - - - Phylogeny reconstruction - - The reconstruction of a phylogeny (evolutionary relatedness amongst organisms), for example, by building a phylogenetic tree. - 1.3 - true - Currently too specific for the topic sub-ontology (but might be unobsoleted). - beta12orEarlier - - - - - - - - - - Phylogenomics - - - beta12orEarlier - The integrated study of evolutionary relationships and whole genome data, for example, in the analysis of species trees, horizontal gene transfer and evolutionary reconstruction. - true - - - - - - - - - - - Virtual PCR - - beta13 - Polymerase chain reaction - beta12orEarlier - Simulated polymerase chain reaction (PCR). - PCR - true - - - - - - - - - - Sequence assembly - - true - Assembly - The assembly of fragments of a DNA sequence to reconstruct the original sequence. - beta12orEarlier - Assembly has two broad types, de-novo and re-sequencing. Re-sequencing is a specialized case of assembly, where an assembled (typically de-novo assembled) reference genome is available and is about 95% identical to the re-sequenced genome. All other cases of assembly are 'de-novo'. - - - - - - - - - - Genetic variation - - Mutation - beta12orEarlier - Polymorphism - Somatic mutations - Stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - http://purl.bioontology.org/ontology/MSH/D014644 - DNA variation - true - - - - - - - - - - Microarrays - - true - http://purl.bioontology.org/ontology/MSH/D046228 - Microarrays, for example, to process microarray data or design probes and experiments. - 1.3 - DNA microarrays - beta12orEarlier - - - - - - - - - - Pharmacology - - Computational pharmacology - beta12orEarlier - Pharmacoinformatics - The study of drugs and their effects or responses in living systems. - VT 3.1.7 Pharmacology and pharmacy - true - - - - - - - - - - - Gene expression - - This includes the study of codon usage in nucleotide sequence(s), genetic codes and so on. - Transcription - Gene expression profiling - Expression profiling - beta12orEarlier - http://edamontology.org/topic_0197 - Gene expression levels are analysed by identifying, quantifying or comparing mRNA transcripts, for example using microarrays, RNA-seq, northern blots, gene-indexed expression profiles etc. - http://purl.bioontology.org/ontology/MSH/D015870 - Gene expression analysis - DNA microarrays - The analysis of levels and patterns of synthesis of gene products (proteins and functional RNA) including interpretation in functional terms of gene expression data. - Codon usage - true - - - - - - - - - - - Gene regulation - - true - Regulatory genomics - beta12orEarlier - The regulation of gene expression. - - - - - - - - - - Pharmacogenomics - - - true - beta12orEarlier - Pharmacogenetics - The influence of genotype on drug response, for example by correlating gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - - - - - - - - - - - Medicinal chemistry - - - VT 3.1.4 Medicinal chemistry - The design and chemical synthesis of bioactive molecules, for example drugs or potential drug compounds, for medicinal purposes. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - true - Drug design - beta12orEarlier - - - - - - - - - - - Fish - - beta12orEarlier - true - 1.3 - Information on a specific fish genome including molecular sequences, genes and annotation. - - - - - - - - - - Flies - - 1.3 - true - beta12orEarlier - Information on a specific fly genome including molecular sequences, genes and annotation. - - - - - - - - - - Mice or rats - - Information on a specific mouse or rat genome including molecular sequences, genes and annotation. - The resource may be specific to a group of mice / rats or all mice / rats. - beta12orEarlier - - - - - - - - - - Worms - - true - 1.3 - beta12orEarlier - Information on a specific worm genome including molecular sequences, genes and annotation. - - - - - - - - - - Literature analysis - - beta12orEarlier - 1.3 - The processing and analysis of the bioinformatics literature and bibliographic data, such as literature search and query. - true - - - - - - - - - - Text mining - - beta12orEarlier - The analysis of the biomedical and informatics literature. - Literature analysis - Literature mining - Text data mining - - - - - - - - - - - Data submission, annotation and curation - - Database curation - Deposition and curation of database accessions, including annotation, typically with terms from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Document, record and content management - - true - The management and manipulation of digital documents, including database records, files and reports. - VT 1.3.6 Multimedia, hypermedia - 1.13 - beta12orEarlier - - - - - - - - - - Sequence annotation - - beta12orEarlier - beta12orEarlier - true - Annotation of a molecular sequence. - - - - - - - - - - Genome annotation - - Annotation of a genome. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - - NMR - - - ROESY - NOESY - Nuclear Overhauser Effect Spectroscopy - An analytical technique that exploits the magenetic properties of certain atomic nuclei to provide information on the structure, dynamics, reaction state and chemical environment of molecules. - HOESY - beta12orEarlier - Heteronuclear Overhauser Effect Spectroscopy - Nuclear magnetic resonance spectroscopy - Spectroscopy - NMR spectroscopy - Rotational Frame Nuclear Overhauser Effect Spectroscopy - - - - - - - - - - - Sequence classification - - 1.12 - true - beta12orEarlier - The classification of molecular sequences based on some measure of their similarity. - Methods including sequence motifs, profile and other diagnostic elements which (typically) represent conserved patterns (of residues or properties) in molecular sequences. - - - - - - - - - - Protein classification - - 1.3 - true - beta12orEarlier - primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc. - - - - - - - - - - Sequence motif or profile - - beta12orEarlier - true - Sequence motifs, or sequence profiles derived from an alignment of molecular sequences of a particular type. - This includes comparison, discovery, recognition etc. of sequence motifs. - beta12orEarlier - - - - - - - - - - Protein modifications - - GO:0006464 - Protein chemical modifications - Protein post-translational modification - Protein chemical modifications, e.g. post-translational modifications. - true - EDAM does not describe all possible protein modifications. For fine-grained annotation of protein modification use the Gene Ontology (children of concept GO:0006464) and/or the Protein Modifications ontology (children of concept MOD:00000) - Protein post-translational modifications - Post-translation modifications - MOD:00000 - beta12orEarlier - - - - - - - - - - Molecular interactions, pathways and networks - - Networks - Environmental information processing pathways - Pathways - Biological networks - Disease pathways - true - Signal transduction pathways - beta13 - Biological models - Cellular process pathways - Molecular interactions - Gene regulatory networks - Molecular interactions, biological pathways, networks and other models. - Biological pathways - Interactions - Genetic information processing pathways - Signaling pathways - http://edamontology.org/topic_3076 - - - - - - - - - - - Informatics - - true - The study and practice of information processing and use of computer information systems. - VT 1.3.99 Other - Knowledge management - VT 1.3.4 Information management - beta12orEarlier - Information management - VT 1.3.5 Knowledge management - VT 1.3.3 Information retrieval - VT 1.3 Information sciences - Information science - - - - - - - - - Literature data resources - - Data resources for the biological or biomedical literature, either a primary source of literature or some derivative. - true - 1.3 - beta12orEarlier - - - - - - - - - - Laboratory information management - - true - Laboratory management and resources, for example, catalogues of biological resources for use in the lab including cell lines, viruses, plasmids, phages, DNA probes and primers and so on. - beta12orEarlier - Laboratory resources - - - - - - - - - - - - Cell and tissue culture - - Tissue culture - 1.3 - true - General cell culture or data on a specific cell lines. - Cell culture - beta12orEarlier - - - - - - - - - - Ecology - - true - The ecological and environmental sciences and especially the application of information technology (ecoinformatics). - http://purl.bioontology.org/ontology/MSH/D004777 - Ecological informatics - VT 1.5.15 Ecology - Computational ecology - beta12orEarlier - Ecoinformatics - Environmental science - - - - - - - - - - - Electron microscopy - - - SEM - Scanning electron microscopy - TEM - The study of matter by studying the interference pattern from firing electrons at a sample, to analyse structures at resolutions higher than can be achieved using light. - - Transmission electron microscopy - beta12orEarlier - Electron crystallography - Electron diffraction experiment - Single particle electron microscopy - - - - - - - - - - - Cell cycle - - beta13 - beta12orEarlier - true - The cell cycle including key genes and proteins. - - - - - - - - - - Peptides and amino acids - - beta12orEarlier - The physicochemical, biochemical or structural properties of amino acids or peptides. - 1.13 - true - - - - - - - - - - Organelles - - Cell membrane - Cytoplasm - Organelle genes and proteins - Smooth endoplasmic reticulum - beta12orEarlier - Lysosome - Centriole - Ribosome - Nucleus - true - A specific organelle, or organelles in general, typically the genes and proteins (or genome and proteome). - Mitochondria - Golgi apparatus - Rough endoplasmic reticulum - 1.3 - - - - - - - - - - Ribosomes - - beta12orEarlier - Ribosomes, typically of ribosome-related genes and proteins. - Ribosome genes and proteins - 1.3 - true - - - - - - - - - - Scents - - A database about scents. - beta12orEarlier - beta13 - true - - - - - - - - - - Drugs and target structures - - beta12orEarlier - The structures of drugs, drug target, their interactions and binding affinities. - true - 1.13 - - - - - - - - - - Model organisms - - This may include information on the genome (including molecular sequences and map, genes and annotation), proteome, as well as more general information about an organism. - beta12orEarlier - A specific organism, or group of organisms, used to study a particular aspect of biology. - true - Organisms - - - - - - - - - - - Genomics - - http://purl.bioontology.org/ontology/MSH/D023281 - Personal genomics - beta12orEarlier - Whole genomes of one or more organisms, or genomes in general, such as meta-information on genomes, genome projects, gene names etc. - true - - - - - - - - - - - Gene families - - beta12orEarlier - Gene family - Gene system - Gene and protein families - Particular gene(s), gene family or other gene group or system and their encoded proteins. - Genes, gene family or system - true - - - - - - - - - - - Chromosomes - - beta12orEarlier - Study of chromosomes. - 1.13 - true - - - - - - - - - - Genotype and phenotype - - Genotype and phenotype resources - The study of genetic constitution of a living entity, such as an individual, and organism, a cell and so on, typically with respect to a particular observable phenotypic traits, or resources concerning such traits, which might be an aspect of biochemistry, physiology, morphology, anatomy, development and so on. - Genotyping - Phenotyping - true - beta12orEarlier - - - - - - - - - - - Gene expression and microarray - - true - beta12orEarlier - beta12orEarlier - Gene expression e.g. microarray data, northern blots, gene-indexed expression profiles etc. - - - - - - - - - - Probes and primers - - Probes - This includes the design of primers for PCR and DNA amplification or the design of molecular probes. - http://purl.bioontology.org/ontology/MSH/D015335 - Primers - true - beta12orEarlier - Molecular probes (e.g. a peptide probe or DNA microarray probe) or PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - - Pathology - - Disease - Diseases, including diseases in general and the genes, gene variations and proteins involved in one or more specific diseases. - true - beta12orEarlier - VT 3.1.6 Pathology - - - - - - - - - - - Specific protein resources - - 1.3 - A particular protein, protein family or other group of proteins. - true - Specific protein - beta12orEarlier - - - - - - - - - - Taxonomy - - true - beta12orEarlier - VT 1.5.25 Taxonomy - Organism classification, identification and naming. - - - - - - - - - - Protein sequence analysis - - beta12orEarlier - Archival, processing and analysis of protein sequences and sequence-based entities such as alignments, motifs and profiles. - 1.8 - true - - - - - - - - - - Nucleic acid sequence analysis - - beta12orEarlier - 1.8 - true - The archival, processing and analysis of nucleotide sequences and and sequence-based entities such as alignments, motifs and profiles. - - - - - - - - - - - Repeat sequences - - true - The repetitive nature of molecular sequences. - beta12orEarlier - 1.3 - - - - - - - - - - Low complexity sequences - - true - The (character) complexity of molecular sequences, particularly regions of low complexity. - 1.3 - beta12orEarlier - - - - - - - - - - Proteome - - A specific proteome including protein sequences and annotation. - beta12orEarlier - beta13 - true - - - - - - - - - - DNA - - DNA analysis - beta12orEarlier - Ancient DNA - Chromosomes - DNA sequences and structure, including processes such as methylation and replication. - The DNA sequences might be coding or non-coding sequences. - - - - - - - - - - Coding RNA - - Protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames - 1.13 - beta12orEarlier - true - - - - - - - - - - Functional, regulatory and non-coding RNA - - - true - small interfering RNA - small nucleolar RNA - ncRNA - Non-coding RNA - Functional RNA - snRNA - Non-coding or functional RNA sequences, including regulatory RNA sequences, ribosomal RNA (rRNA) and transfer RNA (tRNA). - Non-coding RNA includes piwi-interacting RNA (piRNA), small nuclear RNA (snRNA) and small nucleolar RNA (snoRNA). Regulatory RNA includes microRNA (miRNA) - short single stranded RNA molecules that regulate gene expression, and small interfering RNA (siRNA). - Regulatory RNA - siRNA - piRNA - snoRNA - small nuclear RNA - beta12orEarlier - miRNA - microRNA - piwi-interacting RNA - - - - - - - - - - rRNA - - 1.3 - One or more ribosomal RNA (rRNA) sequences. - true - - - - - - - - - - tRNA - - 1.3 - true - One or more transfer RNA (tRNA) sequences. - - - - - - - - - - Protein secondary structure - - true - beta12orEarlier - 1.8 - Protein secondary structure or secondary structure alignments. - This includes assignment, analysis, comparison, prediction, rendering etc. of secondary structure data. - - - - - - - - - - RNA structure - - 1.3 - RNA secondary or tertiary structure and alignments. - beta12orEarlier - true - - - - - - - - - - Protein tertiary structure - - 1.8 - true - Protein tertiary structures. - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - Classification of nucleic acid sequences and structures. - 1.3 - true - beta12orEarlier - - - - - - - - - - Protein families - - true - beta12orEarlier - Protein sequence classification - Protein secondary databases - A protein families database might include the classifier (e.g. a sequence profile) used to build the classification. - Primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc., curation of a particular protein or protein family, or any other proteins that have been classified as members of a common group. - - - - - - - - - - - Protein folds and structural domains - - Protein tertiary structural domains and folds in a protein or polypeptide chain. - This includes topological domains such as cytoplasmic regions in a protein. - Protein transmembrane regions - Protein domains - Protein membrane regions - Intramembrane regions - beta12orEarlier - Protein topological domains - true - This includes trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. For example, the location and size of the membrane spanning segments and intervening loop regions, transmembrane region IN/OUT orientation relative to the membrane, plus the following data for each amino acid: A Z-coordinate (the distance to the membrane center), the free energy of membrane insertion (calculated in a sliding window over the sequence) and a reliability score. The z-coordinate implies information about re-entrant helices, interfacial helices, the tilt of a transmembrane helix and loop lengths. - Protein folds - Transmembrane regions - Protein structural domains - - - - - - - - - - Nucleic acid sequence alignment - - beta12orEarlier - true - 1.3 - Nucleotide sequence alignments. - - - - - - - - - - Protein sequence alignment - - 1.3 - Protein sequence alignments. - beta12orEarlier - true - A sequence profile typically represents a sequence alignment. - - - - - - - - - - Nucleic acid sites and features - - beta12orEarlier - 1.3 - true - The archival, detection, prediction and analysis of -positional features such as functional sites in nucleotide sequences. - - - - - - - - - - - Protein sites and features - - beta12orEarlier - The detection, identification and analysis of positional features in proteins, such as functional sites. - 1.3 - true - - - - - - - - - - - Transcription factors and regulatory sites - - - - CpG islands - Proteins that bind to DNA and control transcription of DNA to mRNA (transcription factors) and also transcriptional regulatory sites, elements and regions (such as promoters, enhancers, silencers and boundary elements / insulators) in nucleotide sequences. - Attenuators - Enhancers - CAAT signals - Transcriptional regulatory sites - TFBS - CAT box - CCAAT box - This includes CpG rich regions (isochores) in a nucleotide sequence. - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - -10 signals - Transcription factor proteins either promote (as an activator) or block (as a repressor) the binding to DNA of RNA polymerase. Regulatory sites including transcription factor binding site as well as promoters, enhancers, silencers and boundary elements / insulators. - Terminators - TATA signals - GC signals - Promoters - -35 signals - Transcription factors - Isochores - beta12orEarlier - Transcription factor binding sites - - - - - - - - - - Phosphorylation sites - - 1.0 - Protein phosphorylation and phosphorylation sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - - Metabolic pathways - - beta12orEarlier - 1.13 - true - Metabolic pathways. - - - - - - - - - - Signaling pathways - - true - Signaling pathways. - 1.13 - beta12orEarlier - - - - - - - - - - Protein and peptide identification - - 1.3 - beta12orEarlier - true - - - - - - - - - - Workflows - - Pipelines - Biological or biomedical analytical workflows or pipelines. - beta12orEarlier - - - - - - - - - Data types and objects - - Structuring data into basic types and (computational) objects. - beta12orEarlier - 1.0 - true - - - - - - - - - - Theoretical biology - - 1.3 - true - - - - - - - - - - Mitochondria - - beta12orEarlier - true - Mitochondria, typically of mitochondrial genes and proteins. - 1.3 - - - - - - - - - - Plants - - The resource may be specific to a plant, a group of plants or all plants. - Plant science - Plants, e.g. information on a specific plant genome including molecular sequences, genes and annotation. - Plant biology - Botany - VT 1.5.22 Plant science - Plant - VT 1.5.10 Botany - beta12orEarlier - - - - - - - - - - Viruses - - Virology - VT 1.5.28 Virology - beta12orEarlier - Viruses, e.g. sequence and structural data, interactions of viral proteins, or a viral genome including molecular sequences, genes and annotation. - The resource may be specific to a virus, a group of viruses or all viruses. - - - - - - - - - - Fungi - - Mycology - beta12orEarlier - The resource may be specific to a fungus, a group of fungi or all fungi. - Yeast - VT 1.5.21 Mycology - Fungi and molds, e.g. information on a specific fungal genome including molecular sequences, genes and annotation. - - - - - - - - - - Pathogens - - Pathogens, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a pathogen, a group of pathogens or all pathogens. - - - - - - - - - - Arabidopsis - - beta12orEarlier - Arabidopsis-specific data. - 1.3 - true - - - - - - - - - - Rice - - Rice-specific data. - true - 1.3 - beta12orEarlier - - - - - - - - - - Genetic mapping and linkage - - Linkage mapping - beta12orEarlier - 1.3 - true - Genetic linkage - Informatics resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. - - - - - - - - - - Comparative genomics - - The study (typically comparison) of the sequence, structure or function of multiple genomes. - true - beta12orEarlier - - - - - - - - - - - Mobile genetic elements - - Transposons - beta12orEarlier - Mobile genetic elements, such as transposons, Plasmids, Bacteriophage elements and Group II introns. - - - - - - - - - - Human disease - - Human diseases, typically describing the genes, mutations and proteins implicated in disease. - beta13 - true - beta12orEarlier - - - - - - - - - - Immunology - - VT 3.1.3 Immunology - Immunoinformatics - http://purl.bioontology.org/ontology/MSH/D007120 - http://purl.bioontology.org/ontology/MSH/D007125 - beta12orEarlier - true - Computational immunology - The application of information technology to immunology such as immunological processes, immunological genes, proteins and peptide ligands, antigens and so on. - - - - - - - - - - - Membrane and lipoproteins - - Lipoproteins (protein-lipid assemblies), and proteins or region of a protein that spans or are associated with a membrane. - true - beta12orEarlier - Membrane proteins - Lipoproteins - Transmembrane proteins - - - - - - - - - - Enzymes - - Proteins that catalyze chemical reaction, the kinetics of enzyme-catalysed reactions, enzyme nomenclature etc. - beta12orEarlier - Enzymology - true - - - - - - - - - - Primers - - true - 1.13 - PCR primers and hybridization oligos in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - PolyA signal or sites - - beta12orEarlier - 1.13 - true - Regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - - - - - - - - - - CpG island and isochores - - beta12orEarlier - 1.13 - true - CpG rich regions (isochores) in a nucleotide sequence. - - - - - - - - - - Restriction sites - - Restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Splice sites - - beta12orEarlier - Splice sites in a nucleotide sequence or alternative RNA splicing events. - 1.13 - true - - - - - - - - - - - Matrix/scaffold attachment sites - - 1.13 - true - beta12orEarlier - Matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - - - - - - - - - - Operon - - beta12orEarlier - 1.13 - true - Operons (operators, promoters and genes) from a bacterial genome. - - - - - - - - - - Promoters - - true - 1.13 - Whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - beta12orEarlier - - - - - - - - - - Structural biology - - Structural assignment - Structure determination - This includes experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - 1.3 - This includes Informatics concerning data generated from the use of microscopes, including optical, electron and scanning probe microscopy. Includes methods for digitizing microscope images and viewing the produced virtual slides and associated data on a computer screen. - The molecular structure of biological molecules, particularly macromolecules such as proteins and nucleic acids. - true - VT 1.5.24 Structural biology - Structural determination - - - - - - - - - - - Protein membrane regions - - 1.8 - 1.13 - true - Trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Structure comparison - - This might involve comparison of secondary or tertiary (3D) structural information. - true - The comparison of two or more molecular structures, for example structure alignment and clustering. - 1.13 - beta12orEarlier - - - - - - - - - - Function analysis - - true - Protein function prediction - The study of gene and protein function including the prediction of functional properties of a protein. - Protein function analysis - beta12orEarlier - - - - - - - - - - - Prokaryotes and archae - - The resource may be specific to a prokaryote, a group of prokaryotes or all prokaryotes. - VT 1.5.2 Bacteriology - Bacteriology - beta12orEarlier - Specific bacteria or archaea, e.g. information on a specific prokaryote genome including molecular sequences, genes and annotation. - - - - - - - - - - Protein databases - - true - 1.3 - Protein data resources. - beta12orEarlier - Protein data resources - - - - - - - - - - Structure determination - - Experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - beta12orEarlier - true - 1.3 - - - - - - - - - - Cell biology - - beta12orEarlier - true - VT 1.5.11 Cell biology - Cellular processes - Cells, such as key genes and proteins involved in the cell cycle. - - - - - - - - - - Classification - - beta13 - beta12orEarlier - Topic focused on identifying, grouping, or naming things in a structured way according to some schema based on observable relationships. - true - - - - - - - - - - Lipoproteins - - true - 1.3 - beta12orEarlier - Lipoproteins (protein-lipid assemblies). - - - - - - - - - - Phylogeny visualisation - - true - Visualise a phylogeny, for example, render a phylogenetic tree. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cheminformatics - - The application of information technology to chemistry in biological research environment. - Chemical informatics - beta12orEarlier - Chemoinformatics - true - - - - - - - - - - - Systems biology - - http://en.wikipedia.org/wiki/Systems_biology - This includes databases of models and methods to construct or analyse a model. - Biological models - http://purl.bioontology.org/ontology/MSH/D049490 - true - beta12orEarlier - Biological modelling - Biological system modelling - The holistic modelling and analysis of complex biological systems and the interactions therein. - - - - - - - - - - - Statistics and probability - - Biostatistics - Probability - http://en.wikipedia.org/wiki/Biostatistics - beta12orEarlier - The application of statistical methods to biological problems. - Statistics - http://purl.bioontology.org/ontology/MSH/D056808 - - - - - - - - - - - Structure database search - - The query is a structure-based entity such as another structure, a 3D (structural) motif, 3D profile or template. - beta12orEarlier - Search for and retrieve molecular structures that are similar to a structure-based query (typically another structure or part of a structure). - beta12orEarlier - true - - - - - - - - - - Molecular modelling - - Molecular docking - Homology modeling - beta12orEarlier - Comparative modelling - Homology modelling - Molecular modeling - Comparative modeling - true - The construction, analysis, evaluation, refinement etc. of models of a molecules properties or behaviour, including the modelling the structure of proteins in complex with small molecules or other macromolecules (docking). - - - - - - - - - - Protein function prediction - - 1.2 - beta12orEarlier - true - The prediction of functional properties of a protein. - - - - - - - - - - SNP - - true - Single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. - beta12orEarlier - 1.13 - - - - - - - - - - Transmembrane protein prediction - - Predict transmembrane domains and topology in protein sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - - Nucleic acid structure comparison - - The comparison two or more nucleic acid (typically RNA) secondary or tertiary structures. - beta12orEarlier - true - beta12orEarlier - Use this concept for methods that are exclusively for nucleic acid structures. - - - - - - - - - - - Exons - - beta12orEarlier - true - Exons in a nucleotide sequences. - 1.13 - - - - - - - - - - Gene transcription - - Transcription of DNA into RNA including the regulation of transcription. - true - 1.13 - beta12orEarlier - - - - - - - - - - DNA mutation - - - beta12orEarlier - DNA mutation. - - - - - - - - - - Oncology - - beta12orEarlier - VT 3.2.16 Oncology - Cancer - true - The study of cancer, for example, genes and proteins implicated in cancer. - Cancer biology - - - - - - - - - - - Toxins and targets - - 1.13 - beta12orEarlier - true - Structural and associated data for toxic chemical substances. - - - - - - - - - - Introns - - 1.13 - Introns in a nucleotide sequences. - beta12orEarlier - true - - - - - - - - - - Tool topic - - beta12orEarlier - A topic concerning primarily bioinformatics software tools, typically the broad function or purpose of a tool. - true - beta12orEarlier - - - - - - - - - - Study topic - - A general area of bioinformatics study, typically the broad scope or category of content of a bioinformatics journal or conference proceeding. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Nomenclature - - true - 1.3 - beta12orEarlier - Biological nomenclature (naming), symbols and terminology. - - - - - - - - - - Disease genes and proteins - - 1.3 - true - beta12orEarlier - The genes, gene variations and proteins involved in one or more specific diseases. - - - - - - - - - - Protein structure analysis - - - Protein structure - true - Protein secondary or tertiary structural data and/or associated annotation. - http://edamontology.org/topic_3040 - beta12orEarlier - - - - - - - - - - - Humans - - beta12orEarlier - The human genome, including molecular sequences, genes, annotation, maps and viewers, the human proteome or human beings in general. - - - - - - - - - - Gene resources - - Gene resource - beta12orEarlier - 1.3 - Informatics resource (typically a database) primarily focussed on genes. - Gene database - true - - - - - - - - - - Yeast - - beta12orEarlier - Yeast, e.g. information on a specific yeast genome including molecular sequences, genes and annotation. - true - 1.3 - - - - - - - - - - Eukaryotes - - Eukaryote - Eukaryotes or data concerning eukaryotes, e.g. information on a specific eukaryote genome including molecular sequences, genes and annotation. - The resource may be specific to a eukaryote, a group of eukaryotes or all eukaryotes. - beta12orEarlier - - - - - - - - - - Invertebrates - - The resource may be specific to an invertebrate, a group of invertebrates or all invertebrates. - beta12orEarlier - Invertebrates, e.g. information on a specific invertebrate genome including molecular sequences, genes and annotation. - - - - - - - - - - Vertebrates - - The resource may be specific to a vertebrate, a group of vertebrates or all vertebrates. - Vertebrates, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - - - - - - - - - - Unicellular eukaryotes - - Unicellular eukaryotes, e.g. information on a unicellular eukaryote genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a unicellular eukaryote, a group of unicellular eukaryotes or all unicellular eukaryotes. - - - - - - - - - - Protein structure alignment - - Protein secondary or tertiary structure alignments. - beta12orEarlier - true - 1.3 - - - - - - - - - - X-ray diffraction - - - The study of matter and their structure by means of the diffraction of X-rays, typically the diffraction pattern caused by the regularly spaced atoms of a crystalline sample. - beta12orEarlier - X-ray microscopy - Crystallography - X-ray crystallography - - - - - - - - - - - Ontologies, nomenclature and classification - - true - Conceptualisation, categorisation and naming of entities or phenomena within biology or bioinformatics. - 1.3 - http://purl.bioontology.org/ontology/MSH/D002965 - beta12orEarlier - - - - - - - - - - Immunoproteins, genes and antigens - - - Immunopeptides - Immunity-related genes, proteins and their ligands. - Antigens - This includes T cell receptors (TR), major histocompatibility complex (MHC), immunoglobulin superfamily (IgSF) / antibodies, major histocompatibility complex superfamily (MhcSF), etc." - beta12orEarlier - Immunoproteins - Immunogenes - - - - - - - - - - - Molecules - - CHEBI:23367 - beta12orEarlier - beta12orEarlier - Specific molecules, including large molecules built from repeating subunits (macromolecules) and small molecules of biological significance. - true - - - - - - - - - - Toxicology - - - Toxins and the adverse effects of these chemical substances on living organisms. - VT 3.1.9 Toxicology - Toxicoinformatics - true - beta12orEarlier - Computational toxicology - - - - - - - - - - - High-throughput sequencing - - Next-generation sequencing - beta13 - true - beta12orEarlier - Parallelized sequencing processes that are capable of sequencing many thousands of sequences simultaneously. - - - - - - - - - - Structural clustering - - The comparison and grouping together of molecular structures on the basis of similarity; generate, process or analyse structural clusters. - 1.7 - Structure classification - true - beta12orEarlier - - - - - - - - - - Gene regulatory networks - - Gene regulatory networks. - true - 1.13 - beta12orEarlier - - - - - - - - - - Disease (specific) - - Informatics resources dedicated to one or more specific diseases (not diseases in general). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - VNTR - - Variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Microsatellites - - true - 1.13 - beta12orEarlier - Microsatellite polymorphism in a DNA sequence. - - - - - - - - - - - RFLP - - Restriction fragment length polymorphisms (RFLP) in a DNA sequence. - true - 1.13 - beta12orEarlier - - - - - - - - - - - DNA polymorphism - - - Includes restriction fragment length polymorphisms (RFLP) in a DNA sequence. An RFLP is defined by the presence or absence of a specific restriction site of a bacterial restriction enzyme. - true - RFLP - Single nucleotide polymorphism - Microsatellites - VNTR - SNP - Includes microsatellite polymorphism in a DNA sequence. A microsatellite polymorphism is a very short subsequence that is repeated a variable number of times between individuals. These repeats consist of the nucleotides cytosine and adenosine. - DNA polymorphism. - Variable number of tandem repeat polymorphism - Includes single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. A SNP is a DNA sequence variation where a single nucleotide differs between members of a species or paired chromosomes in an individual. - beta12orEarlier - Includes variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. VNTRs occur in non-coding regions of DNA and consists sub-sequence that is repeated a multiple (and varied) number of times. - - - - - - - - - - Nucleic acid design - - Topic for the design of nucleic acid sequences with specific conformations. - 1.3 - beta12orEarlier - true - - - - - - - - - - Primer or probe design - - 1.3 - true - beta13 - The design of primers for PCR and DNA amplification or the design of molecular probes. - - - - - - - - - - Structure databases - - beta13 - true - 1.2 - Structure data resources - Molecular secondary or tertiary (3D) structural data resources, typically of proteins and nucleic acids. - - - - - - - - - - Nucleic acid structure - - true - beta13 - Nucleic acid (secondary or tertiary) structure, such as whole structures, structural features and associated annotation. - 1.2 - - - - - - - - - - Sequence databases - - Molecular sequence data resources, including sequence sites, alignments, motifs and profiles. - true - beta13 - Sequence data resources - Sequence data - Sequence data resource - 1.3 - - - - - - - - - - Nucleic acid sequences - - Nucleotide sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - Nucleotide sequences - - - - - - - - - - Protein sequences - - Protein sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - - - - - - - - - - Protein interaction networks - - 1.3 - true - - - - - - - - - - Molecular biology - - true - VT 1.5.4 Biochemistry and molecular biology - beta13 - The molecular basis of biological activity, particularly the macromolecules (e.g. proteins and nucleic acids) that are essential to life. - - - - - - - - - - - Mammals - - true - beta13 - 1.3 - Mammals, e.g. information on a specific mammal genome including molecular sequences, genes and annotation. - - - - - - - - - - Biodiversity - - The degree of variation of life forms within a given ecosystem, biome or an entire planet. - beta13 - VT 1.5.5 Biodiversity conservation - true - http://purl.bioontology.org/ontology/MSH/D044822 - - - - - - - - - - - Sequence clusters and classification - - This includes the results of sequence clustering, ortholog identification, assignment to families, annotation etc. - The comparison, grouping together and classification of macromolecules on the basis of sequence similarity. - Sequence families - 1.3 - true - Sequence clusters - beta13 - - - - - - - - - - Genetics - - http://purl.bioontology.org/ontology/MSH/D005823 - true - The study of genes, genetic variation and heredity in living organisms. - beta13 - Heredity - - - - - - - - - - - Quantitative genetics - - beta13 - The genes and genetic mechanisms such as Mendelian inheritance that underly continuous phenotypic traits (such as height or weight). - true - - - - - - - - - - Population genetics - - The distribution of allele frequencies in a population of organisms and its change subject to evolutionary processes including natural selection, genetic drift, mutation and gene flow. - true - beta13 - - - - - - - - - - - Regulatory RNA - - 1.3 - Regulatory RNA sequences including microRNA (miRNA) and small interfering RNA (siRNA). - true - beta13 - - - - - - - - - - Documentation and help - - The documentation of resources such as tools, services and databases and how to get help. - true - beta13 - 1.13 - - - - - - - - - - Genetic organisation - - The structural and functional organisation of genes and other genetic elements. - 1.3 - beta13 - true - - - - - - - - - - Medical informatics - - true - Health informatics - Clinical informatics - Biomedical informatics - Translational medicine - The application of information technology to health, disease and biomedicine. - Healthcare informatics - beta13 - Health and disease - Molecular medicine - - - - - - - - - - - Developmental biology - - VT 1.5.14 Developmental biology - true - beta13 - How organisms grow and develop. - - - - - - - - - - - Embryology - - true - beta13 - The development of organisms between the one-cell stage (typically the zygote) and the end of the embryonic stage. - - - - - - - - - - - Anatomy - - VT 3.1.1 Anatomy and morphology - beta13 - The form and function of the structures of living organisms. - true - - - - - - - - - - - Literature and reference - - beta13 - true - http://purl.bioontology.org/ontology/MSH/D011642 - The scientific literature, reference information and documentation. - Literature sources - Bibliography - This includes the documentation of resources such as tools, services and databases, user support, how to get help etc. - Documentation - - - - - - - - - - - Biology - - VT 1.5.8 Biology - beta13 - VT 1.5 Biological sciences - VT 1.5.23 Reproductive biology - Cryobiology - Biological rhythms - A particular biological science, especially observable traits such as aspects of biochemistry, physiology, morphology, anatomy, development and so on. - VT 1.5.7 Biological rhythm - Biological science - Aerobiology - VT 1.5.99 Other - Chronobiology - true - VT 1.5.13 Cryobiology - - VT 1.5.1 Aerobiology - VT 1.5.3 Behavioural biology - Reproductive biology - Behavioural biology - - - - - - - - - - - Data management - - The development and use of architectures, policies, practices and procedures for management of data. - true - beta13 - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - VT 1.3.1 Data management - - - - - - - - - - - Sequence feature detection - - 1.3 - true - beta13 - The detection of the positional features, such as functional and other key sites, in molecular sequences. - http://purl.bioontology.org/ontology/MSH/D058977 - - - - - - - - - - Nucleic acid feature detection - - The detection of positional features such as functional sites in nucleotide sequences. - true - beta13 - 1.3 - - - - - - - - - - Protein feature detection - - The detection, identification and analysis of positional protein sequence features, such as functional sites. - beta13 - 1.3 - true - - - - - - - - - - Biological system modelling - - 1.2 - true - beta13 - Topic for modelling biological systems in mathematical terms. - - - - - - - - - - Data acquisition - - The acquisition of data, typically measurements of physical systems using any type of sampling system, or by another other means. - beta13 - - - - - - - - - - Genes and proteins resources - - 1.3 - Gene family - beta13 - Gene and protein families - Specific genes and/or their encoded proteins or a family or other grouping of related genes and proteins. - true - - - - - - - - - - Protein topological domains - - 1.13 - Topological domains such as cytoplasmic regions in a protein. - true - 1.8 - - - - - - - - - - Protein variants - - beta13 - true - Protein sequence variants produced e.g. from alternative splicing, alternative promoter usage, alternative initiation and ribosomal frameshifting. - - - - - - - - - - - Expression signals - - beta13 - true - 1.12 - Regions within a nucleic acid sequence containing a signal that alters a biological function. - - - - - - - - - - DNA binding sites - - - Matrix-attachment region - beta13 - Nucleosome exclusion sequences - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes), restriction enzyme recognition sites (restriction sites) etc. - Restriction sites - Ribosome binding sites - Scaffold-attachment region - This includes sites involved with DNA replication and recombination. This includes binding sites for initiation of replication (origin of replication), regions where transfer is initiated during the conjugation or mobilization (origin of transfer), starting sites for DNA duplication (origin of replication) and regions which are eliminated through any of kind of recombination. Also nucleosome exclusion regions, i.e. specific patterns or regions which exclude nucleosomes (the basic structural units of eukaryotic chromatin which play a significant role in regulating gene expression). - Nucleic acids binding to some other molecule. - Matrix/scaffold attachment region - - - - - - - - - - - Nucleic acid repeats - - true - beta13 - This includes long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a defined sequence and other types of repeating unit. - Repetitive elements within a nucleic acid sequence. - 1,13 - - - - - - - - - - DNA replication and recombination - - DNA replication or recombination. - beta13 - true - - - - - - - - - - Signal or transit peptide - - beta13 - 1.13 - true - Coding sequences for a signal or transit peptide. - - - - - - - - - - Sequence tagged sites - - beta13 - 1.13 - Sequence tagged sites (STS) in nucleic acid sequences. - true - - - - - - - - - - Sequencing - - Resequencing - true - http://purl.bioontology.org/ontology/MSH/D059014 - Chromosome walking - NGS - Next gen sequencing - DNA-Seq - High throughput sequencing - 1.1 - Primer walking - Next generation sequencing - The determination of complete (typically nucleotide) sequences, including those of genomes (full genome sequencing, de novo sequencing and resequencing), amplicons and transcriptomes. - - - - - - - - - - - ChIP-seq - - - Chip sequencing - 1.1 - The analysis of protein-DNA interactions where chromatin immunoprecipitation (ChIP) is used in combination with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. - Chip Seq - Chip-sequencing - - - - - - - - - RNA-Seq - - Small RNA-seq - Whole transcriptome shotgun sequencing - RNA-seq - miRNA-seq - 1.1 - A topic concerning high-throughput sequencing of cDNA to measure the RNA content (transcriptome) of a sample, for example, to investigate how different alleles of a gene are expressed, detect post-transcriptional mutations or identify gene fusions. - Small RNA-Seq - WTSS - This includes small RNA profiling (small RNA-Seq), for example to find novel small RNAs, characterize mutations and analyze expression of small RNAs. - - - - - - - - - DNA methylation - - true - DNA methylation including bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - 1.3 - http://purl.bioontology.org/ontology/MSH/D019175 - 1.1 - - - - - - - - - - Metabolomics - - The systematic study of metabolites, the chemical processes they are involved, and the chemical fingerprints of specific cellular processes in a whole cell, tissue, organ or organism. - true - http://purl.bioontology.org/ontology/MSH/D055432 - 1.1 - - - - - - - - - - - Epigenomics - - - Epigenetics concerns the heritable changes in gene expression owing to mechanisms other than DNA sequence variation. - 1.1 - http://purl.bioontology.org/ontology/MSH/D057890 - The study of the epigenetic modifications of a whole cell, tissue, organism etc. - true - - - - - - - - - - - Metagenomics - - - http://purl.bioontology.org/ontology/MSH/D056186 - Ecogenomics - Community genomics - Environmental genomics - true - 1.1 - The study of genetic material recovered from environmental samples, and associated environmental data. - - - - - - - - - - - DNA structural variation - - - 1.1 - Variation in chromosome structure including microscopic and submicroscopic types of variation such as deletions, duplications, copy-number variants, insertions, inversions and translocations. - Structural variation - Genomic structural variation - - - - - - - - - - DNA packaging - - Nucleosome positioning - beta12orEarlier - DNA-histone complexes (chromatin), organisation of chromatin into nucleosomes and packaging into higher-order structures. - http://purl.bioontology.org/ontology/MSH/D042003 - - - - - - - - - - DNA-Seq - - 1.1 - A topic concerning high-throughput sequencing of randomly fragmented genomic DNA, for example, to investigate whole-genome sequencing and resequencing, SNP discovery, identification of copy number variations and chromosomal rearrangements. - 1.3 - DNA-seq - true - - - - - - - - - - RNA-Seq alignment - - true - 1.3 - RNA-seq alignment - The alignment of sequences of (typically millions) of short reads to a reference genome. This is a specialised topic within sequence alignment, especially because of complications arising from RNA splicing. - beta12orEarlier - - - - - - - - - - ChIP-on-chip - - ChiP - ChIP-Chip - 1.1 - Experimental techniques that combine chromatin immunoprecipitation ('ChIP') with microarray ('chip'). ChIP-on-chip is used for high-throughput study protein-DNA interactions. - ChIP-chip - - - - - - - - - Data security - - 1.3 - Data privacy - The protection of data, such as patient health data, from damage or unwanted access from unauthorized users. - - - - - - - - - - Sample collections - - samples - biobanking - 1.3 - biosamples - Biological samples and specimens. - Specimen collections - - - - - - - - - - - Biochemistry - - - VT 1.5.4 Biochemistry and molecular biology - Chemical biology - 1.3 - Biological chemistry - true - Chemical substances and physico-chemical processes and that occur within living organisms. - - - - - - - - - - - Phylogenetics - - - The study of evolutionary relationships amongst organisms from analysis of genetic information (typically gene or protein sequences). - 1.3 - http://purl.bioontology.org/ontology/MSH/D010802 - true - - - - - - - - - - Epigenetics - - Topic concerning the study of heritable changes, for example in gene expression or phenotype, caused by mechanisms other than changes in the DNA sequence. - This includes sub-topics such as histone modification and DNA methylation. DNA methylation includes bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - http://purl.bioontology.org/ontology/MSH/D019175 - DNA methylation - Bisulfite sequencing - Histone modification - true - 1.3 - - - - - - - - - - - Biotechnology - - true - 1.3 - The exploitation of biological process, structure and function for industrial purposes, for example the genetic manipulation of microorganisms for the antibody production. - - - - - - - - - - - Phenomics - - - - Phenomes, or the study of the change in phenotype (the physical and biochemical traits of organisms) in response to genetic and environmental factors. - 1.3 - true - - - - - - - - - - - Evolutionary biology - - VT 1.5.16 Evolutionary biology - true - 1.3 - The evolutionary processes, from the genetic to environmental scale, that produced life in all its diversity. - - - - - - - - - - - Physiology - - The functions of living organisms and their constituent parts. - 1.3 - VT 3.1.8 Physiology - true - - - - - - - - - - - Microbiology - - true - The biology of microorganisms. - 1.3 - VT 1.5.20 Microbiology - - - - - - - - - - - Parasitology - - true - 1.3 - The biology of parasites. - - - - - - - - - - - Medicine - - General medicine - Research in support of healing by diagnosis, treatment, and prevention of disease. - true - 1.3 - VT 3.1 Basic medicine - VT 3.2.9 General and internal medicine - Experimental medicine - Biomedical research - Clinical medicine - VT 3.2 Clinical medicine - Internal medicine - - - - - - - - - - - Neurobiology - - Neuroscience - 1.3 - true - The study of the nervous system and brain; its anatomy, physiology and function. - VT 3.1.5 Neuroscience - - - - - - - - - - - Public health and epidemiology - - VT 3.3.1 Epidemiology - Topic concerning the the patterns, cause, and effect of disease within populations. - true - 1.3 - Public health - Epidemiology - - - - - - - - - - - Biophysics - - - 1.3 - true - VT 1.5.9 Biophysics - The use of physics to study biological system. - - - - - - - - - - - Computational biology - - VT 1.5.19 Mathematical biology - VT 1.5.12 Computational biology - This includes the modeling and treatment of biological processes and systems in mathematical terms (theoretical biology). - Mathematical biology - VT 1.5.26 Theoretical biology - Theoretical biology - 1.3 - The development and application of theory, analytical methods, mathematical models and computational simulation of biological systems. - true - Biomathematics - - - - - - - - - - - Transcriptomics - - - Comparative transcriptomics - Metatranscriptomics - The analysis of transcriptomes, or a set of all the RNA molecules in a specific cell, tissue etc. - Transcriptome - 1.3 - true - - - - - - - - - - - Chemistry - - VT 1.7.10 Polymer science - VT 1.7.7 Mathematical chemistry - VT 1.7.3 Colloid chemistry - 1.3 - Mathematical chemistry - Physical chemistry - VT 1.7.9 Physical chemistry - Polymer science - Chemical science - Organic chemistry - VT 1.7.6 Inorganic and nuclear chemistry - VT 1.7 Chemical sciences - VT 1.7.5 Electrochemistry - Inorganic chemistry - VT 1.7.2 Chemistry - Nuclear chemistry - VT 1.7.8 Organic chemistry - The composition and properties of matter, reactions, and the use of reactions to create new substances. - - - - - - - - - - - Mathematics - - The study of numbers (quantity) and other topics including structure, space, and change. - VT:1.1 Mathematics - Maths - VT 1.1.99 Other - 1.3 - - - - - - - - - - - Computer science - - 1.3 - VT 1.2 Computer sciences - VT 1.2.99 Other - The theory and practical use of computer systems. - - - - - - - - - - - Physics - - The study of matter, space and time, and related concepts such as energy and force. - 1.3 - - - - - - - - - - - RNA splicing - - - This includes the study of splice sites, splicing patterns, alternative splicing events and variants, isoforms, etc.. - Splice sites - RNA splicing; post-transcription RNA modification involving the removal of introns and joining of exons. - 1.3 - Alternative splicing - true - - - - - - - - - - Molecular genetics - - - 1.3 - The structure and function of genes at a molecular level. - true - - - - - - - - - - - Respiratory medicine - - true - VT 3.2.25 Respiratory systems - Pulmonology - The study of respiratory system. - Pulmonary medicine - Respiratory disease - 1.3 - Pulmonary disorders - - - - - - - - - - - Metabolic disease - - The study of metabolic diseases. - 1.4 - 1.3 - true - - - - - - - - - - Infectious disease - - Transmissable disease - VT 3.3.4 Infectious diseases - Communicable disease - The branch of medicine that deals with the prevention, diagnosis and management of transmissable disease with clinically evident illness resulting from infection with pathogenic biological agents (viruses, bacteria, fungi, protozoa, parasites and prions). - 1.3 - - - - - - - - - - - Rare diseases - - 1.3 - The study of rare diseases. - - - - - - - - - - - Computational chemistry - - - 1.3 - VT 1.7.4 Computational chemistry - true - Topic concerning the development and application of theory, analytical methods, mathematical models and computational simulation of chemical systems. - - - - - - - - - - - Neurology - - Neurological disorders - true - 1.3 - The branch of medicine that deals with the anatomy, functions and disorders of the nervous system. - - - - - - - - - - - Cardiology - - true - Cardiovascular disease - VT 3.2.4 Cardiac and Cardiovascular systems - 1.3 - Cardiovascular medicine - Heart disease - VT 3.2.22 Peripheral vascular disease - The diseases and abnormalities of the heart and circulatory system. - - - - - - - - - - - Drug discovery - - - The discovery and design of drugs or potential drug compounds. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - 1.3 - true - - - - - - - - - - - Biobank - - true - biobanking - 1.3 - Repositories of biological samples, typically human, for basic biological and clinical research. - Tissue collection - - - - - - - - - - - Mouse clinic - - 1.3 - Laboratory study of mice, for example, phenotyping, and mutagenesis of mouse cell lines. - - - - - - - - - - - Microbial collection - - Collections of microbial cells including bacteria, yeasts and moulds. - 1.3 - - - - - - - - - - - Cell culture collection - - 1.3 - Collections of cells grown under laboratory conditions, specifically, cells from multi-cellular eukaryotes and especially animal cells. - - - - - - - - - - - Clone library - - 1.3 - Collections of DNA, including both collections of cloned molecules, and populations of micro-organisms that store and propagate cloned DNA. - - - - - - - - - - - Translational medicine - - 'translating' the output of basic and biomedical research into better diagnostic tools, medicines, medical procedures, policies and advice. - true - 1.3 - - - - - - - - - - - Compound libraries and screening - - Translational medicine - Chemical library - Collections of chemicals, typically for use in high-throughput screening experiments. - Compound library - Chemical screening - 1.3 - - - - - - - - - - - Biomedical science - - Topic concerning biological science that is (typically) performed in the context of medicine. - true - VT 3.3 Health sciences - Health science - 1.3 - - - - - - - - - - - Data identity and mapping - - Topic concerning the identity of biological entities, or reports on such entities, and the mapping of entities and records in different databases. - 1.3 - - - - - - - - - - - Sequence search - - 1.3 - Sequence database search - true - 1.12 - The search and retrieval from a database on the basis of molecular sequence similarity. - - - - - - - - - - Biomarkers - - Diagnostic markers - 1.4 - Objective indicators of biological state often used to assess health, and determinate treatment. - true - - - - - - - - - - Laboratory techniques - - The procedures used to conduct an experiment. - Lab techniques - 1.4 - - - - - - - - - - - Data architecture, analysis and design - - The development of policies, models and standards that cover data acquisitioin, storage and integration, such that it can be put to use, typically through a process of systematically applying statistical and / or logical techniques to describe, illustrate, summarise or evaluate data. - Data analysis - Data design - 1.4 - Data architecture - - - - - - - - - - - Data integration and warehousing - - The combination and integration of data from different sources, for example into a central repository or warehouse, to provide users with a unified view of these data. - - - Data integration - 1.4 - Data warehousing - - - - - - - - - - - Biomaterials - - Any matter, surface or construct that interacts with a biological system. - Diagnostic markers - 1.4 - - - - - - - - - - - Chemical biology - - - true - 1.4 - The use of synthetic chemistry to study and manipulate biological systems. - - - - - - - - - - - Analytical chemistry - - 1.4 - The study of the separation, identification, and quantification of the chemical components of natural and artificial materials. - VT 1.7.1 Analytical chemistry - - - - - - - - - - - Synthetic chemistry - - Synthetic organic chemistry - The use of chemistry to create new compounds. - 1.4 - - - - - - - - - - - Software engineering - - VT 1.2.1 Algorithms - Programming languages - VT 1.2.7 Data structures - Software development - Software engineering - Computer programming - 1.4 - 1.2.12 Programming languages - The process that leads from an original formulation of a computing problem to executable programs. - Data structures - Algorithms - VT 1.2.14 Software engineering - - - - - - - - - - - Drug development - - 1.4 - Medicine development - The process of bringing a new drug to market once a lead compounds has been identified through drug discovery. - Drug development science - Medicines development - true - - - - - - - - - - - Drug formulation and delivery - - The process of formulating abd administering a pharmaceutical compound to achieve a therapeutic effect. - Drug delivery - Drug formulation - 1.4 - - - - - - - - - - - Pharmacokinetics and pharmacodynamics - - Pharmacodynamics - Pharmacokinetics - Drug distribution - true - 1.4 - Drug excretion - The study of how a drug interacts with the body. - Drug absorption - ADME - Drug metabolism - Drug metabolism - - - - - - - - - - - Medicines research and development - Medicine research and development - - The discovery, development and approval of medicines. - Health care research - Drug discovery and development - 1.4 - Health care science - - - - - - - - - - - Safety sciences - - 1.4 - Drug safety - The safety (or lack) of drugs and other medical interventions. - - - - - - - - - - - Pharmacovigilence - - 1.4 - Pharmacovigilence concerns safety once a drug has gone to market. - The detection, assesment, understanding and prevention of adverse effects of medicines. - - - - - - - - - - - Preclinical and clinical studies - - - The testing of new medicines, vaccines or procedures on animals (preclinical) and humans (clinical) prior to their approval by regulatory authorities. - Preclinical studies - 1.4 - Clinical study - Preclinical study - Clinical studies - - - - - - - - - - - Imaging - - true - Microscopy imaging - Microscopy - Diffraction experiment - The visual representation of an object. - This includes diffraction experiments that are based upon the interference of waves, typically electromagnetic waves such as X-rays or visible light, by some object being studied, typical in order to produce an image of the object or determine its structure. - 1.4 - - - - - - - - - - - Biological imaging - - The use of imaging techniques to understand biology. - 1.4 - - - - - - - - - - - Medical imaging - - VT 3.2.24 Radiology - The use of imaging techniques for clinical purposes for medical research. - 1.4 - Radiology - VT 3.2.14 Nuclear medicine - Nuclear medicine - VT 3.2.13 Medical imaging - - - - - - - - - - - Light microscopy - - The use of optical instruments to magnify the image of an object. - 1.4 - - - - - - - - - - - Laboratory animal science - - 1.4 - The use of animals and alternatives in experimental research. - - - - - - - - - - - Marine biology - - 1.4 - VT 1.5.18 Marine and Freshwater biology - true - The study of organisms in the ocean or brackish waters. - - - - - - - - - - - Molecular medicine - - The identification of molecular and genetic causes of disease and the development of interventions to correct them. - 1.4 - true - - - - - - - - - - - Nutritional science - - 1.4 - VT 3.3.7 Nutrition and Dietetics - Dietetics - The study of the effects of food components on the metabolism, health, performance and disease resistance of humans and animals. It also includes the study of human behaviours related to food choices. - Nutrition science - - - - - - - - - - - Omics - - true - The collective characterisation and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms. - 1.4 - - - - - - - - - - - Quality affairs - - The processes that need to be in place to ensure the quality of products for human or animal use. - Good clinical practice - Good manufacturing practice - Quality assurance - Good laboratory practice - 1.4 - - - - - - - - - - - Regulatory affairs - - The protection of public health by controlling the safety and efficacy of products in areas including pharmaceuticals, veterinary medicine, medical devices, pesticides, agrochemicals, cosmetics, and complementary medicines. - 1.4 - - - - - - - - - - - Regnerative medicine - - Stem cell research - Biomedical approaches to clinical interventions that involve the use of stem cells. - true - 1.4 - - - - - - - - - - - Systems medicine - - true - 1.4 - An interdisciplinary field of study that looks at the dynamic systems of the human body as part of an integrted whole, incoporating biochemical, physiological, and environmental interactions that sustain life. - - - - - - - - - - - Veterinary medicine - - 1.4 - Topic concerning the branch of medicine that deals with the prevention, diagnosis, and treatment of disease, disorder and injury in animals. - - - - - - - - - - - Bioengineering - - 1.4 - The application of biological concepts and methods to the analytical and synthetic methodologies of engineering. - Diagnostic markers - - - - - - - - - - - Geriatric medicine - - The branch of medicine dealing with the diagnosis, treatment and prevention of disease in older people, and the problems specific to aging. - VT 3.2.10 Geriatrics and gerontology - true - Ageing - Gerontology - Aging - 1.4 - Geriatrics - - - - - - - - - - - Allergy, clinical immunology and immunotherapeutics. - - VT 3.2.1 Allergy - Health issues related to the immune system and their prevention, diagnosis and mangement. - 1.4 - true - Immune disorders - Clinical immunology - Immunomodulators - Allergy - Immunotherapeutics - - - - - - - - - - - Pain medicine - - 1.4 - Algiatry - true - The prevention of pain and the evaluation, treatment and rehabilitation of persons in pain. - - - - - - - - - - - Anaesthesiology - - Anaesthetics - Anaesthesia and anaesthetics. - 1.4 - VT 3.2.2 Anaesthesiology - - - - - - - - - - - Critical care medicine - - Acute medicine - VT 3.2.5 Critical care/Emergency medicine - Emergency medicine - 1.4 - The multidisciplinary that cares for patients with acute, life-threatening illness or injury. - - - - - - - - - - - Dermatology - - The branch of medicine that deals with prevention, diagnosis and treatment of disorders of the skin, scalp, hair and nails. - Dermatological disorders - 1.4 - VT 3.2.7 Dermatology and venereal diseases - - - - - - - - - - - Dentistry - - 1.4 - The study, diagnosis, prevention and treatments of disorders of the oral cavity, maxillofacial area and adjacent structures. - - - - - - - - - - - Ear, nose and throat medicine - - Otolaryngology - 1.4 - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the ear, nose and throat. - Otorhinolaryngology - Head and neck disorders - VT 3.2.20 Otorhinolaryngology - Audiovestibular medicine - - - - - - - - - - - Endocrinology and metabolism - - 1.4 - Metabolic disorders - true - The branch of medicine dealing with diseases of endocrine organs, hormone systems, their target organs, and disorders of the pathways of glucose and lipid metabolism. - Metabolism - Endocrinology - Endocrine disorders - - - - - - - - - - - Haematology - - VT 3.2.11 Hematology - true - The branch of medicine that deals with the blood, blood-forming organs and blood diseases. - Haematological disorders - 1.4 - Blood disorders - - - - - - - - - - - Gastroenterology - - true - The branch of medicine that deals with disorders of the oesophagus, stomach, duodenum, jejenum, ileum, large intestine, sigmoid colon and rectum. - Gastrointestinal disorders - VT 3.2.8 Gastroenterology and hepatology - 1.4 - - - - - - - - - - - Gender medicine - - The study of the biological and physiological differences between males and females and how they effect differences in disease presentation and management. - 1.4 - - - - - - - - - - - Gynaecology and obstetrics - - The branch of medicine that deals with the health of the female reproductive system, pregnancy and birth. - true - 1.4 - VT 3.2.15 Obstetrics and gynaecology - Gynaecology - Gynaecological disorders - Obstetrics - - - - - - - - - - - Hepatic and biliary medicine - - Hepatobiliary medicine - Liver disorders - 1.4 - true - The branch of medicine that deals with the liver, gallbladder, bile ducts and bile. - - - - - - - - - - - Infectious tropical disease - - The branch of medicine that deals with the infectious diseases of the tropics. - 1.13 - true - 1.4 - - - - - - - - - - Trauma medicine - - 1.4 - The branch of medicine that treats body wounds or shock produced by sudden physical injury, as from violence or accident. - - - - - - - - - - - Medical toxicology - - true - The branch of medicine that deals with the diagnosis, management and prevention of poisoning and other adverse health effects caused by medications, occupational and environmental toxins, and biological agents. - 1.4 - - - - - - - - - - - Musculoskeletal medicine - - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the muscle, bone and connective tissue. It incorporates aspects of orthopaedics, rheumatology, rehabilitation medicine and pain medicine. - VT 3.2.26 Rheumatology - VT 3.2.19 Orthopaedics - Musculoskeletal disorders - Orthopaedics - Rheumatology - 1.4 - - - - - - - - - - - Opthalmology - - Eye disoders - VT 3.2.18 Optometry - 1.4 - Optometry - VT 3.2.17 Ophthalmology - Audiovestibular medicine - The branch of medicine that deals with disorders of the eye, including eyelid, optic nerve/visual pathways and occular muscles. - - - - - - - - - - - Paediatrics - - 1.4 - The branch of medicine that deals with the medical care of infants, children and adolescents. - VT 3.2.21 Paediatrics - Child health - - - - - - - - - - - Psychiatry - - The branch of medicine that deals with the mangement of mental illness, emotional disturbance and abnormal behaviour. - 1.4 - Psychiatric disorders - VT 3.2.23 Psychiatry - Mental health - - - - - - - - - - - Reproductive health - - Reproductive disorders - Audiovestibular medicine - VT 3.2.3 Andrology - Andrology - 1.4 - Family planning - The health of the reproductive processes, functions and systems at all stages of life. - Fertility medicine - - - - - - - - - - - Surgery - - Transplantation - VT 3.2.28 Transplantation - The use of operative, manual and instrumental techniques on a patient to investigate and/or treat a pathological condition or help improve bodily function or appearance. - 1.4 - - - - - - - - - - - Urology and nephrology - - The branches of medicine and physiology focussing on the function and disorders of the urinary system in males and females, the reproductive system in males, and the kidney. - VT 3.2.29 Urology and nephrology - 1.4 - Urology - Kidney disease - Urological disorders - Nephrology - - - - - - - - - - - Complementary medicine - - Medical therapies that fall beyond the scope of conventional medicine but may be used alongside it in the treatment of disease and ill health. - VT 3.2.12 Integrative and Complementary medicine - Holistic medicine - 1.4 - Alternative medicine - Integrative medicine - - - - - - - - - - - MRI - - Nuclear magnetic resonance imaging - 1.7 - MRT - Magnetic resonance tomography - Techniques that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - NMRI - Magnetic resonance imaging - - - - - - - - - - - Neutron diffraction - - - The study of matter by studying the diffraction pattern from firing neutrons at a sample, typically to determine atomic and/or magnetic structure. - Neutron microscopy - Elastic neutron scattering - 1.7 - Neutron diffraction experiment - - - - - - - - - - Tomography - - X-ray tomography - Imaging in sections (sectioning), through the use of a wave-generating device (tomograph) that generates an image (a tomogram). - Electron tomography - 1.7 - - - - - - - - - - Data mining - - 1.7 - VT 1.3.2 Data mining - The discovery of patterns in large data sets and the extraction and trasnsformation of those patterns into a useful format. - true - KDD - Knowledge discovery in databases - - - - - - - - - - Machine learning - - A topic concerning the application of artificial intelligence methods to algorithms, in order to create methods that can learn from data in order to generate an ouput, rather than relying on explicitly encoded information only. - Artificial Intelligence - 1.7 - VT 1.2.2 Artificial Intelligence (expert systems, machine learning, robotics) - - - - - - - - - - Database management - - File management - Document, record and content management - Database administration - This includes databases for the results of scientific experiments, the application of high-throughput technology, computational analysis and the scientific literature. It covers the management and manipulation of digital documents, including database records, files and reports. - Document management - Content management - 1.8 - Databases - Data maintenance - The general handling of data stored in digital archives such as databanks, databases proper, web portals and other data resources. - - Record management - Biological databases - - - - - - - - - - Animals - - 1.8 - Animal biology - Animals, e.g. information on a specific animal genome including molecular sequences, genes and annotation. - Zoology - Animal - VT 1.5.29 Zoology - The resource may be specific to a plant, a group of plants or all plants. - Metazoa - - - - - - - - - - Protein sites, features and motifs - - - A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Protein sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in protein sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Signal peptide cleavage sites - - - - - - - - - - Nucleic acid sites, features and motifs - - - Primer binding sites - Nucleic acid functional sites - Sequence tagged sites - Nucleic acid sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in nucleic acid sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence tagged sites are short DNA sequences that are unique within a genome and serve as a mapping landmark, detectable by PCR they allow a genome to be mapped via an ordering of STSs. - - - - - - - - - - Gene transcripts - - - EST - This includes Introns, and protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. Also expressed sequence tag (EST) or complementary DNA (cDNA) sequences. - Transcription - mRNA features - This includes regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. A polyA signal is required for endonuclease cleavage of an RNA transcript that is followed by polyadenylation. A polyA site is a site on an RNA transcript to which adenine residues will be added during post-transcriptional polyadenylation. - cDNA - Introns - PolyA site - Fusion transcripts - Exons - Signal peptide coding sequence - This includes coding sequences for a signal or transit peptide. A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Transcription of DNA into RNA and features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. - 1.8 - PolyA signal - mRNA - Transit peptide coding sequence - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Coding RNA - Gene transcript features - - - - - - - - - - Protein-ligand interactions - - true - 1.8 - Protein-ligand (small molecule) interaction(s). - 1.13 - Protein-drug interactions - - - - - - - - - - Protein-drug interactions - - 1.13 - 1.8 - true - Protein-drug interaction(s). - - - - - - - - - - Genotyping experiment - - 1.8 - Genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - - - - - - - - - - GWAS study - - 1.8 - Genome-wide association study experiments. - Genome-wide association study - - - - - - - - - - Microarray experiment - - ChIP-chip - Microarray experiments including conditions, protocol, sample:data relationships etc. - Microarrays - Tissue microarray - Reverse phase protein array - Methylation array - mRNA microarray - Multichannel microarray - Proprietary platform micoarray - MicroRNA array - 1.8 - Two channel microarray - miRNA array - This might specify which raw data file relates to which sample and information on hybridisations, e.g. which are technical and which are biological replicates. - One channel microarray - ChIP-on-chip - Genotyping array - - - - - - - - - - PCR experiment - - 1.8 - PCR experiments, e.g. quantitative real-time PCR. - - - - - - - - - - Proteomics experiment - - Proteomics experiments. - Northern blot experiment - 2D PAGE experiment - 1.8 - This includes two-dimensional gel electrophoresis (2D PAGE) experiments, gels or spots in a gel. Also mass spectrometry - an analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. Also Northern blot experiments. - Mass spectrometry - - - - - - - - - - 2D PAGE experiment - - true - Two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - 1.13 - - - - - - - - - - Northern blot experiment - - Northern Blot experiments. - true - 1.13 - 1.8 - - - - - - - - - - RNAi experiment - - 1.8 - RNAi experiments. - - - - - - - - - - Simulation experiment - - 1.8 - Biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - - - - - - - - - - Protein-nucleic acid interactions - - true - 1.8 - Protein-DNA/RNA interaction(s). - 1.13 - - - - - - - - - - Protein-protein interactions - - 1.13 - Protein-protein interaction(s), including interactions between protein domains. - 1.8 - true - - - - - - - - - - Cellular process pathways - - 1.8 - Cellular process pathways. - true - 1.13 - - - - - - - - - - Disease pathways - - 1.13 - Disease pathways, typically of human disease. - true - 1.8 - - - - - - - - - - Environmental information processing pathways - - true - Environmental information processing pathways. - 1.8 - 1.13 - - - - - - - - - - Genetic information processing pathways - - true - 1.8 - Genetic information processing pathways. - 1.13 - - - - - - - - - - Protein super-secondary structure - - Super-secondary structure of protein sequence(s). - true - 1.8 - 1.13 - - - - - - - - - - Protein active sites - - 1.8 - 1.13 - true - Catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein binding sites - - Protein functional sites - Enzyme active site - Binding sites in proteins, including cleavage sites (for a proteolytic enzyme or agent), key residues involved in protein folding, catalytic residues (active site) of an enzyme, ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids, RNA and DNA-binding proteins and binding sites etc. - Protein-nucleic acid binding sites - 1.8 - Protein cleavage sites - Protein key folding sites - - - - - - - - - - Protein-nucleic acid binding sites - - RNA and DNA-binding proteins and binding sites in protein sequences. - 1.13 - 1.8 - true - - - - - - - - - - Protein cleavage sites - - Cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - true - 1.8 - 1.13 - - - - - - - - - - Protein chemical modifications - - true - Chemical modification of a protein. - 1.13 - 1.8 - - - - - - - - - - Protein disordered structure - - Disordered structure in a protein. - 1.8 - Protein features (disordered structure) - - - - - - - - - - Protein domains - - true - 1.13 - Structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - - - - - - - - - - Protein key folding sites - - 1.8 - 1.13 - true - Key residues involved in protein folding. - - - - - - - - - - Protein post-translational modifications - - true - 1.13 - Post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein secondary structure - - The location and size of the secondary structure elements and intervening loop regions is typically given. The report can include disulphide bonds and post-translationally formed peptide bonds (crosslinks). - Secondary structure (predicted or real) of a protein, including super-secondary structure. - Protein super-secondary structure - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - Protein features (secondary structure) - 1.8 - - - - - - - - - - Protein sequence repeats - - true - 1.8 - Short repetitive subsequences (repeat sequences) in a protein sequence. - 1.13 - - - - - - - - - - Protein signal peptides - - 1.13 - Signal peptides or signal peptide cleavage sites in protein sequences. - true - 1.8 - - - - - - - - - - Protein interaction experiment - - 1.12 - Yeast one-hybrid - Co-immunoprecipitation - An experiment for studying protein-protein interactions. - Yeast two-hybrid - Phage display - - - - - - - - - - Applied mathematics - - VT 1.1.1 Applied mathematics - The application of mathematics to specific problems in science, typically by the formulation and analysis of mathematical models. - 1.10 - - - - - - - - - - Pure mathematics - - VT 1.1.1 Pure mathematics - The study of abstract mathematical concepts. - 1.10 - - - - - - - - - - Data governance - - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - The control of data entry and maintenance to ensure the data meets defined standards, qualities or constraints. - 1.10 - Data stewardship - - - - - - - - - - Data quality management - - http://purl.bioontology.org/ontology/MSH/D030541 - 1.10 - Data quality - Data integrity - Data clean-up - Data enrichment - The quality, integrity, cleaning up and enrichment of data. - - - - - - - - - - Freshwater biology - - 1.10 - VT 1.5.18 Marine and Freshwater biology - The study of organisms in freshwater ecosystems. - - - - - - - - - - - Human genetics - - true - The study of inheritatnce in human beings. - VT 3.1.2 Human genetics - 1.10 - - - - - - - - - - - Tropical medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - VT 3.3.14 Tropical medicine - - - - - - - - - - - Medical biotechnology - - VT 3.4.1 Biomedical devices - 1.10 - true - VT 3.4.2 Health-related biotechnology - VT 3.4 Medical biotechnology - VT 3.3.14 Tropical medicine - Pharmaceutical biotechnology - Biotechnology applied to the medical sciences and the development of medicines. - - - - - - - - - - - Personalized medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - Molecular diagnostics - true - VT 3.4.5 Molecular diagnostics - - - - - - - - - - - Immunoprecipitation experiment - - - - Chromatin immunoprecipitation - Experimental techniques to purify a protein-DNA crosslinked complex. Usually sequencing follows e.g. in the techniques ChIP-chip, ChIP-seq and MeDIP-seq. - 1.12 - - - - - - - - - - Whole genome sequencing - - 1.12 - Laboratory technique to sequence the complete DNA sequence of an organism's genome at a single time. - WGS - Whole genome resequencing - - - - - - - - - - Methylated DNA immunoprecipitation - - 1.12 - MeDIP-seq - Methylated DNA immunoprecipitation (MeDIP) - Methylation sequencing - Laboratory technique to sequence the methylated regions in DNA. - MeDIP-chip - Bisulfite sequencing - MeDIP - mDIP - - - - - - - - - - Exome sequencing - - 1.1 - Exome capture - Exome sequencing is considered a cheap alternative to whole genome sequencing. - Targeted exome capture - Exome sequence analysis - Laboratory technique to sequence all the protein-coding regions in a genome, i.e., the exome. - Exome analysis - - - - - - - - - - - Experimental design and studies - - Design of experiments - 1.12 - Experimental design - Studies - The design of an experiment intended to test a hypothesis, and describe or explain empirical data obtained under various experimental conditions. - true - - - - - - - - - - - Animal study - - - Challenge study - 1.12 - The design of an experiment involving non-human animals. - - - - - - - - - - Microbial ecology - - - 1.13 - The ecology of microorganisms including their relationship with one another and their environment. - Microbiome - true - Environmental microbiology - - - - - - - - - - Obsolete concept (EDAM) - - 1.2 - Needed for conversion to the OBO format. - An obsolete concept (redefined in EDAM). - true - - - - - - - - - - - - - - diff --git a/releases/EDAM_1.14.owl b/releases/EDAM_1.14.owl deleted file mode 100644 index b0b3ee1..0000000 --- a/releases/EDAM_1.14.owl +++ /dev/null @@ -1,53249 +0,0 @@ - - - - - - - - - - - - - -]> - - - - - EDAM_topic http://edamontology.org/topic_ "EDAM topics" - EDAM_operation http://edamontology.org/operation_ "EDAM operations" - formats "EDAM data formats" - EDAM - Jon Ison, Matus Kalas, Hervé Ménager - identifiers "EDAM types of identifiers" - data "EDAM types of data" - relations "EDAM relations" - edam "EDAM" - EDAM editors: Jon Ison, Matus Kalas, and Herve Menager. Contributors: Inge Jonassen, Dan Bolser, Hamish McWilliam, Mahmut Uludag, James Malone, Rodrigo Lopez, Steve Pettifer, and Peter Rice. Contibutions from these projects: EMBRACE, ELIXIR, and BioMedBridges (EU); EMBOSS (BBSRC, UK); eSysbio, FUGE Bioinformatics Platform, and ELIXIR.NO/Norwegian Bioinformatics Platform (Research Council of Norway). See http://edamontology.org for documentation and licence. - operations "EDAM operations" - Bioinformatics operations, data types, formats, identifiers and topics - EDAM http://edamontology.org/ "EDAM relations and concept properties" - application/rdf+xml - EDAM_data http://edamontology.org/data_ "EDAM types of data" - concept_properties "EDAM concept properties" - Jon Ison - 3730 - Matúš Kalaš - EDAM_format http://edamontology.org/format_ "EDAM data formats" - 1.14 - topics "EDAM topics" - 24:02:2016 21:54GMT - Hervé Ménager - EDAM is an ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM is a simple ontology - essentially a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use by curators, software developers and end-users. EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources. EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries. - - - - - - - - - - - - - - - Citation - concept_properties - 1.13 - Publication reference - Publication - 'Citation' concept property ('citation' metadata tag) contains a dereferenceable URI, preferrably including a DOI, pointing to a citeable publication of the given data format. - true - - - - - - - - Created in - Version in which a concept was created. - true - concept_properties - - - - - - - - Documentation - Specification - 'Documentation' trailing modifier (qualifier, 'documentation') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page with explanation, description, documentation, or specification of the given data format. - true - concept_properties - - - - - - - - Example - 'Example' concept property ('example' metadata tag) lists examples of valid values of types of identifiers (accessions). Applicable to some other types of data, too. - true - Separated by bar ('|'). - concept_properties - - - - - - - - File extension - 'File extension' concept property ('file_extension' metadata tag) lists examples of usual file extensions of formats. - Separated by bar ('|'), without a dot ('.') prefix, preferrably not all capital characters. - concept_properties - true - - - - - - - - isdebtag - When 'true', the term has been proposed or is supported within Debian Med as a tag. - concept_properties - true - - - - - - - - Media type - MIME type - 'Media type' trailing modifier (qualifier, 'media_type') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to a page specifying a media type of the given data format. - true - concept_properties - - - - - - - - - - - - - - Obsolete since - true - concept_properties - Version in which a concept was made obsolete. - - - - - - - - Regular expression - 'Regular expression' concept property ('regex' metadata tag) specifies the allowed values of types of identifiers (accessions). Applicable to some other types of data, too. - concept_properties - true - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - has format - "http://purl.obolibrary.org/obo/OBI_0000298" - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. Object B can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology that is a 'Format' or is in the role of a 'Format'. In EDAM, 'has_format' is not explicitly defined between EDAM concepts, only the inverse 'is_format_of'. - false - OBO_REL:is_a - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - false - false - edam - 'A has_format B' defines for the subject A, that it has the object B as its data format. - false - - - - - - - - - - has function - http://wsio.org/has_function - false - OBO_REL:is_a - OBO_REL:bearer_of - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is (or is in a role of) a function, or an entity outside of an ontology that is (or is in a role of) a function specification. In the scope of EDAM, 'has_function' serves only for relating annotated entities outside of EDAM with 'Operation' concepts. - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality" - true - 'A has_function B' defines for the subject A, that it has the object B as its function. - "http://purl.obolibrary.org/obo/OBI_0000306" - relations - false - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:bearer_of' is narrower in the sense that it only relates ontological categories (concepts) that are an 'independent_continuant' (snap:IndependentContinuant) with ontological categories that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant), and broader in the sense that it relates with any borne objects not just functions of the subject. - OBO_REL:bearer_of - - - - - In very unusual cases. - true - - - - - - - - - - has identifier - false - false - relations - OBO_REL:is_a - edam - 'A has_identifier B' defines for the subject A, that it has the object B as its identifier. - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is an 'Identifier', or an entity outside of an ontology that is an 'Identifier' or is in the role of an 'Identifier'. In EDAM, 'has_identifier' is not explicitly defined between EDAM concepts, only the inverse 'is_identifier_of'. - false - false - - - - - - - - - - has input - OBO_REL:has_participant - "http://purl.obolibrary.org/obo/OBI_0000293" - false - http://wsio.org/has_input - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_input' is explicitly defined between EDAM concepts ('Operation' 'has_input' 'Data'). The inverse, 'is_input_of', is not explicitly defined. - relations - OBO_REL:is_a - false - 'A has_input B' defines for the subject A, that it has the object B as a necessary or actual input or input argument. - false - true - edam - - - - - true - In very unusual cases. - - - - - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just inputs or input arguments of the subject. - OBO_REL:has_participant - - - - - - - - - - has output - http://wsio.org/has_output - Subject A can either be concept that is or has an 'Operation' function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that has an 'Operation' function or is an 'Operation'. Object B can be any concept or entity. In EDAM, only 'has_output' is explicitly defined between EDAM concepts ('Operation' 'has_output' 'Data'). The inverse, 'is_output_of', is not explicitly defined. - edam - "http://purl.obolibrary.org/obo/OBI_0000299" - OBO_REL:is_a - relations - OBO_REL:has_participant - true - 'A has_output B' defines for the subject A, that it has the object B as a necessary or actual output or output argument. - false - false - false - - - - - 'OBO_REL:has_participant' is narrower in the sense that it only relates ontological categories (concepts) that are a 'process' (span:Process) with ontological categories that are a 'continuant' (snap:Continuant), and broader in the sense that it relates with any participating objects not just outputs or output arguments of the subject. It is also not clear whether an output (result) actually participates in the process that generates it. - OBO_REL:has_participant - - - - - true - In very unusual cases. - - - - - - - - - - has topic - relations - true - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology that is a 'Topic' or is in the role of a 'Topic'. In EDAM, only 'has_topic' is explicitly defined between EDAM concepts ('Operation' or 'Data' 'has_topic' 'Topic'). The inverse, 'is_topic_of', is not explicitly defined. - false - 'A has_topic B' defines for the subject A, that it has the object B as its topic (A is in the scope of a topic B). - edam - OBO_REL:is_a - http://annotation-ontology.googlecode.com/svn/trunk/annotation-core.owl#hasTopic - false - "http://purl.obolibrary.org/obo/IAO_0000136" - false - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#has-quality - "http://purl.obolibrary.org/obo/OBI_0000298" - - - - - - - - - - - - true - In very unusual cases. - - - - - - - - - - is format of - false - OBO_REL:is_a - false - false - false - 'A is_format_of B' defines for the subject A, that it is a data format of the object B. - edam - relations - Subject A can either be a concept that is a 'Format', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Format' or is in the role of a 'Format'. Object B can be any concept or entity outside of an ontology that is (or is in a role of) 'Data', or an input, output, input or output argument of an 'Operation'. In EDAM, only 'is_format_of' is explicitly defined between EDAM concepts ('Format' 'is_format_of' 'Data'). The inverse, 'has_format', is not explicitly defined. - OBO_REL:quality_of - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - - - - - - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - OBO_REL:quality_of - - - - - - - - - - is function of - Subject A can either be concept that is (or is in a role of) a function, or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is (or is in a role of) a function specification. Object B can be any concept or entity. Within EDAM itself, 'is_function_of' is not used. - OBO_REL:inheres_in - true - OBO_REL:is_a - false - 'A is_function_of B' defines for the subject A, that it is a function of the object B. - OBO_REL:function_of - edam - http://wsio.org/is_function_of - relations - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - false - - - - - In very unusual cases. - true - - - - - OBO_REL:function_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:function_of' only relates subjects that are a 'function' (snap:Function) with objects that are an 'independent_continuant' (snap:IndependentContinuant), so for example no processes. It does not define explicitly that the subject is a function of the object. - - - - - OBO_REL:inheres_in - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:inheres_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'specifically_dependent_continuant' (snap:SpecificallyDependentContinuant) with ontological categories that are an 'independent_continuant' (snap:IndependentContinuant), and broader in the sense that it relates any borne subjects not just functions. - - - - - - - - - - is identifier of - false - false - edam - false - relations - Subject A can either be a concept that is an 'Identifier', or an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is an 'Identifier' or is in the role of an 'Identifier'. Object B can be any concept or entity outside of an ontology. In EDAM, only 'is_identifier_of' is explicitly defined between EDAM concepts (only 'Identifier' 'is_identifier_of' 'Data'). The inverse, 'has_identifier', is not explicitly defined. - 'A is_identifier_of B' defines for the subject A, that it is an identifier of the object B. - OBO_REL:is_a - false - - - - - - - - - - - is input of - false - http://wsio.org/is_input_of - relations - true - false - OBO_REL:participates_in - OBO_REL:is_a - "http://purl.obolibrary.org/obo/OBI_0000295" - edam - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_input_of' is not explicitly defined between EDAM concepts, only the inverse 'has_input'. - false - 'A is_input_of B' defines for the subject A, that it as a necessary or actual input or input argument of the object B. - - - - - - true - In very unusual cases. - - - - - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just inputs or input arguments. - OBO_REL:participates_in - - - - - - - - - - is output of - OBO_REL:is_a - false - false - Subject A can be any concept or entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated). Object B can either be a concept that is or has an 'Operation' function, or an entity outside of an ontology that has an 'Operation' function or is an 'Operation'. In EDAM, 'is_output_of' is not explicitly defined between EDAM concepts, only the inverse 'has_output'. - edam - false - 'A is_output_of B' defines for the subject A, that it as a necessary or actual output or output argument of the object B. - OBO_REL:participates_in - http://wsio.org/is_output_of - true - relations - "http://purl.obolibrary.org/obo/OBI_0000312" - - - - - - 'OBO_REL:participates_in' is narrower in the sense that it only relates ontological categories (concepts) that are a 'continuant' (snap:Continuant) with ontological categories that are a 'process' (span:Process), and broader in the sense that it relates any participating subjects not just outputs or output arguments. It is also not clear whether an output (result) actually participates in the process that generates it. - OBO_REL:participates_in - - - - - In very unusual cases. - true - - - - - - - - - - is topic of - 'A is_topic_of B' defines for the subject A, that it is a topic of the object B (a topic A is the scope of B). - relations - OBO_REL:quality_of - false - true - false - Subject A can either be a concept that is a 'Topic', or in unexpected cases an entity outside of an ontology (or an ontology concept in a role of an entity being semantically annotated) that is a 'Topic' or is in the role of a 'Topic'. Object B can be any concept or entity outside of an ontology. In EDAM, 'is_topic_of' is not explicitly defined between EDAM concepts, only the inverse 'has_topic'. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#inherent-in - false - OBO_REL:is_a - edam - - - - - - - - - - - - - OBO_REL:quality_of - Is defined anywhere? Not in the 'unknown' version of RO. 'OBO_REL:quality_of' might be seen narrower in the sense that it only relates subjects that are a 'quality' (snap:Quality) with objects that are an 'independent_continuant' (snap:IndependentContinuant), and is broader in the sense that it relates any qualities of the object. - - - - - In very unusual cases. - true - - - - - - - - - - - - - - - Resource type - - beta12orEarlier - beta12orEarlier - A type of computational resource used in bioinformatics. - true - - - - - - - - - - Data - - - - - Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - http://semanticscience.org/resource/SIO_000088 - http://semanticscience.org/resource/SIO_000069 - "http://purl.obolibrary.org/obo/IAO_0000030" - "http://purl.obolibrary.org/obo/IAO_0000027" - Data set - Data record - beta12orEarlier - http://wsio.org/data_002 - http://purl.org/biotop/biotop.owl#DigitalEntity - http://www.ifomis.org/bfo/1.1/snap#Continuant - Datum - - - - - EDAM does not distinguish a data record (a tool-understandable information artefact) from data or datum (its content, the tool-understandable encoding of an information). - Data record - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Datum - - - - - EDAM does not distinguish the multiplicity of data, such as one data item (datum) versus a collection of data (data set). - Data set - - - - - - - - - - Tool - - beta12orEarlier - A bioinformatics package or tool, e.g. a standalone application or web service. - beta12orEarlier - true - - - - - - - - - - Database - - A digital data archive typically based around a relational model but sometimes using an object-oriented, tree or graph-based model. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ontology - - - - - - - - beta12orEarlier - Ontologies - An ontology of biological or bioinformatics concepts and relations, a controlled vocabulary, structured glossary etc. - - - - - - - - - - Directory metadata - - 1.5 - A directory on disk from which files are read. - beta12orEarlier - true - - - - - - - - - - MeSH vocabulary - - beta12orEarlier - true - Controlled vocabulary from National Library of Medicine. The MeSH thesaurus is used to index articles in biomedical journals for the Medline/PubMED databases. - beta12orEarlier - - - - - - - - - - HGNC vocabulary - - beta12orEarlier - beta12orEarlier - Controlled vocabulary for gene names (symbols) from HUGO Gene Nomenclature Committee. - true - - - - - - - - - - UMLS vocabulary - - Compendium of controlled vocabularies for the biomedical domain (Unified Medical Language System). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Identifier - - - - - - - - - - http://semanticscience.org/resource/SIO_000115 - beta12orEarlier - ID - "http://purl.org/dc/elements/1.1/identifier" - http://wsio.org/data_005 - A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things). - - - - - - - Almost exact but limited to identifying resources. - - - - - - - - - - - Database entry - - beta12orEarlier - beta12orEarlier - An entry (retrievable via URL) from a biological database. - true - - - - - - - - - - Molecular mass - - Mass of a molecule. - beta12orEarlier - - - - - - - - - - Molecular charge - - Net charge of a molecule. - beta12orEarlier - PDBML:pdbx_formal_charge - - - - - - - - - - Chemical formula - - Chemical structure specification - A specification of a chemical structure. - beta12orEarlier - - - - - - - - - - QSAR descriptor - - A QSAR quantitative descriptor (name-value pair) of chemical structure. - QSAR descriptors have numeric values that quantify chemical information encoded in a symbolic representation of a molecule. They are used in quantitative structure activity relationship (QSAR) applications. Many subtypes of individual descriptors (not included in EDAM) cover various types of protein properties. - beta12orEarlier - - - - - - - - - - Raw sequence - - beta12orEarlier - A raw molecular sequence (string of characters) which might include ambiguity, unknown positions and non-sequence characters. - Non-sequence characters may be used for example for gaps and translation stop. - - - - - - - - - - Sequence record - - http://purl.bioontology.org/ontology/MSH/D058977 - beta12orEarlier - A molecular sequence and associated metadata. - SO:2000061 - - - - - - - - - - Sequence set - - A collection of multiple molecular sequences and associated metadata that do not (typically) correspond to molecular sequence database records or entries and which (typically) are derived from some analytical method. - This concept may be used for arbitrary sequence sets and associated data arising from processing. - beta12orEarlier - SO:0001260 - - - - - - - - - - Sequence mask character - - true - beta12orEarlier - 1.5 - A character used to replace (mask) other characters in a molecular sequence. - - - - - - - - - - Sequence mask type - - A label (text token) describing the type of sequence masking to perform. - Sequence masking is where specific characters or positions in a molecular sequence are masked (replaced) with an another (mask character). The mask type indicates what is masked, for example regions that are not of interest or which are information-poor including acidic protein regions, basic protein regions, proline-rich regions, low compositional complexity regions, short-periodicity internal repeats, simple repeats and low complexity regions. Masked sequences are used in database search to eliminate statistically significant but biologically uninteresting hits. - beta12orEarlier - 1.5 - true - - - - - - - - - - DNA sense specification - - DNA strand specification - beta12orEarlier - Strand - The strand of a DNA sequence (forward or reverse). - The forward or 'top' strand might specify a sequence is to be used as given, the reverse or 'bottom' strand specifying the reverse complement of the sequence is to be used. - - - - - - - - - - Sequence length specification - - true - A specification of sequence length(s). - beta12orEarlier - 1.5 - - - - - - - - - - Sequence metadata - - beta12orEarlier - Basic or general information concerning molecular sequences. - This is used for such things as a report including the sequence identifier, type and length. - 1.5 - true - - - - - - - - - - Sequence feature source - - This might be the name and version of a software tool, the name of a database, or 'curated' to indicate a manual annotation (made by a human). - How the annotation of a sequence feature (for example in EMBL or Swiss-Prot) was derived. - beta12orEarlier - - - - - - - - - - Sequence search results - - beta12orEarlier - Database hits (sequence) - - Sequence database hits - Sequence search hits - The score list includes the alignment score, percentage of the query sequence matched, length of the database sequence entry in this alignment, identifier of the database sequence entry, excerpt of the database sequence entry description etc. - A report of sequence hits and associated data from searching a database of sequences (for example a BLAST search). This will typically include a list of scores (often with statistical evaluation) and a set of alignments for the hits. - Sequence database search results - - - - - - - - - - Sequence signature matches - - Sequence motif matches - Protein secondary database search results - beta12orEarlier - Report on the location of matches in one or more sequences to profiles, motifs (conserved or functional patterns) or other signatures. - Sequence profile matches - This ncluding reports of hits from a search of a protein secondary or domain database. - Search results (protein secondary database) - - - - - - - - - - Sequence signature model - - Data files used by motif or profile methods. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence signature data - - - - - - - - beta12orEarlier - This can include metadata about a motif or sequence profile such as its name, length, technical details about the profile construction, and so on. - Data concering concerning specific or conserved pattern in molecular sequences and the classifiers used for their identification, including sequence motifs, profiles or other diagnostic element. - - - - - - - - - - Sequence alignment (words) - - 1.5 - beta12orEarlier - true - Sequence word alignment - Alignment of exact matches between subsequences (words) within two or more molecular sequences. - - - - - - - - - - Dotplot - - A dotplot of sequence similarities identified from word-matching or character comparison. - beta12orEarlier - - - - - - - - - - Sequence alignment - - - - - - - - http://en.wikipedia.org/wiki/Sequence_alignment - http://purl.bioontology.org/ontology/MSH/D016415 - http://semanticscience.org/resource/SIO_010066 - beta12orEarlier - Alignment of multiple molecular sequences. - - - - - - - - - - Sequence alignment parameter - - Some simple value controlling a sequence alignment (or similar 'match') operation. - true - 1.5 - beta12orEarlier - - - - - - - - - - Sequence similarity score - - A value representing molecular sequence similarity. - beta12orEarlier - - - - - - - - - - Sequence alignment metadata - - Report of general information on a sequence alignment, typically include a description, sequence identifiers and alignment score. - beta12orEarlier - true - 1.5 - - - - - - - - - - Sequence alignment report - - Use this for any computer-generated reports on sequence alignments, and for general information (metadata) on a sequence alignment, such as a description, sequence identifiers and alignment score. - An informative report of molecular sequence alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Profile-profile alignment - - beta12orEarlier - A profile-profile alignment (each profile typically representing a sequence alignment). - Sequence profile alignment - - - - - - - - - - Sequence-profile alignment - - beta12orEarlier - Alignment of one or more molecular sequence(s) to one or more sequence profile(s) (each profile typically representing a sequence alignment). - Data associated with the alignment might also be included, e.g. ranked list of best-scoring sequences and a graphical representation of scores. - - - - - - - - - - Sequence distance matrix - - beta12orEarlier - Moby:phylogenetic_distance_matrix - A matrix of estimated evolutionary distance between molecular sequences, such as is suitable for phylogenetic tree calculation. - Phylogenetic distance matrix - Methods might perform character compatibility analysis or identify patterns of similarity in an alignment or data matrix. - - - - - - - - - - Phylogenetic character data - - Basic character data from which a phylogenetic tree may be generated. - As defined, this concept would also include molecular sequences, microsatellites, polymorphisms (RAPDs, RFLPs, or AFLPs), restriction sites and fragments - http://www.evolutionaryontology.org/cdao.owl#Character - beta12orEarlier - - - - - - - - - - Phylogenetic tree - - - - - - - - Phylogeny - Moby:Tree - http://www.evolutionaryontology.org/cdao.owl#Tree - A phylogenetic tree is usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. See also 'Phylogenetic tree image'. - http://purl.bioontology.org/ontology/MSH/D010802 - Moby:phylogenetic_tree - The raw data (not just an image) from which a phylogenetic tree is directly generated or plotted, such as topology, lengths (in time or in expected amounts of variance) and a confidence interval for each length. - beta12orEarlier - Moby:myTree - - - - - - - - - - Comparison matrix - - beta12orEarlier - The comparison matrix might include matrix name, optional comment, height and width (or size) of matrix, an index row/column (of characters) and data rows/columns (of integers or floats). - Matrix of integer or floating point numbers for amino acid or nucleotide sequence comparison. - Substitution matrix - - - - - - - - - - Protein topology - - beta12orEarlier - beta12orEarlier - Predicted or actual protein topology represented as a string of protein secondary structure elements. - true - The location and size of the secondary structure elements and intervening loop regions is usually indicated. - - - - - - - - - - Protein features report (secondary structure) - - beta12orEarlier - 1.8 - true - Secondary structure (predicted or real) of a protein. - - - - - - - - - - Protein features report (super-secondary) - - 1.8 - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - true - beta12orEarlier - Super-secondary structure of protein sequence(s). - - - - - - - - - - Secondary structure alignment (protein) - - - Alignment of the (1D representations of) secondary structure of two or more proteins. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (protein) - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RNA secondary structure - - - - - - - - An informative report of secondary structure (predicted or real) of an RNA molecule. - This includes thermodynamically stable or evolutionarily conserved structures such as knots, pseudoknots etc. - Moby:RNAStructML - Secondary structure (RNA) - beta12orEarlier - - - - - - - - - - Secondary structure alignment (RNA) - - Moby:RNAStructAlignmentML - Alignment of the (1D representations of) secondary structure of two or more RNA molecules. - beta12orEarlier - - - - - - - - - - Secondary structure alignment metadata (RNA) - - true - beta12orEarlier - An informative report of RNA secondary structure alignment-derived data or metadata. - beta12orEarlier - - - - - - - - - - Structure - - - - - - - - beta12orEarlier - Coordinate model - Structure data - The coordinate data may be predicted or real. - http://purl.bioontology.org/ontology/MSH/D015394 - 3D coordinate and associated data for a macromolecular tertiary (3D) structure or part of a structure. - - - - - - - - - - Tertiary structure record - - true - beta12orEarlier - beta12orEarlier - An entry from a molecular tertiary (3D) structure database. - - - - - - - - - - Structure database search results - - 1.8 - Results (hits) from searching a database of tertiary structure. - beta12orEarlier - true - - - - - - - - - - Structure alignment - - - - - - - - Alignment (superimposition) of molecular tertiary (3D) structures. - A tertiary structure alignment will include the untransformed coordinates of one macromolecule, followed by the second (or subsequent) structure(s) with all the coordinates transformed (by rotation / translation) to give a superposition. - beta12orEarlier - - - - - - - - - - Structure alignment report - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - An informative report of molecular tertiary structure alignment-derived data. - - - - - - - - - - Structure similarity score - - beta12orEarlier - A value representing molecular structure similarity, measured from structure alignment or some other type of structure comparison. - - - - - - - - - - Structural profile - - - - - - - - beta12orEarlier - 3D profile - Some type of structural (3D) profile or template (representing a structure or structure alignment). - Structural (3D) profile - - - - - - - - - - Structural (3D) profile alignment - - beta12orEarlier - Structural profile alignment - A 3D profile-3D profile alignment (each profile representing structures or a structure alignment). - - - - - - - - - - Sequence-3D profile alignment - - Sequence-structural profile alignment - 1.5 - An alignment of a sequence to a 3D profile (representing structures or a structure alignment). - beta12orEarlier - true - - - - - - - - - - Protein sequence-structure scoring matrix - - beta12orEarlier - Matrix of values used for scoring sequence-structure compatibility. - - - - - - - - - - Sequence-structure alignment - - beta12orEarlier - An alignment of molecular sequence to structure (from threading sequence(s) through 3D structure or representation of structure(s)). - - - - - - - - - - Amino acid annotation - - An informative report about a specific amino acid. - 1.4 - true - beta12orEarlier - - - - - - - - - - Peptide annotation - - 1.4 - true - An informative report about a specific peptide. - beta12orEarlier - - - - - - - - - - Protein report - - Gene product annotation - beta12orEarlier - An informative human-readable report about one or more specific protein molecules or protein structural domains, derived from analysis of primary (sequence or structural) data. - - - - - - - - - - Protein property - - Protein physicochemical property - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a protein molecule or model. - beta12orEarlier - Protein sequence statistics - Protein properties - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Protein structural motifs and surfaces - - true - 1.8 - 3D structural motifs in a protein. - beta12orEarlier - Protein 3D motifs - - - - - - - - - Protein domain classification - - true - Data concerning the classification of the sequences and/or structures of protein structural domain(s). - 1.5 - beta12orEarlier - - - - - - - - - - Protein features report (domains) - - true - structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - beta12orEarlier - - - - - - - - - - Protein architecture report - - 1.4 - An informative report on architecture (spatial arrangement of secondary structure) of a protein structure. - Protein property (architecture) - Protein structure report (architecture) - beta12orEarlier - true - - - - - - - - - - Protein folding report - - beta12orEarlier - A report on an analysis or model of protein folding properties, folding pathways, residues or sites that are key to protein folding, nucleation or stabilization centers etc. - true - 1.8 - - - - - - - - - - Protein features (mutation) - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on the effect of (typically point) mutation on protein folding, stability, structure and function. - true - beta12orEarlier - Protein property (mutation) - Protein structure report (mutation) - beta13 - Protein report (mutation) - - - - - - - - - - Protein interaction raw data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein interaction report - - - - - - - - Protein report (interaction) - beta12orEarlier - Protein interaction record - Residue interaction data - Atom interaction data - Protein non-covalent interactions report - An informative report on interactions (predicted or known) within or between a protein, structural domain or part of a protein. This includes intra- and inter-residue contacts and distances, as well as interactions with other proteins and non-protein entities such as nucleic acid, metal atoms, water, ions etc. - - - - - - - - - - - - - Protein family report - - - - - - - - beta12orEarlier - An informative report on a specific protein family or other classification or group of protein sequences or structures. - Protein family annotation - Protein classification data - - - - - - - - - - Vmax - - beta12orEarlier - The maximum initial velocity or rate of a reaction. It is the limiting velocity as substrate concentrations get very large. - - - - - - - - - - Km - - Km is the concentration (usually in Molar units) of substrate that leads to half-maximal velocity of an enzyme-catalysed reaction. - beta12orEarlier - - - - - - - - - - Nucleotide base annotation - - beta12orEarlier - true - An informative report about a specific nucleotide base. - 1.4 - - - - - - - - - - Nucleic acid property - - A report of primarily non-positional data describing intrinsic physical, chemical or other properties of a nucleic acid molecule. - The report may be based on analysis of nucleic acid sequence or structural data. This is a broad data type and is used a placeholder for other, more specific types. - Nucleic acid physicochemical property - beta12orEarlier - GC-content - - - - - - - - - - Codon usage data - - - - - - - - beta12orEarlier - Data derived from analysis of codon usage (typically a codon usage table) of DNA sequences. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Gene report - - Gene structure (repot) - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - Gene and transcript structure (report) - Gene features report - Nucleic acid features (gene and transcript structure) - Moby:gene - This includes any report on a particular locus or gene. This might include the gene name, description, summary and so on. It can include details about the function of a gene, such as its encoded protein or a functional classification of the gene sequence along according to the encoded protein(s). - Gene annotation - beta12orEarlier - Moby_namespace:Human_Readable_Description - Gene function (report) - Moby:GeneInfo - - - - - - - - - - Gene classification - - beta12orEarlier - true - A report on the classification of nucleic acid / gene sequences according to the functional classification of their gene products. - beta12orEarlier - - - - - - - - - - DNA variation - - stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - true - 1.8 - beta12orEarlier - - - - - - - - - - Chromosome report - - beta12orEarlier - An informative report on a specific chromosome. - This includes basic information. e.g. chromosome number, length, karyotype features, chromosome sequence etc. - - - - - - - - - - Genotype/phenotype report - - An informative report on the set of genes (or allelic forms) present in an individual, organism or cell and associated with a specific physical characteristic, or a report concerning an organisms traits and phenotypes. - Genotype/phenotype annotation - beta12orEarlier - - - - - - - - - - Nucleic acid features report (primers) - - true - 1.8 - beta12orEarlier - PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - PCR experiment report - - true - beta12orEarlier - PCR experiments, e.g. quantitative real-time PCR. - 1.8 - - - - - - - - - - Sequence trace - - - Fluorescence trace data generated by an automated DNA sequencer, which can be interprted as a molecular sequence (reads), given associated sequencing metadata such as base-call quality scores. - This is the raw data produced by a DNA sequencing machine. - beta12orEarlier - - - - - - - - - - Sequence assembly - - beta12orEarlier - An assembly of fragments of a (typically genomic) DNA sequence. - Contigs - http://en.wikipedia.org/wiki/Sequence_assembly - SO:0001248 - Typically, an assembly is a collection of contigs (for example ESTs and genomic DNA fragments) that are ordered, aligned and merged. Annotation of the assembled sequence might be included. - SO:0000353 - - - - - SO:0001248 - Perhaps surprisingly, the definition of 'SO:assembly' is narrower than the 'SO:sequence_assembly'. - - - - - - - - - - Radiation Hybrid (RH) scores - - beta12orEarlier - Radiation Hybrid (RH) scores are used in Radiation Hybrid mapping. - Radiation hybrid scores (RH) scores for one or more markers. - - - - - - - - - - Genetic linkage report - - beta12orEarlier - Gene annotation (linkage) - Linkage disequilibrium (report) - An informative report on the linkage of alleles. - This includes linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Gene expression profile - - Data quantifying the level of expression of (typically) multiple genes, derived for example from microarray experiments. - beta12orEarlier - Gene expression pattern - - - - - - - - - - Microarray experiment report - - true - microarray experiments including conditions, protocol, sample:data relationships etc. - 1.8 - beta12orEarlier - - - - - - - - - - Oligonucleotide probe data - - beta12orEarlier - beta13 - true - Data on oligonucleotide probes (typically for use with DNA microarrays). - - - - - - - - - - SAGE experimental data - - beta12orEarlier - true - Output from a serial analysis of gene expression (SAGE) experiment. - Serial analysis of gene expression (SAGE) experimental data - beta12orEarlier - - - - - - - - - - MPSS experimental data - - beta12orEarlier - Massively parallel signature sequencing (MPSS) data. - beta12orEarlier - Massively parallel signature sequencing (MPSS) experimental data - true - - - - - - - - - - SBS experimental data - - beta12orEarlier - beta12orEarlier - true - Sequencing by synthesis (SBS) experimental data - Sequencing by synthesis (SBS) data. - - - - - - - - - - Sequence tag profile (with gene assignment) - - 1.14 - beta12orEarlier - true - Tag to gene assignments (tag mapping) of SAGE, MPSS and SBS data. Typically this is the sequencing-based expression profile annotated with gene identifiers. - - - - - - - - - - Protein X-ray crystallographic data - - X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein NMR data - - Protein nuclear magnetic resonance (NMR) raw data. - beta12orEarlier - - - - - - - - - - Protein circular dichroism (CD) spectroscopic data - - beta12orEarlier - Protein secondary structure from protein coordinate or circular dichroism (CD) spectroscopic data. - - - - - - - - - - Electron microscopy volume map - - - - - - - - beta12orEarlier - Volume map data from electron microscopy. - EM volume map - - - - - - - - - - Electron microscopy model - - - - - - - - beta12orEarlier - Annotation on a structural 3D model (volume map) from electron microscopy. - This might include the location in the model of the known features of a particular macromolecule. - - - - - - - - - - 2D PAGE image - - - - - - - - beta12orEarlier - Two-dimensional gel electrophoresis image - - - - - - - - - - Mass spectrometry spectra - - - - - - - - beta12orEarlier - Spectra from mass spectrometry. - - - - - - - - - - Peptide mass fingerprint - - - - - - - - - Peak list - Protein fingerprint - A molecular weight standard fingerprint is standard protonated molecular masses e.g. from trypsin (modified porcine trypsin, Promega) and keratin peptides. - A set of peptide masses (peptide mass fingerprint) from mass spectrometry. - beta12orEarlier - Molecular weights standard fingerprint - - - - - - - - - - Peptide identification - - - - - - - - Protein or peptide identifications with evidence supporting the identifications, typically from comparing a peptide mass fingerprint (from mass spectrometry) to a sequence database. - beta12orEarlier - - - - - - - - - - Pathway or network annotation - - beta12orEarlier - true - An informative report about a specific biological pathway or network, typically including a map (diagram) of the pathway. - beta12orEarlier - - - - - - - - - - Biological pathway map - - beta12orEarlier - true - A map (typically a diagram) of a biological pathway. - beta12orEarlier - - - - - - - - - - Data resource definition - - beta12orEarlier - true - 1.5 - A definition of a data resource serving one or more types of data, including metadata and links to the resource or data proper. - - - - - - - - - - Workflow metadata - - Basic information, annotation or documentation concerning a workflow (but not the workflow itself). - beta12orEarlier - - - - - - - - - - Mathematical model - - - - - - - - Biological model - beta12orEarlier - A biological model represented in mathematical terms. - - - - - - - - - - Statistical estimate score - - beta12orEarlier - A value representing estimated statistical significance of some observed data; typically sequence database hits. - - - - - - - - - - EMBOSS database resource definition - - beta12orEarlier - Resource definition for an EMBOSS database. - true - 1.5 - - - - - - - - - - Version information - - "http://purl.obolibrary.org/obo/IAO_0000129" - 1.5 - Development status / maturity may be part of the version information, for example in case of tools, standards, or some data records. - http://www.ebi.ac.uk/swo/maturity/SWO_9000061 - beta12orEarlier - Information on a version of software or data, for example name, version number and release date. - http://semanticscience.org/resource/SIO_000653 - true - http://usefulinc.com/ns/doap#Version - - - - - - - - - - Database cross-mapping - - beta12orEarlier - A mapping of the accession numbers (or other database identifier) of entries between (typically) two biological or biomedical databases. - The cross-mapping is typically a table where each row is an accession number and each column is a database being cross-referenced. The cells give the accession number or identifier of the corresponding entry in a database. If a cell in the table is not filled then no mapping could be found for the database. Additional information might be given on version, date etc. - - - - - - - - - - Data index - - - - - - - - An index of data of biological relevance. - beta12orEarlier - - - - - - - - - - Data index report - - - - - - - - A report of an analysis of an index of biological data. - Database index annotation - beta12orEarlier - - - - - - - - - - Database metadata - - Basic information on bioinformatics database(s) or other data sources such as name, type, description, URL etc. - beta12orEarlier - - - - - - - - - - Tool metadata - - beta12orEarlier - Basic information about one or more bioinformatics applications or packages, such as name, type, description, or other documentation. - - - - - - - - - - Job metadata - - beta12orEarlier - true - 1.5 - Moby:PDGJOB - Textual metadata on a submitted or completed job. - - - - - - - - - - User metadata - - beta12orEarlier - Textual metadata on a software author or end-user, for example a person or other software. - - - - - - - - - - Small molecule report - - - - - - - - Small molecule annotation - Chemical structure report - An informative report on a specific chemical compound. - beta12orEarlier - Chemical compound annotation - - - - - - - - - - Cell line report - - Organism strain data - Cell line annotation - Report on a particular strain of organism cell line including plants, virus, fungi and bacteria. The data typically includes strain number, organism type, growth conditions, source and so on. - beta12orEarlier - - - - - - - - - - Scent annotation - - beta12orEarlier - An informative report about a specific scent. - 1.4 - true - - - - - - - - - - Ontology term - - Ontology class name - beta12orEarlier - A term (name) from an ontology. - Ontology terms - - - - - - - - - - Ontology concept data - - beta12orEarlier - Ontology class metadata - Ontology term metadata - Data concerning or derived from a concept from a biological ontology. - - - - - - - - - - Keyword - - Phrases - Keyword(s) or phrase(s) used (typically) for text-searching purposes. - Boolean operators (AND, OR and NOT) and wildcard characters may be allowed. - Moby:QueryString - beta12orEarlier - Moby:BooleanQueryString - Moby:Wildcard_Query - Moby:Global_Keyword - Terms - Text - - - - - - - - - - Citation - - Bibliographic data that uniquely identifies a scientific article, book or other published material. - A bibliographic reference might include information such as authors, title, journal name, date and (possibly) a link to the abstract or full-text of the article if available. - Moby:GCP_SimpleCitation - Reference - Bibliographic reference - Moby:Publication - beta12orEarlier - - - - - - - - - - Article - - - - - - - - A document of scientific text, typically a full text article from a scientific journal. - beta12orEarlier - - - - - - - - - - Text mining report - - An abstract of the results of text mining. - beta12orEarlier - Text mining output - A text mining abstract will typically include an annotated a list of words or sentences extracted from one or more scientific articles. - - - - - - - - - - Entity identifier - - beta12orEarlier - true - beta12orEarlier - An identifier of a biological entity or phenomenon. - - - - - - - - - - Data resource identifier - - true - An identifier of a data resource. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Identifier (typed) - - beta12orEarlier - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - An identifier that identifies a particular type of data. - - - - - - - - - - - Tool identifier - - An identifier of a bioinformatics tool, e.g. an application or web service. - beta12orEarlier - - - - - - - - - - - Discrete entity identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a discrete entity (any biological thing with a distinct, discrete physical existence). - - - - - - - - - - Entity feature identifier - - true - beta12orEarlier - Name or other identifier of an entity feature (a physical part or region of a discrete biological entity, or a feature that can be mapped to such a thing). - beta12orEarlier - - - - - - - - - - Entity collection identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a collection of discrete biological entities. - - - - - - - - - - Phenomenon identifier - - beta12orEarlier - true - beta12orEarlier - Name or other identifier of a physical, observable biological occurrence or event. - - - - - - - - - - Molecule identifier - - Name or other identifier of a molecule. - beta12orEarlier - - - - - - - - - - - Atom ID - - Atom identifier - Identifier (e.g. character symbol) of a specific atom. - beta12orEarlier - - - - - - - - - - - Molecule name - - - Name of a specific molecule. - beta12orEarlier - - - - - - - - - - - Molecule type - - For example, 'Protein', 'DNA', 'RNA' etc. - true - 1.5 - beta12orEarlier - A label (text token) describing the type a molecule. - Protein|DNA|RNA - - - - - - - - - - Chemical identifier - - true - beta12orEarlier - beta12orEarlier - Unique identifier of a chemical compound. - - - - - - - - - - Chromosome name - - - - - - - - - beta12orEarlier - Name of a chromosome. - - - - - - - - - - - Peptide identifier - - Identifier of a peptide chain. - beta12orEarlier - - - - - - - - - - - Protein identifier - - - - - - - - beta12orEarlier - Identifier of a protein. - - - - - - - - - - - Compound name - - - Chemical name - Unique name of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number - - beta12orEarlier - Unique registry number of a chemical compound. - - - - - - - - - - - Ligand identifier - - true - beta12orEarlier - Code word for a ligand, for example from a PDB file. - beta12orEarlier - - - - - - - - - - Drug identifier - - - - - - - - beta12orEarlier - Identifier of a drug. - - - - - - - - - - - Amino acid identifier - - - - - - - - Identifier of an amino acid. - beta12orEarlier - Residue identifier - - - - - - - - - - - Nucleotide identifier - - beta12orEarlier - Name or other identifier of a nucleotide. - - - - - - - - - - - Monosaccharide identifier - - beta12orEarlier - Identifier of a monosaccharide. - - - - - - - - - - - Chemical name (ChEBI) - - ChEBI chemical name - Unique name from Chemical Entities of Biological Interest (ChEBI) of a chemical compound. - beta12orEarlier - This is the recommended chemical name for use for example in database annotation. - - - - - - - - - - - Chemical name (IUPAC) - - IUPAC recommended name of a chemical compound. - IUPAC chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (INN) - - INN chemical name - beta12orEarlier - International Non-proprietary Name (INN or 'generic name') of a chemical compound, assigned by the World Health Organization (WHO). - - - - - - - - - - - Chemical name (brand) - - Brand name of a chemical compound. - Brand chemical name - beta12orEarlier - - - - - - - - - - - Chemical name (synonymous) - - beta12orEarlier - Synonymous chemical name - Synonymous name of a chemical compound. - - - - - - - - - - - Chemical registry number (CAS) - - CAS chemical registry number - CAS registry number of a chemical compound. - beta12orEarlier - - - - - - - - - - - Chemical registry number (Beilstein) - - Beilstein chemical registry number - beta12orEarlier - Beilstein registry number of a chemical compound. - - - - - - - - - - - Chemical registry number (Gmelin) - - Gmelin chemical registry number - beta12orEarlier - Gmelin registry number of a chemical compound. - - - - - - - - - - - HET group name - - 3-letter code word for a ligand (HET group) from a PDB file, for example ATP. - Short ligand name - Component identifier code - beta12orEarlier - - - - - - - - - - - Amino acid name - - String of one or more ASCII characters representing an amino acid. - beta12orEarlier - - - - - - - - - - - Nucleotide code - - - beta12orEarlier - String of one or more ASCII characters representing a nucleotide. - - - - - - - - - - - Polypeptide chain ID - - - - - - - - beta12orEarlier - WHATIF: chain - Chain identifier - Identifier of a polypeptide chain from a protein. - PDBML:pdbx_PDB_strand_id - Protein chain identifier - PDB strand id - PDB chain identifier - This is typically a character (for the chain) appended to a PDB identifier, e.g. 1cukA - Polypeptide chain identifier - - - - - - - - - - - Protein name - - - Name of a protein. - beta12orEarlier - - - - - - - - - - - Enzyme identifier - - beta12orEarlier - Name or other identifier of an enzyme or record from a database of enzymes. - - - - - - - - - - - EC number - - [0-9]+\.-\.-\.-|[0-9]+\.[0-9]+\.-\.-|[0-9]+\.[0-9]+\.[0-9]+\.-|[0-9]+\.[0-9]+\.[0-9]+\.[0-9]+ - EC code - Moby:EC_Number - An Enzyme Commission (EC) number of an enzyme. - EC - Moby:Annotated_EC_Number - beta12orEarlier - Enzyme Commission number - - - - - - - - - - - Enzyme name - - - Name of an enzyme. - beta12orEarlier - - - - - - - - - - - Restriction enzyme name - - Name of a restriction enzyme. - beta12orEarlier - - - - - - - - - - - Sequence position specification - - 1.5 - A specification (partial or complete) of one or more positions or regions of a molecular sequence or map. - beta12orEarlier - true - - - - - - - - - - Sequence feature ID - - - A unique identifier of molecular sequence feature, for example an ID of a feature that is unique within the scope of the GFF file. - beta12orEarlier - - - - - - - - - - - Sequence position - - WHATIF: number - WHATIF: PDBx_atom_site - beta12orEarlier - PDBML:_atom_site.id - SO:0000735 - A position of one or more points (base or residue) in a sequence, or part of such a specification. - - - - - - - - - - Sequence range - - beta12orEarlier - Specification of range(s) of sequence positions. - - - - - - - - - - Nucleic acid feature identifier - - beta12orEarlier - beta12orEarlier - Name or other identifier of an nucleic acid feature. - true - - - - - - - - - - Protein feature identifier - - Name or other identifier of a protein feature. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence feature key - - Sequence feature method - The type of a sequence feature, typically a term or accession from the Sequence Ontology, for example an EMBL or Swiss-Prot sequence feature key. - Sequence feature type - beta12orEarlier - A feature key indicates the biological nature of the feature or information about changes to or versions of the sequence. - - - - - - - - - - Sequence feature qualifier - - beta12orEarlier - Typically one of the EMBL or Swiss-Prot feature qualifiers. - Feature qualifiers hold information about a feature beyond that provided by the feature key and location. - - - - - - - - - - Sequence feature label - - Sequence feature name - Typically an EMBL or Swiss-Prot feature label. - A feature label identifies a feature of a sequence database entry. When used with the database name and the entry's primary accession number, it is a unique identifier of that feature. - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Feature Object - - - beta12orEarlier - UFO - The name of a sequence feature-containing entity adhering to the standard feature naming scheme used by all EMBOSS applications. - - - - - - - - - - Codon name - - beta12orEarlier - beta12orEarlier - String of one or more ASCII characters representing a codon. - true - - - - - - - - - - Gene identifier - - - - - - - - Moby:GeneAccessionList - An identifier of a gene, such as a name/symbol or a unique identifier of a gene in a database. - beta12orEarlier - - - - - - - - - - - Gene symbol - - Moby_namespace:Global_GeneSymbol - beta12orEarlier - Moby_namespace:Global_GeneCommonName - The short name of a gene; a single word that does not contain white space characters. It is typically derived from the gene name. - - - - - - - - - - - Gene ID (NCBI) - - - NCBI geneid - Gene identifier (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs:NCBI_Gene - Entrez gene ID - Gene identifier (Entrez) - http://www.geneontology.org/doc/GO.xrf_abbs:LocusID - An NCBI unique identifier of a gene. - NCBI gene ID - beta12orEarlier - - - - - - - - - - - Gene identifier (NCBI RefSeq) - - beta12orEarlier - true - beta12orEarlier - An NCBI RefSeq unique identifier of a gene. - - - - - - - - - - Gene identifier (NCBI UniGene) - - beta12orEarlier - An NCBI UniGene unique identifier of a gene. - beta12orEarlier - true - - - - - - - - - - Gene identifier (Entrez) - - An Entrez unique identifier of a gene. - beta12orEarlier - true - [0-9]+ - beta12orEarlier - - - - - - - - - - Gene ID (CGD) - - CGD ID - Identifier of a gene or feature from the CGD database. - beta12orEarlier - - - - - - - - - - - Gene ID (DictyBase) - - beta12orEarlier - Identifier of a gene from DictyBase. - - - - - - - - - - - Ensembl gene ID - - - beta12orEarlier - Gene ID (Ensembl) - Unique identifier for a gene (or other feature) from the Ensembl database. - - - - - - - - - - - Gene ID (SGD) - - - Identifier of an entry from the SGD database. - S[0-9]+ - SGD identifier - beta12orEarlier - - - - - - - - - - - Gene ID (GeneDB) - - Moby_namespace:GeneDB - GeneDB identifier - beta12orEarlier - [a-zA-Z_0-9\.-]* - Identifier of a gene from the GeneDB database. - - - - - - - - - - - TIGR identifier - - - beta12orEarlier - Identifier of an entry from the TIGR database. - - - - - - - - - - - TAIR accession (gene) - - - Gene:[0-9]{7} - beta12orEarlier - Identifier of an gene from the TAIR database. - - - - - - - - - - - Protein domain ID - - - - - - - - - beta12orEarlier - Identifier of a protein structural domain. - This is typically a character or string concatenated with a PDB identifier and a chain identifier. - - - - - - - - - - - SCOP domain identifier - - Identifier of a protein domain (or other node) from the SCOP database. - beta12orEarlier - - - - - - - - - - - CATH domain ID - - 1nr3A00 - beta12orEarlier - CATH domain identifier - Identifier of a protein domain from CATH. - - - - - - - - - - - SCOP concise classification string (sccs) - - A SCOP concise classification string (sccs) is a compact representation of a SCOP domain classification. - beta12orEarlier - An scss includes the class (alphabetical), fold, superfamily and family (all numerical) to which a given domain belongs. - - - - - - - - - - - SCOP sunid - - Unique identifier (number) of an entry in the SCOP hierarchy, for example 33229. - beta12orEarlier - A sunid uniquely identifies an entry in the SCOP hierarchy, including leaves (the SCOP domains) and higher level nodes including entries corresponding to the protein level. - sunid - SCOP unique identifier - 33229 - - - - - - - - - - - CATH node ID - - 3.30.1190.10.1.1.1.1.1 - CATH code - A code number identifying a node from the CATH database. - CATH node identifier - beta12orEarlier - - - - - - - - - - - Kingdom name - - The name of a biological kingdom (Bacteria, Archaea, or Eukaryotes). - beta12orEarlier - - - - - - - - - - - Species name - - The name of a species (typically a taxonomic group) of organism. - Organism species - beta12orEarlier - - - - - - - - - - - Strain name - - - beta12orEarlier - The name of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - URI - - A string of characters that name or otherwise identify a resource on the Internet. - URIs - beta12orEarlier - - - - - - - - - - Database ID - - - - - - - - An identifier of a biological or bioinformatics database. - Database identifier - beta12orEarlier - - - - - - - - - - - Directory name - - beta12orEarlier - The name of a directory. - - - - - - - - - - - File name - - The name (or part of a name) of a file (of any type). - beta12orEarlier - - - - - - - - - - - Ontology name - - - - - - - - - beta12orEarlier - Name of an ontology of biological or bioinformatics concepts and relations. - - - - - - - - - - - URL - - A Uniform Resource Locator (URL). - Moby:URL - Moby:Link - beta12orEarlier - - - - - - - - - - URN - - beta12orEarlier - A Uniform Resource Name (URN). - - - - - - - - - - LSID - - beta12orEarlier - LSIDs provide a standard way to locate and describe data. An LSID is represented as a Uniform Resource Name (URN) with the following format: URN:LSID:<Authority>:<Namespace>:<ObjectID>[:<Version>] - Life Science Identifier - A Life Science Identifier (LSID) - a unique identifier of some data. - - - - - - - - - - Database name - - - The name of a biological or bioinformatics database. - beta12orEarlier - - - - - - - - - - - Sequence database name - - The name of a molecular sequence database. - true - beta13 - beta12orEarlier - - - - - - - - - - Enumerated file name - - beta12orEarlier - The name of a file (of any type) with restricted possible values. - - - - - - - - - - - File name extension - - The extension of a file name. - A file extension is the characters appearing after the final '.' in the file name. - beta12orEarlier - - - - - - - - - - - File base name - - beta12orEarlier - The base name of a file. - A file base name is the file name stripped of its directory specification and extension. - - - - - - - - - - - QSAR descriptor name - - - - - - - - - beta12orEarlier - Name of a QSAR descriptor. - - - - - - - - - - - Database entry identifier - - true - This concept is required for completeness. It should never have child concepts. - beta12orEarlier - An identifier of an entry from a database where the same type of identifier is used for objects (data) of different semantic type. - beta12orEarlier - - - - - - - - - - Sequence identifier - - - - - - - - An identifier of molecular sequence(s) or entries from a molecular sequence database. - beta12orEarlier - - - - - - - - - - - Sequence set ID - - - - - - - - - An identifier of a set of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence signature identifier - - beta12orEarlier - beta12orEarlier - true - Identifier of a sequence signature (motif or profile) for example from a database of sequence patterns. - - - - - - - - - - - Sequence alignment ID - - - - - - - - - Identifier of a molecular sequence alignment, for example a record from an alignment database. - beta12orEarlier - - - - - - - - - - - Phylogenetic distance matrix identifier - - beta12orEarlier - Identifier of a phylogenetic distance matrix. - true - beta12orEarlier - - - - - - - - - - Phylogenetic tree ID - - - - - - - - - beta12orEarlier - Identifier of a phylogenetic tree for example from a phylogenetic tree database. - - - - - - - - - - - Comparison matrix identifier - - - - - - - - An identifier of a comparison matrix. - Substitution matrix identifier - beta12orEarlier - - - - - - - - - - - Structure ID - - - beta12orEarlier - A unique and persistent identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Structural (3D) profile ID - - - - - - - - - Structural profile identifier - Identifier or name of a structural (3D) profile or template (representing a structure or structure alignment). - beta12orEarlier - - - - - - - - - - - Structure alignment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of tertiary structure alignments. - - - - - - - - - - - Amino acid index ID - - - - - - - - - Identifier of an index of amino acid physicochemical and biochemical property data. - beta12orEarlier - - - - - - - - - - - Protein interaction ID - - - - - - - - - beta12orEarlier - Molecular interaction ID - Identifier of a report of protein interactions from a protein interaction database (typically). - - - - - - - - - - - Protein family identifier - - - - - - - - Protein secondary database record identifier - Identifier of a protein family. - beta12orEarlier - - - - - - - - - - - Codon usage table name - - - - - - - - - - - - - - - Unique name of a codon usage table. - beta12orEarlier - - - - - - - - - - - Transcription factor identifier - - - Identifier of a transcription factor (or a TF binding site). - beta12orEarlier - - - - - - - - - - - Experiment annotation ID - - - - - - - - beta12orEarlier - Identifier of an entry from a database of microarray data. - - - - - - - - - - - Electron microscopy model ID - - - - - - - - - Identifier of an entry from a database of electron microscopy data. - beta12orEarlier - - - - - - - - - - - Gene expression report ID - - - - - - - - - Accession of a report of gene expression (e.g. a gene expression profile) from a database. - beta12orEarlier - Gene expression profile identifier - - - - - - - - - - - Genotype and phenotype annotation ID - - - - - - - - - Identifier of an entry from a database of genotypes and phenotypes. - beta12orEarlier - - - - - - - - - - - Pathway or network identifier - - - - - - - - Identifier of an entry from a database of biological pathways or networks. - beta12orEarlier - - - - - - - - - - - Workflow ID - - - beta12orEarlier - Identifier of a biological or biomedical workflow, typically from a database of workflows. - - - - - - - - - - - Data resource definition ID - - beta12orEarlier - Identifier of a data type definition from some provider. - Data resource definition identifier - - - - - - - - - - - Biological model ID - - - - - - - - Biological model identifier - beta12orEarlier - Identifier of a mathematical model, typically an entry from a database. - - - - - - - - - - - Compound identifier - - - - - - - - beta12orEarlier - Chemical compound identifier - Identifier of an entry from a database of chemicals. - Small molecule identifier - - - - - - - - - - - Ontology concept ID - - - A unique (typically numerical) identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - Article ID - - - - - - - - - beta12orEarlier - Unique identifier of a scientific article. - Article identifier - - - - - - - - - - - FlyBase ID - - - Identifier of an object from the FlyBase database. - FB[a-zA-Z_0-9]{2}[0-9]{7} - beta12orEarlier - - - - - - - - - - - WormBase name - - - Name of an object from the WormBase database, usually a human-readable name. - beta12orEarlier - - - - - - - - - - - WormBase class - - beta12orEarlier - Class of an object from the WormBase database. - A WormBase class describes the type of object such as 'sequence' or 'protein'. - - - - - - - - - - - Sequence accession - - - beta12orEarlier - A persistent, unique identifier of a molecular sequence database entry. - Sequence accession number - - - - - - - - - - - Sequence type - - 1.5 - Sequence type might reflect the molecule (protein, nucleic acid etc) or the sequence itself (gapped, ambiguous etc). - A label (text token) describing a type of molecular sequence. - true - beta12orEarlier - - - - - - - - - - EMBOSS Uniform Sequence Address - - - EMBOSS USA - beta12orEarlier - The name of a sequence-based entity adhering to the standard sequence naming scheme used by all EMBOSS applications. - - - - - - - - - - - Sequence accession (protein) - - - - - - - - Accession number of a protein sequence database entry. - Protein sequence accession number - beta12orEarlier - - - - - - - - - - - Sequence accession (nucleic acid) - - - - - - - - Accession number of a nucleotide sequence database entry. - beta12orEarlier - Nucleotide sequence accession number - - - - - - - - - - - RefSeq accession - - Accession number of a RefSeq database entry. - beta12orEarlier - RefSeq ID - (NC|AC|NG|NT|NW|NZ|NM|NR|XM|XR|NP|AP|XP|YP|ZP)_[0-9]+ - - - - - - - - - - - UniProt accession (extended) - - true - Accession number of a UniProt (protein sequence) database entry. May contain version or isoform number. - [A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9].[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9].[0-9]+|[A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9]-[0-9]+|[OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]-[0-9]+ - beta12orEarlier - Q7M1G0|P43353-2|P01012.107 - 1.0 - - - - - - - - - - PIR identifier - - - - - - - - An identifier of PIR sequence database entry. - beta12orEarlier - PIR ID - PIR accession number - - - - - - - - - - - TREMBL accession - - beta12orEarlier - Identifier of a TREMBL sequence database entry. - true - 1.2 - - - - - - - - - - Gramene primary identifier - - beta12orEarlier - Gramene primary ID - Primary identifier of a Gramene database entry. - - - - - - - - - - - EMBL/GenBank/DDBJ ID - - Identifier of a (nucleic acid) entry from the EMBL/GenBank/DDBJ databases. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniGene) - - UniGene identifier - UniGene cluster id - UniGene ID - UniGene cluster ID - beta12orEarlier - A unique identifier of an entry (gene cluster) from the NCBI UniGene database. - - - - - - - - - - - dbEST accession - - - dbEST ID - Identifier of a dbEST database entry. - beta12orEarlier - - - - - - - - - - - dbSNP ID - - beta12orEarlier - dbSNP identifier - Identifier of a dbSNP database entry. - - - - - - - - - - - EMBOSS sequence type - - beta12orEarlier - true - See the EMBOSS documentation (http://emboss.sourceforge.net/) for a definition of what this includes. - beta12orEarlier - The EMBOSS type of a molecular sequence. - - - - - - - - - - EMBOSS listfile - - 1.5 - List of EMBOSS Uniform Sequence Addresses (EMBOSS listfile). - true - beta12orEarlier - - - - - - - - - - Sequence cluster ID - - - - - - - - An identifier of a cluster of molecular sequence(s). - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (COG) - - COG ID - beta12orEarlier - Unique identifier of an entry from the COG database. - - - - - - - - - - - Sequence motif identifier - - - - - - - - Identifier of a sequence motif, for example an entry from a motif database. - beta12orEarlier - - - - - - - - - - - Sequence profile ID - - - - - - - - - Identifier of a sequence profile. - beta12orEarlier - A sequence profile typically represents a sequence alignment. - - - - - - - - - - - ELM ID - - Identifier of an entry from the ELMdb database of protein functional sites. - beta12orEarlier - - - - - - - - - - - Prosite accession number - - beta12orEarlier - Accession number of an entry from the Prosite database. - PS[0-9]{5} - Prosite ID - - - - - - - - - - - HMMER hidden Markov model ID - - - - - - - - Unique identifier or name of a HMMER hidden Markov model. - beta12orEarlier - - - - - - - - - - - JASPAR profile ID - - beta12orEarlier - Unique identifier or name of a profile from the JASPAR database. - - - - - - - - - - - Sequence alignment type - - beta12orEarlier - 1.5 - true - Possible values include for example the EMBOSS alignment types, BLAST alignment types and so on. - A label (text token) describing the type of a sequence alignment. - - - - - - - - - - BLAST sequence alignment type - - true - beta12orEarlier - beta12orEarlier - The type of a BLAST sequence alignment. - - - - - - - - - - Phylogenetic tree type - - For example 'nj', 'upgmp' etc. - beta12orEarlier - true - A label (text token) describing the type of a phylogenetic tree. - 1.5 - nj|upgmp - - - - - - - - - - TreeBASE study accession number - - Accession number of an entry from the TreeBASE database. - beta12orEarlier - - - - - - - - - - - TreeFam accession number - - beta12orEarlier - Accession number of an entry from the TreeFam database. - - - - - - - - - - - Comparison matrix type - - 1.5 - true - beta12orEarlier - blosum|pam|gonnet|id - A label (text token) describing the type of a comparison matrix. - Substitution matrix type - For example 'blosum', 'pam', 'gonnet', 'id' etc. Comparison matrix type may be required where a series of matrices of a certain type are used. - - - - - - - - - - Comparison matrix name - - - - - - - - - beta12orEarlier - Substitution matrix name - See for example http://www.ebi.ac.uk/Tools/webservices/help/matrix. - Unique name or identifier of a comparison matrix. - - - - - - - - - - - PDB ID - - An identifier of an entry from the PDB database. - [a-zA-Z_0-9]{4} - PDBID - PDB identifier - beta12orEarlier - - - - - - - - - - - AAindex ID - - beta12orEarlier - Identifier of an entry from the AAindex database. - - - - - - - - - - - BIND accession number - - Accession number of an entry from the BIND database. - beta12orEarlier - - - - - - - - - - - IntAct accession number - - EBI\-[0-9]+ - beta12orEarlier - Accession number of an entry from the IntAct database. - - - - - - - - - - - Protein family name - - - beta12orEarlier - Name of a protein family. - - - - - - - - - - - InterPro entry name - - - - - - - - beta12orEarlier - Name of an InterPro entry, usually indicating the type of protein matches for that entry. - - - - - - - - - - - InterPro accession - - - - - - - - Primary accession number of an InterPro entry. - InterPro primary accession - Every InterPro entry has a unique accession number to provide a persistent citation of database records. - beta12orEarlier - InterPro primary accession number - IPR015590 - IPR[0-9]{6} - - - - - - - - - - - InterPro secondary accession - - - - - - - - Secondary accession number of an InterPro entry. - beta12orEarlier - InterPro secondary accession number - - - - - - - - - - - Gene3D ID - - beta12orEarlier - Unique identifier of an entry from the Gene3D database. - - - - - - - - - - - PIRSF ID - - PIRSF[0-9]{6} - beta12orEarlier - Unique identifier of an entry from the PIRSF database. - - - - - - - - - - - PRINTS code - - beta12orEarlier - PR[0-9]{5} - The unique identifier of an entry in the PRINTS database. - - - - - - - - - - - Pfam accession number - - PF[0-9]{5} - Accession number of a Pfam entry. - beta12orEarlier - - - - - - - - - - - SMART accession number - - Accession number of an entry from the SMART database. - beta12orEarlier - SM[0-9]{5} - - - - - - - - - - - Superfamily hidden Markov model number - - Unique identifier (number) of a hidden Markov model from the Superfamily database. - beta12orEarlier - - - - - - - - - - - TIGRFam ID - - TIGRFam accession number - Accession number of an entry (family) from the TIGRFam database. - beta12orEarlier - - - - - - - - - - - ProDom accession number - - A ProDom domain family accession number. - PD[0-9]+ - beta12orEarlier - ProDom is a protein domain family database. - - - - - - - - - - - TRANSFAC accession number - - beta12orEarlier - Identifier of an entry from the TRANSFAC database. - - - - - - - - - - - ArrayExpress accession number - - Accession number of an entry from the ArrayExpress database. - beta12orEarlier - [AEP]-[a-zA-Z_0-9]{4}-[0-9]+ - ArrayExpress experiment ID - - - - - - - - - - - PRIDE experiment accession number - - [0-9]+ - beta12orEarlier - PRIDE experiment accession number. - - - - - - - - - - - EMDB ID - - beta12orEarlier - Identifier of an entry from the EMDB electron microscopy database. - - - - - - - - - - - GEO accession number - - Accession number of an entry from the GEO database. - o^GDS[0-9]+ - beta12orEarlier - - - - - - - - - - - GermOnline ID - - beta12orEarlier - Identifier of an entry from the GermOnline database. - - - - - - - - - - - EMAGE ID - - Identifier of an entry from the EMAGE database. - beta12orEarlier - - - - - - - - - - - Disease ID - - - Accession number of an entry from a database of disease. - beta12orEarlier - - - - - - - - - - - HGVbase ID - - Identifier of an entry from the HGVbase database. - beta12orEarlier - - - - - - - - - - - HIVDB identifier - - true - beta12orEarlier - Identifier of an entry from the HIVDB database. - beta12orEarlier - - - - - - - - - - OMIM ID - - beta12orEarlier - [*#+%^]?[0-9]{6} - Identifier of an entry from the OMIM database. - - - - - - - - - - - KEGG object identifier - - - beta12orEarlier - Unique identifier of an object from one of the KEGG databases (excluding the GENES division). - - - - - - - - - - - Pathway ID (reactome) - - Identifier of an entry from the Reactome database. - Reactome ID - beta12orEarlier - REACT_[0-9]+(\.[0-9]+)? - - - - - - - - - - - Pathway ID (aMAZE) - - beta12orEarlier - aMAZE ID - true - beta12orEarlier - Identifier of an entry from the aMAZE database. - - - - - - - - - - Pathway ID (BioCyc) - - - BioCyc pathway ID - beta12orEarlier - Identifier of an pathway from the BioCyc biological pathways database. - - - - - - - - - - - Pathway ID (INOH) - - beta12orEarlier - INOH identifier - Identifier of an entry from the INOH database. - - - - - - - - - - - Pathway ID (PATIKA) - - Identifier of an entry from the PATIKA database. - PATIKA ID - beta12orEarlier - - - - - - - - - - - Pathway ID (CPDB) - - This concept refers to identifiers used by the databases collated in CPDB; CPDB identifiers are not independently defined. - CPDB ID - Identifier of an entry from the CPDB (ConsensusPathDB) biological pathways database, which is an identifier from an external database integrated into CPDB. - beta12orEarlier - - - - - - - - - - - Pathway ID (Panther) - - Identifier of a biological pathway from the Panther Pathways database. - beta12orEarlier - PTHR[0-9]{5} - Panther Pathways ID - - - - - - - - - - - MIRIAM identifier - - - - - - - - Unique identifier of a MIRIAM data resource. - MIR:00100005 - MIR:[0-9]{8} - beta12orEarlier - This is the identifier used internally by MIRIAM for a data type. - - - - - - - - - - - MIRIAM data type name - - - - - - - - beta12orEarlier - The name of a data type from the MIRIAM database. - - - - - - - - - - - MIRIAM URI - - - - - - - - - beta12orEarlier - The URI (URL or URN) of a data entity from the MIRIAM database. - identifiers.org synonym - urn:miriam:pubmed:16333295|urn:miriam:obo.go:GO%3A0045202 - A MIRIAM URI consists of the URI of the MIRIAM data type (PubMed, UniProt etc) followed by the identifier of an element of that data type, for example PMID for a publication or an accession number for a GO term. - - - - - - - - - - - MIRIAM data type primary name - - beta12orEarlier - The primary name of a MIRIAM data type is taken from a controlled vocabulary. - UniProt|Enzyme Nomenclature - The primary name of a data type from the MIRIAM database. - - - - - - A protein entity has the MIRIAM data type 'UniProt', and an enzyme has the MIRIAM data type 'Enzyme Nomenclature'. - UniProt|Enzyme Nomenclature - - - - - - - - - - MIRIAM data type synonymous name - - A synonymous name of a data type from the MIRIAM database. - A synonymous name for a MIRIAM data type taken from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Taverna workflow ID - - beta12orEarlier - Unique identifier of a Taverna workflow. - - - - - - - - - - - Biological model name - - - beta12orEarlier - Name of a biological (mathematical) model. - - - - - - - - - - - BioModel ID - - Unique identifier of an entry from the BioModel database. - beta12orEarlier - (BIOMD|MODEL)[0-9]{10} - - - - - - - - - - - PubChem CID - - - [0-9]+ - PubChem compound accession identifier - Chemical structure specified in PubChem Compound Identification (CID), a non-zero integer identifier for a unique chemical structure. - beta12orEarlier - - - - - - - - - - - ChemSpider ID - - Identifier of an entry from the ChemSpider database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ChEBI ID - - Identifier of an entry from the ChEBI database. - ChEBI IDs - ChEBI identifier - CHEBI:[0-9]+ - beta12orEarlier - - - - - - - - - - - BioPax concept ID - - beta12orEarlier - An identifier of a concept from the BioPax ontology. - - - - - - - - - - - GO concept ID - - GO concept identifier - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a concept from The Gene Ontology. - - - - - - - - - - - MeSH concept ID - - beta12orEarlier - An identifier of a concept from the MeSH vocabulary. - - - - - - - - - - - HGNC concept ID - - beta12orEarlier - An identifier of a concept from the HGNC controlled vocabulary. - - - - - - - - - - - NCBI taxonomy ID - - - NCBI taxonomy identifier - [1-9][0-9]{0,8} - NCBI tax ID - A stable unique identifier for each taxon (for a species, a family, an order, or any other group in the NCBI taxonomy database. - 9662|3483|182682 - beta12orEarlier - - - - - - - - - - - Plant Ontology concept ID - - An identifier of a concept from the Plant Ontology (PO). - beta12orEarlier - - - - - - - - - - - UMLS concept ID - - An identifier of a concept from the UMLS vocabulary. - beta12orEarlier - - - - - - - - - - - FMA concept ID - - An identifier of a concept from Foundational Model of Anatomy. - FMA:[0-9]+ - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - - - - - - - - - - - EMAP concept ID - - beta12orEarlier - An identifier of a concept from the EMAP mouse ontology. - - - - - - - - - - - ChEBI concept ID - - beta12orEarlier - An identifier of a concept from the ChEBI ontology. - - - - - - - - - - - MGED concept ID - - beta12orEarlier - An identifier of a concept from the MGED ontology. - - - - - - - - - - - myGrid concept ID - - beta12orEarlier - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - An identifier of a concept from the myGrid ontology. - - - - - - - - - - - PubMed ID - - PMID - [1-9][0-9]{0,8} - PubMed unique identifier of an article. - beta12orEarlier - 4963447 - - - - - - - - - - - DOI - - beta12orEarlier - (doi\:)?[0-9]{2}\.[0-9]{4}/.* - Digital Object Identifier - Digital Object Identifier (DOI) of a published article. - - - - - - - - - - - Medline UI - - beta12orEarlier - Medline UI (unique identifier) of an article. - The use of Medline UI has been replaced by the PubMed unique identifier. - Medline unique identifier - - - - - - - - - - - Tool name - - The name of a computer package, application, method or function. - beta12orEarlier - - - - - - - - - - - Tool name (signature) - - beta12orEarlier - The unique name of a signature (sequence classifier) method. - Signature methods from http://www.ebi.ac.uk/Tools/InterProScan/help.html#results include BlastProDom, FPrintScan, HMMPIR, HMMPfam, HMMSmart, HMMTigr, ProfileScan, ScanRegExp, SuperFamily and HAMAP. - - - - - - - - - - - Tool name (BLAST) - - This include 'blastn', 'blastp', 'blastx', 'tblastn' and 'tblastx'. - The name of a BLAST tool. - beta12orEarlier - BLAST name - - - - - - - - - - - Tool name (FASTA) - - beta12orEarlier - The name of a FASTA tool. - This includes 'fasta3', 'fastx3', 'fasty3', 'fastf3', 'fasts3' and 'ssearch'. - - - - - - - - - - - Tool name (EMBOSS) - - The name of an EMBOSS application. - beta12orEarlier - - - - - - - - - - - Tool name (EMBASSY package) - - The name of an EMBASSY package. - beta12orEarlier - - - - - - - - - - - QSAR descriptor (constitutional) - - A QSAR constitutional descriptor. - beta12orEarlier - QSAR constitutional descriptor - - - - - - - - - - QSAR descriptor (electronic) - - beta12orEarlier - A QSAR electronic descriptor. - QSAR electronic descriptor - - - - - - - - - - QSAR descriptor (geometrical) - - QSAR geometrical descriptor - A QSAR geometrical descriptor. - beta12orEarlier - - - - - - - - - - QSAR descriptor (topological) - - beta12orEarlier - QSAR topological descriptor - A QSAR topological descriptor. - - - - - - - - - - QSAR descriptor (molecular) - - A QSAR molecular descriptor. - QSAR molecular descriptor - beta12orEarlier - - - - - - - - - - Sequence set (protein) - - Any collection of multiple protein sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - beta12orEarlier - - - - - - - - - - Sequence set (nucleic acid) - - beta12orEarlier - Any collection of multiple nucleotide sequences and associated metadata that do not (typically) correspond to common sequence database records or database entries. - - - - - - - - - - Sequence cluster - - - - - - - - A set of sequences that have been clustered or otherwise classified as belonging to a group including (typically) sequence cluster information. - The cluster might include sequences identifiers, short descriptions, alignment and summary information. - beta12orEarlier - - - - - - - - - - Psiblast checkpoint file - - beta12orEarlier - A Psiblast checkpoint file uses ASN.1 Binary Format and usually has the extension '.asn'. - beta12orEarlier - true - A file of intermediate results from a PSIBLAST search that is used for priming the search in the next PSIBLAST iteration. - - - - - - - - - - HMMER synthetic sequences set - - Sequences generated by HMMER package in FASTA-style format. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Proteolytic digest - - - - - - - - beta12orEarlier - A protein sequence cleaved into peptide fragments (by enzymatic or chemical cleavage) with fragment masses. - - - - - - - - - - Restriction digest - - Restriction digest fragments from digesting a nucleotide sequence with restriction sites using a restriction endonuclease. - SO:0000412 - beta12orEarlier - - - - - - - - - - PCR primers - - beta12orEarlier - Oligonucleotide primer(s) for PCR and DNA amplification, for example a minimal primer set. - - - - - - - - - - vectorstrip cloning vector definition file - - beta12orEarlier - true - File of sequence vectors used by EMBOSS vectorstrip application, or any file in same format. - beta12orEarlier - - - - - - - - - - Primer3 internal oligo mishybridizing library - - true - beta12orEarlier - A library of nucleotide sequences to avoid during hybridization events. Hybridization of the internal oligo to sequences in this library is avoided, rather than priming from them. The file is in a restricted FASTA format. - beta12orEarlier - - - - - - - - - - Primer3 mispriming library file - - true - A nucleotide sequence library of sequences to avoid during amplification (for example repetitive sequences, or possibly the sequences of genes in a gene family that should not be amplified. The file must is in a restricted FASTA format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - primersearch primer pairs sequence record - - true - beta12orEarlier - beta12orEarlier - File of one or more pairs of primer sequences, as used by EMBOSS primersearch application. - - - - - - - - - - Sequence cluster (protein) - - - Protein sequence cluster - The sequences are typically related, for example a family of sequences. - beta12orEarlier - A cluster of protein sequences. - - - - - - - - - - Sequence cluster (nucleic acid) - - - A cluster of nucleotide sequences. - Nucleotide sequence cluster - beta12orEarlier - The sequences are typically related, for example a family of sequences. - - - - - - - - - - Sequence length - - beta12orEarlier - The size (length) of a sequence, subsequence or region in a sequence, or range(s) of lengths. - - - - - - - - - - Word size - - Word size is used for example in word-based sequence database search methods. - Word length - 1.5 - Size of a sequence word. - true - beta12orEarlier - - - - - - - - - - Window size - - 1.5 - true - A window is a region of fixed size but not fixed position over a molecular sequence. It is typically moved (computationally) over a sequence during scoring. - beta12orEarlier - Size of a sequence window. - - - - - - - - - - Sequence length range - - true - Specification of range(s) of length of sequences. - beta12orEarlier - 1.5 - - - - - - - - - - Sequence information report - - Report on basic information about a molecular sequence such as name, accession number, type (nucleic or protein), length, description etc. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence property - - beta12orEarlier - An informative report about non-positional sequence features, typically a report on general molecular sequence properties derived from sequence analysis. - Sequence properties report - - - - - - - - - - Sequence features - - Sequence features report - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D058977 - SO:0000110 - This includes annotation of positional sequence features, organized into a standard feature table, or any other report of sequence features. General feature reports are a source of sequence feature table information although internal conversion would be required. - General sequence features - Annotation of positional features of molecular sequence(s), i.e. that can be mapped to position(s) in the sequence. - Features - Feature record - - - - - - - - - - Sequence features (comparative) - - Comparative data on sequence features such as statistics, intersections (and data on intersections), differences etc. - beta13 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - beta12orEarlier - - - - - - - - - - Sequence property (protein) - - true - A report of general sequence properties derived from protein sequence data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence property (nucleic acid) - - A report of general sequence properties derived from nucleotide sequence data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence complexity report - - A report on sequence complexity, for example low-complexity or repeat regions in sequences. - beta12orEarlier - Sequence property (complexity) - - - - - - - - - - Sequence ambiguity report - - A report on ambiguity in molecular sequence(s). - Sequence property (ambiguity) - beta12orEarlier - - - - - - - - - - Sequence composition report - - beta12orEarlier - A report (typically a table) on character or word composition / frequency of a molecular sequence(s). - Sequence property (composition) - - - - - - - - - - Peptide molecular weight hits - - A report on peptide fragments of certain molecular weight(s) in one or more protein sequences. - beta12orEarlier - - - - - - - - - - Base position variability plot - - beta12orEarlier - A plot of third base position variability in a nucleotide sequence. - - - - - - - - - - Sequence composition table - - A table of character or word composition / frequency of a molecular sequence. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Base frequencies table - - - beta12orEarlier - A table of base frequencies of a nucleotide sequence. - - - - - - - - - - Base word frequencies table - - - A table of word composition of a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Amino acid frequencies table - - - Sequence composition (amino acid frequencies) - A table of amino acid frequencies of a protein sequence. - beta12orEarlier - - - - - - - - - - Amino acid word frequencies table - - - A table of amino acid word composition of a protein sequence. - Sequence composition (amino acid words) - beta12orEarlier - - - - - - - - - - DAS sequence feature annotation - - beta12orEarlier - Annotation of a molecular sequence in DAS format. - beta12orEarlier - true - - - - - - - - - - Feature table - - Sequence feature table - beta12orEarlier - Annotation of positional sequence features, organized into a standard feature table. - - - - - - - - - - Map - - - - - - - - DNA map - beta12orEarlier - A map of (typically one) DNA sequence annotated with positional or non-positional features. - - - - - - - - - - Nucleic acid features - - - An informative report on intrinsic positional features of a nucleotide sequence. - beta12orEarlier - Genome features - This includes nucleotide sequence feature annotation in any known sequence feature table format and any other report of nucleic acid features. - Genomic features - Nucleic acid feature table - Feature table (nucleic acid) - - - - - - - - - - Protein features - - - An informative report on intrinsic positional features of a protein sequence. - beta12orEarlier - This includes protein sequence feature annotation in any known sequence feature table format and any other report of protein features. - Feature table (protein) - Protein feature table - - - - - - - - - - Genetic map - - A map showing the relative positions of genetic markers in a nucleic acid sequence, based on estimation of non-physical distance such as recombination frequencies. - beta12orEarlier - A genetic (linkage) map indicates the proximity of two genes on a chromosome, whether two genes are linked and the frequency they are transmitted together to an offspring. They are limited to genetic markers of traits observable only in whole organisms. - Linkage map - Moby:GeneticMap - - - - - - - - - - Sequence map - - A sequence map typically includes annotation on significant subsequences such as contigs, haplotypes and genes. The contigs shown will (typically) be a set of small overlapping clones representing a complete chromosomal segment. - beta12orEarlier - A map of genetic markers in a contiguous, assembled genomic sequence, with the sizes and separation of markers measured in base pairs. - - - - - - - - - - Physical map - - A map of DNA (linear or circular) annotated with physical features or landmarks such as restriction sites, cloned DNA fragments, genes or genetic markers, along with the physical distances between them. - Distance in a physical map is measured in base pairs. A physical map might be ordered relative to a reference map (typically a genetic map) in the process of genome sequencing. - beta12orEarlier - - - - - - - - - - Sequence signature map - - true - Image of a sequence with matches to signatures, motifs or profiles. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cytogenetic map - - beta12orEarlier - A map showing banding patterns derived from direct observation of a stained chromosome. - Cytologic map - Chromosome map - Cytogenic map - This is the lowest-resolution physical map and can provide only rough estimates of physical (base pair) distances. Like a genetic map, they are limited to genetic markers of traits observable only in whole organisms. - - - - - - - - - - DNA transduction map - - beta12orEarlier - A gene map showing distances between loci based on relative cotransduction frequencies. - - - - - - - - - - Gene map - - Sequence map of a single gene annotated with genetic features such as introns, exons, untranslated regions, polyA signals, promoters, enhancers and (possibly) mutations defining alleles of a gene. - beta12orEarlier - - - - - - - - - - Plasmid map - - Sequence map of a plasmid (circular DNA). - beta12orEarlier - - - - - - - - - - Genome map - - beta12orEarlier - Sequence map of a whole genome. - - - - - - - - - - Restriction map - - - Image of the restriction enzyme cleavage sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - InterPro compact match image - - beta12orEarlier - Image showing matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Each protein is represented as a scaled horizontal line with colored bars indicating the position of the matches. - beta12orEarlier - true - - - - - - - - - - InterPro detailed match image - - beta12orEarlier - beta12orEarlier - Image showing detailed information on matches between protein sequence(s) and InterPro Entries. - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. - true - - - - - - - - - - InterPro architecture image - - beta12orEarlier - beta12orEarlier - true - The sequence(s) might be screened against InterPro, or be the sequences from the InterPro entry itself. Domain architecture is shown as a series of non-overlapping domains in the protein. - Image showing the architecture of InterPro domains in a protein sequence. - - - - - - - - - - SMART protein schematic - - true - beta12orEarlier - beta12orEarlier - SMART protein schematic in PNG format. - - - - - - - - - - GlobPlot domain image - - beta12orEarlier - beta12orEarlier - true - Images based on GlobPlot prediction of intrinsic disordered regions and globular domains in protein sequences. - - - - - - - - - - Sequence motif matches - - beta12orEarlier - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more sequences. - 1.8 - true - - - - - - - - - - Sequence features (repeats) - - beta12orEarlier - true - 1.5 - Repeat sequence map - The report might include derived data map such as classification, annotation, organization, periodicity etc. - Location of short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Gene and transcript structure (report) - - 1.5 - beta12orEarlier - A report on predicted or actual gene structure, regions which make an RNA product and features such as promoters, coding regions, splice sites etc. - true - - - - - - - - - - Mobile genetic elements - - true - beta12orEarlier - regions of a nucleic acid sequence containing mobile genetic elements. - 1.8 - - - - - - - - - - Nucleic acid features report (PolyA signal or site) - - true - regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features (quadruplexes) - - true - 1.5 - A report on quadruplex-forming motifs in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Nucleic acid features report (CpG island and isochore) - - 1.8 - CpG rich regions (isochores) in a nucleotide sequence. - beta12orEarlier - true - - - - - - - - - - Nucleic acid features report (restriction sites) - - beta12orEarlier - true - 1.8 - restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - - - - - - - - - - Nucleosome exclusion sequences - - beta12orEarlier - true - Report on nucleosome formation potential or exclusion sequence(s). - 1.8 - - - - - - - - - - Nucleic acid features report (splice sites) - - splice sites in a nucleotide sequence or alternative RNA splicing events. - beta12orEarlier - true - 1.8 - - - - - - - - - - Nucleic acid features report (matrix/scaffold attachment sites) - - 1.8 - matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Gene features (exonic splicing enhancer) - - beta12orEarlier - beta13 - true - A report on exonic splicing enhancers (ESE) in an exon. - - - - - - - - - - Nucleic acid features (microRNA) - - true - beta12orEarlier - A report on microRNA sequence (miRNA) or precursor, microRNA targets, miRNA binding sites in an RNA sequence etc. - 1.5 - - - - - - - - - - Gene features report (operon) - - true - operons (operators, promoters and genes) from a bacterial genome. - 1.8 - beta12orEarlier - - - - - - - - - - Nucleic acid features report (promoters) - - 1.8 - whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - true - beta12orEarlier - - - - - - - - - - Coding region - - beta12orEarlier - protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. - 1.8 - true - - - - - - - - - - Gene features (SECIS element) - - beta12orEarlier - beta13 - A report on selenocysteine insertion sequence (SECIS) element in a DNA sequence. - true - - - - - - - - - - Transcription factor binding sites - - transcription factor binding sites (TFBS) in a DNA sequence. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein features (sites) - - true - beta12orEarlier - Use this concept for collections of specific sites which are not necessarily contiguous, rather than contiguous stretches of amino acids. - beta12orEarlier - A report on predicted or known key residue positions (sites) in a protein sequence, such as binding or functional sites. - - - - - - - - - - Protein features report (signal peptides) - - true - signal peptides or signal peptide cleavage sites in protein sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Protein features report (cleavage sites) - - true - 1.8 - cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - beta12orEarlier - - - - - - - - - - Protein features (post-translation modifications) - - true - beta12orEarlier - post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein features report (active sites) - - 1.8 - true - beta12orEarlier - catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein features report (binding sites) - - beta12orEarlier - ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids. - true - 1.8 - - - - - - - - - - Protein features (epitopes) - - A report on antigenic determinant sites (epitopes) in proteins, from sequence and / or structural data. - beta13 - beta12orEarlier - Epitope mapping is commonly done during vaccine design. - true - - - - - - - - - - Protein features report (nucleic acid binding sites) - - true - beta12orEarlier - 1.8 - RNA and DNA-binding proteins and binding sites in protein sequences. - - - - - - - - - - MHC Class I epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on epitopes that bind to MHC class I molecules. - - - - - - - - - - MHC Class II epitopes report - - beta12orEarlier - beta12orEarlier - true - A report on predicted epitopes that bind to MHC class II molecules. - - - - - - - - - - Protein features (PEST sites) - - beta12orEarlier - A report or plot of PEST sites in a protein sequence. - true - beta13 - 'PEST' motifs target proteins for proteolytic degradation and reduce the half-lives of proteins dramatically. - - - - - - - - - - Sequence database hits scores list - - Scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database hits alignments list - - beta12orEarlier - Alignments from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - Sequence database hits evaluation data - - beta12orEarlier - A report on the evaluation of the significance of sequence similarity scores from a sequence database search (for example a BLAST search). - beta12orEarlier - true - - - - - - - - - - MEME motif alphabet - - Alphabet for the motifs (patterns) that MEME will search for. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - MEME background frequencies file - - MEME background frequencies file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - MEME motifs directive file - - beta12orEarlier - true - File of directives for ordering and spacing of MEME motifs. - beta12orEarlier - - - - - - - - - - Dirichlet distribution - - Dirichlet distribution used by hidden Markov model analysis programs. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts - - Emission and transition counts of a hidden Markov model, generated once HMM has been determined, for example after residues/gaps have been assigned to match, delete and insert states. - true - 1.4 - beta12orEarlier - - - - - - - - - - - Regular expression - - Regular expression pattern. - beta12orEarlier - - - - - - - - - - Sequence motif - - - - - - - - beta12orEarlier - Any specific or conserved pattern (typically expressed as a regular expression) in a molecular sequence. - - - - - - - - - - Sequence profile - - - - - - - - Some type of statistical model representing a (typically multiple) sequence alignment. - http://semanticscience.org/resource/SIO_010531 - beta12orEarlier - - - - - - - - - - Protein signature - - An informative report about a specific or conserved protein sequence pattern. - InterPro entry - Protein repeat signature - Protein region signature - Protein site signature - beta12orEarlier - Protein family signature - Protein domain signature - - - - - - - - - - Prosite nucleotide pattern - - A nucleotide regular expression pattern from the Prosite database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Prosite protein pattern - - A protein regular expression pattern from the Prosite database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Position frequency matrix - - beta12orEarlier - PFM - A profile (typically representing a sequence alignment) that is a simple matrix of nucleotide (or amino acid) counts per position. - - - - - - - - - - Position weight matrix - - PWM - beta12orEarlier - A profile (typically representing a sequence alignment) that is weighted matrix of nucleotide (or amino acid) counts per position. - Contributions of individual sequences to the matrix might be uneven (weighted). - - - - - - - - - - Information content matrix - - beta12orEarlier - ICM - A profile (typically representing a sequence alignment) derived from a matrix of nucleotide (or amino acid) counts per position that reflects information content at each position. - - - - - - - - - - Hidden Markov model - - HMM - beta12orEarlier - A hidden Markov model representation of a set or alignment of sequences. - - - - - - - - - - Fingerprint - - beta12orEarlier - One or more fingerprints (sequence classifiers) as used in the PRINTS database. - - - - - - - - - - Domainatrix signature - - A protein signature of the type used in the EMBASSY Signature package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - HMMER NULL hidden Markov model - - beta12orEarlier - beta12orEarlier - true - NULL hidden Markov model representation used by the HMMER package. - - - - - - - - - - Protein family signature - - Protein family signatures cover all domains in the matching proteins and span >80% of the protein length and with no adjacent protein domain signatures or protein region signatures. - beta12orEarlier - true - 1.5 - A protein family signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein domain signature - - beta12orEarlier - 1.5 - true - A protein domain signature (sequence classifier) from the InterPro database. - Protein domain signatures identify structural or functional domains or other units with defined boundaries. - - - - - - - - - - Protein region signature - - A protein region signature (sequence classifier) from the InterPro database. - true - beta12orEarlier - 1.5 - A protein region signature defines a region which cannot be described as a protein family or domain signature. - - - - - - - - - - Protein repeat signature - - true - 1.5 - A protein repeat signature is a repeated protein motif, that is not in single copy expected to independently fold into a globular domain. - beta12orEarlier - A protein repeat signature (sequence classifier) from the InterPro database. - - - - - - - - - - Protein site signature - - A protein site signature is a classifier for a specific site in a protein. - beta12orEarlier - A protein site signature (sequence classifier) from the InterPro database. - true - 1.5 - - - - - - - - - - Protein conserved site signature - - 1.4 - true - A protein conserved site signature is any short sequence pattern that may contain one or more unique residues and is cannot be described as a active site, binding site or post-translational modification. - A protein conserved site signature (sequence classifier) from the InterPro database. - beta12orEarlier - - - - - - - - - - Protein active site signature - - A protein active site signature (sequence classifier) from the InterPro database. - A protein active site signature corresponds to an enzyme catalytic pocket. An active site typically includes non-contiguous residues, therefore multiple signatures may be required to describe an active site. ; residues involved in enzymatic reactions for which mutational data is typically available. - true - 1.4 - beta12orEarlier - - - - - - - - - - Protein binding site signature - - 1.4 - A protein binding site signature (sequence classifier) from the InterPro database. - true - A protein binding site signature corresponds to a site that reversibly binds chemical compounds, which are not themselves substrates of the enzymatic reaction. This includes enzyme cofactors and residues involved in electron transport or protein structure modification. - beta12orEarlier - - - - - - - - - - Protein post-translational modification signature - - A protein post-translational modification signature (sequence classifier) from the InterPro database. - A protein post-translational modification signature corresponds to sites that undergo modification of the primary structure, typically to activate or de-activate a function. For example, methylation, sumoylation, glycosylation etc. The modification might be permanent or reversible. - 1.4 - beta12orEarlier - true - - - - - - - - - - Sequence alignment (pair) - - http://semanticscience.org/resource/SIO_010068 - beta12orEarlier - Alignment of exactly two molecular sequences. - - - - - - - - - - Sequence alignment (multiple) - - beta12orEarlier - beta12orEarlier - Alignment of more than two molecular sequences. - true - - - - - - - - - - Sequence alignment (nucleic acid) - - beta12orEarlier - Alignment of multiple nucleotide sequences. - - - - - - - - - - Sequence alignment (protein) - - - Alignment of multiple protein sequences. - beta12orEarlier - - - - - - - - - - Sequence alignment (hybrid) - - Alignment of multiple molecular sequences of different types. - Hybrid sequence alignments include for example genomic DNA to EST, cDNA or mRNA. - beta12orEarlier - - - - - - - - - - Sequence alignment (nucleic acid pair) - - beta12orEarlier - Alignment of exactly two nucleotide sequences. - true - 1.12 - - - - - - - - - - - Sequence alignment (protein pair) - - true - 1.12 - Alignment of exactly two protein sequences. - beta12orEarlier - - - - - - - - - - - Hybrid sequence alignment (pair) - - true - beta12orEarlier - beta12orEarlier - Alignment of exactly two molecular sequences of different types. - - - - - - - - - - Multiple nucleotide sequence alignment - - beta12orEarlier - Alignment of more than two nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - Multiple protein sequence alignment - - true - beta12orEarlier - beta12orEarlier - Alignment of more than two protein sequences. - - - - - - - - - - Alignment score or penalty - - beta12orEarlier - A simple floating point number defining the penalty for opening or extending a gap in an alignment. - - - - - - - - - - Score end gaps control - - beta12orEarlier - beta12orEarlier - Whether end gaps are scored or not. - true - - - - - - - - - - Aligned sequence order - - beta12orEarlier - beta12orEarlier - true - Controls the order of sequences in an output sequence alignment. - - - - - - - - - - Gap opening penalty - - A penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap extension penalty - - A penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty - - beta12orEarlier - A penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap penalty - - beta12orEarlier - A penalty for gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - true - beta12orEarlier - - - - - - - - - - - Match reward score - - beta12orEarlier - The score for a 'match' used in various sequence database search applications with simple scoring schemes. - - - - - - - - - - Mismatch penalty score - - beta12orEarlier - The score (penalty) for a 'mismatch' used in various alignment and sequence database search applications with simple scoring schemes. - - - - - - - - - - Drop off score - - This is the threshold drop in score at which extension of word alignment is halted. - beta12orEarlier - - - - - - - - - - Gap opening penalty (integer) - - beta12orEarlier - true - A simple floating point number defining the penalty for opening a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap opening penalty (float) - - beta12orEarlier - beta12orEarlier - A simple floating point number defining the penalty for opening a gap in an alignment. - true - - - - - - - - - - Gap extension penalty (integer) - - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gap extension penalty (float) - - beta12orEarlier - true - A simple floating point number defining the penalty for extending a gap in an alignment. - beta12orEarlier - - - - - - - - - - Gap separation penalty (integer) - - A simple floating point number defining the penalty for gaps that are close together in an alignment. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gap separation penalty (float) - - beta12orEarlier - true - beta12orEarlier - A simple floating point number defining the penalty for gaps that are close together in an alignment. - - - - - - - - - - Terminal gap opening penalty - - beta12orEarlier - A number defining the penalty for opening gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - - - - - - - - - - Terminal gap extension penalty - - A number defining the penalty for extending gaps at the termini of an alignment, either from the N/C terminal of protein or 5'/3' terminal of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence identity - - Sequence identity is the number (%) of matches (identical characters) in positions from an alignment of two molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity - - beta12orEarlier - Sequence similarity is the similarity (expressed as a percentage) of two molecular sequences calculated from their alignment, a scoring matrix for scoring characters substitutions and penalties for gap insertion and extension. - Data Type is float probably. - - - - - - - - - - Sequence alignment metadata (quality report) - - beta12orEarlier - true - beta12orEarlier - Data on molecular sequence alignment quality (estimated accuracy). - - - - - - - - - - Sequence alignment report (site conservation) - - beta12orEarlier - Data on character conservation in a molecular sequence alignment. - 1.4 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - true - - - - - - - - - - Sequence alignment report (site correlation) - - 1.4 - beta12orEarlier - Data on correlations between sites in a molecular sequence alignment, typically to identify possible covarying positions and predict contacts or structural constraints in protein structures. - true - - - - - - - - - - Sequence-profile alignment (Domainatrix signature) - - beta12orEarlier - Alignment of molecular sequences to a Domainatrix signature (representing a sequence alignment). - beta12orEarlier - true - - - - - - - - - - Sequence-profile alignment (HMM) - - beta12orEarlier - 1.5 - true - Alignment of molecular sequence(s) to a hidden Markov model(s). - - - - - - - - - - Sequence-profile alignment (fingerprint) - - Alignment of molecular sequences to a protein fingerprint from the PRINTS database. - 1.5 - beta12orEarlier - true - - - - - - - - - - Phylogenetic continuous quantitative data - - beta12orEarlier - Phylogenetic continuous quantitative characters - Quantitative traits - Continuous quantitative data that may be read during phylogenetic tree calculation. - - - - - - - - - - Phylogenetic discrete data - - Discrete characters - Character data with discrete states that may be read during phylogenetic tree calculation. - Phylogenetic discrete states - beta12orEarlier - Discretely coded characters - - - - - - - - - - Phylogenetic character cliques - - One or more cliques of mutually compatible characters that are generated, for example from analysis of discrete character data, and are used to generate a phylogeny. - Phylogenetic report (cliques) - beta12orEarlier - - - - - - - - - - Phylogenetic invariants - - - - - - - - Phylogenetic invariants data for testing alternative tree topologies. - beta12orEarlier - Phylogenetic report (invariants) - - - - - - - - - - Phylogenetic report - - Phylogenetic tree-derived report - This is a broad data type and is used for example for reports on confidence, shape or stratigraphic (age) data derived from phylogenetic tree analysis. - beta12orEarlier - A report of data concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - Phylogenetic tree report - 1.5 - true - - - - - - - - - - DNA substitution model - - Substitution model - Phylogenetic tree report (DNA substitution model) - Sequence alignment report (DNA substitution model) - beta12orEarlier - A model of DNA substitution that explains a DNA sequence alignment, derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic tree report (tree shape) - - beta12orEarlier - true - 1.4 - Data about the shape of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree report (tree evaluation) - - beta12orEarlier - true - 1.4 - Data on the confidence of a phylogenetic tree. - - - - - - - - - - Phylogenetic tree distances - - beta12orEarlier - Phylogenetic tree report (tree distances) - Distances, such as Branch Score distance, between two or more phylogenetic trees. - - - - - - - - - - Phylogenetic tree report (tree stratigraphic) - - beta12orEarlier - 1.4 - true - Molecular clock and stratigraphic (age) data derived from phylogenetic tree analysis. - - - - - - - - - - Phylogenetic character contrasts - - Phylogenetic report (character contrasts) - Independent contrasts for characters used in a phylogenetic tree, or covariances, regressions and correlations between characters for those contrasts. - beta12orEarlier - - - - - - - - - - Comparison matrix (integers) - - beta12orEarlier - Substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for sequence comparison. - true - - - - - - - - - - Comparison matrix (floats) - - beta12orEarlier - beta12orEarlier - true - Matrix of floating point numbers for sequence comparison. - Substitution matrix (floats) - - - - - - - - - - Comparison matrix (nucleotide) - - Matrix of integer or floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix - - - - - - - - - - Comparison matrix (amino acid) - - - Amino acid comparison matrix - beta12orEarlier - Matrix of integer or floating point numbers for amino acid comparison. - Amino acid substitution matrix - - - - - - - - - - Nucleotide comparison matrix (integers) - - Nucleotide substitution matrix (integers) - beta12orEarlier - Matrix of integer numbers for nucleotide comparison. - true - beta12orEarlier - - - - - - - - - - Nucleotide comparison matrix (floats) - - beta12orEarlier - true - Matrix of floating point numbers for nucleotide comparison. - beta12orEarlier - Nucleotide substitution matrix (floats) - - - - - - - - - - Amino acid comparison matrix (integers) - - beta12orEarlier - Matrix of integer numbers for amino acid comparison. - Amino acid substitution matrix (integers) - true - beta12orEarlier - - - - - - - - - - Amino acid comparison matrix (floats) - - beta12orEarlier - Amino acid substitution matrix (floats) - beta12orEarlier - true - Matrix of floating point numbers for amino acid comparison. - - - - - - - - - - Protein features report (membrane regions) - - true - beta12orEarlier - 1.8 - trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Nucleic acid structure - - - - - - - - 3D coordinate and associated data for a nucleic acid tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein structure - - - - - - - - Protein structures - 3D coordinate and associated data for a protein tertiary (3D) structure. - beta12orEarlier - - - - - - - - - - Protein-ligand complex - - The structure of a protein in complex with a ligand, typically a small molecule such as an enzyme substrate or cofactor, but possibly another macromolecule. - beta12orEarlier - This includes interactions of proteins with atoms, ions and small molecules or macromolecules such as nucleic acids or other polypeptides. For stable inter-polypeptide interactions use 'Protein complex' instead. - - - - - - - - - - Carbohydrate structure - - - - - - - - - - - - - - beta12orEarlier - 3D coordinate and associated data for a carbohydrate (3D) structure. - - - - - - - - - - Small molecule structure - - - - - - - - 3D coordinate and associated data for the (3D) structure of a small molecule, such as any common chemical compound. - CHEBI:23367 - beta12orEarlier - - - - - - - - - - DNA structure - - beta12orEarlier - 3D coordinate and associated data for a DNA tertiary (3D) structure. - - - - - - - - - - RNA structure - - - - - - - - beta12orEarlier - 3D coordinate and associated data for an RNA tertiary (3D) structure. - - - - - - - - - - tRNA structure - - 3D coordinate and associated data for a tRNA tertiary (3D) structure, including tmRNA, snoRNAs etc. - beta12orEarlier - - - - - - - - - - Protein chain - - beta12orEarlier - 3D coordinate and associated data for the tertiary (3D) structure of a polypeptide chain. - - - - - - - - - - Protein domain - - - - - - - - 3D coordinate and associated data for the tertiary (3D) structure of a protein domain. - beta12orEarlier - - - - - - - - - - Protein structure (all atoms) - - beta12orEarlier - 1.5 - true - 3D coordinate and associated data for a protein tertiary (3D) structure (all atoms). - - - - - - - - - - C-alpha trace - - 3D coordinate and associated data for a protein tertiary (3D) structure (typically C-alpha atoms only). - C-beta atoms from amino acid side-chains may be included. - Protein structure (C-alpha atoms) - beta12orEarlier - - - - - - - - - - Protein chain (all atoms) - - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (all atoms). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Protein chain (C-alpha atoms) - - true - 3D coordinate and associated data for a polypeptide chain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Protein domain (all atoms) - - 3D coordinate and associated data for a protein domain tertiary (3D) structure (all atoms). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Protein domain (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - 3D coordinate and associated data for a protein domain tertiary (3D) structure (typically C-alpha atoms only). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Structure alignment (pair) - - Alignment (superimposition) of exactly two molecular tertiary (3D) structures. - beta12orEarlier - Pair structure alignment - - - - - - - - - - Structure alignment (multiple) - - beta12orEarlier - beta12orEarlier - true - Alignment (superimposition) of more than two molecular tertiary (3D) structures. - - - - - - - - - - Structure alignment (protein) - - - Protein structure alignment - beta12orEarlier - Alignment (superimposition) of protein tertiary (3D) structures. - - - - - - - - - - Structure alignment (nucleic acid) - - beta12orEarlier - Alignment (superimposition) of nucleic acid tertiary (3D) structures. - Nucleic acid structure alignment - - - - - - - - - - Structure alignment (protein pair) - - 1.12 - Protein pair structural alignment - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures. - - - - - - - - - - - Multiple protein tertiary structure alignment - - Alignment (superimposition) of more than two protein tertiary (3D) structures. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structure alignment (protein all atoms) - - 1.5 - Alignment (superimposition) of protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - true - - - - - - - - - - Structure alignment (protein C-alpha atoms) - - Alignment (superimposition) of protein tertiary (3D) structures (typically C-alpha atoms only considered). - C-beta atoms from amino acid side-chains may be considered. - 1.5 - C-alpha trace - true - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (all atoms) - - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Pairwise protein tertiary structure alignment (C-alpha atoms) - - C-beta atoms from amino acid side-chains may be included. - true - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (all atoms) - - beta12orEarlier - true - Alignment (superimposition) of exactly two protein tertiary (3D) structures (all atoms considered). - beta12orEarlier - - - - - - - - - - Multiple protein tertiary structure alignment (C-alpha atoms) - - beta12orEarlier - Alignment (superimposition) of exactly two protein tertiary (3D) structures (typically C-alpha atoms only considered). - true - beta12orEarlier - C-beta atoms from amino acid side-chains may be included. - - - - - - - - - - Structure alignment (nucleic acid pair) - - beta12orEarlier - 1.12 - true - Nucleic acid pair structure alignment - Alignment (superimposition) of exactly two nucleic acid tertiary (3D) structures. - - - - - - - - - - - Multiple nucleic acid tertiary structure alignment - - beta12orEarlier - Alignment (superimposition) of more than two nucleic acid tertiary (3D) structures. - true - beta12orEarlier - - - - - - - - - - Structure alignment (RNA) - - RNA structure alignment - Alignment (superimposition) of RNA tertiary (3D) structures. - beta12orEarlier - - - - - - - - - Structural transformation matrix - - Matrix to transform (rotate/translate) 3D coordinates, typically the transformation necessary to superimpose two molecular structures. - beta12orEarlier - - - - - - - - - - DaliLite hit table - - DaliLite hit table of protein chain tertiary structure alignment data. - The significant and top-scoring hits for regions of the compared structures is shown. Data such as Z-Scores, number of aligned residues, root-mean-square deviation (RMSD) of atoms and sequence identity are given. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Molecular similarity score - - beta12orEarlier - A score reflecting structural similarities of two molecules. - true - beta12orEarlier - - - - - - - - - - Root-mean-square deviation - - RMSD - beta12orEarlier - Root-mean-square deviation (RMSD) is calculated to measure the average distance between superimposed macromolecular coordinates. - - - - - - - - - - Tanimoto similarity score - - beta12orEarlier - A measure of the similarity between two ligand fingerprints. - A ligand fingerprint is derived from ligand structural data from a Protein DataBank file. It reflects the elements or groups present or absent, covalent bonds and bond orders and the bonded environment in terms of SATIS codes and BLEEP atom types. - - - - - - - - - - 3D-1D scoring matrix - - A matrix of 3D-1D scores reflecting the probability of amino acids to occur in different tertiary structural environments. - beta12orEarlier - - - - - - - - - - Amino acid index - - - beta12orEarlier - A table of 20 numerical values which quantify a property (e.g. physicochemical or biochemical) of the common amino acids. - - - - - - - - - - Amino acid index (chemical classes) - - Chemical classes (amino acids) - Chemical classification (small, aliphatic, aromatic, polar, charged etc) of amino acids. - beta12orEarlier - - - - - - - - - - Amino acid pair-wise contact potentials - - Contact potentials (amino acid pair-wise) - Statistical protein contact potentials. - beta12orEarlier - - - - - - - - - - Amino acid index (molecular weight) - - Molecular weights of amino acids. - Molecular weight (amino acids) - beta12orEarlier - - - - - - - - - - Amino acid index (hydropathy) - - Hydrophobic, hydrophilic or charge properties of amino acids. - beta12orEarlier - Hydropathy (amino acids) - - - - - - - - - - Amino acid index (White-Wimley data) - - beta12orEarlier - White-Wimley data (amino acids) - Experimental free energy values for the water-interface and water-octanol transitions for the amino acids. - - - - - - - - - - Amino acid index (van der Waals radii) - - van der Waals radii (amino acids) - Van der Waals radii of atoms for different amino acid residues. - beta12orEarlier - - - - - - - - - - Enzyme report - - true - 1.5 - Protein report (enzyme) - beta12orEarlier - An informative report on a specific enzyme. - - - - - - - - - - Restriction enzyme report - - An informative report on a specific restriction enzyme such as enzyme reference data. - This might include name of enzyme, organism, isoschizomers, methylation, source, suppliers, literature references, or data on restriction enzyme patterns such as name of enzyme, recognition site, length of pattern, number of cuts made by enzyme, details of blunt or sticky end cut etc. - Restriction enzyme pattern data - Protein report (restriction enzyme) - beta12orEarlier - true - 1.5 - - - - - - - - - - Peptide molecular weights - - beta12orEarlier - List of molecular weight(s) of one or more proteins or peptides, for example cut by proteolytic enzymes or reagents. - The report might include associated data such as frequency of peptide fragment molecular weights. - - - - - - - - - - Peptide hydrophobic moment - - beta12orEarlier - Report on the hydrophobic moment of a polypeptide sequence. - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - - - - - - - - - - Protein aliphatic index - - The aliphatic index of a protein. - beta12orEarlier - The aliphatic index is the relative protein volume occupied by aliphatic side chains. - - - - - - - - - - Protein sequence hydropathy plot - - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - A protein sequence with annotation on hydrophobic or hydrophilic / charged regions, hydrophobicity plot etc. - beta12orEarlier - - - - - - - - - - Protein charge plot - - beta12orEarlier - A plot of the mean charge of the amino acids within a window of specified length as the window is moved along a protein sequence. - - - - - - - - - - Protein solubility - - beta12orEarlier - The solubility or atomic solvation energy of a protein sequence or structure. - Protein solubility data - - - - - - - - - - Protein crystallizability - - beta12orEarlier - Protein crystallizability data - Data on the crystallizability of a protein sequence. - - - - - - - - - - Protein globularity - - Protein globularity data - beta12orEarlier - Data on the stability, intrinsic disorder or globularity of a protein sequence. - - - - - - - - - - Protein titration curve - - - The titration curve of a protein. - beta12orEarlier - - - - - - - - - - Protein isoelectric point - - beta12orEarlier - The isoelectric point of one proteins. - - - - - - - - - - Protein pKa value - - The pKa value of a protein. - beta12orEarlier - - - - - - - - - - Protein hydrogen exchange rate - - beta12orEarlier - The hydrogen exchange rate of a protein. - - - - - - - - - - Protein extinction coefficient - - The extinction coefficient of a protein. - beta12orEarlier - - - - - - - - - - Protein optical density - - The optical density of a protein. - beta12orEarlier - - - - - - - - - - Protein subcellular localization - - Protein report (subcellular localization) - An informative report on protein subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or destination (exported / extracellular proteins). - beta12orEarlier - true - beta13 - - - - - - - - - - Peptide immunogenicity data - - An report on allergenicity / immunogenicity of peptides and proteins. - Peptide immunogenicity report - beta12orEarlier - Peptide immunogenicity - This includes data on peptide ligands that elicit an immune response (immunogens), allergic cross-reactivity, predicted antigenicity (Hopp and Woods plot) etc. These data are useful in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - - - - - - - - - - MHC peptide immunogenicity report - - A report on the immunogenicity of MHC class I or class II binding peptides. - beta13 - true - beta12orEarlier - - - - - - - - - - Protein structure report - - - Protein structural property - Protein structure-derived report - This includes for example reports on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure, protein flexibility or motion, and protein architecture (spatial arrangement of secondary structure). - Protein property (structural) - Annotation about, or structural information derived from, one or more specific protein 3D structure(s) or structural domains. - beta12orEarlier - Protein report (structure) - Protein structure report (domain) - - - - - - - - - - Protein structural quality report - - Report on the quality of a protein three-dimensional model. - Protein structure report (quality evaluation) - Protein structure validation report - Protein property (structural quality) - Model validation might involve checks for atomic packing, steric clashes, agreement with electron density maps etc. - Protein report (structural quality) - beta12orEarlier - - - - - - - - - - Protein non-covalent interactions report - - Data on inter-atomic or inter-residue contacts, distances and interactions in protein structure(s) or on the interactions of protein atoms or residues with non-protein groups. - beta12orEarlier - true - 1.12 - - - - - - - - - - Protein flexibility or motion report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein property (flexibility or motion) - Informative report on flexibility or motion of a protein structure. - Protein flexibility or motion - beta12orEarlier - true - 1.4 - Protein structure report (flexibility or motion) - - - - - - - - - - Protein solvent accessibility report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. This concept covers definitions of the protein surface, interior and interfaces, accessible and buried residues, surface accessible pockets, interior inaccessible cavities etc. - beta12orEarlier - Data on the solvent accessible or buried surface area of a protein structure. - - - - - - - - - - Protein surface report - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Protein structure report (surface) - 1.4 - Data on the surface properties (shape, hydropathy, electrostatic patches etc) of a protein structure. - beta12orEarlier - true - - - - - - - - - - Ramachandran plot - - beta12orEarlier - Phi/psi angle data or a Ramachandran plot of a protein structure. - - - - - - - - - - Protein dipole moment - - Data on the net charge distribution (dipole moment) of a protein structure. - beta12orEarlier - - - - - - - - - - Protein distance matrix - - - beta12orEarlier - A matrix of distances between amino acid residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map - - An amino acid residue contact map for a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue 3D cluster - - beta12orEarlier - Report on clusters of contacting residues in protein structures such as a key structural residue network. - - - - - - - - - - Protein hydrogen bonds - - Patterns of hydrogen bonding in protein structures. - beta12orEarlier - - - - - - - - - - Protein non-canonical interactions - - Protein non-canonical interactions report - true - Non-canonical atomic interactions in protein structures. - 1.4 - beta12orEarlier - - - - - - - - - - CATH node - - Information on a node from the CATH database. - The report (for example http://www.cathdb.info/cathnode/1.10.10.10) includes CATH code (of the node and upper levels in the hierarchy), classification text (of appropriate levels in hierarchy), list of child nodes, representative domain and other relevant data and links. - 1.5 - beta12orEarlier - true - CATH classification node report - - - - - - - - - - SCOP node - - true - SCOP classification node - Information on a node from the SCOP database. - 1.5 - beta12orEarlier - - - - - - - - - - EMBASSY domain classification - - beta12orEarlier - beta12orEarlier - true - An EMBASSY domain classification file (DCF) of classification and other data for domains from SCOP or CATH, in EMBL-like format. - - - - - - - - - - CATH class - - beta12orEarlier - 1.5 - Information on a protein 'class' node from the CATH database. - true - - - - - - - - - - CATH architecture - - beta12orEarlier - 1.5 - Information on a protein 'architecture' node from the CATH database. - true - - - - - - - - - - CATH topology - - true - 1.5 - Information on a protein 'topology' node from the CATH database. - beta12orEarlier - - - - - - - - - - CATH homologous superfamily - - 1.5 - true - beta12orEarlier - Information on a protein 'homologous superfamily' node from the CATH database. - - - - - - - - - - CATH structurally similar group - - 1.5 - true - beta12orEarlier - Information on a protein 'structurally similar group' node from the CATH database. - - - - - - - - - - CATH functional category - - Information on a protein 'functional category' node from the CATH database. - true - 1.5 - beta12orEarlier - - - - - - - - - - Protein fold recognition report - - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - beta12orEarlier - A report on known protein structural domains or folds that are recognized (identified) in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Protein-protein interaction report - - protein-protein interaction(s), including interactions between protein domains. - beta12orEarlier - true - 1.8 - - - - - - - - - - Protein-ligand interaction report - - Protein-drug interaction report - beta12orEarlier - An informative report on protein-ligand (small molecule) interaction(s). - - - - - - - - - - Protein-nucleic acid interactions report - - true - protein-DNA/RNA interaction(s). - beta12orEarlier - 1.8 - - - - - - - - - - Nucleic acid melting profile - - Nucleic acid stability profile - A melting (stability) profile calculated the free energy required to unwind and separate the nucleic acid strands, plotted for sliding windows over a sequence. - Data on the dissociation characteristics of a double-stranded nucleic acid molecule (DNA or a DNA/RNA hybrid) during heating. - beta12orEarlier - - - - - - - - - - Nucleic acid enthalpy - - beta12orEarlier - Enthalpy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - - - - - - - - - - Nucleic acid entropy - - Entropy of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - - - - - - - - - - Nucleic acid melting temperature - - Melting temperature of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Nucleic acid stitch profile - - beta12orEarlier - Stitch profile of hybridized or double stranded nucleic acid (DNA or RNA/DNA). - A stitch profile diagram shows partly melted DNA conformations (with probabilities) at a range of temperatures. For example, a stitch profile might show possible loop openings with their location, size, probability and fluctuations at a given temperature. - - - - - - - - - - DNA base pair stacking energies data - - DNA base pair stacking energies data. - beta12orEarlier - - - - - - - - - - DNA base pair twist angle data - - beta12orEarlier - DNA base pair twist angle data. - - - - - - - - - - DNA base trimer roll angles data - - beta12orEarlier - DNA base trimer roll angles data. - - - - - - - - - - Vienna RNA parameters - - RNA parameters used by the Vienna package. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA structure constraints - - true - Structure constraints used by the Vienna package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Vienna RNA concentration data - - RNA concentration data used by the Vienna package. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Vienna RNA calculated energy - - beta12orEarlier - beta12orEarlier - true - RNA calculated energy data generated by the Vienna package. - - - - - - - - - - Base pairing probability matrix dotplot - - - beta12orEarlier - Such as generated by the Vienna package. - Dotplot of RNA base pairing probability matrix. - - - - - - - - - - Nucleic acid folding report - - Nucleic acid report (folding) - beta12orEarlier - Nucleic acid report (folding model) - RNA secondary structure folding probablities - A report on an analysis of RNA/DNA folding, minimum folding energies for DNA or RNA sequences, energy landscape of RNA mutants etc. - RNA secondary structure folding classification - - - - - - - - - - Codon usage table - - - - - - - - Table of codon usage data calculated from one or more nucleic acid sequences. - A codon usage table might include the codon usage table name, optional comments and a table with columns for codons and corresponding codon usage data. A genetic code can be extracted from or represented by a codon usage table. - beta12orEarlier - - - - - - - - - - Genetic code - - beta12orEarlier - A genetic code for an organism. - A genetic code need not include detailed codon usage information. - - - - - - - - - - Codon adaptation index - - true - A simple measure of synonymous codon usage bias often used to predict gene expression levels. - CAI - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon usage bias plot - - Synonymous codon usage statistic plot - beta12orEarlier - A plot of the synonymous codon usage calculated for windows over a nucleotide sequence. - - - - - - - - - - Nc statistic - - true - beta12orEarlier - The effective number of codons used in a gene sequence. This reflects how far codon usage of a gene departs from equal usage of synonymous codons. - beta12orEarlier - - - - - - - - - - Codon usage fraction difference - - The differences in codon usage fractions between two codon usage tables. - beta12orEarlier - - - - - - - - - - Pharmacogenomic test report - - beta12orEarlier - The report might correlate gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - Data on the influence of genotype on drug response. - - - - - - - - - - Disease report - - - - - - - - An informative report on a specific disease. - For example, an informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Linkage disequilibrium (report) - - true - A report on linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - 1.8 - beta12orEarlier - - - - - - - - - - Heat map - - - A graphical 2D tabular representation of gene expression data, typically derived from a DNA microarray experiment. - beta12orEarlier - A heat map is a table where rows and columns correspond to different genes and contexts (for example, cells or samples) and the cell color represents the level of expression of a gene that context. - - - - - - - - - - Affymetrix probe sets library file - - true - Affymetrix library file of information about which probes belong to which probe set. - CDF file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Affymetrix probe sets information library file - - true - Affymetrix library file of information about the probe sets such as the gene name with which the probe set is associated. - GIN file - beta12orEarlier - beta12orEarlier - - - - - - - - - - Molecular weights standard fingerprint - - beta12orEarlier - true - 1.12 - Standard protonated molecular masses from trypsin (modified porcine trypsin, Promega) and keratin peptides, used in EMBOSS. - - - - - - - - - - Metabolic pathway report - - This includes carbohydrate, energy, lipid, nucleotide, amino acid, glycan, PK/NRP, cofactor/vitamin, secondary metabolite, xenobiotics etc. - beta12orEarlier - A report typically including a map (diagram) of a metabolic pathway. - 1.8 - true - - - - - - - - - - Genetic information processing pathway report - - beta12orEarlier - 1.8 - true - genetic information processing pathways. - - - - - - - - - - Environmental information processing pathway report - - true - environmental information processing pathways. - beta12orEarlier - 1.8 - - - - - - - - - - Signal transduction pathway report - - A report typically including a map (diagram) of a signal transduction pathway. - 1.8 - true - beta12orEarlier - - - - - - - - - - Cellular process pathways report - - 1.8 - Topic concernning cellular process pathways. - true - beta12orEarlier - - - - - - - - - - Disease pathway or network report - - true - beta12orEarlier - disease pathways, typically of human disease. - 1.8 - - - - - - - - - - Drug structure relationship map - - A report typically including a map (diagram) of drug structure relationships. - beta12orEarlier - - - - - - - - - - Protein interaction networks - - 1.8 - networks of protein interactions. - true - beta12orEarlier - - - - - - - - - - MIRIAM datatype - - A MIRIAM entry describes a MIRIAM data type including the official name, synonyms, root URI, identifier pattern (regular expression applied to a unique identifier of the data type) and documentation. Each data type can be associated with several resources. Each resource is a physical location of a service (typically a database) providing information on the elements of a data type. Several resources may exist for each data type, provided the same (mirrors) or different information. MIRIAM provides a stable and persistent reference to its data types. - An entry (data type) from the Minimal Information Requested in the Annotation of Biochemical Models (MIRIAM) database of data resources. - beta12orEarlier - true - 1.5 - - - - - - - - - - E-value - - An expectation value (E-Value) is the expected number of observations which are at least as extreme as observations expected to occur by random chance. The E-value describes the number of hits with a given score or better that are expected to occur at random when searching a database of a particular size. It decreases exponentially with the score (S) of a hit. A low E value indicates a more significant score. - beta12orEarlier - A simple floating point number defining the lower or upper limit of an expectation value (E-value). - Expectation value - - - - - - - - - - Z-value - - beta12orEarlier - The z-value is the number of standard deviations a data value is above or below a mean value. - A z-value might be specified as a threshold for reporting hits from database searches. - - - - - - - - - - P-value - - beta12orEarlier - A z-value might be specified as a threshold for reporting hits from database searches. - The P-value is the probability of obtaining by random chance a result that is at least as extreme as an observed result, assuming a NULL hypothesis is true. - - - - - - - - - - Database version information - - true - Ontology version information - 1.5 - Information on a database (or ontology) version, for example name, version number and release date. - beta12orEarlier - - - - - - - - - - Tool version information - - beta12orEarlier - Information on an application version, for example name, version number and release date. - true - 1.5 - - - - - - - - - - CATH version information - - beta12orEarlier - beta12orEarlier - true - Information on a version of the CATH database. - - - - - - - - - - Swiss-Prot to PDB mapping - - Cross-mapping of Swiss-Prot codes to PDB identifiers. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence database cross-references - - Cross-references from a sequence record to other databases. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Job status - - Metadata on the status of a submitted job. - beta12orEarlier - 1.5 - true - Values for EBI services are 'DONE' (job has finished and the results can then be retrieved), 'ERROR' (the job failed or no results where found), 'NOT_FOUND' (the job id is no longer available; job results might be deleted, 'PENDING' (the job is in a queue waiting processing), 'RUNNING' (the job is currently being processed). - - - - - - - - - - Job ID - - 1.0 - The (typically numeric) unique identifier of a submitted job. - beta12orEarlier - true - - - - - - - - - - Job type - - 1.5 - true - beta12orEarlier - A label (text token) describing the type of job, for example interactive or non-interactive. - - - - - - - - - - Tool log - - 1.5 - A report of tool-specific metadata on some analysis or process performed, for example a log of diagnostic or error messages. - true - beta12orEarlier - - - - - - - - - - DaliLite log file - - true - beta12orEarlier - DaliLite log file describing all the steps taken by a DaliLite alignment of two protein structures. - beta12orEarlier - - - - - - - - - - STRIDE log file - - STRIDE log file. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - NACCESS log file - - beta12orEarlier - beta12orEarlier - true - NACCESS log file. - - - - - - - - - - EMBOSS wordfinder log file - - EMBOSS wordfinder log file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS domainatrix log file - - beta12orEarlier - EMBOSS (EMBASSY) domainatrix application log file. - beta12orEarlier - true - - - - - - - - - - EMBOSS sites log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS (EMBASSY) sites application log file. - - - - - - - - - - EMBOSS supermatcher error file - - EMBOSS (EMBASSY) supermatcher error file. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - EMBOSS megamerger log file - - beta12orEarlier - beta12orEarlier - EMBOSS megamerger log file. - true - - - - - - - - - - EMBOSS whichdb log file - - beta12orEarlier - true - EMBOSS megamerger log file. - beta12orEarlier - - - - - - - - - - EMBOSS vectorstrip log file - - true - beta12orEarlier - beta12orEarlier - EMBOSS vectorstrip log file. - - - - - - - - - - Username - - A username on a computer system. - beta12orEarlier - - - - - - - - - - - Password - - beta12orEarlier - A password on a computer system. - - - - - - - - - - - Email address - - beta12orEarlier - Moby:Email - A valid email address of an end-user. - Moby:EmailAddress - - - - - - - - - - - Person name - - beta12orEarlier - The name of a person. - - - - - - - - - - - Number of iterations - - 1.5 - Number of iterations of an algorithm. - true - beta12orEarlier - - - - - - - - - - Number of output entities - - Number of entities (for example database hits, sequences, alignments etc) to write to an output file. - 1.5 - beta12orEarlier - true - - - - - - - - - - Hit sort order - - Controls the order of hits (reported matches) in an output file from a database search. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Drug report - - - - - - - - An informative report on a specific drug. - beta12orEarlier - Drug annotation - - - - - - - - - - - Phylogenetic tree image - - beta12orEarlier - An image (for viewing or printing) of a phylogenetic tree including (typically) a plot of rooted or unrooted phylogenies, cladograms, circular trees or phenograms and associated information. - See also 'Phylogenetic tree' - - - - - - - - - - RNA secondary structure image - - beta12orEarlier - Image of RNA secondary structure, knots, pseudoknots etc. - - - - - - - - - - Protein secondary structure image - - Image of protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure image - - beta12orEarlier - Image of one or more molecular tertiary (3D) structures. - - - - - - - - - - Sequence alignment image - - beta12orEarlier - Image of two or more aligned molecular sequences possibly annotated with alignment features. - - - - - - - - - - Chemical structure image - - An image of the structure of a small chemical compound. - The molecular identifier and formula are typically included. - Small molecule structure image - beta12orEarlier - - - - - - - - - - Fate map - - - - - - - - - beta12orEarlier - A fate map is a plan of early stage of an embryo such as a blastula, showing areas that are significance to development. - - - - - - - - - - Microarray spots image - - - beta12orEarlier - An image of spots from a microarray experiment. - - - - - - - - - - BioPax term - - beta12orEarlier - A term from the BioPax ontology. - beta12orEarlier - true - - - - - - - - - - GO - - beta12orEarlier - Gene Ontology term - Moby:Annotated_GO_Term - Moby:Annotated_GO_Term_With_Probability - true - A term definition from The Gene Ontology (GO). - beta12orEarlier - Moby:GO_Term - Moby:GOTerm - - - - - - - - - - MeSH - - true - A term from the MeSH vocabulary. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HGNC - - beta12orEarlier - true - A term from the HGNC controlled vocabulary. - beta12orEarlier - - - - - - - - - - NCBI taxonomy vocabulary - - beta12orEarlier - beta12orEarlier - true - A term from the NCBI taxonomy vocabulary. - - - - - - - - - - Plant ontology term - - beta12orEarlier - true - beta12orEarlier - A term from the Plant Ontology (PO). - - - - - - - - - - UMLS - - beta12orEarlier - beta12orEarlier - A term from the UMLS vocabulary. - true - - - - - - - - - - FMA - - beta12orEarlier - Classifies anatomical entities according to their shared characteristics (genus) and distinguishing characteristics (differentia). Specifies the part-whole and spatial relationships of the entities, morphological transformation of the entities during prenatal development and the postnatal life cycle and principles, rules and definitions according to which classes and relationships in the other three components of FMA are represented. - beta12orEarlier - A term from Foundational Model of Anatomy. - true - - - - - - - - - - EMAP - - A term from the EMAP mouse ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - ChEBI - - beta12orEarlier - A term from the ChEBI ontology. - true - beta12orEarlier - - - - - - - - - - MGED - - beta12orEarlier - true - A term from the MGED ontology. - beta12orEarlier - - - - - - - - - - myGrid - - The ontology is provided as two components, the service ontology and the domain ontology. The domain ontology acts provides concepts for core bioinformatics data types and their relations. The service ontology describes the physical and operational features of web services. - beta12orEarlier - true - A term from the myGrid ontology. - beta12orEarlier - - - - - - - - - - GO (biological process) - - beta12orEarlier - true - beta12orEarlier - Data Type is an enumerated string. - A term definition for a biological process from the Gene Ontology (GO). - - - - - - - - - - GO (molecular function) - - A term definition for a molecular function from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - true - beta12orEarlier - - - - - - - - - - GO (cellular component) - - beta12orEarlier - true - A term definition for a cellular component from the Gene Ontology (GO). - beta12orEarlier - Data Type is an enumerated string. - - - - - - - - - - Ontology relation type - - 1.5 - beta12orEarlier - true - A relation type defined in an ontology. - - - - - - - - - - Ontology concept definition - - beta12orEarlier - Ontology class definition - The definition of a concept from an ontology. - - - - - - - - - - Ontology concept comment - - beta12orEarlier - 1.4 - true - A comment on a concept from an ontology. - - - - - - - - - - Ontology concept reference - - beta12orEarlier - true - Reference for a concept from an ontology. - beta12orEarlier - - - - - - - - - - doc2loc document information - - beta12orEarlier - true - The doc2loc output includes the url, format, type and availability code of a document for every service provider. - beta12orEarlier - Information on a published article provided by the doc2loc program. - - - - - - - - - - PDB residue number - - WHATIF: pdb_number - PDBML:PDB_residue_no - beta12orEarlier - A residue identifier (a string) from a PDB file. - - - - - - - - - - Atomic coordinate - - Cartesian coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian coordinate - - - - - - - - - - Atomic x coordinate - - WHATIF: PDBx_Cartn_x - Cartesian x coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_x in PDBML - Cartesian x coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic y coordinate - - WHATIF: PDBx_Cartn_y - Cartesian y coordinate - beta12orEarlier - PDBML:_atom_site.Cartn_y in PDBML - Cartesian y coordinate of an atom (in a molecular structure). - - - - - - - - - - Atomic z coordinate - - PDBML:_atom_site.Cartn_z - WHATIF: PDBx_Cartn_z - Cartesian z coordinate of an atom (in a molecular structure). - beta12orEarlier - Cartesian z coordinate - - - - - - - - - - PDB atom name - - WHATIF: PDBx_type_symbol - beta12orEarlier - WHATIF: PDBx_auth_atom_id - WHATIF: alternate_atom - PDBML:pdbx_PDB_atom_name - WHATIF: atom_type - Identifier (a string) of a specific atom from a PDB file for a molecular structure. - - - - - - - - - - - Protein atom - - Atom data - CHEBI:33250 - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Data on a single atom from a protein structure. - beta12orEarlier - - - - - - - - - - Protein residue - - beta12orEarlier - Data on a single amino acid residue position in a protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Residue - - - - - - - - - - Atom name - - - Name of an atom. - beta12orEarlier - - - - - - - - - - - PDB residue name - - Three-letter amino acid residue names as used in PDB files. - WHATIF: type - beta12orEarlier - - - - - - - - - - - PDB model number - - Identifier of a model structure from a PDB file. - beta12orEarlier - PDBML:pdbx_PDB_model_num - Model number - WHATIF: model_number - - - - - - - - - - - CATH domain report - - beta12orEarlier - true - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - Summary of domain classification information for a CATH domain. - - - - - - - - - - CATH representative domain sequences (ATOM) - - beta12orEarlier - beta12orEarlier - FASTA sequence database (based on ATOM records in PDB) for CATH domains (clustered at different levels of sequence identity). - true - - - - - - - - - - CATH representative domain sequences (COMBS) - - true - FASTA sequence database (based on COMBS sequence data) for CATH domains (clustered at different levels of sequence identity). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (ATOM) - - true - FASTA sequence database for all CATH domains (based on PDB ATOM records). - beta12orEarlier - beta12orEarlier - - - - - - - - - - CATH domain sequences (COMBS) - - FASTA sequence database for all CATH domains (based on COMBS sequence data). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Sequence version - - beta12orEarlier - Information on an molecular sequence version. - Sequence version information - - - - - - - - - - Score - - A numerical value, that is some type of scored value arising for example from a prediction method. - beta12orEarlier - - - - - - - - - - Protein report (function) - - true - For properties that can be mapped to a sequence, use 'Sequence report' instead. - beta13 - Report on general functional properties of specific protein(s). - beta12orEarlier - - - - - - - - - - Gene name (ASPGD) - - 1.3 - beta12orEarlier - true - Name of a gene from Aspergillus Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs:ASPGD_LOCUS - - - - - - - - - - Gene name (CGD) - - Name of a gene from Candida Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:CGD_LOCUS - beta12orEarlier - 1.3 - - - - - - - - - - Gene name (dictyBase) - - http://www.geneontology.org/doc/GO.xrf_abbs:dictyBase - beta12orEarlier - 1.3 - true - Name of a gene from dictyBase database. - - - - - - - - - - Gene name (EcoGene primary) - - http://www.geneontology.org/doc/GO.xrf_abbs:ECOGENE_G - Primary name of a gene from EcoGene Database. - EcoGene primary gene name - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (MaizeGDB) - - http://www.geneontology.org/doc/GO.xrf_abbs:MaizeGDB_Locus - 1.3 - Name of a gene from MaizeGDB (maize genes) database. - true - beta12orEarlier - - - - - - - - - - Gene name (SGD) - - true - 1.3 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs:SGD_LOCUS - Name of a gene from Saccharomyces Genome Database. - - - - - - - - - - Gene name (TGD) - - beta12orEarlier - 1.3 - Name of a gene from Tetrahymena Genome Database. - true - http://www.geneontology.org/doc/GO.xrf_abbs:TGD_LOCUS - - - - - - - - - - Gene name (CGSC) - - beta12orEarlier - 1.3 - true - http://www.geneontology.org/doc/GO.xrf_abbs: CGSC - Symbol of a gene from E.coli Genetic Stock Center. - - - - - - - - - - Gene name (HGNC) - - beta12orEarlier - HUGO symbol - 1.3 - true - HGNC symbol - Official gene name - HUGO gene name - http://www.geneontology.org/doc/GO.xrf_abbs: HGNC_gene - HGNC gene name - HUGO gene symbol - HGNC:[0-9]{1,5} - Gene name (HUGO) - HGNC gene symbol - Symbol of a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - Gene name (MGD) - - MGI:[0-9]+ - Symbol of a gene from the Mouse Genome Database. - http://www.geneontology.org/doc/GO.xrf_abbs: MGD - 1.3 - true - beta12orEarlier - - - - - - - - - - Gene name (Bacillus subtilis) - - http://www.geneontology.org/doc/GO.xrf_abbs: SUBTILISTG - Symbol of a gene from Bacillus subtilis Genome Sequence Project. - beta12orEarlier - 1.3 - true - - - - - - - - - - Gene ID (PlasmoDB) - - Identifier of a gene from PlasmoDB Plasmodium Genome Resource. - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ApiDB_PlasmoDB - - - - - - - - - - - Gene ID (EcoGene) - - Identifier of a gene from EcoGene Database. - EcoGene Accession - EcoGene ID - beta12orEarlier - - - - - - - - - - - Gene ID (FlyBase) - - beta12orEarlier - Gene identifier from FlyBase database. - http://www.geneontology.org/doc/GO.xrf_abbs: FB - http://www.geneontology.org/doc/GO.xrf_abbs: FlyBase - - - - - - - - - - - Gene ID (GeneDB Glossina morsitans) - - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Gmorsitans - beta13 - Gene identifier from Glossina morsitans GeneDB database. - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Leishmania major) - - Gene identifier from Leishmania major GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Lmajor - beta12orEarlier - beta13 - - - - - - - - - - Gene ID (GeneDB Plasmodium falciparum) - - Gene identifier from Plasmodium falciparum GeneDB database. - true - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Pfalciparum - beta13 - beta12orEarlier - - - - - - - - - - Gene ID (GeneDB Schizosaccharomyces pombe) - - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Spombe - beta12orEarlier - true - beta13 - Gene identifier from Schizosaccharomyces pombe GeneDB database. - - - - - - - - - - Gene ID (GeneDB Trypanosoma brucei) - - Gene identifier from Trypanosoma brucei GeneDB database. - true - beta13 - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GeneDB_Tbrucei - - - - - - - - - - Gene ID (Gramene) - - http://www.geneontology.org/doc/GO.xrf_abbs: GR_gene - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: GR_GENE - Gene identifier from Gramene database. - - - - - - - - - - - Gene ID (Virginia microbial) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: PAMGO_VMD - Gene identifier from Virginia Bioinformatics Institute microbial database. - http://www.geneontology.org/doc/GO.xrf_abbs: VMD - - - - - - - - - - - Gene ID (SGN) - - http://www.geneontology.org/doc/GO.xrf_abbs: SGN - Gene identifier from Sol Genomics Network. - beta12orEarlier - - - - - - - - - - - Gene ID (WormBase) - - - Gene identifier used by WormBase database. - WBGene[0-9]{8} - http://www.geneontology.org/doc/GO.xrf_abbs: WB - http://www.geneontology.org/doc/GO.xrf_abbs: WormBase - beta12orEarlier - - - - - - - - - - - Gene synonym - - Gene name synonym - true - Any name (other than the recommended one) for a gene. - beta12orEarlier - beta12orEarlier - - - - - - - - - - ORF name - - - beta12orEarlier - The name of an open reading frame attributed by a sequencing project. - - - - - - - - - - - Sequence assembly component - - A component of a larger sequence assembly. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Chromosome annotation (aberration) - - beta12orEarlier - beta12orEarlier - true - A report on a chromosome aberration such as abnormalities in chromosome structure. - - - - - - - - - - Clone ID - - beta12orEarlier - An identifier of a clone (cloned molecular sequence) from a database. - - - - - - - - - - - PDB insertion code - - beta12orEarlier - WHATIF: insertion_code - PDBML:pdbx_PDB_ins_code - An insertion code (part of the residue number) for an amino acid residue from a PDB file. - - - - - - - - - - Atomic occupancy - - WHATIF: PDBx_occupancy - The fraction of an atom type present at a site in a molecular structure. - beta12orEarlier - The sum of the occupancies of all the atom types at a site should not normally significantly exceed 1.0. - - - - - - - - - - Isotropic B factor - - Isotropic B factor (atomic displacement parameter) for an atom from a PDB file. - WHATIF: PDBx_B_iso_or_equiv - beta12orEarlier - - - - - - - - - - Deletion map - - A cytogenetic map is built from a set of mutant cell lines with sub-chromosomal deletions and a reference wild-type line ('genome deletion panel'). The panel is used to map markers onto the genome by comparing mutant to wild-type banding patterns. Markers are linked (occur in the same deleted region) if they share the same banding pattern (presence or absence) as the deletion panel. - beta12orEarlier - A cytogenetic map showing chromosome banding patterns in mutant cell lines relative to the wild type. - Deletion-based cytogenetic map - - - - - - - - - - QTL map - - A genetic map which shows the approximate location of quantitative trait loci (QTL) between two or more markers. - beta12orEarlier - Quantitative trait locus map - - - - - - - - - - Haplotype map - - beta12orEarlier - Moby:Haplotyping_Study_obj - A map of haplotypes in a genome or other sequence, describing common patterns of genetic variation. - - - - - - - - - - Map set data - - beta12orEarlier - Data describing a set of multiple genetic or physical maps, typically sharing a common set of features which are mapped. - Moby:GCP_CorrelatedLinkageMapSet - Moby:GCP_CorrelatedMapSet - - - - - - - - - - Map feature - - beta12orEarlier - true - A feature which may mapped (positioned) on a genetic or other type of map. - Moby:MapFeature - beta12orEarlier - Mappable features may be based on Gramene's notion of map features; see http://www.gramene.org/db/cmap/feature_type_info. - - - - - - - - - - - - Map type - - A designation of the type of map (genetic map, physical map, sequence map etc) or map set. - Map types may be based on Gramene's notion of a map type; see http://www.gramene.org/db/cmap/map_type_info. - 1.5 - true - beta12orEarlier - - - - - - - - - - Protein fold name - - The name of a protein fold. - beta12orEarlier - - - - - - - - - - - Taxon - - Moby:PotentialTaxon - Taxonomy rank - beta12orEarlier - Taxonomic rank - For a complete list of taxonomic ranks see https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - The name of a group of organisms belonging to the same taxonomic rank. - Moby:BriefTaxonConcept - - - - - - - - - - - Organism identifier - - - - - - - - beta12orEarlier - A unique identifier of a (group of) organisms. - - - - - - - - - - - Genus name - - beta12orEarlier - The name of a genus of organism. - - - - - - - - - - - Taxonomic classification - - Moby:TaxonName - Moby:GCP_Taxon - beta12orEarlier - The full name for a group of organisms, reflecting their biological classification and (usually) conforming to a standard nomenclature. - Moby:iANT_organism-xml - Taxonomic name - Name components correspond to levels in a taxonomic hierarchy (e.g. 'Genus', 'Species', etc.) Meta information such as a reference where the name was defined and a date might be included. - Taxonomic information - Moby:TaxonScientificName - Moby:TaxonTCS - - - - - - - - - - - iHOP organism ID - - beta12orEarlier - Moby_namespace:iHOPorganism - A unique identifier for an organism used in the iHOP database. - - - - - - - - - - - Genbank common name - - Common name for an organism as used in the GenBank database. - beta12orEarlier - - - - - - - - - - - NCBI taxon - - The name of a taxon from the NCBI taxonomy database. - beta12orEarlier - - - - - - - - - - - Synonym - - beta12orEarlier - Alternative name - beta12orEarlier - true - An alternative for a word. - - - - - - - - - - Misspelling - - A common misspelling of a word. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Acronym - - true - An abbreviation of a phrase or word. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Misnomer - - A term which is likely to be misleading of its meaning. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Author ID - - Information on the authors of a published work. - Moby:Author - beta12orEarlier - - - - - - - - - - - DragonDB author identifier - - An identifier representing an author in the DragonDB database. - beta12orEarlier - - - - - - - - - - - Annotated URI - - beta12orEarlier - A URI along with annotation describing the data found at the address. - Moby:DescribedLink - - - - - - - - - - UniProt keywords - - true - beta12orEarlier - beta12orEarlier - A controlled vocabulary for words and phrases that can appear in the keywords field (KW line) of entries from the UniProt database. - - - - - - - - - - Gene ID (GeneFarm) - - Moby_namespace:GENEFARM_GeneID - Identifier of a gene from the GeneFarm database. - beta12orEarlier - - - - - - - - - - - Blattner number - - beta12orEarlier - Moby_namespace:Blattner_number - The blattner identifier for a gene. - - - - - - - - - - - Gene ID (MIPS Maize) - - MIPS genetic element identifier (Maize) - Identifier for genetic elements in MIPS Maize database. - beta12orEarlier - Moby_namespace:MIPS_GE_Maize - beta13 - true - - - - - - - - - - Gene ID (MIPS Medicago) - - MIPS genetic element identifier (Medicago) - beta12orEarlier - beta13 - true - Moby_namespace:MIPS_GE_Medicago - Identifier for genetic elements in MIPS Medicago database. - - - - - - - - - - Gene name (DragonDB) - - true - The name of an Antirrhinum Gene from the DragonDB database. - beta12orEarlier - Moby_namespace:DragonDB_Gene - 1.3 - - - - - - - - - - Gene name (Arabidopsis) - - Moby_namespace:ArabidopsisGeneSymbol - true - A unique identifier for an Arabidopsis gene, which is an acronym or abbreviation of the gene name. - beta12orEarlier - 1.3 - - - - - - - - - - iHOP symbol - - - - A unique identifier of a protein or gene used in the iHOP database. - Moby_namespace:iHOPsymbol - beta12orEarlier - - - - - - - - - - - Gene name (GeneFarm) - - 1.3 - true - Name of a gene from the GeneFarm database. - Moby_namespace:GENEFARM_GeneName - GeneFarm gene ID - beta12orEarlier - - - - - - - - - - Locus ID - - - - - - - - - A unique name or other identifier of a genetic locus, typically conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - Locus name - beta12orEarlier - Locus identifier - - - - - - - - - - - Locus ID (AGI) - - AT[1-5]G[0-9]{5} - AGI ID - Locus identifier for Arabidopsis Genome Initiative (TAIR, TIGR and MIPS databases) - http://www.geneontology.org/doc/GO.xrf_abbs:AGI_LocusCode - Arabidopsis gene loci number - AGI locus code - beta12orEarlier - AGI identifier - - - - - - - - - - - Locus ID (ASPGD) - - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGD - http://www.geneontology.org/doc/GO.xrf_abbs: ASPGDID - Identifier for loci from ASPGD (Aspergillus Genome Database). - - - - - - - - - - - Locus ID (MGG) - - Identifier for loci from Magnaporthe grisea Database at the Broad Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: Broad_MGG - beta12orEarlier - - - - - - - - - - - Locus ID (CGD) - - Identifier for loci from CGD (Candida Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: CGDID - beta12orEarlier - CGDID - CGD locus identifier - http://www.geneontology.org/doc/GO.xrf_abbs: CGD - - - - - - - - - - - Locus ID (CMR) - - http://www.geneontology.org/doc/GO.xrf_abbs: TIGR_CMR - Locus identifier for Comprehensive Microbial Resource at the J. Craig Venter Institute. - http://www.geneontology.org/doc/GO.xrf_abbs: JCVI_CMR - beta12orEarlier - - - - - - - - - - - NCBI locus tag - - beta12orEarlier - Moby_namespace:LocusID - Locus ID (NCBI) - http://www.geneontology.org/doc/GO.xrf_abbs: NCBI_locus_tag - Identifier for loci from NCBI database. - - - - - - - - - - - Locus ID (SGD) - - - Identifier for loci from SGD (Saccharomyces Genome Database). - http://www.geneontology.org/doc/GO.xrf_abbs: SGDID - beta12orEarlier - http://www.geneontology.org/doc/GO.xrf_abbs: SGD - SGDID - - - - - - - - - - - Locus ID (MMP) - - Identifier of loci from Maize Mapping Project. - Moby_namespace:MMP_Locus - beta12orEarlier - - - - - - - - - - - Locus ID (DictyBase) - - Moby_namespace:DDB_gene - Identifier of locus from DictyBase (Dictyostelium discoideum). - beta12orEarlier - - - - - - - - - - - Locus ID (EntrezGene) - - Identifier of a locus from EntrezGene database. - beta12orEarlier - Moby_namespace:EntrezGene_ID - Moby_namespace:EntrezGene_EntrezGeneID - - - - - - - - - - - Locus ID (MaizeGDB) - - Identifier of locus from MaizeGDB (Maize genome database). - Moby_namespace:MaizeGDB_Locus - beta12orEarlier - - - - - - - - - - - Quantitative trait locus - - QTL - A QTL sometimes but does not necessarily correspond to a gene. - true - beta12orEarlier - beta12orEarlier - A stretch of DNA that is closely linked to the genes underlying a quantitative trait (a phenotype that varies in degree and depends upon the interactions between multiple genes and their environment). - Moby:SO_QTL - - - - - - - - - - Gene ID (KOME) - - Identifier of a gene from the KOME database. - beta12orEarlier - Moby_namespace:GeneId - - - - - - - - - - - Locus ID (Tropgene) - - Identifier of a locus from the Tropgene database. - Moby:Tropgene_locus - beta12orEarlier - - - - - - - - - - - Alignment - - An alignment of molecular sequences, structures or profiles derived from them. - beta12orEarlier - - - - - - - - - - Atomic property - - General atomic property - Data for an atom (in a molecular structure). - beta12orEarlier - - - - - - - - - - UniProt keyword - - beta12orEarlier - A word or phrase that can appear in the keywords field (KW line) of entries from the UniProt database. - Moby_namespace:SP_KW - http://www.geneontology.org/doc/GO.xrf_abbs: SP_KW - - - - - - - - - - Ordered locus name - - beta12orEarlier - true - A name for a genetic locus conforming to a scheme that names loci (such as predicted genes) depending on their position in a molecular sequence, for example a completely sequenced genome or chromosome. - beta12orEarlier - - - - - - - - - - Sequence coordinates - - - - Map position - Moby:Position - Locus - Sequence co-ordinates - A position in a map (for example a genetic map), either a single position (point) or a region / interval. - Moby:GenePosition - This includes positions in genomes based on a reference sequence. A position may be specified for any mappable object, i.e. anything that may have positional information such as a physical position in a chromosome. Data might include sequence region name, strand, coordinate system name, assembly name, start position and end position. - Moby:HitPosition - beta12orEarlier - Moby:MapPosition - Moby:Locus - Moby:GCP_MapInterval - Moby:GCP_MapPosition - Moby:GCP_MapPoint - PDBML:_atom_site.id - - - - - - - - - - Amino acid property - - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all amino acids. - Amino acid data - beta12orEarlier - - - - - - - - - - Annotation - - beta12orEarlier - true - beta13 - This is a broad data type and is used a placeholder for other, more specific types. - A human-readable collection of information which (typically) is generated or collated by hand and which describes a biological entity, phenomena or associated primary (e.g. sequence or structural) data, as distinct from the primary data itself and computer-generated reports derived from it. - - - - - - - - - - Map data - - - - - - - - Map attribute - A molecular map (genetic or physical), an attribute of such a map, or data extracted from or derived from the analysis of such a map. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - - - - - - - - - - Vienna RNA structural data - - true - Data used by the Vienna RNA analysis package. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence mask parameter - - beta12orEarlier - 1.5 - true - Data used to replace (mask) characters in a molecular sequence. - - - - - - - - - - Enzyme kinetics data - - - Data concerning chemical reaction(s) catalysed by enzyme(s). - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Michaelis Menten plot - - A plot giving an approximation of the kinetics of an enzyme-catalysed reaction, assuming simple kinetics (i.e. no intermediate or product inhibition, allostericity or cooperativity). It plots initial reaction rate to the substrate concentration (S) from which the maximum rate (vmax) is apparent. - beta12orEarlier - - - - - - - - - - Hanes Woolf plot - - beta12orEarlier - A plot based on the Michaelis Menten equation of enzyme kinetics plotting the ratio of the initial substrate concentration (S) against the reaction velocity (v). - - - - - - - - - - Experimental data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - true - Raw data from or annotation on laboratory experiments. - beta12orEarlier - Experimental measurement data - beta13 - - - - - - - - - - - Genome version information - - beta12orEarlier - true - Information on a genome version. - 1.5 - - - - - - - - - - Evidence - - Typically a statement about some data or results, including evidence or the source of a statement, which may include computational prediction, laboratory experiment, literature reference etc. - beta12orEarlier - - - - - - - - - - Sequence record lite - - beta12orEarlier - A molecular sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - 1.8 - - - - - - - - - - Sequence - - - - - - - - http://purl.bioontology.org/ontology/MSH/D008969 - Sequences - http://purl.org/biotop/biotop.owl#BioMolecularSequenceInformation - This concept is a placeholder of concepts for primary sequence data including raw sequences and sequence records. It should not normally be used for derivatives such as sequence alignments, motifs or profiles. - beta12orEarlier - One or more molecular sequences, possibly with associated annotation. - - - - - - - - - - Nucleic acid sequence record (lite) - - beta12orEarlier - 1.8 - true - A nucleic acid sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - - - - - - - - - - Protein sequence record (lite) - - 1.8 - Sequence record lite (protein) - beta12orEarlier - A protein sequence and minimal metadata, typically an identifier of the sequence and/or a comment. - true - - - - - - - - - - Report - - You can use this term by default for any textual report, in case you can't find another, more specific term. Reports may be generated automatically or collated by hand and can include metadata on the origin, source, history, ownership or location of some thing. - http://semanticscience.org/resource/SIO_000148 - Document - A human-readable collection of information including annotation on a biological entity or phenomena, computer-generated reports of analysis of primary data (e.g. sequence or structural), and metadata (data about primary data) or any other free (essentially unformatted) text, as distinct from the primary data itself. - beta12orEarlier - - - - - - - - - - Molecular property (general) - - General molecular property - General data for a molecule. - beta12orEarlier - - - - - - - - - - Structural data - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - true - Data concerning molecular structural data. - beta13 - - - - - - - - - - - Sequence motif (nucleic acid) - - Nucleic acid sequence motif - DNA sequence motif - A nucleotide sequence motif. - beta12orEarlier - RNA sequence motif - - - - - - - - - - Sequence motif (protein) - - beta12orEarlier - An amino acid sequence motif. - Protein sequence motif - - - - - - - - - - Search parameter - - beta12orEarlier - 1.5 - true - Some simple value controlling a search operation, typically a search of a database. - - - - - - - - - - Database search results - - beta12orEarlier - A report of hits from searching a database of some type. - Search results - Database hits - - - - - - - - - - Secondary structure - - 1.5 - true - beta12orEarlier - The secondary structure assignment (predicted or real) of a nucleic acid or protein. - - - - - - - - - - Matrix - - beta12orEarlier - Array - This is a broad data type and is used a placeholder for other, more specific types. - An array of numerical values. - - - - - - - - - - Alignment data - - beta12orEarlier - 1.8 - true - Data concerning, extracted from, or derived from the analysis of molecular alignment of some type. - This is a broad data type and is used a placeholder for other, more specific types. - Alignment report - - - - - - - - - - Nucleic acid report - - An informative human-readable report about one or more specific nucleic acid molecules, derived from analysis of primary (sequence or structural) data. - beta12orEarlier - - - - - - - - - - Structure report - - An informative report on general information, properties or features of one or more molecular tertiary (3D) structures. - beta12orEarlier - Structure-derived report - - - - - - - - - - Nucleic acid structure data - - Nucleic acid property (structural) - This includes reports on the stiffness, curvature, twist/roll data or other conformational parameters or properties. - Nucleic acid structural property - beta12orEarlier - A report on nucleic acid structure-derived data, describing structural properties of a DNA molecule, or any other annotation or information about specific nucleic acid 3D structure(s). - - - - - - - - - - Molecular property - - beta12orEarlier - SO:0000400 - A report on the physical (e.g. structural) or chemical properties of molecules, or parts of a molecule. - Physicochemical property - - - - - - - - - - DNA base structural data - - Structural data for DNA base pairs or runs of bases, such as energy or angle data. - beta12orEarlier - - - - - - - - - - Database entry version information - - true - beta12orEarlier - 1.5 - Information on a database (or ontology) entry version, such as name (or other identifier) or parent database, unique identifier of entry, data, author and so on. - - - - - - - - - - Accession - - beta12orEarlier - http://semanticscience.org/resource/SIO_000731 - A persistent (stable) and unique identifier, typically identifying an object (entry) from a database. - http://semanticscience.org/resource/SIO_000675 - - - - - - - - - - - SNP - - single nucleotide polymorphism (SNP) in a DNA sequence. - true - beta12orEarlier - 1.8 - - - - - - - - - - Data reference - - A list of database accessions or identifiers are usually included. - Reference to a dataset (or a cross-reference between two datasets), typically one or more entries in a biological database or ontology. - beta12orEarlier - - - - - - - - - - Job identifier - - http://wsio.org/data_009 - An identifier of a submitted job. - beta12orEarlier - - - - - - - - - - - Name - - http://semanticscience.org/resource/SIO_000116 - http://usefulinc.com/ns/doap#name - "http://www.w3.org/2000/01/rdf-schema#label - beta12orEarlier - A name of a thing, which need not necessarily uniquely identify it. - Symbolic name - - - - - - - Closely related, but focusing on labeling and human readability but not on identification. - - - - - - - - - - - Type - - A label (text token) describing the type of a thing, typically an enumerated string (a string with one of a limited set of values). - http://purl.org/dc/elements/1.1/type - 1.5 - beta12orEarlier - true - - - - - - - - - - User ID - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - KEGG organism code - - - A three-letter code used in the KEGG databases to uniquely identify organisms. - beta12orEarlier - - - - - - - - - - - Gene name (KEGG GENES) - - beta12orEarlier - KEGG GENES entry name - [a-zA-Z_0-9]+:[a-zA-Z_0-9\.-]* - Name of an entry (gene) from the KEGG GENES database. - Moby_namespace:GeneId - true - 1.3 - - - - - - - - - - BioCyc ID - - - Identifier of an object from one of the BioCyc databases. - beta12orEarlier - - - - - - - - - - - Compound ID (BioCyc) - - - BioCyc compound identifier - Identifier of a compound from the BioCyc chemical compounds database. - BioCyc compound ID - beta12orEarlier - - - - - - - - - - - Reaction ID (BioCyc) - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from the BioCyc reactions database. - - - - - - - - - - - Enzyme ID (BioCyc) - - - BioCyc enzyme ID - beta12orEarlier - Identifier of an enzyme from the BioCyc enzymes database. - - - - - - - - - - - Reaction ID - - - - - - - - - beta12orEarlier - Identifier of a biological reaction from a database. - - - - - - - - - - - Identifier (hybrid) - - An identifier that is re-used for data objects of fundamentally different types (typically served from a single database). - beta12orEarlier - This branch provides an alternative organisation of the concepts nested under 'Accession' and 'Name'. All concepts under here are already included under 'Accession' or 'Name'. - - - - - - - - - - - Molecular property identifier - - - - - - - - beta12orEarlier - Identifier of a molecular property. - - - - - - - - - - - Codon usage table ID - - - - - - - - - - - - - - Identifier of a codon usage table, for example a genetic code. - Codon usage table identifier - beta12orEarlier - - - - - - - - - - - FlyBase primary identifier - - beta12orEarlier - Primary identifier of an object from the FlyBase database. - - - - - - - - - - - WormBase identifier - - beta12orEarlier - Identifier of an object from the WormBase database. - - - - - - - - - - - WormBase wormpep ID - - - Protein identifier used by WormBase database. - CE[0-9]{5} - beta12orEarlier - - - - - - - - - - - Nucleic acid features (codon) - - beta12orEarlier - true - An informative report on a trinucleotide sequence that encodes an amino acid including the triplet sequence, the encoded amino acid or whether it is a start or stop codon. - beta12orEarlier - - - - - - - - - - Map identifier - - - - - - - - An identifier of a map of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Person identifier - - An identifier of a software end-user (typically a person). - beta12orEarlier - - - - - - - - - - - Nucleic acid identifier - - - - - - - - Name or other identifier of a nucleic acid molecule. - beta12orEarlier - - - - - - - - - - - Translation frame specification - - beta12orEarlier - Frame for translation of DNA (3 forward and 3 reverse frames relative to a chromosome). - - - - - - - - - - Genetic code identifier - - - - - - - - An identifier of a genetic code. - beta12orEarlier - - - - - - - - - - - Genetic code name - - - Informal name for a genetic code, typically an organism name. - beta12orEarlier - - - - - - - - - - - File format name - - - Name of a file format such as HTML, PNG, PDF, EMBL, GenBank and so on. - beta12orEarlier - - - - - - - - - - - Sequence profile type - - true - 1.5 - A label (text token) describing a type of sequence profile such as frequency matrix, Gribskov profile, hidden Markov model etc. - beta12orEarlier - - - - - - - - - - Operating system name - - beta12orEarlier - Name of a computer operating system such as Linux, PC or Mac. - - - - - - - - - - - Mutation type - - beta12orEarlier - true - beta12orEarlier - A type of point or block mutation, including insertion, deletion, change, duplication and moves. - - - - - - - - - - Logical operator - - beta12orEarlier - A logical operator such as OR, AND, XOR, and NOT. - - - - - - - - - - - Results sort order - - Possible options including sorting by score, rank, by increasing P-value (probability, i.e. most statistically significant hits given first) and so on. - beta12orEarlier - true - 1.5 - A control of the order of data that is output, for example the order of sequences in an alignment. - - - - - - - - - - Toggle - - beta12orEarlier - A simple parameter that is a toggle (boolean value), typically a control for a modal tool. - true - beta12orEarlier - - - - - - - - - - Sequence width - - true - beta12orEarlier - beta12orEarlier - The width of an output sequence or alignment. - - - - - - - - - - Gap penalty - - beta12orEarlier - A penalty for introducing or extending a gap in an alignment. - - - - - - - - - - Nucleic acid melting temperature - - beta12orEarlier - A temperature concerning nucleic acid denaturation, typically the temperature at which the two strands of a hybridized or double stranded nucleic acid (DNA or RNA/DNA) molecule separate. - Melting temperature - - - - - - - - - - Concentration - - beta12orEarlier - The concentration of a chemical compound. - - - - - - - - - - Window step size - - 1.5 - beta12orEarlier - true - Size of the incremental 'step' a sequence window is moved over a sequence. - - - - - - - - - - EMBOSS graph - - beta12orEarlier - true - beta12orEarlier - An image of a graph generated by the EMBOSS suite. - - - - - - - - - - EMBOSS report - - An application report generated by the EMBOSS suite. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence offset - - true - beta12orEarlier - 1.5 - An offset for a single-point sequence position. - - - - - - - - - - Threshold - - 1.5 - beta12orEarlier - true - A value that serves as a threshold for a tool (usually to control scoring or output). - - - - - - - - - - Protein report (transcription factor) - - beta13 - true - This might include conformational or physicochemical properties, as well as sequence information for transcription factor(s) binding sites. - An informative report on a transcription factor protein. - Transcription factor binding site data - beta12orEarlier - - - - - - - - - - Database category name - - true - The name of a category of biological or bioinformatics database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Sequence profile name - - beta12orEarlier - Name of a sequence profile. - true - beta12orEarlier - - - - - - - - - - Color - - Specification of one or more colors. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Rendering parameter - - true - beta12orEarlier - 1.5 - A parameter that is used to control rendering (drawing) to a device or image. - Graphics parameter - Graphical parameter - - - - - - - - - - Sequence name - - - Any arbitrary name of a molecular sequence. - beta12orEarlier - - - - - - - - - - - Date - - 1.5 - A temporal date. - beta12orEarlier - true - - - - - - - - - - Word composition - - beta12orEarlier - Word composition data for a molecular sequence. - true - beta12orEarlier - - - - - - - - - - - Fickett testcode plot - - A plot of Fickett testcode statistic (identifying protein coding regions) in a nucleotide sequences. - beta12orEarlier - - - - - - - - - - Sequence similarity plot - - - Use this concept for calculated substitution rates, relative site variability, data on sites with biased properties, highly conserved or very poorly conserved sites, regions, blocks etc. - beta12orEarlier - Sequence conservation report - A plot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Helical wheel - - beta12orEarlier - An image of peptide sequence sequence looking down the axis of the helix for highlighting amphipathicity and other properties. - - - - - - - - - - Helical net - - beta12orEarlier - Useful for highlighting amphipathicity and other properties. - An image of peptide sequence sequence in a simple 3,4,3,4 repeating pattern that emulates at a simple level the arrangement of residues around an alpha helix. - - - - - - - - - - Protein sequence properties plot - - true - beta12orEarlier - beta12orEarlier - A plot of general physicochemical properties of a protein sequence. - - - - - - - - - - Protein ionization curve - - - beta12orEarlier - A plot of pK versus pH for a protein. - - - - - - - - - - Sequence composition plot - - - beta12orEarlier - A plot of character or word composition / frequency of a molecular sequence. - - - - - - - - - - Nucleic acid density plot - - - beta12orEarlier - Density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence trace image - - Image of a sequence trace (nucleotide sequence versus probabilities of each of the 4 bases). - beta12orEarlier - - - - - - - - - - Nucleic acid features (siRNA) - - true - 1.5 - beta12orEarlier - A report on siRNA duplexes in mRNA. - - - - - - - - - - Sequence set (stream) - - beta12orEarlier - true - This concept may be used for sequence sets that are expected to be read and processed a single sequence at a time. - A collection of multiple molecular sequences and (typically) associated metadata that is intended for sequential processing. - beta12orEarlier - - - - - - - - - - FlyBase secondary identifier - - Secondary identifier of an object from the FlyBase database. - Secondary identifier are used to handle entries that were merged with or split from other entries in the database. - beta12orEarlier - - - - - - - - - - - Cardinality - - The number of a certain thing. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 1 - - beta12orEarlier - beta12orEarlier - A single thing. - true - - - - - - - - - - 1 or more - - One or more things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Exactly 2 - - Exactly two things. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 2 or more - - Two or more things. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence checksum - - A fixed-size datum calculated (by using a hash function) for a molecular sequence, typically for purposes of error detection or indexing. - beta12orEarlier - Hash code - Hash sum - Hash - Hash value - - - - - - - - - - Protein features report (chemical modifications) - - 1.8 - beta12orEarlier - chemical modification of a protein. - true - - - - - - - - - - Error - - beta12orEarlier - Data on an error generated by computer system or tool. - 1.5 - true - - - - - - - - - - Database entry metadata - - beta12orEarlier - Basic information on any arbitrary database entry. - - - - - - - - - - Gene cluster - - beta13 - true - beta12orEarlier - A cluster of similar genes. - - - - - - - - - - Sequence record full - - true - beta12orEarlier - A molecular sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - - - - - - - - - - Plasmid identifier - - An identifier of a plasmid in a database. - beta12orEarlier - - - - - - - - - - - Mutation ID - - - beta12orEarlier - A unique identifier of a specific mutation catalogued in a database. - - - - - - - - - - - Mutation annotation (basic) - - Information describing the mutation itself, the organ site, tissue and type of lesion where the mutation has been identified, description of the patient origin and life-style. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Mutation annotation (prevalence) - - beta12orEarlier - true - An informative report on the prevalence of mutation(s), including data on samples and mutation prevalence (e.g. by tumour type).. - beta12orEarlier - - - - - - - - - - Mutation annotation (prognostic) - - beta12orEarlier - An informative report on mutation prognostic data, such as information on patient cohort, the study settings and the results of the study. - beta12orEarlier - true - - - - - - - - - - Mutation annotation (functional) - - An informative report on the functional properties of mutant proteins including transcriptional activities, promotion of cell growth and tumorigenicity, dominant negative effects, capacity to induce apoptosis, cell-cycle arrest or checkpoints in human cells and so on. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Codon number - - beta12orEarlier - The number of a codon, for instance, at which a mutation is located. - - - - - - - - - - Tumor annotation - - true - 1.4 - An informative report on a specific tumor including nature and origin of the sample, anatomic site, organ or tissue, tumor type, including morphology and/or histologic type, and so on. - beta12orEarlier - - - - - - - - - - Server metadata - - Basic information about a server on the web, such as an SRS server. - beta12orEarlier - 1.5 - true - - - - - - - - - - Database field name - - The name of a field in a database. - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (SYSTERS) - - SYSTERS cluster ID - Unique identifier of a sequence cluster from the SYSTERS database. - beta12orEarlier - - - - - - - - - - - Ontology metadata - - - - - - - - beta12orEarlier - Data concerning a biological ontology. - - - - - - - - - - Raw SCOP domain classification - - true - beta12orEarlier - Raw SCOP domain classification data files. - beta13 - These are the parsable data files provided by SCOP. - - - - - - - - - - Raw CATH domain classification - - Raw CATH domain classification data files. - These are the parsable data files provided by CATH. - true - beta13 - beta12orEarlier - - - - - - - - - - Heterogen annotation - - 1.4 - true - beta12orEarlier - An informative report on the types of small molecules or 'heterogens' (non-protein groups) that are represented in PDB files. - - - - - - - - - - Phylogenetic property values - - beta12orEarlier - Phylogenetic property values data. - true - beta12orEarlier - - - - - - - - - - Sequence set (bootstrapped) - - 1.5 - beta12orEarlier - Bootstrapping is often performed in phylogenetic analysis. - true - A collection of sequences output from a bootstrapping (resampling) procedure. - - - - - - - - - - Phylogenetic consensus tree - - true - A consensus phylogenetic tree derived from comparison of multiple trees. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Schema - - beta12orEarlier - true - A data schema for organising or transforming data of some type. - 1.5 - - - - - - - - - - DTD - - A DTD (document type definition). - true - beta12orEarlier - 1.5 - - - - - - - - - - XML Schema - - beta12orEarlier - XSD - An XML Schema. - true - 1.5 - - - - - - - - - - Relax-NG schema - - beta12orEarlier - 1.5 - A relax-NG schema. - true - - - - - - - - - - XSLT stylesheet - - 1.5 - beta12orEarlier - An XSLT stylesheet. - true - - - - - - - - - - Data resource definition name - - - beta12orEarlier - The name of a data type. - - - - - - - - - - - OBO file format name - - Name of an OBO file format such as OBO-XML, plain and so on. - beta12orEarlier - - - - - - - - - - - Gene ID (MIPS) - - Identifier for genetic elements in MIPS database. - beta12orEarlier - MIPS genetic element identifier - - - - - - - - - - - Sequence identifier (protein) - - An identifier of protein sequence(s) or protein sequence database entries. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Sequence identifier (nucleic acid) - - An identifier of nucleotide sequence(s) or nucleotide sequence database entries. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - EMBL accession - - EMBL ID - beta12orEarlier - EMBL accession number - EMBL identifier - An accession number of an entry from the EMBL sequence database. - - - - - - - - - - - UniProt ID - - - - - - - - UniProtKB identifier - An identifier of a polypeptide in the UniProt database. - UniProtKB entry name - beta12orEarlier - UniProt identifier - UniProt entry name - - - - - - - - - - - GenBank accession - - GenBank ID - GenBank identifier - Accession number of an entry from the GenBank sequence database. - beta12orEarlier - GenBank accession number - - - - - - - - - - - Gramene secondary identifier - - beta12orEarlier - Gramene internal identifier - Gramene internal ID - Secondary (internal) identifier of a Gramene database entry. - Gramene secondary ID - - - - - - - - - - - Sequence variation ID - - - An identifier of an entry from a database of molecular sequence variation. - beta12orEarlier - - - - - - - - - - - Gene ID - - - Gene accession - beta12orEarlier - A unique (and typically persistent) identifier of a gene in a database, that is (typically) different to the gene name/symbol. - Gene code - - - - - - - - - - - Gene name (AceView) - - AceView gene name - 1.3 - true - Name of an entry (gene) from the AceView genes database. - beta12orEarlier - - - - - - - - - - Gene ID (ECK) - - ECK accession - beta12orEarlier - E. coli K-12 gene identifier - Identifier of an E. coli K-12 gene from EcoGene Database. - http://www.geneontology.org/doc/GO.xrf_abbs: ECK - - - - - - - - - - - Gene ID (HGNC) - - HGNC ID - beta12orEarlier - Identifier for a gene approved by the HUGO Gene Nomenclature Committee. - - - - - - - - - - - Gene name - - - The name of a gene, (typically) assigned by a person and/or according to a naming scheme. It may contain white space characters and is typically more intuitive and readable than a gene symbol. It (typically) may be used to identify similar genes in different species and to derive a gene symbol. - Allele name - beta12orEarlier - - - - - - - - - - - Gene name (NCBI) - - beta12orEarlier - 1.3 - NCBI gene name - Name of an entry (gene) from the NCBI genes database. - true - - - - - - - - - - SMILES string - - A specification of a chemical structure in SMILES format. - beta12orEarlier - - - - - - - - - - STRING ID - - Unique identifier of an entry from the STRING database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - Virus annotation - - An informative report on a specific virus. - true - 1.4 - beta12orEarlier - - - - - - - - - - Virus annotation (taxonomy) - - An informative report on the taxonomy of a specific virus. - beta12orEarlier - true - 1.4 - - - - - - - - - - Reaction ID (SABIO-RK) - - Identifier of a biological reaction from the SABIO-RK reactions database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - Carbohydrate report - - Annotation on or information derived from one or more specific carbohydrate 3D structure(s). - beta12orEarlier - - - - - - - - - - GI number - - beta12orEarlier - NCBI GI number - gi number - A series of digits that are assigned consecutively to each sequence record processed by NCBI. The GI number bears no resemblance to the Accession number of the sequence record. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - - - - - - - - - - - NCBI version - - beta12orEarlier - NCBI accession.version - Nucleotide sequence version contains two letters followed by six digits, a dot, and a version number (or for older nucleotide sequence records, the format is one letter followed by five digits, a dot, and a version number). Protein sequence version contains three letters followed by five digits, a dot, and a version number. - An identifier assigned to sequence records processed by NCBI, made of the accession number of the database record followed by a dot and a version number. - accession.version - - - - - - - - - - - Cell line name - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (exact) - - beta12orEarlier - The name of a cell line. - - - - - - - - - - - Cell line name (truncated) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (no punctuation) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Cell line name (assonant) - - The name of a cell line. - beta12orEarlier - - - - - - - - - - - Enzyme ID - - - beta12orEarlier - A unique, persistent identifier of an enzyme. - Enzyme accession - - - - - - - - - - - REBASE enzyme number - - Identifier of an enzyme from the REBASE enzymes database. - beta12orEarlier - - - - - - - - - - - DrugBank ID - - beta12orEarlier - DB[0-9]{5} - Unique identifier of a drug from the DrugBank database. - - - - - - - - - - - GI number (protein) - - beta12orEarlier - protein gi number - A unique identifier assigned to NCBI protein sequence records. - Nucleotide sequence GI number is shown in the VERSION field of the database record. Protein sequence GI number is shown in the CDS/db_xref field of a nucleotide database record, and the VERSION field of a protein database record. - protein gi - - - - - - - - - - - Bit score - - A score derived from the alignment of two sequences, which is then normalized with respect to the scoring system. - Bit scores are normalized with respect to the scoring system and therefore can be used to compare alignment scores from different searches. - beta12orEarlier - - - - - - - - - - Translation phase specification - - beta12orEarlier - Phase for translation of DNA (0, 1 or 2) relative to a fragment of the coding sequence. - Phase - - - - - - - - - - Resource metadata - - Data concerning or describing some core computational resource, as distinct from primary data. This includes metadata on the origin, source, history, ownership or location of some thing. - This is a broad data type and is used a placeholder for other, more specific types. - Provenance metadata - beta12orEarlier - - - - - - - - - - Ontology identifier - - - - - - - - beta12orEarlier - Any arbitrary identifier of an ontology. - - - - - - - - - - - Ontology concept name - - - The name of a concept in an ontology. - beta12orEarlier - - - - - - - - - - - Genome build identifier - - beta12orEarlier - An identifier of a build of a particular genome. - - - - - - - - - - - Pathway or network name - - The name of a biological pathway or network. - beta12orEarlier - - - - - - - - - - - Pathway ID (KEGG) - - - Identifier of a pathway from the KEGG pathway database. - beta12orEarlier - [a-zA-Z_0-9]{2,3}[0-9]{5} - KEGG pathway ID - - - - - - - - - - - Pathway ID (NCI-Nature) - - beta12orEarlier - [a-zA-Z_0-9]+ - Identifier of a pathway from the NCI-Nature pathway database. - - - - - - - - - - - Pathway ID (ConsensusPathDB) - - - beta12orEarlier - Identifier of a pathway from the ConsensusPathDB pathway database. - - - - - - - - - - - Sequence cluster ID (UniRef) - - Unique identifier of an entry from the UniRef database. - UniRef cluster id - UniRef entry accession - beta12orEarlier - - - - - - - - - - - Sequence cluster ID (UniRef100) - - UniRef100 cluster id - beta12orEarlier - UniRef100 entry accession - Unique identifier of an entry from the UniRef100 database. - - - - - - - - - - - Sequence cluster ID (UniRef90) - - UniRef90 entry accession - beta12orEarlier - UniRef90 cluster id - Unique identifier of an entry from the UniRef90 database. - - - - - - - - - - - Sequence cluster ID (UniRef50) - - beta12orEarlier - UniRef50 cluster id - UniRef50 entry accession - Unique identifier of an entry from the UniRef50 database. - - - - - - - - - - - Ontology data - - - - - - - - Data concerning or derived from an ontology. - Ontological data - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - RNA family report - - beta12orEarlier - An informative report on a specific RNA family or other group of classified RNA sequences. - RNA family annotation - - - - - - - - - - RNA family identifier - - - - - - - - beta12orEarlier - Identifier of an RNA family, typically an entry from a RNA sequence classification database. - - - - - - - - - - - RFAM accession - - - Stable accession number of an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Protein signature type - - beta12orEarlier - true - A label (text token) describing a type of protein family signature (sequence classifier) from the InterPro database. - 1.5 - - - - - - - - - - Domain-nucleic acid interaction report - - 1.5 - true - An informative report on protein domain-DNA/RNA interaction(s). - beta12orEarlier - - - - - - - - - - Domain-domain interactions - - 1.8 - An informative report on protein domain-protein domain interaction(s). - beta12orEarlier - true - - - - - - - - - - Domain-domain interaction (indirect) - - true - beta12orEarlier - beta12orEarlier - Data on indirect protein domain-protein domain interaction(s). - - - - - - - - - - Sequence accession (hybrid) - - - - - - - - Accession number of a nucleotide or protein sequence database entry. - beta12orEarlier - - - - - - - - - - - 2D PAGE data - - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - beta13 - beta12orEarlier - true - Data concerning two-dimensional polygel electrophoresis. - - - - - - - - - - 2D PAGE report - - beta12orEarlier - two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - true - - - - - - - - - - Pathway or network accession - - - A persistent, unique identifier of a biological pathway or network (typically a database entry). - beta12orEarlier - - - - - - - - - - - Secondary structure alignment - - Alignment of the (1D representations of) secondary structure of two or more molecules. - beta12orEarlier - - - - - - - - - - ASTD ID - - - beta12orEarlier - Identifier of an object from the ASTD database. - - - - - - - - - - - ASTD ID (exon) - - beta12orEarlier - Identifier of an exon from the ASTD database. - - - - - - - - - - - ASTD ID (intron) - - beta12orEarlier - Identifier of an intron from the ASTD database. - - - - - - - - - - - ASTD ID (polya) - - Identifier of a polyA signal from the ASTD database. - beta12orEarlier - - - - - - - - - - - ASTD ID (tss) - - Identifier of a transcription start site from the ASTD database. - beta12orEarlier - - - - - - - - - - - 2D PAGE spot report - - 2D PAGE spot annotation - beta12orEarlier - An informative report on individual spot(s) from a two-dimensional (2D PAGE) gel. - 1.8 - true - - - - - - - - - - Spot ID - - - beta12orEarlier - Unique identifier of a spot from a two-dimensional (protein) gel. - - - - - - - - - - - Spot serial number - - Unique identifier of a spot from a two-dimensional (protein) gel in the SWISS-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Spot ID (HSC-2DPAGE) - - Unique identifier of a spot from a two-dimensional (protein) gel from a HSC-2DPAGE database. - beta12orEarlier - - - - - - - - - - - Protein-motif interaction - - beta13 - true - Data on the interaction of a protein (or protein domain) with specific structural (3D) and/or sequence motifs. - beta12orEarlier - - - - - - - - - - Strain identifier - - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - beta12orEarlier - - - - - - - - - - - CABRI accession - - - A unique identifier of an item from the CABRI database. - beta12orEarlier - - - - - - - - - - - Experiment report (genotyping) - - true - Report of genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - 1.8 - beta12orEarlier - - - - - - - - - - Genotype experiment ID - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of genotype experiment metadata. - - - - - - - - - - - EGA accession - - beta12orEarlier - Identifier of an entry from the EGA database. - - - - - - - - - - - IPI protein ID - - Identifier of a protein entry catalogued in the International Protein Index (IPI) database. - IPI[0-9]{8} - beta12orEarlier - - - - - - - - - - - RefSeq accession (protein) - - RefSeq protein ID - Accession number of a protein from the RefSeq database. - beta12orEarlier - - - - - - - - - - - EPD ID - - beta12orEarlier - Identifier of an entry (promoter) from the EPD database. - EPD identifier - - - - - - - - - - - TAIR accession - - - beta12orEarlier - Identifier of an entry from the TAIR database. - - - - - - - - - - - TAIR accession (At gene) - - beta12orEarlier - Identifier of an Arabidopsis thaliana gene from the TAIR database. - - - - - - - - - - - UniSTS accession - - beta12orEarlier - Identifier of an entry from the UniSTS database. - - - - - - - - - - - UNITE accession - - beta12orEarlier - Identifier of an entry from the UNITE database. - - - - - - - - - - - UTR accession - - beta12orEarlier - Identifier of an entry from the UTR database. - - - - - - - - - - - UniParc accession - - beta12orEarlier - UPI[A-F0-9]{10} - Accession number of a UniParc (protein sequence) database entry. - UniParc ID - UPI - - - - - - - - - - - mFLJ/mKIAA number - - beta12orEarlier - Identifier of an entry from the Rouge or HUGE databases. - - - - - - - - - - - Fungi annotation - - true - beta12orEarlier - 1.4 - An informative report on a specific fungus. - - - - - - - - - - Fungi annotation (anamorph) - - beta12orEarlier - An informative report on a specific fungus anamorph. - 1.4 - true - - - - - - - - - - Gene features report (exon) - - true - exons in a nucleotide sequences. - 1.8 - beta12orEarlier - - - - - - - - - - Ensembl protein ID - - - Ensembl ID (protein) - beta12orEarlier - Protein ID (Ensembl) - Unique identifier for a protein from the Ensembl database. - - - - - - - - - - - Gene transcriptional features report - - 1.8 - beta12orEarlier - transcription of DNA into RNA including the regulation of transcription. - true - - - - - - - - - - Toxin annotation - - beta12orEarlier - An informative report on a specific toxin. - 1.4 - true - - - - - - - - - - Protein report (membrane protein) - - beta12orEarlier - true - An informative report on a membrane protein. - beta12orEarlier - - - - - - - - - - Protein-drug interaction report - - true - An informative report on tentative or known protein-drug interaction(s). - 1.12 - beta12orEarlier - - - - - - - - - - Map data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - true - beta13 - Data concerning a map of molecular sequence(s). - - - - - - - - - - - Phylogenetic data - - Data concerning phylogeny, typically of molecular sequences, including reports of information concerning or derived from a phylogenetic tree, or from comparing two or more phylogenetic trees. - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - - - - - - - - - - Protein data - - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning one or more protein molecules. - true - beta12orEarlier - - - - - - - - - - Nucleic acid data - - true - Data concerning one or more nucleic acid molecules. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article data - - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. It includes concepts that are best described as scientific text or closely concerned with or derived from text. - Article report - Data concerning, extracted from, or derived from the analysis of a scientific text (or texts) such as a full text article from a scientific journal. - - - - - - - - - - - Parameter - - http://semanticscience.org/resource/SIO_000144 - Tool-specific parameter - beta12orEarlier - http://www.e-lico.eu/ontologies/dmo/DMOP/DMOP.owl#Parameter - Typically a simple numerical or string value that controls the operation of a tool. - Parameters - Tool parameter - - - - - - - - - - Molecular data - - Molecule-specific data - true - Data concerning a specific type of molecule. - beta13 - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Molecule report - - An informative report on a specific molecule. - beta12orEarlier - Molecular report - 1.5 - true - - - - - - - - - - - Organism report - - An informative report on a specific organism. - beta12orEarlier - Organism annotation - - - - - - - - - - Experiment report - - Experiment metadata - beta12orEarlier - Experiment annotation - Annotation on a wet lab experiment, such as experimental conditions. - - - - - - - - - - Nucleic acid features report (mutation) - - DNA mutation. - 1.8 - true - beta12orEarlier - - - - - - - - - - Sequence attribute - - An attribute of a molecular sequence, possibly in reference to some other sequence. - Sequence parameter - beta12orEarlier - - - - - - - - - - Sequence tag profile - - SAGE, MPSS and SBS experiments are usually performed to study gene expression. The sequence tags are typically subsequently annotated (after a database search) with the mRNA (and therefore gene) the tag was extracted from. - beta12orEarlier - Sequencing-based expression profile - This includes tag to gene assignments (tag mapping) of SAGE, MPSS and SBS data. Typically this is the sequencing-based expression profile annotated with gene identifiers. - Sequence tag profile (with gene assignment) - Output from a serial analysis of gene expression (SAGE), massively parallel signature sequencing (MPSS) or sequencing by synthesis (SBS) experiment. In all cases this is a list of short sequence tags and the number of times it is observed. - - - - - - - - - - Mass spectrometry data - - beta12orEarlier - Data concerning a mass spectrometry measurement. - - - - - - - - - - Protein structure raw data - - beta12orEarlier - Raw data from experimental methods for determining protein structure. - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - - - - - - - - - - Mutation identifier - - An identifier of a mutation. - beta12orEarlier - - - - - - - - - - - Alignment data - - This is a broad data type and is used a placeholder for other, more specific types. This includes entities derived from sequences and structures such as motifs and profiles. - true - beta13 - Data concerning an alignment of two or more molecular sequences, structures or derived data. - beta12orEarlier - - - - - - - - - - - Data index data - - true - Data concerning an index of data. - beta12orEarlier - beta13 - Database index - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Amino acid name (single letter) - - beta12orEarlier - Single letter amino acid identifier, e.g. G. - - - - - - - - - - - Amino acid name (three letter) - - beta12orEarlier - Three letter amino acid identifier, e.g. GLY. - - - - - - - - - - - Amino acid name (full name) - - beta12orEarlier - Full name of an amino acid, e.g. Glycine. - - - - - - - - - - - Toxin identifier - - - - - - - - beta12orEarlier - Identifier of a toxin. - - - - - - - - - - - ArachnoServer ID - - Unique identifier of a toxin from the ArachnoServer database. - beta12orEarlier - - - - - - - - - - - Expressed gene list - - beta12orEarlier - true - 1.5 - Gene annotation (expressed gene list) - A simple summary of expressed genes. - - - - - - - - - - BindingDB Monomer ID - - Unique identifier of a monomer from the BindingDB database. - beta12orEarlier - - - - - - - - - - - GO concept name - - true - beta12orEarlier - beta12orEarlier - The name of a concept from the GO ontology. - - - - - - - - - - GO concept ID (biological process) - - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - An identifier of a 'biological process' concept from the the Gene Ontology. - - - - - - - - - - - GO concept ID (molecular function) - - beta12orEarlier - [0-9]{7}|GO:[0-9]{7} - An identifier of a 'molecular function' concept from the the Gene Ontology. - - - - - - - - - - - GO concept name (cellular component) - - The name of a concept for a cellular component from the GO ontology. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Northern blot image - - beta12orEarlier - An image arising from a Northern Blot experiment. - - - - - - - - - - Blot ID - - - Unique identifier of a blot from a Northern Blot. - beta12orEarlier - - - - - - - - - - - BlotBase blot ID - - beta12orEarlier - Unique identifier of a blot from a Northern Blot from the BlotBase database. - - - - - - - - - - - Hierarchy - - beta12orEarlier - Raw data on a biological hierarchy, describing the hierarchy proper, hierarchy components and possibly associated annotation. - Hierarchy annotation - - - - - - - - - - Hierarchy identifier - - Identifier of an entry from a database of biological hierarchies. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Brite hierarchy ID - - beta12orEarlier - Identifier of an entry from the Brite database of biological hierarchies. - - - - - - - - - - - Cancer type - - true - A type (represented as a string) of cancer. - beta12orEarlier - beta12orEarlier - - - - - - - - - - BRENDA organism ID - - A unique identifier for an organism used in the BRENDA database. - beta12orEarlier - - - - - - - - - - - UniGene taxon - - The name of a taxon using the controlled vocabulary of the UniGene database. - UniGene organism abbreviation - beta12orEarlier - - - - - - - - - - - UTRdb taxon - - beta12orEarlier - The name of a taxon using the controlled vocabulary of the UTRdb database. - - - - - - - - - - - Catalogue ID - - beta12orEarlier - An identifier of a catalogue of biological resources. - Catalogue identifier - - - - - - - - - - - CABRI catalogue name - - - The name of a catalogue of biological resources from the CABRI database. - beta12orEarlier - - - - - - - - - - - Secondary structure alignment metadata - - An informative report on protein secondary structure alignment-derived data or metadata. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Molecule interaction report - - An informative report on the physical, chemical or other information concerning the interaction of two or more molecules (or parts of molecules). - beta12orEarlier - Molecular interaction report - Molecular interaction data - - - - - - - - - Pathway or network - - - - - - - - Network - beta12orEarlier - Pathway - Primary data about a specific biological pathway or network (the nodes and connections within the pathway or network). - - - - - - - - - - Small molecule data - - true - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - beta13 - Data concerning one or more small molecules. - - - - - - - - - - Genotype and phenotype data - - beta12orEarlier - true - beta13 - Data concerning a particular genotype, phenotype or a genotype / phenotype relation. - - - - - - - - - - Gene expression data - - - - - - - - beta12orEarlier - Image or hybridisation data for a microarray, typically a study of gene expression. - Microarray data - This is a broad data type and is used a placeholder for other, more specific types. See also http://edamontology.org/data_0931 - - - - - - - - - - Compound ID (KEGG) - - - C[0-9]+ - Unique identifier of a chemical compound from the KEGG database. - beta12orEarlier - KEGG compound ID - KEGG compound identifier - - - - - - - - - - - RFAM name - - - Name (not necessarily stable) an entry (RNA family) from the RFAM database. - beta12orEarlier - - - - - - - - - - - Reaction ID (KEGG) - - - Identifier of a biological reaction from the KEGG reactions database. - R[0-9]+ - beta12orEarlier - - - - - - - - - - - Drug ID (KEGG) - - - beta12orEarlier - Unique identifier of a drug from the KEGG Drug database. - D[0-9]+ - - - - - - - - - - - Ensembl ID - - - beta12orEarlier - ENS[A-Z]*[FPTG][0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl database. - Ensembl IDs - - - - - - - - - - - ICD identifier - - - - - - - - An identifier of a disease from the International Classification of Diseases (ICD) database. - beta12orEarlier - [A-Z][0-9]+(\.[-[0-9]+])? - - - - - - - - - - - Sequence cluster ID (CluSTr) - - Unique identifier of a sequence cluster from the CluSTr database. - [0-9A-Za-z]+:[0-9]+:[0-9]{1,5}(\.[0-9])? - CluSTr ID - beta12orEarlier - CluSTr cluster ID - - - - - - - - - - - KEGG Glycan ID - - - G[0-9]+ - Unique identifier of a glycan ligand from the KEGG GLYCAN database (a subset of KEGG LIGAND). - beta12orEarlier - - - - - - - - - - - TCDB ID - - beta12orEarlier - OBO file for regular expression. - TC number - [0-9]+\.[A-Z]\.[0-9]+\.[0-9]+\.[0-9]+ - A unique identifier of a family from the transport classification database (TCDB) of membrane transport proteins. - - - - - - - - - - - MINT ID - - MINT\-[0-9]{1,5} - Unique identifier of an entry from the MINT database of protein-protein interactions. - beta12orEarlier - - - - - - - - - - - DIP ID - - Unique identifier of an entry from the DIP database of protein-protein interactions. - beta12orEarlier - DIP[\:\-][0-9]{3}[EN] - - - - - - - - - - - Signaling Gateway protein ID - - beta12orEarlier - Unique identifier of a protein listed in the UCSD-Nature Signaling Gateway Molecule Pages database. - A[0-9]{6} - - - - - - - - - - - Protein modification ID - - - beta12orEarlier - Identifier of a protein modification catalogued in a database. - - - - - - - - - - - RESID ID - - Identifier of a protein modification catalogued in the RESID database. - AA[0-9]{4} - beta12orEarlier - - - - - - - - - - - RGD ID - - - [0-9]{4,7} - beta12orEarlier - Identifier of an entry from the RGD database. - - - - - - - - - - - TAIR accession (protein) - - - - - - - - - AASequence:[0-9]{10} - Identifier of a protein sequence from the TAIR database. - beta12orEarlier - - - - - - - - - - - Compound ID (HMDB) - - HMDB[0-9]{5} - beta12orEarlier - HMDB ID - Identifier of a small molecule metabolite from the Human Metabolome Database (HMDB). - - - - - - - - - - - LIPID MAPS ID - - beta12orEarlier - LM ID - Identifier of an entry from the LIPID MAPS database. - LM(FA|GL|GP|SP|ST|PR|SL|PK)[0-9]{4}([0-9a-zA-Z]{4})? - - - - - - - - - - - PeptideAtlas ID - - Identifier of a peptide from the PeptideAtlas peptide databases. - PDBML:pdbx_PDB_strand_id - beta12orEarlier - PAp[0-9]{8} - - - - - - - - - - - Molecular interaction ID - - Identifier of a report of molecular interactions from a database (typically). - true - beta12orEarlier - 1.7 - - - - - - - - - - BioGRID interaction ID - - [0-9]+ - beta12orEarlier - A unique identifier of an interaction from the BioGRID database. - - - - - - - - - - - Enzyme ID (MEROPS) - - MEROPS ID - Unique identifier of a peptidase enzyme from the MEROPS database. - beta12orEarlier - S[0-9]{2}\.[0-9]{3} - - - - - - - - - - - Mobile genetic element ID - - - An identifier of a mobile genetic element. - beta12orEarlier - - - - - - - - - - - ACLAME ID - - beta12orEarlier - mge:[0-9]+ - An identifier of a mobile genetic element from the Aclame database. - - - - - - - - - - - SGD ID - - - PWY[a-zA-Z_0-9]{2}\-[0-9]{3} - beta12orEarlier - Identifier of an entry from the Saccharomyces genome database (SGD). - - - - - - - - - - - Book ID - - - beta12orEarlier - Unique identifier of a book. - - - - - - - - - - - ISBN - - beta12orEarlier - (ISBN)?(-13|-10)?[:]?[ ]?([0-9]{2,3}[ -]?)?[0-9]{1,5}[ -]?[0-9]{1,7}[ -]?[0-9]{1,6}[ -]?([0-9]|X) - The International Standard Book Number (ISBN) is for identifying printed books. - - - - - - - - - - - Compound ID (3DMET) - - B[0-9]{5} - 3DMET ID - beta12orEarlier - Identifier of a metabolite from the 3DMET database. - - - - - - - - - - - MatrixDB interaction ID - - ([A-NR-Z][0-9][A-Z][A-Z0-9][A-Z0-9][0-9])_.*|([OPQ][0-9][A-Z0-9][A-Z0-9][A-Z0-9][0-9]_.*)|(GAG_.*)|(MULT_.*)|(PFRAG_.*)|(LIP_.*)|(CAT_.*) - A unique identifier of an interaction from the MatrixDB database. - beta12orEarlier - - - - - - - - - - - cPath ID - - - [0-9]+ - These identifiers are unique within the cPath database, however, they are not stable between releases. - beta12orEarlier - A unique identifier for pathways, reactions, complexes and small molecules from the cPath (Pathway Commons) database. - - - - - - - - - - - PubChem bioassay ID - - - Identifier of an assay from the PubChem database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PubChem ID - - - PubChem identifier - beta12orEarlier - Identifier of an entry from the PubChem database. - - - - - - - - - - - Reaction ID (MACie) - - beta12orEarlier - M[0-9]{4} - MACie entry number - Identifier of an enzyme reaction mechanism from the MACie database. - - - - - - - - - - - Gene ID (miRBase) - - beta12orEarlier - miRNA name - miRNA ID - Identifier for a gene from the miRBase database. - MI[0-9]{7} - miRNA identifier - - - - - - - - - - - Gene ID (ZFIN) - - Identifier for a gene from the Zebrafish information network genome (ZFIN) database. - beta12orEarlier - ZDB\-GENE\-[0-9]+\-[0-9]+ - - - - - - - - - - - Reaction ID (Rhea) - - [0-9]{5} - Identifier of an enzyme-catalysed reaction from the Rhea database. - beta12orEarlier - - - - - - - - - - - Pathway ID (Unipathway) - - UPA[0-9]{5} - upaid - beta12orEarlier - Identifier of a biological pathway from the Unipathway database. - - - - - - - - - - - Compound ID (ChEMBL) - - Identifier of a small molecular from the ChEMBL database. - ChEMBL ID - beta12orEarlier - [0-9]+ - - - - - - - - - - - LGICdb identifier - - Unique identifier of an entry from the Ligand-gated ion channel (LGICdb) database. - beta12orEarlier - [a-zA-Z_0-9]+ - - - - - - - - - - - Reaction kinetics ID (SABIO-RK) - - Identifier of a biological reaction (kinetics entry) from the SABIO-RK reactions database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - PharmGKB ID - - - beta12orEarlier - Identifier of an entry from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Pathway ID (PharmGKB) - - - PA[0-9]+ - Identifier of a pathway from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - - - - - - - - - - - Disease ID (PharmGKB) - - - Identifier of a disease from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - beta12orEarlier - PA[0-9]+ - - - - - - - - - - - Drug ID (PharmGKB) - - - beta12orEarlier - Identifier of a drug from the pharmacogenetics and pharmacogenomics knowledge base (PharmGKB). - PA[0-9]+ - - - - - - - - - - - Drug ID (TTD) - - DAP[0-9]+ - Identifier of a drug from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Target ID (TTD) - - TTDS[0-9]+ - Identifier of a target protein from the Therapeutic Target Database (TTD). - beta12orEarlier - - - - - - - - - - - Cell type identifier - - beta12orEarlier - A unique identifier of a type or group of cells. - - - - - - - - - - - NeuronDB ID - - [0-9]+ - beta12orEarlier - A unique identifier of a neuron from the NeuronDB database. - - - - - - - - - - - NeuroMorpho ID - - beta12orEarlier - A unique identifier of a neuron from the NeuroMorpho database. - [a-zA-Z_0-9]+ - - - - - - - - - - - Compound ID (ChemIDplus) - - Identifier of a chemical from the ChemIDplus database. - ChemIDplus ID - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (SMPDB) - - beta12orEarlier - Identifier of a pathway from the Small Molecule Pathway Database (SMPDB). - SMP[0-9]{5} - - - - - - - - - - - BioNumbers ID - - Identifier of an entry from the BioNumbers database of key numbers and associated data in molecular biology. - [0-9]+ - beta12orEarlier - - - - - - - - - - - T3DB ID - - beta12orEarlier - T3D[0-9]+ - Unique identifier of a toxin from the Toxin and Toxin Target Database (T3DB) database. - - - - - - - - - - - Carbohydrate identifier - - - - - - - - - - - - - - beta12orEarlier - Identifier of a carbohydrate. - - - - - - - - - - - GlycomeDB ID - - Identifier of an entry from the GlycomeDB database. - beta12orEarlier - [0-9]+ - - - - - - - - - - - LipidBank ID - - beta12orEarlier - [a-zA-Z_0-9]+[0-9]+ - Identifier of an entry from the LipidBank database. - - - - - - - - - - - CDD ID - - beta12orEarlier - cd[0-9]{5} - Identifier of a conserved domain from the Conserved Domain Database. - - - - - - - - - - - MMDB ID - - [0-9]{1,5} - beta12orEarlier - An identifier of an entry from the MMDB database. - MMDB accession - - - - - - - - - - - iRefIndex ID - - Unique identifier of an entry from the iRefIndex database of protein-protein interactions. - beta12orEarlier - [0-9]+ - - - - - - - - - - - ModelDB ID - - Unique identifier of an entry from the ModelDB database. - [0-9]+ - beta12orEarlier - - - - - - - - - - - Pathway ID (DQCS) - - [0-9]+ - Identifier of a signaling pathway from the Database of Quantitative Cellular Signaling (DQCS). - beta12orEarlier - - - - - - - - - - - Ensembl ID (Homo sapiens) - - beta12orEarlier - true - beta12orEarlier - ENS([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database (Homo sapiens division). - - - - - - - - - - Ensembl ID ('Bos taurus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Bos taurus' division). - true - beta12orEarlier - ENSBTA([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Canis familiaris') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Canis familiaris' division). - true - ENSCAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Cavia porcellus') - - ENSCPO([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Cavia porcellus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Ciona intestinalis') - - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona intestinalis' division). - beta12orEarlier - beta12orEarlier - ENSCIN([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Ciona savignyi') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ciona savignyi' division). - ENSCSAV([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Ensembl ID ('Danio rerio') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Danio rerio' division). - true - beta12orEarlier - beta12orEarlier - ENSDAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Dasypus novemcinctus') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Dasypus novemcinctus' division). - beta12orEarlier - beta12orEarlier - ENSDNO([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Echinops telfairi') - - ENSETE([EGTP])[0-9]{11} - true - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Echinops telfairi' division). - - - - - - - - - - Ensembl ID ('Erinaceus europaeus') - - true - ENSEEU([EGTP])[0-9]{11} - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Erinaceus europaeus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Felis catus') - - beta12orEarlier - true - ENSFCA([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Felis catus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gallus gallus') - - ENSGAL([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gallus gallus' division). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Gasterosteus aculeatus') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Gasterosteus aculeatus' division). - true - ENSGAC([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Homo sapiens') - - ENSHUM([EGTP])[0-9]{11} - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Homo sapiens' division). - true - - - - - - - - - - Ensembl ID ('Loxodonta africana') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Loxodonta africana' division). - ENSLAF([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Macaca mulatta') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Macaca mulatta' division). - beta12orEarlier - ENSMMU([EGTP])[0-9]{11} - true - beta12orEarlier - - - - - - - - - - Ensembl ID ('Monodelphis domestica') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Monodelphis domestica' division). - true - ENSMOD([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Mus musculus') - - ENSMUS([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Mus musculus' division). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Ensembl ID ('Myotis lucifugus') - - beta12orEarlier - ENSMLU([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Myotis lucifugus' division). - - - - - - - - - - Ensembl ID ("Ornithorhynchus anatinus") - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Ornithorhynchus anatinus' division). - ENSOAN([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryctolagus cuniculus') - - beta12orEarlier - ENSOCU([EGTP])[0-9]{11} - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryctolagus cuniculus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Oryzias latipes') - - ENSORL([EGTP])[0-9]{11} - true - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Oryzias latipes' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Otolemur garnettii') - - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Otolemur garnettii' division). - true - beta12orEarlier - ENSSAR([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Pan troglodytes') - - beta12orEarlier - beta12orEarlier - ENSPTR([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Pan troglodytes' division). - true - - - - - - - - - - Ensembl ID ('Rattus norvegicus') - - beta12orEarlier - true - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Rattus norvegicus' division). - ENSRNO([EGTP])[0-9]{11} - beta12orEarlier - - - - - - - - - - Ensembl ID ('Spermophilus tridecemlineatus') - - true - beta12orEarlier - ENSSTO([EGTP])[0-9]{11} - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Spermophilus tridecemlineatus' division). - beta12orEarlier - - - - - - - - - - Ensembl ID ('Takifugu rubripes') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Takifugu rubripes' division). - ENSFRU([EGTP])[0-9]{11} - true - - - - - - - - - - Ensembl ID ('Tupaia belangeri') - - beta12orEarlier - beta12orEarlier - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Tupaia belangeri' division). - true - ENSTBE([EGTP])[0-9]{11} - - - - - - - - - - Ensembl ID ('Xenopus tropicalis') - - Identifier of an entry (exon, gene, transcript or protein) from the Ensembl 'core' database ('Xenopus tropicalis' division). - beta12orEarlier - beta12orEarlier - true - ENSXET([EGTP])[0-9]{11} - - - - - - - - - - CATH identifier - - beta12orEarlier - Identifier of a protein domain (or other node) from the CATH database. - - - - - - - - - - - CATH node ID (family) - - beta12orEarlier - A code number identifying a family from the CATH database. - 2.10.10.10 - - - - - - - - - - - Enzyme ID (CAZy) - - Identifier of an enzyme from the CAZy enzymes database. - beta12orEarlier - CAZy ID - - - - - - - - - - - Clone ID (IMAGE) - - I.M.A.G.E. cloneID - IMAGE cloneID - A unique identifier assigned by the I.M.A.G.E. consortium to a clone (cloned molecular sequence). - beta12orEarlier - - - - - - - - - - - GO concept ID (cellular compartment) - - An identifier of a 'cellular compartment' concept from the Gene Ontology. - [0-9]{7}|GO:[0-9]{7} - beta12orEarlier - GO concept identifier (cellular compartment) - - - - - - - - - - - Chromosome name (BioCyc) - - Name of a chromosome as used in the BioCyc database. - beta12orEarlier - - - - - - - - - - - CleanEx entry name - - beta12orEarlier - An identifier of a gene expression profile from the CleanEx database. - - - - - - - - - - - CleanEx dataset code - - beta12orEarlier - An identifier of (typically a list of) gene expression experiments catalogued in the CleanEx database. - - - - - - - - - - - Genome report - - An informative report of general information concerning a genome as a whole. - beta12orEarlier - - - - - - - - - - Protein ID (CORUM) - - beta12orEarlier - CORUM complex ID - Unique identifier for a protein complex from the CORUM database. - - - - - - - - - - - CDD PSSM-ID - - beta12orEarlier - Unique identifier of a position-specific scoring matrix from the CDD database. - - - - - - - - - - - Protein ID (CuticleDB) - - CuticleDB ID - beta12orEarlier - Unique identifier for a protein from the CuticleDB database. - - - - - - - - - - - DBD ID - - Identifier of a predicted transcription factor from the DBD database. - beta12orEarlier - - - - - - - - - - - Oligonucleotide probe annotation - - - - - - - - General annotation on an oligonucleotide probe, or a set of probes. - beta12orEarlier - Oligonucleotide probe sets annotation - - - - - - - - - - Oligonucleotide ID - - - Identifier of an oligonucleotide from a database. - beta12orEarlier - - - - - - - - - - - dbProbe ID - - Identifier of an oligonucleotide probe from the dbProbe database. - beta12orEarlier - - - - - - - - - - - Dinucleotide property - - beta12orEarlier - Physicochemical property data for one or more dinucleotides. - - - - - - - - - - DiProDB ID - - beta12orEarlier - Identifier of an dinucleotide property from the DiProDB database. - - - - - - - - - - - Protein features report (disordered structure) - - 1.8 - true - beta12orEarlier - disordered structure in a protein. - - - - - - - - - - Protein ID (DisProt) - - DisProt ID - beta12orEarlier - Unique identifier for a protein from the DisProt database. - - - - - - - - - - - Embryo report - - Annotation on an embryo or concerning embryological development. - true - Embryo annotation - beta12orEarlier - 1.5 - - - - - - - - - - Ensembl transcript ID - - - beta12orEarlier - Transcript ID (Ensembl) - Unique identifier for a gene transcript from the Ensembl database. - - - - - - - - - - - Inhibitor annotation - - 1.4 - beta12orEarlier - An informative report on one or more small molecules that are enzyme inhibitors. - true - - - - - - - - - - Promoter ID - - - beta12orEarlier - An identifier of a promoter of a gene that is catalogued in a database. - Moby:GeneAccessionList - - - - - - - - - - - EST accession - - Identifier of an EST sequence. - beta12orEarlier - - - - - - - - - - - COGEME EST ID - - beta12orEarlier - Identifier of an EST sequence from the COGEME database. - - - - - - - - - - - COGEME unisequence ID - - Identifier of a unisequence from the COGEME database. - A unisequence is a single sequence assembled from ESTs. - beta12orEarlier - - - - - - - - - - - Protein family ID (GeneFarm) - - GeneFarm family ID - beta12orEarlier - Accession number of an entry (family) from the TIGRFam database. - - - - - - - - - - - Family name - - beta12orEarlier - The name of a family of organism. - - - - - - - - - - - Genus name (virus) - - true - The name of a genus of viruses. - beta13 - beta12orEarlier - - - - - - - - - - Family name (virus) - - beta13 - The name of a family of viruses. - true - beta12orEarlier - - - - - - - - - - Database name (SwissRegulon) - - true - beta13 - The name of a SwissRegulon database. - beta12orEarlier - - - - - - - - - - Sequence feature ID (SwissRegulon) - - beta12orEarlier - A feature identifier as used in the SwissRegulon database. - This can be name of a gene, the ID of a TFBS, or genomic coordinates in form "chr:start..end". - - - - - - - - - - - FIG ID - - A FIG ID consists of four parts: a prefix, genome id, locus type and id number. - A unique identifier of gene in the NMPDR database. - beta12orEarlier - - - - - - - - - - - Gene ID (Xenbase) - - A unique identifier of gene in the Xenbase database. - beta12orEarlier - - - - - - - - - - - Gene ID (Genolist) - - beta12orEarlier - A unique identifier of gene in the Genolist database. - - - - - - - - - - - Gene name (Genolist) - - beta12orEarlier - true - Genolist gene name - 1.3 - Name of an entry (gene) from the Genolist genes database. - - - - - - - - - - ABS ID - - ABS identifier - beta12orEarlier - Identifier of an entry (promoter) from the ABS database. - - - - - - - - - - - AraC-XylS ID - - Identifier of a transcription factor from the AraC-XylS database. - beta12orEarlier - - - - - - - - - - - Gene name (HUGO) - - beta12orEarlier - beta12orEarlier - true - Name of an entry (gene) from the HUGO database. - - - - - - - - - - Locus ID (PseudoCAP) - - beta12orEarlier - Identifier of a locus from the PseudoCAP database. - - - - - - - - - - - Locus ID (UTR) - - beta12orEarlier - Identifier of a locus from the UTR database. - - - - - - - - - - - MonosaccharideDB ID - - Unique identifier of a monosaccharide from the MonosaccharideDB database. - beta12orEarlier - - - - - - - - - - - Database name (CMD) - - beta12orEarlier - true - The name of a subdivision of the Collagen Mutation Database (CMD) database. - beta13 - - - - - - - - - - Database name (Osteogenesis) - - beta12orEarlier - true - beta13 - The name of a subdivision of the Osteogenesis database. - - - - - - - - - - Genome identifier - - An identifier of a particular genome. - beta12orEarlier - - - - - - - - - - - GenomeReviews ID - - beta12orEarlier - An identifier of a particular genome. - - - - - - - - - - - GlycoMap ID - - [0-9]+ - beta12orEarlier - Identifier of an entry from the GlycosciencesDB database. - - - - - - - - - - - Carbohydrate conformational map - - beta12orEarlier - A conformational energy map of the glycosidic linkages in a carbohydrate molecule. - - - - - - - - - - Gene features report (intron) - - introns in a nucleotide sequences. - true - beta12orEarlier - 1.8 - - - - - - - - - - Transcription factor name - - - The name of a transcription factor. - beta12orEarlier - - - - - - - - - - - TCID - - Identifier of a membrane transport proteins from the transport classification database (TCDB). - beta12orEarlier - - - - - - - - - - - Pfam domain name - - beta12orEarlier - Name of a domain from the Pfam database. - PF[0-9]{5} - - - - - - - - - - - Pfam clan ID - - beta12orEarlier - CL[0-9]{4} - Accession number of a Pfam clan. - - - - - - - - - - - Gene ID (VectorBase) - - VectorBase ID - beta12orEarlier - Identifier for a gene from the VectorBase database. - - - - - - - - - - - UTRSite ID - - Identifier of an entry from the UTRSite database of regulatory motifs in eukaryotic UTRs. - beta12orEarlier - - - - - - - - - - - Sequence signature report - - - - - - - - Sequence motif report - Sequence profile report - An informative report about a specific or conserved pattern in a molecular sequence, such as its context in genes or proteins, its role, origin or method of construction, etc. - beta12orEarlier - - - - - - - - - - Locus annotation - - Locus report - true - beta12orEarlier - An informative report on a particular locus. - beta12orEarlier - - - - - - - - - - Protein name (UniProt) - - Official name of a protein as used in the UniProt database. - beta12orEarlier - - - - - - - - - - - Term ID list - - One or more terms from one or more controlled vocabularies which are annotations on an entity. - beta12orEarlier - true - The concepts are typically provided as a persistent identifier or some other link the source ontologies. Evidence of the validity of the annotation might be included. - 1.5 - - - - - - - - - - HAMAP ID - - Name of a protein family from the HAMAP database. - beta12orEarlier - - - - - - - - - - - Identifier with metadata - - Basic information concerning an identifier of data (typically including the identifier itself). For example, a gene symbol with information concerning its provenance. - beta12orEarlier - true - 1.12 - - - - - - - - - - Gene symbol annotation - - true - beta12orEarlier - Annotation about a gene symbol. - beta12orEarlier - - - - - - - - - - Transcript ID - - - - - - - - - Identifier of a RNA transcript. - beta12orEarlier - - - - - - - - - - - HIT ID - - Identifier of an RNA transcript from the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HIX ID - - A unique identifier of gene cluster in the H-InvDB database. - beta12orEarlier - - - - - - - - - - - HPA antibody id - - beta12orEarlier - Identifier of a antibody from the HPA database. - - - - - - - - - - - IMGT/HLA ID - - Identifier of a human major histocompatibility complex (HLA) or other protein from the IMGT/HLA database. - beta12orEarlier - - - - - - - - - - - Gene ID (JCVI) - - A unique identifier of gene assigned by the J. Craig Venter Institute (JCVI). - beta12orEarlier - - - - - - - - - - - Kinase name - - beta12orEarlier - The name of a kinase protein. - - - - - - - - - - - ConsensusPathDB entity ID - - - Identifier of a physical entity from the ConsensusPathDB database. - beta12orEarlier - - - - - - - - - - - ConsensusPathDB entity name - - - beta12orEarlier - Name of a physical entity from the ConsensusPathDB database. - - - - - - - - - - - CCAP strain number - - The number of a strain of algae and protozoa from the CCAP database. - beta12orEarlier - - - - - - - - - - - Stock number - - - beta12orEarlier - An identifier of stock from a catalogue of biological resources. - - - - - - - - - - - Stock number (TAIR) - - beta12orEarlier - A stock number from The Arabidopsis information resource (TAIR). - - - - - - - - - - - REDIdb ID - - beta12orEarlier - Identifier of an entry from the RNA editing database (REDIdb). - - - - - - - - - - - SMART domain name - - Name of a domain from the SMART database. - beta12orEarlier - - - - - - - - - - - Protein family ID (PANTHER) - - beta12orEarlier - Panther family ID - Accession number of an entry (family) from the PANTHER database. - - - - - - - - - - - RNAVirusDB ID - - beta12orEarlier - Could list (or reference) other taxa here from https://www.phenoscape.org/wiki/Taxonomic_Rank_Vocabulary. - A unique identifier for a virus from the RNAVirusDB database. - - - - - - - - - - - Virus ID - - - beta12orEarlier - An accession of annotation on a (group of) viruses (catalogued in a database). - - - - - - - - - - - NCBI Genome Project ID - - An identifier of a genome project assigned by NCBI. - beta12orEarlier - - - - - - - - - - - NCBI genome accession - - A unique identifier of a whole genome assigned by the NCBI. - beta12orEarlier - - - - - - - - - - - Sequence profile data - - 1.8 - Data concerning, extracted from, or derived from the analysis of a sequence profile, such as its name, length, technical details about the profile or it's construction, the biological role or annotation, and so on. - true - beta12orEarlier - - - - - - - - - - Protein ID (TopDB) - - beta12orEarlier - TopDB ID - Unique identifier for a membrane protein from the TopDB database. - - - - - - - - - - - Gel ID - - Gel identifier - Identifier of a two-dimensional (protein) gel. - beta12orEarlier - - - - - - - - - - - Reference map name (SWISS-2DPAGE) - - - beta12orEarlier - Name of a reference map gel from the SWISS-2DPAGE database. - - - - - - - - - - - Protein ID (PeroxiBase) - - PeroxiBase ID - beta12orEarlier - Unique identifier for a peroxidase protein from the PeroxiBase database. - - - - - - - - - - - SISYPHUS ID - - beta12orEarlier - Identifier of an entry from the SISYPHUS database of tertiary structure alignments. - - - - - - - - - - - ORF ID - - - beta12orEarlier - Accession of an open reading frame (catalogued in a database). - - - - - - - - - - - ORF identifier - - An identifier of an open reading frame. - beta12orEarlier - - - - - - - - - - - Linucs ID - - Identifier of an entry from the GlycosciencesDB database. - beta12orEarlier - - - - - - - - - - - Protein ID (LGICdb) - - beta12orEarlier - LGICdb ID - Unique identifier for a ligand-gated ion channel protein from the LGICdb database. - - - - - - - - - - - MaizeDB ID - - beta12orEarlier - Identifier of an EST sequence from the MaizeDB database. - - - - - - - - - - - Gene ID (MfunGD) - - beta12orEarlier - A unique identifier of gene in the MfunGD database. - - - - - - - - - - - Orpha number - - - - - - - - beta12orEarlier - An identifier of a disease from the Orpha database. - - - - - - - - - - - Protein ID (EcID) - - beta12orEarlier - Unique identifier for a protein from the EcID database. - - - - - - - - - - - Clone ID (RefSeq) - - - A unique identifier of a cDNA molecule catalogued in the RefSeq database. - beta12orEarlier - - - - - - - - - - - Protein ID (ConoServer) - - beta12orEarlier - Unique identifier for a cone snail toxin protein from the ConoServer database. - - - - - - - - - - - GeneSNP ID - - Identifier of a GeneSNP database entry. - beta12orEarlier - - - - - - - - - - - Lipid identifier - - - - - - - - - - - - - - Identifier of a lipid. - beta12orEarlier - - - - - - - - - - - Databank - - true - beta12orEarlier - A flat-file (textual) data archive. - beta12orEarlier - - - - - - - - - - Web portal - - A web site providing data (web pages) on a common theme to a HTTP client. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Gene ID (VBASE2) - - Identifier for a gene from the VBASE2 database. - beta12orEarlier - VBASE2 ID - - - - - - - - - - - DPVweb ID - - DPVweb virus ID - beta12orEarlier - A unique identifier for a virus from the DPVweb database. - - - - - - - - - - - Pathway ID (BioSystems) - - beta12orEarlier - Identifier of a pathway from the BioSystems pathway database. - [0-9]+ - - - - - - - - - - - Experimental data (proteomics) - - true - Data concerning a proteomics experiment. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Abstract - - beta12orEarlier - An abstract of a scientific article. - - - - - - - - - - Lipid structure - - beta12orEarlier - 3D coordinate and associated data for a lipid structure. - - - - - - - - - - Drug structure - - beta12orEarlier - 3D coordinate and associated data for the (3D) structure of a drug. - - - - - - - - - - Toxin structure - - 3D coordinate and associated data for the (3D) structure of a toxin. - beta12orEarlier - - - - - - - - - - Position-specific scoring matrix - - - beta12orEarlier - PSSM - A simple matrix of numbers, where each value (or column of values) is derived derived from analysis of the corresponding position in a sequence alignment. - - - - - - - - - - Distance matrix - - A matrix of distances between molecular entities, where a value (distance) is (typically) derived from comparison of two entities and reflects their similarity. - beta12orEarlier - - - - - - - - - - Structural distance matrix - - Distances (values representing similarity) between a group of molecular structures. - beta12orEarlier - - - - - - - - - - Article metadata - - true - beta12orEarlier - Bibliographic data concerning scientific article(s). - 1.5 - - - - - - - - - - Ontology concept - - beta12orEarlier - This includes any fields from the concept definition such as concept name, definition, comments and so on. - A concept from a biological ontology. - - - - - - - - - - Codon usage bias - - A numerical measure of differences in the frequency of occurrence of synonymous codons in DNA sequences. - beta12orEarlier - - - - - - - - - - Northern blot report - - true - beta12orEarlier - 1.8 - Northern Blot experiments. - - - - - - - - - - Nucleic acid features report (VNTR) - - 1.8 - beta12orEarlier - true - variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - - - - - - - - - - Nucleic acid features report (microsatellite) - - true - microsatellite polymorphism in a DNA sequence. - 1.8 - beta12orEarlier - - - - - - - - - - - Nucleic acid features report (RFLP) - - beta12orEarlier - true - 1.8 - restriction fragment length polymorphisms (RFLP) in a DNA sequence. - - - - - - - - - - Radiation hybrid map - - The radiation method can break very closely linked markers providing a more detailed map. Most genetic markers and subsequences may be located to a defined map position and with a more precise estimates of distance than a linkage map. - A map showing distance between genetic markers estimated by radiation-induced breaks in a chromosome. - beta12orEarlier - RH map - - - - - - - - - - ID list - - A simple list of data identifiers (such as database accessions), possibly with additional basic information on the addressed data. - beta12orEarlier - - - - - - - - - - Phylogenetic gene frequencies data - - beta12orEarlier - Gene frequencies data that may be read during phylogenetic tree calculation. - - - - - - - - - - Sequence set (polymorphic) - - beta13 - beta12orEarlier - true - A set of sub-sequences displaying some type of polymorphism, typically indicating the sequence in which they occur, their position and other metadata. - - - - - - - - - - DRCAT resource - - 1.5 - An entry (resource) from the DRCAT bioinformatics resource catalogue. - beta12orEarlier - true - - - - - - - - - - Protein complex - - beta12orEarlier - 3D coordinate and associated data for a multi-protein complex; two or more polypeptides chains in a stable, functional association with one another. - - - - - - - - - - Protein structural motif - - beta12orEarlier - 3D coordinate and associated data for a protein (3D) structural motif; any group of contiguous or non-contiguous amino acid residues but typically those forming a feature with a structural or functional role. - - - - - - - - - - Lipid report - - beta12orEarlier - Annotation on or information derived from one or more specific lipid 3D structure(s). - - - - - - - - - - Secondary structure image - - 1.4 - beta12orEarlier - Image of one or more molecular secondary structures. - true - - - - - - - - - - Secondary structure report - - Secondary structure-derived report - beta12orEarlier - true - An informative report on general information, properties or features of one or more molecular secondary structures. - 1.5 - - - - - - - - - - DNA features - - beta12orEarlier - DNA sequence-specific feature annotation (not in a feature table). - true - beta12orEarlier - - - - - - - - - - RNA features report - - true - beta12orEarlier - 1.5 - Features concerning RNA or regions of DNA that encode an RNA molecule. - RNA features - Nucleic acid features (RNA features) - - - - - - - - - - Plot - - beta12orEarlier - Biological data that has been plotted as a graph of some type. - - - - - - - - - - Nucleic acid features report (polymorphism) - - true - DNA polymorphism. - beta12orEarlier - - - - - - - - - - Protein sequence record - - - A protein sequence and associated metadata. - beta12orEarlier - Sequence record (protein) - - - - - - - - - - Nucleic acid sequence record - - - RNA sequence record - Nucleotide sequence record - A nucleic acid sequence and associated metadata. - beta12orEarlier - DNA sequence record - Sequence record (nucleic acid) - - - - - - - - - - Protein sequence record (full) - - A protein sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - 1.8 - beta12orEarlier - true - - - - - - - - - - Nucleic acid sequence record (full) - - true - A nucleic acid sequence and comprehensive metadata (such as a feature table), typically corresponding to a full entry from a molecular sequence database. - beta12orEarlier - 1.8 - - - - - - - - - - Biological model accession - - - beta12orEarlier - Accession of a mathematical model, typically an entry from a database. - - - - - - - - - - - Cell type name - - - The name of a type or group of cells. - beta12orEarlier - - - - - - - - - - - Cell type accession - - - Cell type ID - beta12orEarlier - Accession of a type or group of cells (catalogued in a database). - - - - - - - - - - - Compound accession - - - Small molecule accession - Accession of an entry from a database of chemicals. - beta12orEarlier - Chemical compound accession - - - - - - - - - - - Drug accession - - - Accession of a drug. - beta12orEarlier - - - - - - - - - - - Toxin name - - - Name of a toxin. - beta12orEarlier - - - - - - - - - - - Toxin accession - - - beta12orEarlier - Accession of a toxin (catalogued in a database). - - - - - - - - - - - Monosaccharide accession - - - Accession of a monosaccharide (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Drug name - - - beta12orEarlier - Common name of a drug. - - - - - - - - - - - Carbohydrate accession - - - Accession of an entry from a database of carbohydrates. - beta12orEarlier - - - - - - - - - - - Molecule accession - - - Accession of a specific molecule (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Data resource definition accession - - - beta12orEarlier - Accession of a data definition (catalogued in a database). - - - - - - - - - - - Genome accession - - - An accession of a particular genome (in a database). - beta12orEarlier - - - - - - - - - - - Map accession - - - An accession of a map of a molecular sequence (deposited in a database). - beta12orEarlier - - - - - - - - - - - Lipid accession - - - beta12orEarlier - Accession of an entry from a database of lipids. - - - - - - - - - - - Peptide ID - - - beta12orEarlier - Accession of a peptide deposited in a database. - - - - - - - - - - - Protein accession - - - Protein accessions - beta12orEarlier - Accession of a protein deposited in a database. - - - - - - - - - - - Organism accession - - - An accession of annotation on a (group of) organisms (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Organism name - - - Moby:Organism_Name - Moby:OrganismsShortName - Moby:OccurrenceRecord - Moby:BriefOccurrenceRecord - Moby:FirstEpithet - Moby:InfraspecificEpithet - beta12orEarlier - Moby:OrganismsLongName - The name of an organism (or group of organisms). - - - - - - - - - - - Protein family accession - - - beta12orEarlier - Accession of a protein family (that is deposited in a database). - - - - - - - - - - - Transcription factor accession - - - - beta12orEarlier - Accession of an entry from a database of transcription factors or binding sites. - - - - - - - - - - - Strain accession - - - - - - - - - beta12orEarlier - Identifier of a strain of an organism variant, typically a plant, virus or bacterium. - - - - - - - - - - - Virus identifier - - An accession of annotation on a (group of) viruses (catalogued in a database). - beta12orEarlier - - - - - - - - - - - Sequence features metadata - - beta12orEarlier - Metadata on sequence features. - - - - - - - - - - Gramene identifier - - beta12orEarlier - Identifier of a Gramene database entry. - - - - - - - - - - - DDBJ accession - - beta12orEarlier - DDBJ accession number - DDBJ identifier - DDBJ ID - An identifier of an entry from the DDBJ sequence database. - - - - - - - - - - - ConsensusPathDB identifier - - beta12orEarlier - An identifier of an entity from the ConsensusPathDB database. - - - - - - - - - - - Sequence data - - This is a broad data type and is used a placeholder for other, more specific types. - 1.8 - beta12orEarlier - true - Data concerning, extracted from, or derived from the analysis of molecular sequence(s). - - - - - - - - - - Codon usage - - beta12orEarlier - true - beta13 - Data concerning codon usage. - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Article report - - beta12orEarlier - 1.5 - Data derived from the analysis of a scientific text such as a full text article from a scientific journal. - true - - - - - - - - - - Sequence report - - An informative report of information about molecular sequence(s), including basic information (metadata), and reports generated from molecular sequence analysis, including positional features and non-positional properties. - beta12orEarlier - Sequence-derived report - - - - - - - - - - Protein secondary structure report - - An informative report about the properties or features of one or more protein secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plot - - - A Hopp and Woods plot of predicted antigenicity of a peptide or protein. - beta12orEarlier - - - - - - - - - - Nucleic acid melting curve - - - Shows the proportion of nucleic acid which are double-stranded versus temperature. - A melting curve of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile - - A probability profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - beta12orEarlier - Shows the probability of a base pair not being melted (i.e. remaining as double-stranded DNA) at a specified temperature - - - - - - - - - - Nucleic acid temperature profile - - A temperature profile of a double-stranded nucleic acid molecule (DNA or DNA/RNA). - Plots melting temperature versus base position. - beta12orEarlier - Melting map - - - - - - - - - - Gene regulatory network report - - 1.8 - A report typically including a map (diagram) of a gene regulatory network. - true - beta12orEarlier - - - - - - - - - - 2D PAGE gel report - - An informative report on a two-dimensional (2D PAGE) gel. - 2D PAGE image report - 1.8 - true - 2D PAGE gel annotation - beta12orEarlier - 2D PAGE image annotation - - - - - - - - - - Oligonucleotide probe sets annotation - - beta12orEarlier - 1.14 - true - General annotation on a set of oligonucleotide probes, such as the gene name with which the probe set is associated and which probes belong to the set. - - - - - - - - - - Microarray image - - 1.5 - beta12orEarlier - Gene expression image - An image from a microarray experiment which (typically) allows a visualisation of probe hybridisation and gene-expression data. - true - - - - - - - - - - Image - - http://semanticscience.org/resource/SIO_000081 - Biological or biomedical data has been rendered into an image, typically for display on screen. - http://semanticscience.org/resource/SIO_000079 - Image data - beta12orEarlier - - - - - - - - - - Sequence image - - - Image of a molecular sequence, possibly with sequence features or properties shown. - beta12orEarlier - - - - - - - - - - Protein hydropathy data - - Protein hydropathy report - A report on protein properties concerning hydropathy. - beta12orEarlier - - - - - - - - - - Workflow data - - beta12orEarlier - beta13 - Data concerning a computational workflow. - true - - - - - - - - - - Workflow - - true - beta12orEarlier - 1.5 - A computational workflow. - - - - - - - - - - Secondary structure data - - beta13 - true - beta12orEarlier - Data concerning molecular secondary structure data. - - - - - - - - - - Protein sequence (raw) - - - Raw protein sequence - beta12orEarlier - Raw sequence (protein) - A raw protein sequence (string of characters). - - - - - - - - - - Nucleic acid sequence (raw) - - - Nucleic acid raw sequence - beta12orEarlier - Nucleotide sequence (raw) - Raw sequence (nucleic acid) - A raw nucleic acid sequence. - - - - - - - - - - Protein sequence - - One or more protein sequences, possibly with associated annotation. - Protein sequences - beta12orEarlier - http://purl.org/biotop/biotop.owl#AminoAcidSequenceInformation - - - - - - - - - - Nucleic acid sequence - - One or more nucleic acid sequences, possibly with associated annotation. - beta12orEarlier - DNA sequence - Nucleotide sequence - Nucleotide sequences - Nucleic acid sequences - http://purl.org/biotop/biotop.owl#NucleotideSequenceInformation - - - - - - - - - - Reaction data - - Enzyme kinetics annotation - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Reaction annotation - Data concerning a biochemical reaction, typically data and more general annotation on the kinetics of enzyme-catalysed reaction. - - - - - - - - - - Peptide property - - beta12orEarlier - Peptide data - Data concerning small peptides. - - - - - - - - - - Protein classification - - This is a broad data type and is used a placeholder for other, more specific types. - Protein classification data - An informative report concerning the classification of protein sequences or structures. - beta12orEarlier - - - - - - - - - Sequence motif data - - true - 1.8 - Data concerning specific or conserved pattern in molecular sequences. - beta12orEarlier - This is a broad data type and is used a placeholder for other, more specific types. - - - - - - - - - - Sequence profile data - - beta12orEarlier - true - This is a broad data type and is used a placeholder for other, more specific types. - beta13 - Data concerning models representing a (typically multiple) sequence alignment. - - - - - - - - - - Pathway or network data - - Data concerning a specific biological pathway or network. - beta13 - true - beta12orEarlier - - - - - - - - - - - Pathway or network report - - - - - - - - beta12orEarlier - An informative report concerning or derived from the analysis of a biological pathway or network, such as a map (diagram) or annotation. - - - - - - - - - - Nucleic acid thermodynamic data - - Nucleic acid property (thermodynamic or kinetic) - A thermodynamic or kinetic property of a nucleic acid molecule. - Nucleic acid thermodynamic property - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification of nucleic acid sequences or structures. - Nucleic acid classification data - - - - - - - - - Classification report - - This can include an entire classification, components such as classifiers, assignments of entities to a classification and so on. - beta12orEarlier - true - Classification data - A report on a classification of molecular sequences, structures or other entities. - 1.5 - - - - - - - - - - Protein features report (key folding sites) - - beta12orEarlier - key residues involved in protein folding. - 1.8 - true - - - - - - - - - - Protein geometry report - - Torsion angle data - beta12orEarlier - Geometry data for a protein structure, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - - - - - - - - - - Protein structure image - - - An image of protein structure. - beta12orEarlier - Structure image (protein) - - - - - - - - - - Phylogenetic character weights - - Weights for sequence positions or characters in phylogenetic analysis where zero is defined as unweighted. - beta12orEarlier - - - - - - - - - - Annotation track - - beta12orEarlier - Genomic track - Annotation of one particular positional feature on a biomolecular (typically genome) sequence, suitable for import and display in a genome browser. - Genome annotation track - Genome-browser track - Genome track - Sequence annotation track - - - - - - - - - - UniProt accession - - - - - - - - UniProtKB accession number - beta12orEarlier - P43353|Q7M1G0|Q9C199|A5A6J6 - UniProt entry accession - [OPQ][0-9][A-Z0-9]{3}[0-9]|[A-NR-Z][0-9]([A-Z][A-Z0-9]{2}[0-9]){1,2} - Swiss-Prot entry accession - TrEMBL entry accession - Accession number of a UniProt (protein sequence) database entry. - UniProtKB accession - UniProt accession number - - - - - - - - - - - NCBI genetic code ID - - - Identifier of a genetic code in the NCBI list of genetic codes. - [1-9][0-9]? - 16 - beta12orEarlier - - - - - - - - - - - Ontology concept identifier - - - - - - - - Identifier of a concept in an ontology of biological or bioinformatics concepts and relations. - beta12orEarlier - - - - - - - - - - - GO concept name (biological process) - - true - The name of a concept for a biological process from the GO ontology. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GO concept name (molecular function) - - true - beta12orEarlier - The name of a concept for a molecular function from the GO ontology. - beta12orEarlier - - - - - - - - - - Taxonomy - - - - - - - - This is a broad data type and is used a placeholder for other, more specific types. - beta12orEarlier - Data concerning the classification, identification and naming of organisms. - Taxonomic data - - - - - - - - - - Protein ID (EMBL/GenBank/DDBJ) - - beta13 - EMBL/GENBANK/DDBJ coding feature protein identifier, issued by International collaborators. - This qualifier consists of a stable ID portion (3+5 format with 3 position letters and 5 numbers) plus a version number after the decimal point. When the protein sequence encoded by the CDS changes, only the version number of the /protein_id value is incremented; the stable part of the /protein_id remains unchanged and as a result will permanently be associated with a given protein; this qualifier is valid only on CDS features which translate into a valid protein. - - - - - - - - - - - Core data - - Core data entities typically have a format and may be identified by an accession number. - A type of data that (typically) corresponds to entries from the primary biological databases and which is (typically) the primary input or output of a tool, i.e. the data the tool processes or generates, as distinct from metadata and identifiers which describe and identify such core data, parameters that control the behaviour of tools, reports of derivative data generated by tools and annotation. - 1.5 - true - beta13 - - - - - - - - - - Sequence feature identifier - - - - - - - - beta13 - Name or other identifier of molecular sequence feature(s). - - - - - - - - - - - Structure identifier - - - - - - - - beta13 - An identifier of a molecular tertiary structure, typically an entry from a structure database. - - - - - - - - - - - Matrix identifier - - - - - - - - An identifier of an array of numerical values, such as a comparison matrix. - beta13 - - - - - - - - - - - Protein sequence composition - - beta13 - 1.8 - true - A report (typically a table) on character or word composition / frequency of protein sequence(s). - - - - - - - - - - Nucleic acid sequence composition (report) - - 1.8 - A report (typically a table) on character or word composition / frequency of nucleic acid sequence(s). - true - beta13 - - - - - - - - - - Protein domain classification node - - beta13 - A node from a classification of protein structural domain(s). - true - 1.5 - - - - - - - - - - CAS number - - beta13 - CAS registry number - Unique numerical identifier of chemicals in the scientific literature, as assigned by the Chemical Abstracts Service. - - - - - - - - - - - ATC code - - Unique identifier of a drug conforming to the Anatomical Therapeutic Chemical (ATC) Classification System, a drug classification system controlled by the WHO Collaborating Centre for Drug Statistics Methodology (WHOCC). - beta13 - - - - - - - - - - - UNII - - beta13 - A unique, unambiguous, alphanumeric identifier of a chemical substance as catalogued by the Substance Registration System of the Food and Drug Administration (FDA). - Unique Ingredient Identifier - - - - - - - - - - - Geotemporal metadata - - 1.5 - beta13 - true - Basic information concerning geographical location or time. - - - - - - - - - - System metadata - - Metadata concerning the software, hardware or other aspects of a computer system. - beta13 - - - - - - - - - - Sequence feature name - - - A name of a sequence feature, e.g. the name of a feature to be displayed to an end-user. - beta13 - - - - - - - - - - - Experimental measurement - - beta13 - Raw data such as measurements or other results from laboratory experiments, as generated from laboratory hardware. - Experimental measurement data - Measurement - This is a broad data type and is used a placeholder for other, more specific types. It is primarily intended to help navigation of EDAM and would not typically be used for annotation. - Measured data - Experimentally measured data - Measurement metadata - Measurement data - Raw experimental data - - - - - - - - - - Raw microarray data - - - beta13 - Raw data (typically MIAME-compliant) for hybridisations from a microarray experiment. - Such data as found in Affymetrix CEL or GPR files. - - - - - - - - - - Processed microarray data - - - - - - - - Data generated from processing and analysis of probe set data from a microarray experiment. - Gene annotation (expression) - Microarray probe set data - beta13 - Gene expression report - Such data as found in Affymetrix .CHP files or data from other software such as RMA or dChip. - - - - - - - - - - Gene expression matrix - - - This combines data from all hybridisations. - beta13 - Normalised microarray data - The final processed (normalised) data for a set of hybridisations in a microarray experiment. - Gene expression data matrix - - - - - - - - - - Sample annotation - - Annotation on a biological sample, for example experimental factors and their values. - This might include compound and dose in a dose response experiment. - beta13 - - - - - - - - - - Microarray metadata - - This might include gene identifiers, genomic coordinates, probe oligonucleotide sequences etc. - Annotation on the array itself used in a microarray experiment. - beta13 - - - - - - - - - - Microarray protocol annotation - - true - This might describe e.g. the normalisation methods used to process the raw data. - beta13 - 1.8 - Annotation on laboratory and/or data processing protocols used in an microarray experiment. - - - - - - - - - - Microarray hybridisation data - - Data concerning the hybridisations measured during a microarray experiment. - beta13 - - - - - - - - - - Protein features report (topological domains) - - 1.8 - beta13 - topological domains such as cytoplasmic regions in a protein. - true - - - - - - - - - - Sequence features (compositionally-biased regions) - - 1.5 - beta13 - true - A report of regions in a molecular sequence that are biased to certain characters. - - - - - - - - - - Nucleic acid features (difference and change) - - beta13 - A report on features in a nucleic acid sequence that indicate changes to or differences between sequences. - 1.5 - true - - - - - - - - - - Nucleic acid features report (expression signal) - - true - beta13 - regions within a nucleic acid sequence containing a signal that alters a biological function. - 1.8 - - - - - - - - - - Nucleic acid features report (binding) - - nucleic acids binding to some other molecule. - 1.8 - true - beta13 - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes). - - - - - - - - - - Nucleic acid repeats (report) - - true - repetitive elements within a nucleic acid sequence. - 1.8 - beta13 - - - - - - - - - - Nucleic acid features report (replication and recombination) - - beta13 - true - 1.8 - DNA replication or recombination. - - - - - - - - - - Nucleic acid structure report - - - A report on regions within a nucleic acid sequence which form secondary or tertiary (3D) structures. - Stem loop (report) - d-loop (report) - Nucleic acid features (structure) - Quadruplexes (report) - beta13 - - - - - - - - - - Protein features report (repeats) - - 1.8 - short repetitive subsequences (repeat sequences) in a protein sequence. - beta13 - true - - - - - - - - - - Sequence motif matches (protein) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more protein sequences. - 1.8 - beta13 - true - - - - - - - - - - Sequence motif matches (nucleic acid) - - Report on the location of matches to profiles, motifs (conserved or functional patterns) or other signatures in one or more nucleic acid sequences. - beta13 - true - 1.8 - - - - - - - - - - Nucleic acid features (d-loop) - - beta13 - true - 1.5 - A report on displacement loops in a mitochondrial DNA sequence. - A displacement loop is a region of mitochondrial DNA in which one of the strands is displaced by an RNA molecule. - - - - - - - - - - Nucleic acid features (stem loop) - - beta13 - true - A report on stem loops in a DNA sequence. - 1.5 - A stem loop is a hairpin structure; a double-helical structure formed when two complementary regions of a single strand of RNA or DNA molecule form base-pairs. - - - - - - - - - - Gene transcript report - - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Nucleic acid features (mRNA features) - beta13 - Transcript (report) - mRNA features - Gene transcript annotation - Clone or EST (report) - mRNA (report) - An informative report on features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. This includes reports on a specific gene transcript, clone or EST. - - - - - - - - - - - Nucleic acid features report (signal or transit peptide) - - true - coding sequences for a signal or transit peptide. - 1.8 - beta13 - - - - - - - - - - Non-coding RNA - - beta13 - true - features of non-coding or functional RNA molecules, including tRNA and rRNA. - 1.8 - - - - - - - - - - Transcriptional features (report) - - 1.5 - true - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - Features concerning transcription of DNA into RNA including the regulation of transcription. - beta13 - - - - - - - - - - Nucleic acid features report (STS) - - sequence tagged sites (STS) in nucleic acid sequences. - 1.8 - true - beta13 - - - - - - - - - - Nucleic acid features (immunoglobulin gene structure) - - true - beta13 - 1.5 - A report on predicted or actual immunoglobulin gene structure including constant, switch and variable regions and diversity, joining and variable segments. - - - - - - - - - - SCOP class - - 1.5 - beta13 - true - Information on a 'class' node from the SCOP database. - - - - - - - - - - SCOP fold - - beta13 - Information on a 'fold' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP superfamily - - beta13 - Information on a 'superfamily' node from the SCOP database. - 1.5 - true - - - - - - - - - - SCOP family - - 1.5 - true - Information on a 'family' node from the SCOP database. - beta13 - - - - - - - - - - SCOP protein - - Information on a 'protein' node from the SCOP database. - true - beta13 - 1.5 - - - - - - - - - - SCOP species - - 1.5 - true - beta13 - Information on a 'species' node from the SCOP database. - - - - - - - - - - Mass spectrometry experiment - - 1.8 - true - mass spectrometry experiments. - beta13 - - - - - - - - - - Gene family report - - An informative report on a particular family of genes, typically a set of genes with similar sequence that originate from duplication of a common ancestor gene, or any other classification of nucleic acid sequences or structures that reflects gene structure. - This includes reports on on gene homologues between species. - beta13 - Gene annotation (homology information) - Homology information - Gene annotation (homology) - Nucleic acid classification - Gene family annotation - Gene homology (report) - - - - - - - - - - Protein image - - beta13 - An image of a protein. - - - - - - - - - - Protein alignment - - An alignment of protein sequences and/or structures. - beta13 - - - - - - - - - - NGS experiment - - 1.8 - 1.0 - sequencing experiment, including samples, sampling, preparation, sequencing, and analysis. - true - - - - - - - - - - Sequence assembly report - - An informative report about a DNA sequence assembly. - 1.1 - This might include an overall quality assement of the assembly and summary statistics including counts, average length and number of bases for reads, matches and non-matches, contigs, reads in pairs etc. - Assembly report - - - - - - - - - - Genome index - - 1.1 - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. - An index of a genome sequence. - - - - - - - - - - GWAS report - - 1.8 - 1.1 - Report concerning genome-wide association study experiments. - true - Genome-wide association study - - - - - - - - - - Cytoband position - - 1.2 - The position of a cytogenetic band in a genome. - Information might include start and end position in a chromosome sequence, chromosome identifier, name of band and so on. - - - - - - - - - - Cell type ontology ID - - - CL ID - Cell type ontology concept ID. - CL_[0-9]{7} - 1.2 - beta12orEarlier - - - - - - - - - - - Kinetic model - - 1.2 - Mathematical model of a network, that contains biochemical kinetics. - - - - - - - - - - COSMIC ID - - COSMIC identifier - cosmic ID - Identifier of a COSMIC database entry. - cosmic identifier - cosmic id - 1.3 - - - - - - - - - - - HGMD ID - - Identifier of a HGMD database entry. - hgmd ID - hgmd identifier - beta12orEarlier - hgmd id - HGMD identifier - - - - - - - - - - - Sequence assembly ID - - Sequence assembly version - Unique identifier of sequence assembly. - 1.3 - - - - - - - - - - - Sequence feature type - - true - A label (text token) describing a type of sequence feature such as gene, transcript, cds, exon, repeat, simple, misc, variation, somatic variation, structural variation, somatic structural variation, constrained or regulatory. - 1.3 - 1.5 - - - - - - - - - - Gene homology (report) - - beta12orEarlier - true - An informative report on gene homologues between species. - 1.5 - - - - - - - - - - Ensembl gene tree ID - - - ENSGT00390000003602 - Ensembl ID (gene tree) - Unique identifier for a gene tree from the Ensembl database. - 1.3 - - - - - - - - - - - Gene tree - - 1.3 - A phylogenetic tree that is an estimate of the character's phylogeny. - - - - - - - - - - Species tree - - A phylogenetic tree that reflects phylogeny of the taxa from which the characters (used in calculating the tree) were sampled. - 1.3 - - - - - - - - - - Sample ID - - - - - - - - - 1.3 - Sample accession - Name or other identifier of an entry from a biosample database. - - - - - - - - - - - MGI accession - - - Identifier of an object from the MGI database. - 1.3 - - - - - - - - - - - Phenotype name - - - 1.3 - Name of a phenotype. - Phenotypes - Phenotype - - - - - - - - - - - Transition matrix - - A HMM transition matrix contains the probabilities of switching from one HMM state to another. - Consider for example an HMM with two states (AT-rich and GC-rich). The transition matrix will hold the probabilities of switching from the AT-rich to the GC-rich state, and vica versa. - HMM transition matrix - 1.4 - - - - - - - - - Emission matrix - - A HMM emission matrix holds the probabilities of choosing the four nucleotides (A, C, G and T) in each of the states of a HMM. - 1.4 - Consider for example an HMM with two states (AT-rich and GC-rich). The emission matrix holds the probabilities of choosing each of the four nucleotides (A, C, G and T) in the AT-rich state and in the GC-rich state. - HMM emission matrix - - - - - - - - - Hidden Markov model - - A statistical Markov model of a system which is assumed to be a Markov process with unobserved (hidden) states. - 1.4 - - - - - - - - - Format identifier - - An identifier of a data format. - 1.4 - - - - - - - - - Raw image - - 1.5 - Amino acid data - http://semanticscience.org/resource/SIO_000081 - beta12orEarlier - Image data - Raw biological or biomedical image generated by some experimental technique. - - - - - - - - - - Carbohydrate property - - Carbohydrate data - Data concerning the intrinsic physical (e.g. structural) or chemical properties of one, more or all carbohydrates. - 1.5 - - - - - - - - - - Proteomics experiment report - - true - 1.8 - Report concerning proteomics experiments. - 1.5 - - - - - - - - - - RNAi report - - 1.5 - RNAi experiments. - true - 1.8 - - - - - - - - - - Simulation experiment report - - 1.5 - biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - true - 1.8 - - - - - - - - - - MRI image - - - - - - - - MRT image - 1.7 - Magnetic resonance tomography image - Nuclear magnetic resonance imaging image - - Magnetic resonance imaging image - - NMRI image - An imaging technique that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - - - - - - - - - - Cell migration track image - - - - - - - - 1.7 - An image from a cell migration track assay. - - - - - - - - - - Rate of association - - kon - 1.7 - Rate of association of a protein with another protein or some other molecule. - - - - - - - - - - Gene order - - Such data are often used for genome rearrangement tools and phylogenetic tree labeling. - Multiple gene identifiers in a specific order. - 1.7 - - - - - - - - - - Spectrum - - 1.7 - The spectrum of frequencies of electromagnetic radiation emitted from a molecule as a result of some spectroscopy experiment. - Spectra - - - - - - - - - - NMR spectrum - - - - - - - - Spectral information for a molecule from a nuclear magnetic resonance experiment. - 1.7 - NMR spectra - - - - - - - - - - Chemical structure sketch - - Chemical structure sketches are used for presentational purposes but also as inputs to various analysis software. - 1.8 - Small molecule sketch - A sketch of a small molecule made with some specialised drawing package. - - - - - - - - - - Nucleic acid signature - - 1.8 - An informative report about a specific or conserved nucleic acid sequence pattern. - - - - - - - - - - DNA sequence - - DNA sequences - 1.8 - A DNA sequence. - - - - - - - - - - RNA sequence - - A DNA sequence. - DNA sequences - RNA sequences - 1.8 - - - - - - - - - - RNA sequence (raw) - - - Raw sequence (RNA) - 1.8 - A raw RNA sequence. - RNA raw sequence - - - - - - - - - - DNA sequence (raw) - - - Raw sequence (DNA) - A raw DNA sequence. - 1.8 - DNA raw sequence - - - - - - - - - - Sequence variations - - - - - - - - 1.8 - Data on gene sequence variations resulting large-scale genotyping and DNA sequencing projects. - Gene sequence variations - Variations are stored along with a reference genome. - - - - - - - - - - Bibliography - - 1.8 - A list of publications such as scientic papers or books. - - - - - - - - - - Ontology mapping - - A mapping of supplied textual terms or phrases to ontology concepts (URIs). - beta12orEarlier - - - - - - - - - - Image metadata - - Image-associated data - This can include basic provenance and technical information about the image, scientific annotation and so on. - Any data concerning a specific biological or biomedical image. - 1.9 - Image data - Image-related data - - - - - - - - - - Clinical trial report - - Clinical trial information - A report concerning a clinical trial. - 1.9 - - - - - - - - - - Reference sample report - - 1.10 - A report about a biosample. - Biosample report - - - - - - - - - - Gene Expression Atlas Experiment ID - - Accession number of an entry from the Gene Expression Atlas. - 1.10 - - - - - - - - - - - Disease identifier - - - - - - - - - beta12orEarlier - Identifier of an entry from a database of disease. - - - - - - - - - - - Disease name - - - The name of some disease. - 1.12 - - - - - - - - - - - Training material - - Open educational resource - Some material that is used for educational (training) purposes. - OER - 1.12 - - - - - - - - - - Online course - - MOOC - A training course available for use on the Web. - On-line course - 1.12 - Massive open online course - - - - - - - - - - Text - - - Any free or plain text, as often specified as some search query. - Plain text - Free text - 1.12 - - - - - - - - - - Biodiversity report - - Biodiversity information - 1.9 - A report about biodiversity data. - - - - - - - - - - Biosafety report - - A report about biosafety data. - Biosafety information - 1.14 - - - - - - - - - - Isolation report - - Geographic location - Isolation source - 1.14 - A report about any kind of isolation of biological material. - - - - - - - - - - Pathogenicity report - - 1.14 - Information about the ability of an organism to cause disease in a corresponding host. - Pathogenicity - - - - - - - - - - Biosafety classification - - Information about the biosafety classification of an organism according to corresponding law. - Biosafety level - 1.14 - - - - - - - - - - Geographic location - - A report about localisation of the isolaton of biological material e.g. country or coordinates. - 1.14 - - - - - - - - - - Isolation source - - A report about any kind of isolation source of biological material e.g. blood, water, soil. - 1.14 - - - - - - - - - - Physiology parameter - - Experimentally determined parameter of the physiology of an organism, e.g. substrate spectrum. - 1.14 - - - - - - - - - - Morphology parameter - - Experimentally determined parameter of the morphology of an organism, e.g. size & shape. - 1.14 - - - - - - - - - - Cultivation parameter - - Salinity - Carbon source - Experimental determined parameter for the cultivation of an organism. - Cultivation conditions - Temperature - 1.14 - Culture media composition - pH value - Nitrogen source - - - - - - - - - - SMILES - - - Chemical structure specified in Simplified Molecular Input Line Entry System (SMILES) line notation. - beta12orEarlier - - - - - - - - - - - - - - InChI - - - Chemical structure specified in IUPAC International Chemical Identifier (InChI) line notation. - beta12orEarlier - - - - - - - - - - mf - - - Chemical structure specified by Molecular Formula (MF), including a count of each element in a compound. - beta12orEarlier - The general MF query format consists of a series of valid atomic symbols, with an optional number or range. - - - - - - - - - - InChIKey - - - An InChIKey identifier is not human- nor machine-readable but is more suitable for web searches than an InChI chemical structure specification. - The InChIKey (hashed InChI) is a fixed length (25 character) condensed digital representation of an InChI chemical structure specification. It uniquely identifies a chemical compound. - beta12orEarlier - - - - - - - - - - smarts - - SMILES ARbitrary Target Specification (SMARTS) format for chemical structure specification, which is a subset of the SMILES line notation. - beta12orEarlier - - - - - - - - - - unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - nucleotide - - - Non-sequence characters may be used for example for gaps. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Nucleotide_sequence - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - protein - - - Alphabet for a protein sequence with possible ambiguity, unknown positions and non-sequence characters. - beta12orEarlier - Non-sequence characters may be used for gaps and translation stop. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Amino_acid_sequence - - - - - - - - - - consensus - - - beta12orEarlier - Alphabet for the consensus of two or more molecular sequences. - - - - - - - - - - pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - unambiguous pure nucleotide - - - beta12orEarlier - Alphabet for a nucleotide sequence (characters ACGTU only) with possible unknown positions but without ambiguity or non-sequence characters . - - - - - - - - - - dna - - beta12orEarlier - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#DNA_sequence - Alphabet for a DNA sequence with possible ambiguity, unknown positions and non-sequence characters. - - - - - - - - - - rna - - Alphabet for an RNA sequence with possible ambiguity, unknown positions and non-sequence characters. - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#RNA_sequence - beta12orEarlier - - - - - - - - - - unambiguous pure dna - - - Alphabet for a DNA sequence (characters ACGT only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure dna - - - Alphabet for a DNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) with possible unknown positions but without ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - pure rna - - - Alphabet for an RNA sequence with possible ambiguity and unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible unknown positions but without ambiguity or non-sequence characters. - - - - - - - - - - pure protein - - - beta12orEarlier - Alphabet for any protein sequence with possible ambiguity and unknown positions but without non-sequence characters. - - - - - - - - - - UniGene entry format - - beta12orEarlier - Format of an entry from UniGene. - A UniGene entry includes a set of transcript sequences assigned to the same transcription locus (gene or expressed pseudogene), with information on protein similarities, gene expression, cDNA clone reagents, and genomic location. - beta12orEarlier - true - - - - - - - - - - COG sequence cluster format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the COG database of clusters of (related) protein sequences. - - - - - - - - - - EMBL feature location - - - beta12orEarlier - Feature location - Format for sequence positions (feature location) as used in DDBJ/EMBL/GenBank database. - - - - - - - - - - quicktandem - - - Report format for tandem repeats in a nucleotide sequence (format generated by the Sanger Centre quicktandem program). - beta12orEarlier - - - - - - - - - - Sanger inverted repeats - - - beta12orEarlier - Report format for inverted repeats in a nucleotide sequence (format generated by the Sanger Centre inverted program). - - - - - - - - - - EMBOSS repeat - - - Report format for tandem repeats in a sequence (an EMBOSS report format). - beta12orEarlier - - - - - - - - - - est2genome format - - - beta12orEarlier - Format of a report on exon-intron structure generated by EMBOSS est2genome. - - - - - - - - - - restrict format - - - Report format for restriction enzyme recognition sites used by EMBOSS restrict program. - beta12orEarlier - - - - - - - - - - restover format - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by EMBOSS restover program. - - - - - - - - - - REBASE restriction sites - - - beta12orEarlier - Report format for restriction enzyme recognition sites used by REBASE database. - - - - - - - - - - FASTA search results format - - - Format of results of a sequence database search using FASTA. - beta12orEarlier - This includes (typically) score data, alignment data and a histogram (of observed and expected distribution of E values.) - - - - - - - - - - BLAST results - - - Format of results of a sequence database search using some variant of BLAST. - beta12orEarlier - This includes score data, alignment data and summary table. - - - - - - - - - - mspcrunch - - - beta12orEarlier - Format of results of a sequence database search using some variant of MSPCrunch. - - - - - - - - - - Smith-Waterman format - - - beta12orEarlier - Format of results of a sequence database search using some variant of Smith Waterman. - - - - - - - - - - dhf - - - The hits are relatives to a SCOP or CATH family and are found from a search of a sequence database. - beta12orEarlier - Format of EMBASSY domain hits file (DHF) of hits (sequences) with domain classification information. - - - - - - - - - - lhf - - - beta12orEarlier - Format of EMBASSY ligand hits file (LHF) of database hits (sequences) with ligand classification information. - The hits are putative ligand-binding sequences and are found from a search of a sequence database. - - - - - - - - - - InterPro hits format - - - Results format for searches of the InterPro database. - beta12orEarlier - - - - - - - - - - InterPro protein view report format - - Format of results of a search of the InterPro database showing matches of query protein sequence(s) to InterPro entries. - The report includes a classification of regions in a query protein sequence which are assigned to a known InterPro protein family or group. - beta12orEarlier - - - - - - - - - - InterPro match table format - - Format of results of a search of the InterPro database showing matches between protein sequence(s) and signatures for an InterPro entry. - beta12orEarlier - The table presents matches between query proteins (rows) and signature methods (columns) for this entry. Alternatively the sequence(s) might be from from the InterPro entry itself. The match position in the protein sequence and match status (true positive, false positive etc) are indicated. - - - - - - - - - - HMMER Dirichlet prior - - - beta12orEarlier - Dirichlet distribution HMMER format. - - - - - - - - - - MEME Dirichlet prior - - - beta12orEarlier - Dirichlet distribution MEME format. - - - - - - - - - - HMMER emission and transition - - - Format of a report from the HMMER package on the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - prosite-pattern - - - Format of a regular expression pattern from the Prosite database. - beta12orEarlier - - - - - - - - - - EMBOSS sequence pattern - - - Format of an EMBOSS sequence pattern. - beta12orEarlier - - - - - - - - - - meme-motif - - - A motif in the format generated by the MEME program. - beta12orEarlier - - - - - - - - - - prosite-profile - - - Sequence profile (sequence classifier) format used in the PROSITE database. - beta12orEarlier - - - - - - - - - - JASPAR format - - - beta12orEarlier - A profile (sequence classifier) in the format used in the JASPAR database. - - - - - - - - - - MEME background Markov model - - - Format of the model of random sequences used by MEME. - beta12orEarlier - - - - - - - - - - HMMER format - - - Format of a hidden Markov model representation used by the HMMER package. - beta12orEarlier - - - - - - - - - - HMMER-aln - - - - beta12orEarlier - FASTA-style format for multiple sequences aligned by HMMER package to an HMM. - - - - - - - - - - DIALIGN format - - - Format of multiple sequences aligned by DIALIGN package. - beta12orEarlier - - - - - - - - - - daf - - - The format is clustal-like and includes annotation of domain family classification information. - EMBASSY 'domain alignment file' (DAF) format, containing a sequence alignment of protein domains belonging to the same SCOP or CATH family. - beta12orEarlier - - - - - - - - - - Sequence-MEME profile alignment - - - beta12orEarlier - Format for alignment of molecular sequences to MEME profiles (position-dependent scoring matrices) as generated by the MAST tool from the MEME package. - - - - - - - - - - HMMER profile alignment (sequences versus HMMs) - - - Format used by the HMMER package for an alignment of a sequence against a hidden Markov model database. - beta12orEarlier - - - - - - - - - - HMMER profile alignment (HMM versus sequences) - - - Format used by the HMMER package for of an alignment of a hidden Markov model against a sequence database. - beta12orEarlier - - - - - - - - - - Phylip distance matrix - - - Data Type must include the distance matrix, probably as pairs of sequence identifiers with a distance (integer or float). - beta12orEarlier - Format of PHYLIP phylogenetic distance matrix data. - - - - - - - - - - ClustalW dendrogram - - - beta12orEarlier - Dendrogram (tree file) format generated by ClustalW. - - - - - - - - - - Phylip tree raw - - - Raw data file format used by Phylip from which a phylogenetic tree is directly generated or plotted. - beta12orEarlier - - - - - - - - - - Phylip continuous quantitative characters - - - beta12orEarlier - PHYLIP file format for continuous quantitative character data. - - - - - - - - - - Phylogenetic property values format - - Format of phylogenetic property data. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Phylip character frequencies format - - - beta12orEarlier - PHYLIP file format for phylogenetics character frequency data. - - - - - - - - - - Phylip discrete states format - - - Format of PHYLIP discrete states data. - beta12orEarlier - - - - - - - - - - Phylip cliques format - - - beta12orEarlier - Format of PHYLIP cliques data. - - - - - - - - - - Phylip tree format - - - Phylogenetic tree data format used by the PHYLIP program. - beta12orEarlier - - - - - - - - - - TreeBASE format - - - beta12orEarlier - The format of an entry from the TreeBASE database of phylogenetic data. - - - - - - - - - - TreeFam format - - - beta12orEarlier - The format of an entry from the TreeFam database of phylogenetic data. - - - - - - - - - - Phylip tree distance format - - - Format for distances, such as Branch Score distance, between two or more phylogenetic trees as used by the Phylip package. - beta12orEarlier - - - - - - - - - - dssp - - - beta12orEarlier - The DSSP database is built using the DSSP application which defines secondary structure, geometrical features and solvent exposure of proteins, given atomic coordinates in PDB format. - Format of an entry from the DSSP database (Dictionary of Secondary Structure in Proteins). - - - - - - - - - - hssp - - - Entry format of the HSSP database (Homology-derived Secondary Structure in Proteins). - beta12orEarlier - - - - - - - - - - Dot-bracket format - - - beta12orEarlier - Format of RNA secondary structure in dot-bracket notation, originally generated by the Vienna RNA package/server. - Vienna RNA secondary structure format - Vienna RNA format - - - - - - - - - - Vienna local RNA secondary structure format - - - Format of local RNA secondary structure components with free energy values, generated by the Vienna RNA package/server. - beta12orEarlier - - - - - - - - - - PDB database entry format - - - - - - - - beta12orEarlier - PDB entry format - Format of an entry (or part of an entry) from the PDB database. - - - - - - - - - - PDB - - - PDB format - beta12orEarlier - Entry format of PDB database in PDB format. - - - - - - - - - - mmCIF - - - Chemical MIME (http://www.ch.ic.ac.uk/chemime): chemical/x-mmcif - Entry format of PDB database in mmCIF format. - beta12orEarlier - mmcif - - - - - - - - - - PDBML - - - Entry format of PDB database in PDBML (XML) format. - beta12orEarlier - - - - - - - - - - Domainatrix 3D-1D scoring matrix format - - beta12orEarlier - true - beta12orEarlier - Format of a matrix of 3D-1D scores used by the EMBOSS Domainatrix applications. - - - - - - - - - - aaindex - - - Amino acid index format used by the AAindex database. - beta12orEarlier - - - - - - - - - - IntEnz enzyme report format - - beta12orEarlier - beta12orEarlier - Format of an entry from IntEnz (The Integrated Relational Enzyme Database). - IntEnz is the master copy of the Enzyme Nomenclature, the recommendations of the NC-IUBMB on the Nomenclature and Classification of Enzyme-Catalysed Reactions. - true - - - - - - - - - - BRENDA enzyme report format - - true - Format of an entry from the BRENDA enzyme database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - KEGG REACTION enzyme report format - - true - beta12orEarlier - Format of an entry from the KEGG REACTION database of biochemical reactions. - beta12orEarlier - - - - - - - - - - KEGG ENZYME enzyme report format - - beta12orEarlier - true - Format of an entry from the KEGG ENZYME database. - beta12orEarlier - - - - - - - - - - REBASE proto enzyme report format - - Format of an entry from the proto section of the REBASE enzyme database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - REBASE withrefm enzyme report format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the withrefm section of the REBASE enzyme database. - - - - - - - - - - Pcons report format - - - Format of output of the Pcons Model Quality Assessment Program (MQAP). - beta12orEarlier - Pcons ranks protein models by assessing their quality based on the occurrence of recurring common three-dimensional structural patterns. Pcons returns a score reflecting the overall global quality and a score for each individual residue in the protein reflecting the local residue quality. - - - - - - - - - - ProQ report format - - - beta12orEarlier - ProQ is a neural network-based predictor that predicts the quality of a protein model based on the number of structural features. - Format of output of the ProQ protein model quality predictor. - - - - - - - - - - SMART domain assignment report format - - beta12orEarlier - true - Format of SMART domain assignment data. - The SMART output file includes data on genetically mobile domains / analysis of domain architectures, including phyletic distributions, functional class, tertiary structures and functionally important residues. - beta12orEarlier - - - - - - - - - - BIND entry format - - Entry format for the BIND database of protein interaction. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - IntAct entry format - - beta12orEarlier - beta12orEarlier - Entry format for the IntAct database of protein interaction. - true - - - - - - - - - - InterPro entry format - - Entry format for the InterPro database of protein signatures (sequence classifiers) and classified sequences. - true - beta12orEarlier - This includes signature metadata, sequence references and a reference to the signature itself. There is normally a header (entry accession numbers and name), abstract, taxonomy information, example proteins etc. Each entry also includes a match list which give a number of different views of the signature matches for the sequences in each InterPro entry. - beta12orEarlier - - - - - - - - - - InterPro entry abstract format - - true - beta12orEarlier - References are included and a functional inference is made where possible. - beta12orEarlier - Entry format for the textual abstract of signatures in an InterPro entry and its protein matches. - - - - - - - - - - Gene3D entry format - - Entry format for the Gene3D protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PIRSF entry format - - beta12orEarlier - Entry format for the PIRSF protein secondary database. - true - beta12orEarlier - - - - - - - - - - PRINTS entry format - - beta12orEarlier - beta12orEarlier - true - Entry format for the PRINTS protein secondary database. - - - - - - - - - - Panther Families and HMMs entry format - - beta12orEarlier - beta12orEarlier - Entry format for the Panther library of protein families and subfamilies. - true - - - - - - - - - - Pfam entry format - - Entry format for the Pfam protein secondary database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - SMART entry format - - true - beta12orEarlier - Entry format for the SMART protein secondary database. - beta12orEarlier - - - - - - - - - - Superfamily entry format - - Entry format for the Superfamily protein secondary database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - TIGRFam entry format - - beta12orEarlier - true - Entry format for the TIGRFam protein secondary database. - beta12orEarlier - - - - - - - - - - ProDom entry format - - Entry format for the ProDom protein domain classification database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FSSP entry format - - Entry format for the FSSP database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - findkm - - - beta12orEarlier - A report format for the kinetics of enzyme-catalysed reaction(s) in a format generated by EMBOSS findkm. This includes Michaelis Menten plot, Hanes Woolf plot, Michaelis Menten constant (Km) and maximum velocity (Vmax). - - - - - - - - - - Ensembl gene report format - - beta12orEarlier - Entry format of Ensembl genome database. - beta12orEarlier - true - - - - - - - - - - DictyBase gene report format - - true - beta12orEarlier - Entry format of DictyBase genome database. - beta12orEarlier - - - - - - - - - - CGD gene report format - - beta12orEarlier - true - beta12orEarlier - Entry format of Candida Genome database. - - - - - - - - - - DragonDB gene report format - - beta12orEarlier - Entry format of DragonDB genome database. - beta12orEarlier - true - - - - - - - - - - EcoCyc gene report format - - Entry format of EcoCyc genome database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - FlyBase gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of FlyBase genome database. - - - - - - - - - - Gramene gene report format - - beta12orEarlier - beta12orEarlier - Entry format of Gramene genome database. - true - - - - - - - - - - KEGG GENES gene report format - - true - beta12orEarlier - Entry format of KEGG GENES genome database. - beta12orEarlier - - - - - - - - - - MaizeGDB gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Maize genetics and genomics database (MaizeGDB). - - - - - - - - - - MGD gene report format - - Entry format of the Mouse Genome Database (MGD). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - RGD gene report format - - true - beta12orEarlier - Entry format of the Rat Genome Database (RGD). - beta12orEarlier - - - - - - - - - - SGD gene report format - - true - beta12orEarlier - beta12orEarlier - Entry format of the Saccharomyces Genome Database (SGD). - - - - - - - - - - GeneDB gene report format - - Entry format of the Sanger GeneDB genome database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - TAIR gene report format - - beta12orEarlier - beta12orEarlier - Entry format of The Arabidopsis Information Resource (TAIR) genome database. - true - - - - - - - - - - WormBase gene report format - - Entry format of the WormBase genomes database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - ZFIN gene report format - - beta12orEarlier - beta12orEarlier - true - Entry format of the Zebrafish Information Network (ZFIN) genome database. - - - - - - - - - - TIGR gene report format - - true - Entry format of the TIGR genome database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - dbSNP polymorphism report format - - beta12orEarlier - Entry format for the dbSNP database. - true - beta12orEarlier - - - - - - - - - - OMIM entry format - - beta12orEarlier - true - beta12orEarlier - Format of an entry from the OMIM database of genotypes and phenotypes. - - - - - - - - - - HGVbase entry format - - true - Format of a record from the HGVbase database of genotypes and phenotypes. - beta12orEarlier - beta12orEarlier - - - - - - - - - - HIVDB entry format - - beta12orEarlier - beta12orEarlier - true - Format of a record from the HIVDB database of genotypes and phenotypes. - - - - - - - - - - KEGG DISEASE entry format - - beta12orEarlier - Format of an entry from the KEGG DISEASE database. - true - beta12orEarlier - - - - - - - - - - Primer3 primer - - - Report format on PCR primers and hybridization oligos as generated by Whitehead primer3 program. - beta12orEarlier - - - - - - - - - - ABI - - - A format of raw sequence read data from an Applied Biosystems sequencing machine. - beta12orEarlier - - - - - - - - - - mira - - - Format of MIRA sequence trace information file. - beta12orEarlier - - - - - - - - - - CAF - - - Common Assembly Format (CAF). A sequence assembly format including contigs, base-call qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - exp - - - Sequence assembly project file EXP format. - beta12orEarlier - - - - - - - - - - SCF - - - Staden Chromatogram Files format (SCF) of base-called sequence reads, qualities, and other metadata. - beta12orEarlier - - - - - - - - - - - - PHD - - - beta12orEarlier - PHD sequence trace format to store serialised chromatogram data (reads). - - - - - - - - - - - - dat - - - - - - - - - beta12orEarlier - Format of Affymetrix data file of raw image data. - Affymetrix image data file format - - - - - - - - - - cel - - - - - - - - - beta12orEarlier - Affymetrix probe raw data format - Format of Affymetrix data file of information about (raw) expression levels of the individual probes. - - - - - - - - - - affymetrix - - - Format of affymetrix gene cluster files (hc-genes.txt, hc-chips.txt) from hierarchical clustering. - beta12orEarlier - - - - - - - - - - ArrayExpress entry format - - beta12orEarlier - true - Entry format for the ArrayExpress microarrays database. - beta12orEarlier - - - - - - - - - - affymetrix-exp - - - Affymetrix data file format for information about experimental conditions and protocols. - Affymetrix experimental conditions data file format - beta12orEarlier - - - - - - - - - - CHP - - - - - - - - - Affymetrix probe normalised data format - beta12orEarlier - Format of Affymetrix data file of information about (normalised) expression levels of the individual probes. - - - - - - - - - - EMDB entry format - - beta12orEarlier - Format of an entry from the Electron Microscopy DataBase (EMDB). - true - beta12orEarlier - - - - - - - - - - KEGG PATHWAY entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PATHWAY database of pathway maps for molecular interactions and reaction networks. - true - - - - - - - - - - MetaCyc entry format - - true - beta12orEarlier - The format of an entry from the MetaCyc metabolic pathways database. - beta12orEarlier - - - - - - - - - - HumanCyc entry format - - The format of a report from the HumanCyc metabolic pathways database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - INOH entry format - - beta12orEarlier - true - The format of an entry from the INOH signal transduction pathways database. - beta12orEarlier - - - - - - - - - - PATIKA entry format - - beta12orEarlier - The format of an entry from the PATIKA biological pathways database. - beta12orEarlier - true - - - - - - - - - - Reactome entry format - - beta12orEarlier - The format of an entry from the reactome biological pathways database. - true - beta12orEarlier - - - - - - - - - - aMAZE entry format - - beta12orEarlier - true - The format of an entry from the aMAZE biological pathways and molecular interactions database. - beta12orEarlier - - - - - - - - - - CPDB entry format - - The format of an entry from the CPDB database. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Panther Pathways entry format - - beta12orEarlier - true - beta12orEarlier - The format of an entry from the Panther Pathways database. - - - - - - - - - - Taverna workflow format - - - Format of Taverna workflows. - beta12orEarlier - - - - - - - - - - BioModel mathematical model format - - beta12orEarlier - beta12orEarlier - Format of mathematical models from the BioModel database. - true - Models are annotated and linked to relevant data resources, such as publications, databases of compounds and pathways, controlled vocabularies, etc. - - - - - - - - - - KEGG LIGAND entry format - - The format of an entry from the KEGG LIGAND chemical database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - KEGG COMPOUND entry format - - beta12orEarlier - The format of an entry from the KEGG COMPOUND database. - true - beta12orEarlier - - - - - - - - - - KEGG PLANT entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from the KEGG PLANT database. - true - - - - - - - - - - KEGG GLYCAN entry format - - true - beta12orEarlier - The format of an entry from the KEGG GLYCAN database. - beta12orEarlier - - - - - - - - - - PubChem entry format - - beta12orEarlier - The format of an entry from PubChem. - true - beta12orEarlier - - - - - - - - - - ChemSpider entry format - - beta12orEarlier - The format of an entry from a database of chemical structures and property predictions. - beta12orEarlier - true - - - - - - - - - - ChEBI entry format - - beta12orEarlier - beta12orEarlier - The format of an entry from Chemical Entities of Biological Interest (ChEBI). - true - ChEBI includes an ontological classification defining relations between entities or classes of entities. - - - - - - - - - - MSDchem ligand dictionary entry format - - The format of an entry from the MSDchem ligand dictionary. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - HET group dictionary entry format - - - The format of an entry from the HET group dictionary (HET groups from PDB files). - beta12orEarlier - - - - - - - - - - KEGG DRUG entry format - - The format of an entry from the KEGG DRUG database. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - PubMed citation - - - beta12orEarlier - Format of bibliographic reference as used by the PubMed database. - - - - - - - - - - Medline Display Format - - - beta12orEarlier - Format for abstracts of scientific articles from the Medline database. - Bibliographic reference information including citation information is included - - - - - - - - - - CiteXplore-core - - - beta12orEarlier - CiteXplore 'core' citation format including title, journal, authors and abstract. - - - - - - - - - - CiteXplore-all - - - CiteXplore 'all' citation format includes all known details such as Mesh terms and cross-references. - beta12orEarlier - - - - - - - - - - pmc - - - beta12orEarlier - Article format of the PubMed Central database. - - - - - - - - - - iHOP text mining abstract format - - - beta12orEarlier - iHOP abstract format. - - - - - - - - - - Oscar3 - - - Oscar 3 performs chemistry-specific parsing of chemical documents. It attempts to identify chemical names, ontology concepts and chemical data from a document. - Text mining abstract format from the Oscar 3 application. - beta12orEarlier - - - - - - - - - - PDB atom record format - - true - beta13 - beta12orEarlier - Format of an ATOM record (describing data for an individual atom) from a PDB file. - - - - - - - - - - CATH chain report format - - The report (for example http://www.cathdb.info/chain/1cukA) includes chain identifiers, domain identifiers and CATH codes for domains in a given protein chain. - beta12orEarlier - Format of CATH domain classification information for a polypeptide chain. - beta12orEarlier - true - - - - - - - - - - CATH PDB report format - - beta12orEarlier - beta12orEarlier - true - Format of CATH domain classification information for a protein PDB file. - The report (for example http://www.cathdb.info/pdb/1cuk) includes chain identifiers, domain identifiers and CATH codes for domains in a given PDB file. - - - - - - - - - - NCBI gene report format - - true - Entry (gene) format of the NCBI database. - beta12orEarlier - beta12orEarlier - - - - - - - - - - GeneIlluminator gene report format - - Report format for biological functions associated with a gene name and its alternative names (synonyms, homonyms), as generated by the GeneIlluminator service. - This includes a gene name and abbreviation of the name which may be in a name space indicating the gene status and relevant organisation. - beta12orEarlier - beta12orEarlier - Moby:GI_Gene - true - - - - - - - - - - BacMap gene card format - - Format of a report on the DNA and protein sequences for a given gene label from a bacterial chromosome maps from the BacMap database. - true - beta12orEarlier - beta12orEarlier - Moby:BacMapGeneCard - - - - - - - - - - ColiCard report format - - Format of a report on Escherichia coli genes, proteins and molecules from the CyberCell Database (CCDB). - true - beta12orEarlier - Moby:ColiCard - beta12orEarlier - - - - - - - - - - PlasMapper TextMap - - - beta12orEarlier - Map of a plasmid (circular DNA) in PlasMapper TextMap format. - - - - - - - - - - newick - - - nh - beta12orEarlier - Phylogenetic tree Newick (text) format. - - - - - - - - - - TreeCon format - - - beta12orEarlier - Phylogenetic tree TreeCon (text) format. - - - - - - - - - - Nexus format - - - Phylogenetic tree Nexus (text) format. - beta12orEarlier - - - - - - - - - - Format - - - - http://en.wikipedia.org/wiki/File_format - http://purl.org/biotop/biotop.owl#MachineLanguage - File format - Data model - http://www.onto-med.de/ontologies/gfo.owl#Symbol_structure - Exchange format - "http://purl.obolibrary.org/obo/IAO_0000098" - http://semanticscience.org/resource/SIO_000612 - http://semanticscience.org/resource/SIO_000618 - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - "http://purl.org/dc/elements/1.1/format" - http://wsio.org/compression_004 - A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere. - http://en.wikipedia.org/wiki/List_of_file_formats - http://www.ifomis.org/bfo/1.1/snap#Quality - Data format - http://purl.org/biotop/biotop.owl#Quality - The main focus in EDAM lies on formats as means of structuring data exchanged between different tools or resources. The serialisation, compression, or encoding of concrete data formats/models is not in scope of EDAM. Format 'is format of' Data. - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - - - - - Data model - A defined data format has its implicit or explicit data model, and EDAM does not distinguish the two. Some data models however do not have any standard way of serialisation into an exchange format, and those are thus not considered formats in EDAM. (Remark: even broader - or closely related - term to 'Data model' would be an 'Information model'.) - - - - - File format - File format denotes only formats of a computer file, but the same formats apply also to data blobs or exchanged messages. - - - - - - - - - - Atomic data format - - beta12orEarlier - beta13 - Data format for an individual atom. - true - - - - - - - - - - Sequence record format - - - - - - - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence feature annotation format - - - - - - - - beta12orEarlier - Data format for molecular sequence feature information. - - - - - - - - - - Alignment format - - - - - - - - Data format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - acedb - - beta12orEarlier - ACEDB sequence format. - - - - - - - - - - clustal sequence format - - true - beta12orEarlier - Clustalw output format. - beta12orEarlier - - - - - - - - - - codata - - - Codata entry format. - beta12orEarlier - - - - - - - - - - dbid - - beta12orEarlier - Fasta format variant with database name before ID. - - - - - - - - - - EMBL format - - - EMBL entry format. - EMBL sequence format - EMBL - beta12orEarlier - - - - - - - - - - Staden experiment format - - - Staden experiment file format. - beta12orEarlier - - - - - - - - - - FASTA - - - beta12orEarlier - FASTA format - FASTA sequence format - FASTA format including NCBI-style IDs. - - - - - - - - - - FASTQ - - FASTQ short read format ignoring quality scores. - beta12orEarlier - FASTAQ - fq - - - - - - - - - - FASTQ-illumina - - FASTQ Illumina 1.3 short read format. - beta12orEarlier - - - - - - - - - - FASTQ-sanger - - FASTQ short read format with phred quality. - beta12orEarlier - - - - - - - - - - FASTQ-solexa - - FASTQ Solexa/Illumina 1.0 short read format. - beta12orEarlier - - - - - - - - - - fitch program - - - Fitch program format. - beta12orEarlier - - - - - - - - - - GCG - - - GCG SSF - beta12orEarlier - GCG SSF (single sequence file) file format. - GCG sequence file format. - - - - - - - - - - GenBank format - - - beta12orEarlier - Genbank entry format. - GenBank - - - - - - - - - - genpept - - beta12orEarlier - Genpept protein entry format. - Currently identical to refseqp format - - - - - - - - - - GFF2-seq - - - GFF feature file format with sequence in the header. - beta12orEarlier - - - - - - - - - - GFF3-seq - - - GFF3 feature file format with sequence. - beta12orEarlier - - - - - - - - - - giFASTA format - - FASTA sequence format including NCBI-style GIs. - beta12orEarlier - - - - - - - - - - hennig86 - - - beta12orEarlier - Hennig86 output sequence format. - - - - - - - - - - ig - - - Intelligenetics sequence format. - beta12orEarlier - - - - - - - - - - igstrict - - - beta12orEarlier - Intelligenetics sequence format (strict version). - - - - - - - - - - jackknifer - - - Jackknifer interleaved and non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - mase format - - - beta12orEarlier - Mase program sequence format. - - - - - - - - - - mega-seq - - - beta12orEarlier - Mega interleaved and non-interleaved sequence format. - - - - - - - - - - GCG MSF - - beta12orEarlier - GCG MSF (multiple sequence file) file format. - MSF - - - - - - - - - - nbrf/pir - - NBRF/PIR entry sequence format. - nbrf - beta12orEarlier - pir - - - - - - - - - - nexus-seq - - - - beta12orEarlier - Nexus/paup interleaved sequence format. - - - - - - - - - - pdbatom - - - - pdb format in EMBOSS. - beta12orEarlier - PDB sequence format (ATOM lines). - - - - - - - - - - pdbatomnuc - - - - beta12orEarlier - pdbnuc format in EMBOSS. - PDB nucleotide sequence format (ATOM lines). - - - - - - - - - - pdbseqresnuc - - - - pdbnucseq format in EMBOSS. - PDB nucleotide sequence format (SEQRES lines). - beta12orEarlier - - - - - - - - - - pdbseqres - - - - PDB sequence format (SEQRES lines). - beta12orEarlier - pdbseq format in EMBOSS. - - - - - - - - - - Pearson format - - beta12orEarlier - Plain old FASTA sequence format (unspecified format for IDs). - - - - - - - - - - phylip sequence format - - beta12orEarlier - Phylip interleaved sequence format. - true - beta12orEarlier - - - - - - - - - - phylipnon sequence format - - true - Phylip non-interleaved sequence format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - raw - - - beta12orEarlier - Raw sequence format with no non-sequence characters. - - - - - - - - - - refseqp - - - beta12orEarlier - Refseq protein entry sequence format. - Currently identical to genpept format - - - - - - - - - - selex sequence format - - beta12orEarlier - true - beta12orEarlier - Selex sequence format. - - - - - - - - - - Staden format - - - beta12orEarlier - Staden suite sequence format. - - - - - - - - - - - - - - Stockholm format - - - Stockholm multiple sequence alignment format (used by Pfam and Rfam). - beta12orEarlier - - - - - - - - - - - - strider format - - - DNA strider output sequence format. - beta12orEarlier - - - - - - - - - - UniProtKB format - - UniProt format - SwissProt format - beta12orEarlier - UniProtKB entry sequence format. - - - - - - - - - - plain text format (unformatted) - - beta12orEarlier - Plain text sequence format (essentially unformatted). - - - - - - - - - - treecon sequence format - - true - beta12orEarlier - beta12orEarlier - Treecon output sequence format. - - - - - - - - - - ASN.1 sequence format - - - NCBI ASN.1-based sequence format. - beta12orEarlier - - - - - - - - - - DAS format - - - das sequence format - DAS sequence (XML) format (any type). - beta12orEarlier - - - - - - - - - - dasdna - - - beta12orEarlier - DAS sequence (XML) format (nucleotide-only). - The use of this format is deprecated. - - - - - - - - - - debug-seq - - - EMBOSS debugging trace sequence format of full internal data content. - beta12orEarlier - - - - - - - - - - jackknifernon - - - beta12orEarlier - Jackknifer output sequence non-interleaved format. - - - - - - - - - - meganon sequence format - - beta12orEarlier - beta12orEarlier - Mega non-interleaved output sequence format. - true - - - - - - - - - - NCBI format - - NCBI FASTA sequence format with NCBI-style IDs. - beta12orEarlier - There are several variants of this. - - - - - - - - - - nexusnon - - - - Nexus/paup non-interleaved sequence format. - beta12orEarlier - - - - - - - - - - GFF2 - - beta12orEarlier - General Feature Format (GFF) of sequence features. - - - - - - - - - - - - GFF3 - - beta12orEarlier - Generic Feature Format version 3 (GFF3) of sequence features. - - - - - - - - - - - - pir - - true - 1.7 - PIR feature format. - beta12orEarlier - - - - - - - - - - swiss feature - - true - Swiss-Prot feature format. - beta12orEarlier - beta12orEarlier - - - - - - - - - - DASGFF - - - DAS GFF (XML) feature format. - das feature - DASGFF feature - beta12orEarlier - - - - - - - - - - debug-feat - - - EMBOSS debugging trace feature format of full internal data content. - beta12orEarlier - - - - - - - - - - EMBL feature - - beta12orEarlier - EMBL feature format. - true - beta12orEarlier - - - - - - - - - - GenBank feature - - beta12orEarlier - Genbank feature format. - beta12orEarlier - true - - - - - - - - - - ClustalW format - - - clustal - beta12orEarlier - ClustalW format for (aligned) sequences. - - - - - - - - - - debug - - - EMBOSS alignment format for debugging trace of full internal data content. - beta12orEarlier - - - - - - - - - - FASTA-aln - - - beta12orEarlier - Fasta format for (aligned) sequences. - - - - - - - - - - markx0 - - beta12orEarlier - Pearson MARKX0 alignment format. - - - - - - - - - - markx1 - - Pearson MARKX1 alignment format. - beta12orEarlier - - - - - - - - - - markx10 - - beta12orEarlier - Pearson MARKX10 alignment format. - - - - - - - - - - markx2 - - beta12orEarlier - Pearson MARKX2 alignment format. - - - - - - - - - - markx3 - - beta12orEarlier - Pearson MARKX3 alignment format. - - - - - - - - - - match - - - Alignment format for start and end of matches between sequence pairs. - beta12orEarlier - - - - - - - - - - mega - - Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - meganon - - Mega non-interleaved format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - msf alignment format - - true - beta12orEarlier - beta12orEarlier - MSF format for (aligned) sequences. - - - - - - - - - - nexus alignment format - - Nexus/paup format for (aligned) sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - nexusnon alignment format - - beta12orEarlier - true - Nexus/paup non-interleaved format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - pair - - EMBOSS simple sequence pair alignment format. - beta12orEarlier - - - - - - - - - - PHYLIP format - - phy - beta12orEarlier - ph - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - PHYLIP interleaved format - Phylip format for (aligned) sequences. - - - - - - - - - - phylipnon - - http://www.bioperl.org/wiki/PHYLIP_multiple_alignment_format - beta12orEarlier - PHYLIP sequential format - Phylip non-interleaved format for (aligned) sequences. - - - - - - - - - - scores format - - - Alignment format for score values for pairs of sequences. - beta12orEarlier - - - - - - - - - - selex - - - - beta12orEarlier - SELEX format for (aligned) sequences. - - - - - - - - - - EMBOSS simple format - - - EMBOSS simple multiple alignment format. - beta12orEarlier - - - - - - - - - - srs format - - - beta12orEarlier - Simple multiple sequence (alignment) format for SRS. - - - - - - - - - - srspair - - - beta12orEarlier - Simple sequence pair (alignment) format for SRS. - - - - - - - - - - T-Coffee format - - - T-Coffee program alignment format. - beta12orEarlier - - - - - - - - - - TreeCon-seq - - - - Treecon format for (aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format - - - - - - - - Data format for a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Biological pathway or network format - - - - - - - - beta12orEarlier - Data format for a biological pathway or network. - - - - - - - - - - Sequence-profile alignment format - - - - - - - - beta12orEarlier - Data format for a sequence-profile alignment. - - - - - - - - - - Sequence-profile alignment (HMM) format - - beta12orEarlier - beta12orEarlier - true - Data format for a sequence-HMM profile alignment. - - - - - - - - - - Amino acid index format - - - - - - - - Data format for an amino acid index. - beta12orEarlier - - - - - - - - - - Article format - - - - - - - - beta12orEarlier - Literature format - Data format for a full-text scientific article. - - - - - - - - - - Text mining report format - - - - - - - - beta12orEarlier - Data format for an abstract (report) from text mining. - - - - - - - - - - Enzyme kinetics report format - - - - - - - - Data format for reports on enzyme kinetics. - beta12orEarlier - - - - - - - - - - Small molecule report format - - - - - - - - beta12orEarlier - Chemical compound annotation format - Format of a report on a chemical compound. - - - - - - - - - - Gene annotation format - - - - - - - - Format of a report on a particular locus, gene, gene system or groups of genes. - beta12orEarlier - Gene features format - - - - - - - - - - Workflow format - - beta12orEarlier - Format of a workflow. - - - - - - - - - - Tertiary structure format - - beta12orEarlier - Data format for a molecular tertiary structure. - - - - - - - - - - Biological model format - - Data format for a biological model. - beta12orEarlier - 1.2 - true - - - - - - - - - - Chemical formula format - - - - - - - - beta12orEarlier - Text format of a chemical formula. - - - - - - - - - - Phylogenetic character data format - - - - - - - - beta12orEarlier - Format of raw (unplotted) phylogenetic data. - - - - - - - - - - Phylogenetic continuous quantitative character format - - - - - - - - Format of phylogenetic continuous quantitative character data. - beta12orEarlier - - - - - - - - - - Phylogenetic discrete states format - - - - - - - - Format of phylogenetic discrete states data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (cliques) format - - - - - - - - Format of phylogenetic cliques data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree report (invariants) format - - - - - - - - beta12orEarlier - Format of phylogenetic invariants data. - - - - - - - - - - Electron microscopy model format - - beta12orEarlier - true - beta12orEarlier - Annotation format for electron microscopy models. - - - - - - - - - - Phylogenetic tree report (tree distances) format - - - - - - - - Format for phylogenetic tree distance data. - beta12orEarlier - - - - - - - - - - Polymorphism report format - - beta12orEarlier - true - 1.0 - Format for sequence polymorphism data. - - - - - - - - - - Protein family report format - - - - - - - - beta12orEarlier - Format for reports on a protein family. - - - - - - - - - - Protein interaction format - - - - - - - - beta12orEarlier - Format for molecular interaction data. - Molecular interaction format - - - - - - - - - - Sequence assembly format - - - - - - - - beta12orEarlier - Format for sequence assembly data. - - - - - - - - - - Microarray experiment data format - - Format for information about a microarray experimental per se (not the data generated from that experiment). - beta12orEarlier - - - - - - - - - - Sequence trace format - - - - - - - - Format for sequence trace data (i.e. including base call information). - beta12orEarlier - - - - - - - - - - Gene expression report format - - - - - - - - Gene expression data format - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - beta12orEarlier - - - - - - - - - - Genotype and phenotype annotation format - - beta12orEarlier - true - Format of a report on genotype / phenotype information. - beta12orEarlier - - - - - - - - - - Map format - - - - - - - - Format of a map of (typically one) molecular sequence annotated with features. - beta12orEarlier - - - - - - - - - - Nucleic acid features (primers) format - - beta12orEarlier - Format of a report on PCR primers or hybridization oligos in a nucleic acid sequence. - - - - - - - - - - Protein report format - - - - - - - - Format of a report of general information about a specific protein. - beta12orEarlier - - - - - - - - - - Protein report (enzyme) format - - Format of a report of general information about a specific enzyme. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix format - - - - - - - - beta12orEarlier - Format of a matrix of 3D-1D scores (amino acid environment probabilities). - - - - - - - - - - Protein structure report (quality evaluation) format - - - - - - - - Format of a report on the quality of a protein three-dimensional model. - beta12orEarlier - - - - - - - - - - Database hits (sequence) format - - - - - - - - Format of a report on sequence hits and associated data from searching a sequence database. - beta12orEarlier - - - - - - - - - - Sequence distance matrix format - - - - - - - - beta12orEarlier - Format of a matrix of genetic distances between molecular sequences. - - - - - - - - - - Sequence motif format - - - - - - - - Format of a sequence motif. - beta12orEarlier - - - - - - - - - - Sequence profile format - - - - - - - - Format of a sequence profile. - beta12orEarlier - - - - - - - - - - Hidden Markov model format - - - - - - - - beta12orEarlier - Format of a hidden Markov model. - - - - - - - - - - Dirichlet distribution format - - - - - - - - Data format of a dirichlet distribution. - beta12orEarlier - - - - - - - - - - HMM emission and transition counts format - - - - - - - - - - - - - - Data format for the emission and transition counts of a hidden Markov model. - beta12orEarlier - - - - - - - - - - RNA secondary structure format - - - - - - - - beta12orEarlier - Format for secondary structure (predicted or real) of an RNA molecule. - - - - - - - - - - Protein secondary structure format - - Format for secondary structure (predicted or real) of a protein molecule. - beta12orEarlier - - - - - - - - - - Sequence range format - - - - - - - - beta12orEarlier - Format used to specify range(s) of sequence positions. - - - - - - - - - - pure - - - Alphabet for molecular sequence with possible unknown positions but without non-sequence characters. - beta12orEarlier - - - - - - - - - - unpure - - - Alphabet for a molecular sequence with possible unknown positions but possibly with non-sequence characters. - beta12orEarlier - - - - - - - - - - unambiguous sequence - - - Alphabet for a molecular sequence with possible unknown positions but without ambiguity characters. - beta12orEarlier - - - - - - - - - - ambiguous - - - beta12orEarlier - Alphabet for a molecular sequence with possible unknown positions and possible ambiguity characters. - - - - - - - - - - Sequence features (repeats) format - - beta12orEarlier - Format used for map of repeats in molecular (typically nucleotide) sequences. - - - - - - - - - - Nucleic acid features (restriction sites) format - - beta12orEarlier - Format used for report on restriction enzyme recognition sites in nucleotide sequences. - - - - - - - - - - Gene features (coding region) format - - beta12orEarlier - Format used for report on coding regions in nucleotide sequences. - true - 1.10 - - - - - - - - - - Sequence cluster format - - - - - - - - beta12orEarlier - Format used for clusters of molecular sequences. - - - - - - - - - - Sequence cluster format (protein) - - Format used for clusters of protein sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster format (nucleic acid) - - Format used for clusters of nucleotide sequences. - beta12orEarlier - - - - - - - - - - Gene cluster format - - true - beta13 - beta12orEarlier - Format used for clusters of genes. - - - - - - - - - - EMBL-like (text) - - - This concept may be used for the many non-standard EMBL-like text formats. - beta12orEarlier - A text format resembling EMBL entry format. - - - - - - - - - - FASTQ-like format (text) - - - A text format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - EMBLXML - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - cdsxml - - XML format for EMBL entries. - beta12orEarlier - - - - - - - - - - insdxml - - beta12orEarlier - XML format for EMBL entries. - - - - - - - - - - geneseq - - Geneseq sequence format. - beta12orEarlier - - - - - - - - - - UniProt-like (text) - - - A text sequence format resembling uniprotkb entry format. - beta12orEarlier - - - - - - - - - - UniProt format - - beta12orEarlier - true - UniProt entry sequence format. - 1.8 - - - - - - - - - - ipi - - 1.8 - beta12orEarlier - ipi sequence format. - true - - - - - - - - - - medline - - - Abstract format used by MedLine database. - beta12orEarlier - - - - - - - - - - Ontology format - - - - - - - - Format used for ontologies. - beta12orEarlier - - - - - - - - - - OBO format - - beta12orEarlier - A serialisation format conforming to the Open Biomedical Ontologies (OBO) model. - - - - - - - - - - OWL format - - A serialisation format conforming to the Web Ontology Language (OWL) model. - beta12orEarlier - - - - - - - - - - FASTA-like (text) - - - This concept may also be used for the many non-standard FASTA-like formats. - http://filext.com/file-extension/FASTA - beta12orEarlier - A text format resembling FASTA format. - - - - - - - - - - Sequence record full format - - 1.8 - beta12orEarlier - Data format for a molecular sequence record, typically corresponding to a full entry from a molecular sequence database. - true - - - - - - - - - - Sequence record lite format - - true - 1.8 - beta12orEarlier - Data format for a molecular sequence record 'lite', typically molecular sequence and minimal metadata, such as an identifier of the sequence and/or a comment. - - - - - - - - - - EMBL format (XML) - - beta12orEarlier - An XML format for EMBL entries. - This is a placeholder for other more specific concepts. It should not normally be used for annotation. - - - - - - - - - - GenBank-like format (text) - - - A text format resembling GenBank entry (plain text) format. - This concept may be used for the non-standard GenBank-like text formats. - beta12orEarlier - - - - - - - - - - Sequence feature table format (text) - - Text format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Strain data format - - Format of a report on organism strain data / cell line. - beta12orEarlier - true - 1.0 - - - - - - - - - - CIP strain data format - - Format for a report of strain data as used for CIP database entries. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - phylip property values - - true - PHYLIP file format for phylogenetic property data. - beta12orEarlier - beta12orEarlier - - - - - - - - - - STRING entry format (HTML) - - beta12orEarlier - true - beta12orEarlier - Entry format (HTML) for the STRING database of protein interaction. - - - - - - - - - - STRING entry format (XML) - - - Entry format (XML) for the STRING database of protein interaction. - beta12orEarlier - - - - - - - - - - GFF - - - GFF feature format (of indeterminate version). - beta12orEarlier - - - - - - - - - - GTF - - Gene Transfer Format (GTF), a restricted version of GFF. - beta12orEarlier - - - - - - - - - - - - - FASTA-HTML - - - FASTA format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - EMBL-HTML - - - EMBL entry format wrapped in HTML elements. - beta12orEarlier - - - - - - - - - - BioCyc enzyme report format - - true - beta12orEarlier - beta12orEarlier - Format of an entry from the BioCyc enzyme database. - - - - - - - - - - ENZYME enzyme report format - - Format of an entry from the Enzyme nomenclature database (ENZYME). - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - PseudoCAP gene report format - - true - beta12orEarlier - beta12orEarlier - Format of a report on a gene from the PseudoCAP database. - - - - - - - - - - GeneCards gene report format - - Format of a report on a gene from the GeneCards database. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Textual format - - http://filext.com/file-extension/TSV - http://www.iana.org/assignments/media-types/text/plain - Textual format. - Data in text format can be compressed into binary format, or can be a value of an XML element or attribute. Markup formats are not considered textual (or more precisely, not plain-textual). - txt - http://filext.com/file-extension/TXT - Plain text - http://www.iana.org/assignments/media-types/media-types.xhtml#text - beta12orEarlier - - - - - - - - - - HTML - - - - - - - - HTML format. - beta12orEarlier - http://filext.com/file-extension/HTML - Hypertext Markup Language - - - - - - - - - - XML - - Data in XML format can be serialised into text, or binary format. - beta12orEarlier - eXtensible Markup Language (XML) format. - xml - - Extensible Markup Language - - - - - - - - - - - - - Binary format - - Only specific native binary formats are listed under 'Binary format' in EDAM. Generic binary formats - such as any data being zipped, or any XML data being serialised into the Efficient XML Interchange (EXI) format - are not modelled in EDAM. Refer to http://wsio.org/compression_004. - beta12orEarlier - Binary format. - - - - - - - - - - URI format - - beta13 - true - Typical textual representation of a URI. - beta12orEarlier - - - - - - - - - - NCI-Nature pathway entry format - - beta12orEarlier - true - The format of an entry from the NCI-Nature pathways database. - beta12orEarlier - - - - - - - - - - Format (typed) - - This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. The concept branch under 'Format (typed)' provides an alternative organisation of the concepts nested under the other top-level branches ('Binary', 'HTML', 'RDF', 'Text' and 'XML'. All concepts under here are already included under those branches. - beta12orEarlier - A broad class of format distinguished by the scientific nature of the data that is identified. - - - - - - - - - - BioXSD - - - - - - - - - - - - - - - - - - - - - - - - BioXSD XML format - beta12orEarlier - BioXSD XML format of basic bioinformatics types of data (sequence records, alignments, feature records, references to resources, and more). - - - - - - - - - - - - RDF format - - - beta12orEarlier - A serialisation format conforming to the Resource Description Framework (RDF) model. - - - - - - - - - - GenBank-HTML - - - beta12orEarlier - Genbank entry format wrapped in HTML elements. - - - - - - - - - - Protein features (domains) format - - beta12orEarlier - true - beta12orEarlier - Format of a report on protein features (domain composition). - - - - - - - - - - EMBL-like format - - beta12orEarlier - A format resembling EMBL entry (plain text) format. - This concept may be used for the many non-standard EMBL-like formats. - - - - - - - - - - FASTQ-like format - - A format resembling FASTQ short read format. - This concept may be used for non-standard FASTQ short read-like formats. - beta12orEarlier - - - - - - - - - - FASTA-like - - This concept may be used for the many non-standard FASTA-like formats. - beta12orEarlier - A format resembling FASTA format. - - - - - - - - - - uniprotkb-like format - - - beta12orEarlier - A sequence format resembling uniprotkb entry format. - - - - - - - - - - Sequence feature table format - - - - - - - - Format for a sequence feature table. - beta12orEarlier - - - - - - - - - - OBO - - - beta12orEarlier - OBO ontology text format. - - - - - - - - - - OBO-XML - - - beta12orEarlier - OBO ontology XML format. - - - - - - - - - - Sequence record format (text) - - Data format for a molecular sequence record. - beta12orEarlier - - - - - - - - - - Sequence record format (XML) - - beta12orEarlier - Data format for a molecular sequence record. - - - - - - - - - - Sequence feature table format (XML) - - XML format for a sequence feature table. - beta12orEarlier - - - - - - - - - - Alignment format (text) - - Text format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Alignment format (XML) - - XML format for molecular sequence alignment information. - beta12orEarlier - - - - - - - - - - Phylogenetic tree format (text) - - beta12orEarlier - Text format for a phylogenetic tree. - - - - - - - - - - Phylogenetic tree format (XML) - - beta12orEarlier - XML format for a phylogenetic tree. - - - - - - - - - - EMBL-like (XML) - - - An XML format resembling EMBL entry format. - This concept may be used for the any non-standard EMBL-like XML formats. - beta12orEarlier - - - - - - - - - - GenBank-like format - - A format resembling GenBank entry (plain text) format. - beta12orEarlier - This concept may be used for the non-standard GenBank-like formats. - - - - - - - - - - STRING entry format - - beta12orEarlier - Entry format for the STRING database of protein interaction. - beta12orEarlier - true - - - - - - - - - - Sequence assembly format (text) - - beta12orEarlier - Text format for sequence assembly data. - - - - - - - - - - Amino acid identifier format - - beta13 - Text format (representation) of amino acid residues. - true - beta12orEarlier - - - - - - - - - - completely unambiguous - - - beta12orEarlier - Alphabet for a molecular sequence without any unknown positions or ambiguity characters. - - - - - - - - - - completely unambiguous pure - - - beta12orEarlier - Alphabet for a molecular sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure nucleotide - - - Alphabet for a nucleotide sequence (characters ACGTU only) without unknown positions, ambiguity or non-sequence characters . - beta12orEarlier - - - - - - - - - - completely unambiguous pure dna - - - beta12orEarlier - Alphabet for a DNA sequence (characters ACGT only) without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - completely unambiguous pure rna sequence - - - Alphabet for an RNA sequence (characters ACGU only) without unknown positions, ambiguity or non-sequence characters. - beta12orEarlier - - - - - - - - - - Raw sequence format - - - - - - - - http://www.onto-med.de/ontologies/gfo.owl#Symbol_sequence - beta12orEarlier - Format of a raw molecular sequence (i.e. the alphabet used). - - - - - - - - - - BAM - - - - beta12orEarlier - BAM format, the binary, BGZF-formatted compressed version of SAM format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SAM - - - - The format supports short and long reads (up to 128Mbp) produced by different sequencing platforms and is used to hold mapped data within the GATK and across the Broad Institute, the Sanger Centre, and throughout the 1000 Genomes project. - beta12orEarlier - Sequence Alignment/Map (SAM) format for alignment of nucleotide sequences (e.g. sequencing reads) to (a) reference sequence(s). May contain base-call and alignment qualities and other data. - - - - - - - - - - - - SBML - - - Systems Biology Markup Language (SBML), the standard XML format for models of biological processes such as for example metabolism, cell signaling, and gene regulation. - beta12orEarlier - - - - - - - - - - - - completely unambiguous pure protein - - - beta12orEarlier - Alphabet for any protein sequence without unknown positions, ambiguity or non-sequence characters. - - - - - - - - - - Bibliographic reference format - - - - - - - - - - - - - - Format of a bibliographic reference. - beta12orEarlier - - - - - - - - - - Sequence annotation track format - - - - - - - - Format of a sequence annotation track. - beta12orEarlier - - - - - - - - - - Alignment format (pair only) - - - - - - - - beta12orEarlier - Data format for molecular sequence alignment information that can hold sequence alignment(s) of only 2 sequences. - - - - - - - - - - Sequence variation annotation format - - - - - - - - Format of sequence variation annotation. - beta12orEarlier - - - - - - - - - - markx0 variant - - - Some variant of Pearson MARKX alignment format. - beta12orEarlier - - - - - - - - - - mega variant - - - - Some variant of Mega format for (typically aligned) sequences. - beta12orEarlier - - - - - - - - - - Phylip format variant - - - - beta12orEarlier - Some variant of Phylip format for (aligned) sequences. - - - - - - - - - - AB1 - - - beta12orEarlier - AB1 binary format of raw DNA sequence reads (output of Applied Biosystems' sequencing analysis software). Contains an electropherogram and the DNA base sequence. - AB1 uses the generic binary Applied Biosystems, Inc. Format (ABIF). - - - - - - - - - - ACE - - - ACE sequence assembly format including contigs, base-call qualities, and other metadata (version Aug 1998 and onwards). - beta12orEarlier - - - - - - - - - - - - BED - - - beta12orEarlier - BED detail format includes 2 additional columns (http://genome.ucsc.edu/FAQ/FAQformat#format1.7) and BED 15 includes 3 additional columns for experiment scores (http://genomewiki.ucsc.edu/index.php/Microarray_track). - Browser Extensible Data (BED) format of sequence annotation track, typically to be displayed in a genome browser. - - - - - - - - - - - - bigBed - - - beta12orEarlier - bigBed format for large sequence annotation tracks, similar to textual BED format. - - - - - - - - - - - - WIG - - - Wiggle format (WIG) of a sequence annotation track that consists of a value for each sequence position. Typically to be displayed in a genome browser. - beta12orEarlier - - - - - - - - - - - - bigWig - - - beta12orEarlier - bigWig format for large sequence annotation tracks that consist of a value for each sequence position. Similar to textual WIG format. - - - - - - - - - - - - PSL - - - - PSL format of alignments, typically generated by BLAT or psLayout. Can be displayed in a genome browser like a sequence annotation track. - beta12orEarlier - - - - - - - - - - - - MAF - - - - Multiple Alignment Format (MAF) supporting alignments of whole genomes with rearrangements, directions, multiple pieces to the alignment, and so forth. - Typically generated by Multiz and TBA aligners; can be displayed in a genome browser like a sequence annotation track. This should not be confused with MIRA Assembly Format or Mutation Annotation Format. - beta12orEarlier - - - - - - - - - - - - 2bit - - - beta12orEarlier - 2bit binary format of nucleotide sequences using 2 bits per nucleotide. In addition encodes unknown nucleotides and lower-case 'masking'. - - - - - - - - - - - - - .nib - - - beta12orEarlier - .nib (nibble) binary format of a nucleotide sequence using 4 bits per nucleotide (including unknown) and its lower-case 'masking'. - - - - - - - - - - - - genePred - - - genePred table format for gene prediction tracks. - genePred format has 3 main variations (http://genome.ucsc.edu/FAQ/FAQformat#format9 http://www.broadinstitute.org/software/igv/genePred). They reflect UCSC Browser DB tables. - beta12orEarlier - - - - - - - - - - - - pgSnp - - - Personal Genome SNP (pgSnp) format for sequence variation tracks (indels and polymorphisms), supported by the UCSC Genome Browser. - beta12orEarlier - - - - - - - - - - - - axt - - - beta12orEarlier - axt format of alignments, typically produced from BLASTZ. - - - - - - - - - - - - LAV - - - beta12orEarlier - LAV format of alignments generated by BLASTZ and LASTZ. - - - - - - - - - - - - Pileup - - - beta12orEarlier - Pileup format of alignment of sequences (e.g. sequencing reads) to (a) reference sequence(s). Contains aligned bases per base of the reference sequence(s). - - - - - - - - - - - - VCF - - - beta12orEarlier - Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - - SRF - - - Sequence Read Format (SRF) of sequence trace data. Supports submission to the NCBI Short Read Archive. - beta12orEarlier - - - - - - - - - - - - ZTR - - - ZTR format for storing chromatogram data from DNA sequencing instruments. - beta12orEarlier - - - - - - - - - - - - GVF - - - Genome Variation Format (GVF). A GFF3-compatible format with defined header and attribute tags for sequence variation. - beta12orEarlier - - - - - - - - - - - - BCF - - - beta12orEarlier - BCF, the binary version of Variant Call Format (VCF) for sequence variation (indels, polymorphisms, structural variation). - - - - - - - - - - - Matrix format - - - - - - - - Format of a matrix (array) of numerical values. - beta13 - - - - - - - - - - Protein domain classification format - - - - - - - - Format of data concerning the classification of the sequences and/or structures of protein structural domain(s). - beta13 - - - - - - - - - - Raw SCOP domain classification format - - Format of raw SCOP domain classification data files. - These are the parsable data files provided by SCOP. - beta13 - - - - - - - - - - Raw CATH domain classification format - - These are the parsable data files provided by CATH. - beta13 - Format of raw CATH domain classification data files. - - - - - - - - - - CATH domain report format - - Format of summary of domain classification information for a CATH domain. - beta13 - The report (for example http://www.cathdb.info/domain/1cukA01) includes CATH codes for levels in the hierarchy for the domain, level descriptions and relevant data and links. - - - - - - - - - - SBRML - - - 1.0 - Systems Biology Result Markup Language (SBRML), the standard XML format for simulated or calculated results (e.g. trajectories) of systems biology models. - - - - - - - - - - - - BioPAX - - BioPAX is an exchange format for pathway data, with its data model defined in OWL. - 1.0 - - - - - - - - - - - - EBI Application Result XML - - - - EBI Application Result XML is a format returned by sequence similarity search Web services at EBI. - 1.0 - - - - - - - - - - - - PSI MI XML (MIF) - - - 1.0 - XML Molecular Interaction Format (MIF), standardised by HUPO PSI MI. - MIF - - - - - - - - - - - - phyloXML - - - phyloXML is a standardised XML format for phylogenetic trees, networks, and associated data. - 1.0 - - - - - - - - - - - - NeXML - - - 1.0 - NeXML is a standardised XML format for rich phyloinformatic data. - - - - - - - - - - - - MAGE-ML - - - - - - - - - 1.0 - MAGE-ML XML format for microarray expression data, standardised by MGED (now FGED). - - - - - - - - - - - - MAGE-TAB - - - - - - - - - MAGE-TAB textual format for microarray expression data, standardised by MGED (now FGED). - 1.0 - - - - - - - - - - - - GCDML - - - GCDML XML format for genome and metagenome metadata according to MIGS/MIMS/MIMARKS information standards, standardised by the Genomic Standards Consortium (GSC). - 1.0 - - - - - - - - - - - - GTrack - - - 1.0 - GTrack is an optimised tabular format for genome/sequence feature tracks unifying the power of other tabular formats (e.g. GFF3, BED, WIG). - - - - - - - - - - - - Biological pathway or network report format - - - - - - - - Data format for a report of information derived from a biological pathway or network. - beta12orEarlier - - - - - - - - - - Experiment annotation format - - - - - - - - beta12orEarlier - Data format for annotation on a laboratory experiment. - - - - - - - - - - Cytoband format - - - - - - - - - 1.2 - Cytoband format for chromosome cytobands. - Reflects a UCSC Browser DB table. - - - - - - - - - - - - CopasiML - - - - 1.2 - CopasiML, the native format of COPASI. - - - - - - - - - - - - CellML - - - CellML, the format for mathematical models of biological and other networks. - 1.2 - - - - - - - - - - - - - - PSI MI TAB (MITAB) - - - 1.2 - Tabular Molecular Interaction format (MITAB), standardised by HUPO PSI MI. - - - - - - - - - - - - PSI-PAR - - Protein affinity format (PSI-PAR), standardised by HUPO PSI MI. It is compatible with PSI MI XML (MIF) and uses the same XML Schema. - 1.2 - - - - - - - - - - - - mzML - - - mzML is the successor and unifier of the mzData format developed by PSI and mzXML developed at the Seattle Proteome Center. - 1.2 - mzML format for raw spectrometer output data, standardised by HUPO PSI MSS. - - - - - - - - - - - - Mass spectrometry data format - - - - - - - - Format for mass pectra and derived data, include peptide sequences etc. - 1.2 - - - - - - - - - - TraML - - - TraML (Transition Markup Language) is the format for mass spectrometry transitions, standardised by HUPO PSI MSS. - 1.2 - - - - - - - - - - - - mzIdentML - - - mzIdentML is the exchange format for peptides and proteins identified from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of proteomics search engines. - 1.2 - - - - - - - - - - - - mzQuantML - - - mzQuantML is the format for quantitation values associated with peptides, proteins and small molecules from mass spectra, standardised by HUPO PSI PI. It can be used for outputs of quantitation software for proteomics. - 1.2 - - - - - - - - - - - - GelML - - - 1.2 - GelML is the format for describing the process of gel electrophoresis, standardised by HUPO PSI PS. - - - - - - - - - - - - spML - - - 1.2 - spML is the format for describing proteomics sample processing, other than using gels, prior to mass spectrometric protein identification, standardised by HUPO PSI PS. It may also be applicable for metabolomics. - - - - - - - - - - - - OWL Functional Syntax - - - A human-readable encoding for the Web Ontology Language (OWL). - 1.2 - - - - - - - - - - Manchester OWL Syntax - - - A syntax for writing OWL class expressions. - 1.2 - This format was influenced by the OWL Abstract Syntax and the DL style syntax. - - - - - - - - - - KRSS2 Syntax - - - This format is used in Protege 4. - A superset of the "Description-Logic Knowledge Representation System Specification from the KRSS Group of the ARPA Knowledge Sharing Effort". - 1.2 - - - - - - - - - - Turtle - - - The SPARQL Query Language incorporates a very similar syntax. - 1.2 - The Terse RDF Triple Language (Turtle) is a human-friendly serialization format for RDF (Resource Description Framework) graphs. - - - - - - - - - - N-Triples - - - N-Triples should not be confused with Notation 3 which is a superset of Turtle. - 1.2 - A plain text serialisation format for RDF (Resource Description Framework) graphs, and a subset of the Turtle (Terse RDF Triple Language) format. - - - - - - - - - - Notation3 - - - N3 - A shorthand non-XML serialization of Resource Description Framework model, designed with human-readability in mind. - - - - - - - - - - RDF/XML - - - - RDF - Resource Description Framework (RDF) XML format. - 1.2 - http://www.ebi.ac.uk/SWO/data/SWO_3000006 - RDF/XML is a serialization syntax for OWL DL, but not for OWL Full. - - - - - - - - - - OWL/XML - - - OWL ontology XML serialisation format. - 1.2 - OWL - - - - - - - - - - A2M - - - The A2M format is used as the primary format for multiple alignments of protein or nucleic-acid sequences in the SAM suite of tools. It is a small modification of FASTA format for sequences and is compatible with most tools that read FASTA. - 1.3 - - - - - - - - - - - - SFF - - - Standard flowgram format - Standard flowgram format (SFF) is a binary file format used to encode results of pyrosequencing from the 454 Life Sciences platform for high-throughput sequencing. - 1.3 - - - - - - - - - - - - MAP - - The MAP file describes SNPs and is used by the Plink package. - 1.3 - Plink MAP - - - - - - - - - - - PED - - Plink PED - 1.3 - The PED file describes individuals and genetic data and is used by the Plink package. - - - - - - - - - - - Individual genetic data format - - Data format for a metadata on an individual and their genetic data. - 1.3 - - - - - - - - - - PED/MAP - - - The PED/MAP file describes data used by the Plink package. - Plink PED/MAP - 1.3 - - - - - - - - - - - CT - - - File format of a CT (Connectivity Table) file from the RNAstructure package. - beta12orEarlier - Connect format - Connectivity Table file format - - - - - - - - - - - - SS - - - beta12orEarlier - XRNA old input style format. - - - - - - - - - - - RNAML - - - - RNA Markup Language. - beta12orEarlier - - - - - - - - - - - GDE - - - Format for the Genetic Data Environment (GDE). - beta12orEarlier - - - - - - - - - - - BLC - - 1.3 - Block file format - A multiple alignment in vertical format, as used in the AMPS (Alignment of Multiple Protein Sequences) pacakge. - - - - - - - - - - - Data index format - - - - - - - - - 1.3 - - - - - - - - - - BAI - - - - - - - - 1.3 - BAM indexing format - - - - - - - - - - - HMMER2 - - HMMER profile HMM file for HMMER versions 2.x - 1.3 - - - - - - - - - - - HMMER3 - - 1.3 - HMMER profile HMM file for HMMER versions 3.x - - - - - - - - - - - PO - - EMBOSS simple sequence pair alignment format. - 1.3 - - - - - - - - - - - BLAST XML results format - - - XML format as produced by the NCBI Blast package - 1.3 - - - - - - - - - - CRAM - - - Reference-based compression of alignment format - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - http://www.ebi.ac.uk/ena/software/cram-usage#format_specification http://samtools.github.io/hts-specs/CRAMv2.1.pdf - 1.7 - - - - - - - - - - JSON - - 1.7 - Javascript Object Notation format; a lightweight, text-based format to represent structured data using key-value pairs. - - - - - - - - - - EPS - - Encapsulated PostScript format - 1.7 - - - - - - - - - - GIF - - 1.7 - Graphics Interchange Format. - - - - - - - - - - xls - - - Microsoft Excel spreadsheet format. - Microsoft Excel format - 1.7 - - - - - - - - - - TSV - - Tabular format - http://filext.com/file-extension/CSV - http://www.iana.org/assignments/media-types/text/csv - Tabular data represented as tab-separated values in a text file. - 1.7 - http://filext.com/file-extension/TSV - CSV - - - - - - - - - - Gene expression data format - - true - 1.10 - 1.7 - Format of a file of gene expression data, e.g. a gene expression matrix or profile. - - - - - - - - - - Cytoscape input file format - - - Format of the cytoscape input file of gene expression ratios or values are specified over one or more experiments. - 1.7 - - - - - - - - - - ebwt - - - - - - - - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie index format - 1.7 - Bowtie format for indexed reference genome for "small" genomes. - - - - - - - - - - RSF - - http://www.molbiol.ox.ac.uk/tutorials/Seqlab_GCG.pdf - RSF-format files contain one or more sequences that may or may not be related. In addition to the sequence data, each sequence can be annotated with descriptive sequence information (from the GCG manual). - Rich sequence format. - 1.7 - GCG RSF - - - - - - - - - - GCG format variant - - - - 1.7 - Some format based on the GCG format. - - - - - - - - - - BSML - - - http://rothlab.ucdavis.edu/genhelp/chapter_2_using_sequences.html#_Creating_and_Editing_Single_Sequenc - Bioinformatics Sequence Markup Language format. - 1.7 - - - - - - - - - - ebwtl - - - - - - - - 1.7 - https://github.com/BenLangmead/bowtie/blob/master/MANUAL - Bowtie long index format - Bowtie format for indexed reference genome for "large" genomes. - - - - - - - - - - Ensembl variation file format - - - Ensembl standard format for variation data. - 1.8 - - - - - - - - - - - docx - - - 1.8 - Microsoft Word format - doc - Microsoft Word format. - - - - - - - - - - Document format - - Format of documents including word processor, spreadsheet and presentation. - 1.8 - - - - - - - - - - PDF - - - 1.8 - Portable Document Format - - - - - - - - - - Image format - - - - - - - - Format used for images and image metadata. - 1.9 - - - - - - - - - - DICOM format - - - 1.9 - Medical image format corresponding to the Digital Imaging and Communications in Medicine (DICOM) standard. - - - - - - - - - - - - - nii - - - Medical image and metadata format of the Neuroimaging Informatics Technology Initiative. - - - NIfTI-1 format - 1.9 - - - - - - - - - - - mhd - - - Metalmage format - 1.9 - Text-based tagged file format for medical images generated using the MetaImage software package. - - - - - - - - - - - nrrd - - - 1.9 - Nearly Raw Rasta Data format designed to support scientific visualization and image processing involving N-dimensional raster data. - - - - - - - - - - - R file format - - File format used for scripts written in the R programming language for execution within the R software environment, typically for statistical computation and graphics. - - 1.9 - - - - - - - - - - SPSS - - 1.9 - File format used for scripts for the Statistical Package for the Social Sciences. - - - - - - - - - - - MHT - MIME HTML format for Web pages, which can include external resources, including images, Flash animations and so on. - - EMBL entry format wrapped in HTML elements. - 1.9 - MHTML - - - - - - - - - - IDAT - - - - - - - - - Proprietary file format for (raw) BeadArray data used by genomewide profiling platforms from Illumina Inc. This format is output directly from the scanner and stores summary intensities for each probe-type on an array. - 1.10 - - - - - - - - - - JPG - - - 1.10 - Joint Picture Group file format for lossy graphics file. - - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rcc - - - 1.10 - Reporter Code Count-A data file (.csv) output by the Nanostring nCounter Digital Analyzer, which contains gene sample information, probe information and probe counts. - - - - - - - - - - arff - - ARFF (Attribute-Relation File Format) is an ASCII text file format that describes a list of instances sharing a set of attributes. - 1.11 - This file format is for machine learning. - - - - - - - - - - - - afg - - - 1.11 - AFG is a single text-based file assembly format that holds read and consensus information together - - - - - - - - - - - - bedgraph - - - Holds a tab-delimited chromosome /start /end / datavalue dataset. - 1.11 - The bedGraph format allows display of continuous-valued data in track format. This display type is useful for probability scores and transcriptome data - - - - - - - - - - - - bedstrict - - Browser Extensible Data (BED) format of sequence annotation track that strictly does not contain non-standard fields beyond the first 3 columns. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - 1.11 - - - - - - - - - - - - bed6 - - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 6 - BED file format where each feature is described by chromosome, start, end, name, score, and strand. - 1.11 - - - - - - - - - - - - bed12 - - 1.11 - Tab delimited data in strict BED format - no non-standard columns allowed; column count forced to 12 - A BED file where each feature is described by all twelve columns. - - - - - - - - - - - - chrominfo - - - 1.11 - Tabular format of chromosome names and sizes used by Galaxy. - Galaxy allows BED files to contain non-standard fields beyond the first 3 columns, some other implementations do not. - - - - - - - - - - - - customtrack - - - 1.11 - Custom Sequence annotation track format used by Galaxy. - Used for tracks/track views within galaxy. - - - - - - - - - - - - csfasta - - - Color space FASTA format sequence variant. - 1.3 - FASTA format extended for color space information. - - - - - - - - - - - - hdf5 - - An HDF5 file appears to the user as a directed graph. The nodes of this graph are the higher-level HDF5 objects that are exposed by the HDF5 APIs: Groups, Datasets, Named datatypes. H5py uses straightforward NumPy and Python metaphors, like dictionary and NumPy array syntax. - 1.11 - h5 - Binary format used by Galaxy for hierarchical data. - - - - - - - - - - - - tiff - - - The TIFF format is perhaps the most versatile and diverse bitmap format in existence. Its extensible nature and support for numerous data compression schemes allow developers to customize the TIFF format to fit any peculiar data storage needs. - - A versatile bitmap format. - 1.11 - - - - - - - - - - - bmp - - - Standard bitmap storage format in the Microsoft Windows environment. - 1.11 - Although it is based on Windows internal bitmap data structures, it is supported by many non-Windows and non-PC applications. - - - - - - - - - - - im - - - IM is a format used by LabEye and other applications based on the IFUNC image processing library. - IFUNC library reads and writes most uncompressed interchange versions of this format. - - 1.11 - - - - - - - - - - - pcd - - - PCD was developed by Kodak. A PCD file contains five different resolution (ranging from low to high) of a slide or film negative. Due to it PCD is often used by many photographers and graphics professionals for high-end printed applications. - 1.11 - Photo CD format, which is the highest resolution format for images on a CD. - - - - - - - - - - - pcx - - - 1.11 - PCX is an image file format that uses a simple form of run-length encoding. It is lossless. - - - - - - - - - - - - ppm - - - The PPM format is a lowest common denominator color image file format. - - 1.11 - - - - - - - - - - - psd - - - 1.11 - PSD (Photoshop Document) is a proprietary file that allows the user to work with the images’ individual layers even after the file has been saved. - - - - - - - - - - - xbm - - - The XBM format was replaced by XPM for X11 in 1989. - 1.11 - X BitMap is a plain text binary image format used by the X Window System used for storing cursor and icon bitmaps used in the X GUI. - - - - - - - - - - - xpm - - - X PixMap (XPM) is an image file format used by the X Window System, it is intended primarily for creating icon pixmaps, and supports transparent pixels. - - 1.11 - Sequence of segments with markers. Begins with byte of 0xFF and follows by marker type. - - - - - - - - - - - rgb - - - RGB file format is the native raster graphics file format for Silicon Graphics workstations. - - 1.11 - - - - - - - - - - - pbm - - - The PBM format is a lowest common denominator monochrome file format. It serves as the common language of a large family of bitmap image conversion filters. - - 1.11 - - - - - - - - - - - pgm - - - It is designed to be extremely easy to learn and write programs for. - The PGM format is a lowest common denominator grayscale file format. - - 1.11 - - - - - - - - - - - PNG - - - 1.11 - png - PNG is a file format for image compression. - - It iis expected to replace the Graphics Interchange Format (GIF). - - - - - - - - - - - SVG - - - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - Scalable Vector Graphics (SVG) is an XML-based vector image format for two-dimensional graphics with support for interactivity and animation. - svg - Scalable Vector Graphics - 1.11 - - - - - - - - - - - rast - - - Sun Raster is a raster graphics file format used on SunOS by Sun Microsystems - 1.11 - The SVG specification is an open standard developed by the World Wide Web Consortium (W3C) since 1999. - - - - - - - - - - - Sequence quality report format (text) - - - - - - - - - Textual report format for sequence quality for reports from sequencing machines. - 1.11 - - - - - - - - - - qual - - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - Phred quality scores are defined as a property which is logarithmically related to the base-calling error probabilities. - FASTQ format subset for Phred sequencing quality score data only (no sequences). - - - - - - - - - - qualsolexa - - - Solexa/Illumina 1.0 format can encode a Solexa/Illumina quality score from -5 to 62 using ASCII 59 to 126 (although in raw read data Solexa scores from -5 to 40 only are expected) - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) for Solexa/Illumina 1.0 format. - - - - - - - - - - qualillumina - - - Starting in Illumina 1.5 and before Illumina 1.8, the Phred scores 0 to 2 have a slightly different meaning. The values 0 and 1 are no longer used and the value 2, encoded by ASCII 66 "B", is used also at the end of reads as a Read Segment Quality Control Indicator. - FASTQ format subset for Phred sequencing quality score data only (no sequences) from Illumina 1.5 and before Illumina 1.8. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qualsolid - - For SOLiD data, the sequence is in color space, except the first position. The quality values are those of the Sanger format. - FASTQ format subset for Phred sequencing quality score data only (no sequences) for SOLiD data. - 1.11 - http://en.wikipedia.org/wiki/Phred_quality_score - - - - - - - - - - qual454 - - http://en.wikipedia.org/wiki/Phred_quality_score - 1.11 - FASTQ format subset for Phred sequencing quality score data only (no sequences) from 454 sequencers. - - - - - - - - - - ENCODE peak format - - 1.11 - Human ENCODE peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE narrow peak format - - 1.11 - Human ENCODE narrow peak format. - Format that covers both the broad peak format and narrow peak format from ENCODE. - - - - - - - - - - - - ENCODE broad peak format - - 1.11 - Human ENCODE broad peak format. - - - - - - - - - - - - bgzip - - - BAM files are compressed using a variant of GZIP (GNU ZIP), into a format called BGZF (Blocked GNU Zip Format). - Blocked GNU Zip format. - 1.11 - - - - - - - - - - - tabix - - - TAB-delimited genome position file index format. - 1.11 - - - - - - - - - - - - Graph format - - Data format for graph data. - 1.11 - - - - - - - - - - xgmml - - XML-based format used to store graph descriptions within Galaxy. - 1.11 - - - - - - - - - - - sif - - 1.11 - SIF (simple interaction file) Format - a network/pathway format used for instance in cytoscape. - - - - - - - - - - - xlsx - - - 1.11 - MS Excel spreadsheet format consisting of a set of XML documents stored in a ZIP-compressed file. - - - - - - - - - - SQLite - - https://www.sqlite.org/fileformat2.html - Data format used by the SQLite database. - 1.11 - - - - - - - - - - GeminiSQLite - - https://gemini.readthedocs.org/en/latest/content/quick_start.html - 1.11 - Data format used by the SQLite database conformant to the Gemini schema. - - - - - - - - - - Index format - - - - - - - - - Format of a data index of some type. - 1.11 - - - - - - - - - - snpeffdb - - An index of a genome database, indexed for use by the snpeff tool. - 1.11 - - - - - - - - - - MAT - - - - - - - - MATLAB file format - Binary format used by MATLAB files to store workspace variables. - 1.12 - MAT file format - .mat file format - - - - - - - - - - - netCDF - - 1.12 - ANDI-MS - Format used by netCDF software library for writing and reading chromatography-MS data files. - - - - - - - - - - - MGF - - Files includes *m*/*z*, intensity pairs separated by headers; headers can contain a bit more information, including search engine instructions. - Mascot Generic Format. Encodes multiple MS/MS spectra in a single file. - 1.12 - - - - - - - - - - dta - - Each file contains one header line for the known or assumed charge and the mass of the precursor peptide ion, calculated from the measured *m*/*z* and the charge. This one line was then followed by all the *m*/*z*, intensity pairs that represent the spectrum. - 1.12 - Spectral data format file where each spectrum is written to a separate file. - - - - - - - - - - pkl - - Spectral data file similar to dta. - Differ from .dta only in subtleties of the header line format and content and support the added feature of being able to. - 1.12 - - - - - - - - - - mzXML - - 1.12 - https://dx.doi.org/10.1038%2Fnbt1031 - Common file format for proteomics mass spectrometric data developed at the Seattle Proteome Center/Institute for Systems Biology. - - - - - - - - - - pepXML - - http://sashimi.sourceforge.net/schema_revision/pepXML/pepXML_v118.xsd - Open data format for the storage, exchange, and processing of peptide sequence assignments of MS/MS scans, intended to provide a common data output format for many different MS/MS search engines and subsequent peptide-level analyses. - 1.12 - - - - - - - - - - GPML - - - 1.12 - Graphical Pathway Markup Language (GPML) is an XML format used - for exchanging biological pathways. - - - - - - - - - - - K-mer countgraph - - - 1.12 - oxlicg - http://www.iana.org/assignments/media-types/application/vnd.oxli.countgraph - A list of k-mers and their occurences in a dataset. Can also be used as an implicit De Bruijn graph. - - - - - - - - - - - mzTab - - - 1.13 - mzTab is a tab-delimited format for mass spectrometry-based proteomics and metabolomics results. - - - - - - - - - - - - - imzML - - - - imzML is a data format for mass spectrometry imaging data. NB.: See comment. - 1.13 - imzML|ibd - Data is recorded in 2 files: '.imzXML' is a metadata XML file based on mzML by HUPO-PSI, and '.ibd' is a binary file containing the mass spectra. - - - - - - - - - - - - - qcML - - - - The focus of qcML is towards mass spectrometry based proteomics, but the format is suitable for metabolomics and sequencing as well. - qcML is an XML format for quality-related data of mass spectrometry and other high-throughput measurements. - 1.13 - - - - - - - - - - - - PRIDE XML - - - - 1.13 - PRIDE XML is an XML format for mass spectra, peptide and protein identifications, and metadata about a corresponding measurement, sample, experiment. - - - - - - - - - - - - SED-ML - - - Simulation Experiment Description Markup Language (SED-ML) is an XML format for encoding simulation setups, according to the MIASE (Minimum Information About a Simulation Experiment) requirements. - 1.13 - - - - - - - - - - - - - - COMBINE OMEX - - - - 1.13 - An OMEX file is a ZIP container that includes a manifest file, listing the content of the archive, an optional metadata file adding information about the archive and its content, and the files describing the model. OMEX is one of the standardised formats within COMBINE (Computational Modeling in Biology Network). - Open Modeling EXchange format (OMEX) is a ZIPped format for encapsulating all information necessary for a modeling and simulation project in systems biology. - - - - - - - - - - - - - ISA-TAB - - - - ISA-TAB is based on MAGE-TAB. Other than tabular, the ISA model can also be represented in RDF, and in JSON (compliable with a set of defined JSON Schemata). - The Investigation / Study / Assay (ISA) tab-delimited (TAB) format incorporates metadata from -experiments employing a combination of technologies. - 1.13 - ISA-Tab - - - - - - - - - - - - SBtab - - - 1.13 - SBtab is a tabular format for biochemical network models. - - - - - - - - - - - - - BCML - - - 1.13 - Biological Connection Markup Language (BCML) is an XML format for biological pathways. - - - - - - - - - - - - BDML - - Biological Dynamics Markup Language (BDML) is an XML format for quantitative data describing biological dynamics. - 1.13 - - - - - - - - - - - - - BEL - - 1.13 - Biological Expression Language (BEL) is a textual format for representing scientific findings in life sciences in a computable form. - - - - - - - - - - - - SBGN-ML - - - SBGN-ML is an XML format for Systems Biology Graphical Notation (SBGN) diagrams of biological pathways or networks. - 1.13 - - - - - - - - - - - - AGP - - - 1.13 - AGP is a tabular format for a sequence assembly (a contig, a scaffold/supercontig, or a chromosome). - - - - - - - - - - - - PS - - PostScript - PostScript format - 1.13 - - - - - - - - - - SRA format - - SRA archive format (SRA) is the archive format used for input to the NCBI Sequence Read Archive. - SRA archive format - 1.13 - SRA - - - - - - - - - - - VDB - - VDB ('vertical database') is the format (SRA) is the native format used for export from the NCBI Sequence Read Archive. - SRA native format - 1.13 - SRA - - - - - - - - - - - Tabix index file format - - - - - - - - 1.3 - Index file format used by the samtools package to index TAB-delimited genome position files. - - - - - - - - - - - sequin - - A five-column, tab-delimited table of feature locations and qualifiers for importing annotation into an existing Sequin submission (an NCBI tool for submitting and updating GenBank entries). - 1.13 - - - - - - - - - - MSF - - Magellan storage file format - This format corresponds to an SQLite database, and you can look into the files with e.g. SQLiteStudio3. There are also some readers (http://pubs.acs.org/doi/abs/10.1021/pr2005154) and converters (http://www.sciencedirect.com/science/article/pii/S1874391915300531) for this format available, which re-engineered the database schema, but there is no official DB schema specification of Thermo Scientific for the format. - Proprietary mass-spectrometry format of Thermo Scientific's ProteomeDiscoverer software. - 1.14 - - - - - - - - - - Biodiversity data format - - - - - - - - Data format for biodiversity data. - 1.14 - - - - - - - - - - ABCD format - - - - - - - - ABCD - Exchange format of the Access to Biological Collections Data (ABCD) Schema; a standard for the access to and exchange of data about specimens and observations (primary biodiversity data). - 1.14 - - - - - - - - - - - GCT/Res format - - - Res format - Tab-delimited text files of GenePattern that contain a column for each sample, a row for each gene, and an expression value for each gene in each sample. - GCT format - 1.14 - - - - - - - - - - WIFF format - - - wiff - wiff - 1.14 - Mass spectrum file format from QSTAR and QTRAP instruments (ABI/Sciex). - - - - - - - - - - X!Tandem XML - - - - Output format used by X! series search engines that is based on the XML language BIOML. - 1.14 - - - - - - - - - - - Thermo RAW - - - Proprietary format for which documentation is not available. - Proprietary file format for mass spectrometry data from Thermo Scientific. - 1.14 - - - - - - - - - - Mascot .dat file - - - "Raw" result file from Mascot database search. - 1.14 - - - - - - - - - - - MaxQuant APL peaklist format - - - 1.14 - MaxQuant APL - Format of peak list files from Andromeda search engine (MaxQuant) that consist of arbitrarily many spectra. - - - - - - - - - - - SBOL - - 1.14 - SBOL introduces a standardized format for the electronic exchange of information on the structural and functional aspects of biological designs. - Synthetic Biology Open Language (SBOL) is an XML format for the specification and exchange of biological design information in synthetic biology. - - - - - - - - - - - PMML - - One or more mining models can be contained in a PMML document. - 1.14 - PMML uses XML to represent mining models. The structure of the models is described by an XML Schema. - - - - - - - - - - - OME-TIFF - - - Image file format used by the Open Microscopy Environment (OME). - - 1.14 - OME develops open-source software and data format standards for the storage and manipulation of biological microscopy data. It is a joint project between universities, research establishments, industry and the software development community. - An OME-TIFF dataset consists of one or more files in standard TIFF or BigTIFF format, with the file extension .ome.tif or .ome.tiff, and an identical (or in the case of multiple files, nearly identical) string of OME-XML metadata embedded in the ImageDescription tag of each file’s first IFD (Image File Directory). BigTIFF file extensions are also permitted, with the file extension .ome.tf2, .ome.tf8 or .ome.btf, but note these file extensions are an addition to the original specification, and software using an older version of the specification may not be able to handle these file extensions. - - - - - - - - - - - LocARNA PP - - 1.14 - Format for multiple aligned or single sequences together with the probabilistic description of the (consensus) RNA secondary structure ensemble by probabilities of base pairs, base pair stackings, and base pairs and unpaired bases in the loop of base pairs. - The LocARNA PP format combines sequence or alignment information and (respectively, single or consensus) ensemble probabilities into an PP 2.0 record. - - - - - - - - - - - dbGaP format - - Input format used by the Database of Genotypes and Phenotypes (dbGaP). - The Database of Genotypes and Phenotypes (dbGaP) is a National Institutes of Health (NIH) sponsored repository charged to archive, curate and distribute information produced by studies investigating the interaction of genotype and phenotype. - 1.14 - - - - - - - - - - - Operation - - - A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs). - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - Computational tool - Function - http://purl.org/biotop/biotop.owl#Function - http://www.ifomis.org/bfo/1.1/snap#Function - http://en.wikipedia.org/wiki/Function_(mathematics) - Computational method - http://semanticscience.org/resource/SIO_000017 - http://www.ebi.ac.uk/swo/SWO_0000003 - Mathematical operation - sumo:Function - beta12orEarlier - Process - Computational operation - Computational subroutine - http://semanticscience.org/resource/SIO_000649 - Special cases are: a) An operation that consumes no input (has no input arguments). Such operation is either a constant function, or an operation depending only on the underlying state. b) An operation that may modify the underlying state but has no output. c) The singular-case operation with no input or output, that still may modify the underlying state. - http://www.ifomis.org/bfo/1.1/span#Process - http://www.ifomis.org/bfo/1.1/snap#Continuant - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - Computational procedure - Mathematical function - Lambda abstraction - Function (programming) - http://www.onto-med.de/ontologies/gfo.owl#Process - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - http://wsio.org/operation_001 - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#process - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Function - http://en.wikipedia.org/wiki/Function_(computer_science) - http://en.wikipedia.org/wiki/Subroutine - - - - - Process can have a function (as its quality/attribute), and can also perform an operation with inputs and outputs. - Process - - - - - Computational tool - Computational tool provides one or more operations. - - - - - Operation is a function that is computational. It typically has input(s) and output(s), which are always data. - Function - - - - - - - - - - Query and retrieval - - - - - - - - - - - - - - beta12orEarlier - Query - Search or query a data resource and retrieve entries and / or annotation. - Database retrieval - - - - - - - - - - Data retrieval (database cross-reference) - - beta12orEarlier - Search database to retrieve all relevant references to a particular entity or entry. - true - beta13 - - - - - - - - - - Annotation - - - - - - - - - - - - - - Annotate an entity (typically a biological or biomedical database entity) with terms from a controlled vocabulary. - beta12orEarlier - This is a broad concept and is used a placeholder for other, more specific concepts. - - - - - - - - - - Indexing - - - - - - - - Data indexing - beta12orEarlier - Generate an index of (typically a file of) biological data. - Database indexing - - - - - - - - - - Data index analysis - - Database index analysis - Analyse an index of biological data. - beta12orEarlier - true - 1.6 - - - - - - - - - - Annotation retrieval (sequence) - - true - beta12orEarlier - Retrieve basic information about a molecular sequence. - beta12orEarlier - - - - - - - - - - Sequence generation - - - beta12orEarlier - Generate a molecular sequence by some means. - - - - - - - - - - Sequence editing - - - Edit or change a molecular sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence merging - - beta12orEarlier - Merge two or more (typically overlapping) molecular sequences. - Sequence splicing - - - - - - - - - - Sequence conversion - - - Convert a molecular sequence from one type to another. - beta12orEarlier - - - - - - - - - - Sequence complexity calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate sequence complexity, for example to find low-complexity regions in sequences. - - - - - - - - - - Sequence ambiguity calculation - - - - - - - - - - - - - - Calculate sequence ambiguity, for example identity regions in protein or nucleotide sequences with many ambiguity codes. - beta12orEarlier - - - - - - - - - - Sequence composition calculation - - - - - - - - - - - - - - - beta12orEarlier - Calculate character or word composition or frequency of a molecular sequence. - - - - - - - - - - Repeat sequence analysis - - - - - - - - Find and/or analyse repeat sequences in (typically nucleotide) sequences. - beta12orEarlier - Repeat sequences include tandem repeats, inverted or palindromic repeats, DNA microsatellites (Simple Sequence Repeats or SSRs), interspersed repeats, maximal duplications and reverse, complemented and reverse complemented repeats etc. Repeat units can be exact or imperfect, in tandem or dispersed, of specified or unspecified length. - - - - - - - - - - Sequence motif discovery - - - - - - - - - - - - - - - Motifs and patterns might be conserved or over-represented (occur with improbable frequency). - beta12orEarlier - Discover new motifs or conserved patterns in sequences or sequence alignments (de-novo discovery). - Motif discovery - - - - - - - - - - Sequence motif recognition - - - - - - - - - - - - - - - beta12orEarlier - Sequence signature recognition - Motif scanning - Motif search - Sequence motif search - Protein secondary database search - Motif detection - Sequence signature detection - Sequence profile search - Find (scan for) known motifs, patterns and regular expressions in molecular sequence(s). - Sequence motif detection - Motif recognition - - - - - - - - - - Sequence motif comparison - - - - - - - - - - - - - - - beta12orEarlier - Find motifs shared by molecular sequences. - - - - - - - - - - Transcription regulatory sequence analysis - - beta12orEarlier - beta13 - Analyse the sequence, conformational or physicochemical properties of transcription regulatory elements in DNA sequences. - For example transcription factor binding sites (TFBS) analysis to predict accessibility of DNA to binding factors. - true - - - - - - - - - - Conserved transcription regulatory sequence identification - - - For example cross-species comparison of transcription factor binding sites (TFBS). Methods might analyse co-regulated or co-expressed genes, or sets of oppositely expressed genes. - beta12orEarlier - Identify common, conserved (homologous) or synonymous transcriptional regulatory motifs (transcription factor binding sites). - - - - - - - - - - Protein property calculation (from structure) - - - - - - - - - - - - - - - This might be a residue-level search for properties such as solvent accessibility, hydropathy, secondary structure, ligand-binding etc. - Extract, calculate or predict non-positional (physical or chemical) properties of a protein from processing a protein (3D) structure. - beta12orEarlier - Protein structural property calculation - - - - - - - - - - Protein flexibility and motion analysis - - - beta12orEarlier - Analyse flexibility and motion in protein structure. - Use this concept for analysis of flexible and rigid residues, local chain deformability, regions undergoing conformational change, molecular vibrations or fluctuational dynamics, domain motions or other large-scale structural transitions in a protein structure. - - - - - - - - - - Protein structural motif recognition - - - - - - - - - Identify or screen for 3D structural motifs in protein structure(s). - This includes conserved substructures and conserved geometry, such as spatial arrangement of secondary structure or protein backbone. Methods might use structure alignment, structural templates, searches for similar electrostatic potential and molecular surface shape, surface-mapping of phylogenetic information etc. - beta12orEarlier - Protein structural feature identification - - - - - - - - - - Protein domain recognition - - - - - - - - - beta12orEarlier - Identify structural domains in a protein structure from first principles (for example calculations on structural compactness). - - - - - - - - - - Protein architecture analysis - - beta12orEarlier - Analyse the architecture (spatial arrangement of secondary structure) of protein structure(s). - - - - - - - - - - Residue interaction calculation - - - - - - - - WHATIF: SymShellTenXML - WHATIF:ListContactsRelaxed - WHATIF: SymShellTwoXML - WHATIF:ListSideChainContactsRelaxed - beta12orEarlier - WHATIF:ListSideChainContactsNormal - WHATIF:ListContactsNormal - Calculate or extract inter-atomic, inter-residue or residue-atom contacts, distances and interactions in protein structure(s). - WHATIF: SymShellFiveXML - WHATIF: SymShellOneXML - - - - - - - - - - Protein geometry calculation - - - - - - - - WHATIF:ResidueTorsions - beta12orEarlier - Backbone torsion angle calculation - WHATIF:CysteineTorsions - Calculate, visualise or analyse phi/psi angles of a protein structure. - WHATIF:ResidueTorsionsBB - WHATIF:ShowTauAngle - Torsion angle calculation - Tau angle calculation - Cysteine torsion angle calculation - - - - - - - - - - Protein property calculation - - - - This includes methods to render and visualise the properties of a protein sequence. - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - beta12orEarlier - Protein property rendering - - - - - - - - - - Peptide immunogenicity prediction - - - - - - - - - - - - - - - Immunogenicity prediction - beta12orEarlier - This is usually done in the development of peptide-specific antibodies or multi-epitope vaccines. Methods might use sequence data (for example motifs) and / or structural data. - This includes methods that generate a graphical rendering of antigenicity of a protein, such as a Hopp and Woods plot. - Hopp and Woods plotting - Predict antigenicity, allergenicity / immunogenicity, allergic cross-reactivity etc of peptides and proteins. - MHC peptide immunogenicity prediction - - - - - - - - - - Sequence feature detection - - - - - - - - - - - - - - - Sequence feature prediction - Predict, recognise and identify positional features in molecular sequences such as key functional sites or regions. - Sequence feature recognition - beta12orEarlier - Motif database search - SO:0000110 - - - - - - - - - - Data retrieval (feature table) - - beta13 - Extract a sequence feature table from a sequence database entry. - true - beta12orEarlier - - - - - - - - - - Feature table query - - 1.6 - beta12orEarlier - true - Query the features (in a feature table) of molecular sequence(s). - - - - - - - - - - Sequence feature comparison - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Compare the feature tables of two or more molecular sequences. - Feature comparison - Feature table comparison - - - - - - - - - - Data retrieval (sequence alignment) - - beta12orEarlier - true - beta13 - Display basic information about a sequence alignment. - - - - - - - - - - Sequence alignment analysis - - - - - - - - Analyse a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence alignment comparison - - - Compare (typically by aligning) two molecular sequence alignments. - beta12orEarlier - See also 'Sequence profile alignment'. - - - - - - - - - - Sequence alignment conversion - - - beta12orEarlier - Convert a molecular sequence alignment from one type to another (for example amino acid to coding nucleotide sequence). - - - - - - - - - - Nucleic acid property processing - - beta12orEarlier - true - Process (read and / or write) physicochemical property data of nucleic acids. - beta13 - - - - - - - - - - Nucleic acid property calculation - - - - - - - - - beta12orEarlier - Calculate or predict physical or chemical properties of nucleic acid molecules, including any non-positional properties of the molecular sequence. - - - - - - - - - - Splice transcript prediction - - - - - - - - beta12orEarlier - Predict splicing alternatives or transcript isoforms from analysis of sequence data. - - - - - - - - - - Frameshift detection - - - - - - - - - Detect frameshifts in DNA sequences, including frameshift sites and signals, and frameshift errors from sequencing projects. - Frameshift error detection - beta12orEarlier - Methods include sequence alignment (if related sequences are available) and word-based sequence comparison. - - - - - - - - - - Vector sequence detection - - - beta12orEarlier - Detect vector sequences in nucleotide sequence, typically by comparison to a set of known vector sequences. - - - - - - - - - - Protein secondary structure prediction - - - - Methods might use amino acid composition, local sequence information, multiple sequence alignments, physicochemical features, estimated energy content, statistical algorithms, hidden Markov models, support vector machines, kernel machines, neural networks etc. - Predict secondary structure of protein sequences. - Secondary structure prediction (protein) - beta12orEarlier - - - - - - - - - - Protein super-secondary structure prediction - - - - - - - - beta12orEarlier - Predict super-secondary structure of protein sequence(s). - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - - - - - - - - - - Transmembrane protein prediction - - - Predict and/or classify transmembrane proteins or transmembrane (helical) domains or regions in protein sequences. - beta12orEarlier - - - - - - - - - - Transmembrane protein analysis - - - - - - - - beta12orEarlier - Analyse transmembrane protein(s), typically by processing sequence and / or structural data, and write an informative report for example about the protein and its transmembrane domains / regions. - Use this (or child) concept for analysis of transmembrane domains (buried and exposed faces), transmembrane helices, helix topology, orientation, inter-helical contacts, membrane dipping (re-entrant) loops and other secondary structure etc. Methods might use pattern discovery, hidden Markov models, sequence alignment, structural profiles, amino acid property analysis, comparison to known domains or some combination (hybrid methods). - - - - - - - - - - Structure prediction - - - - - - - - - - - - - - - Predict tertiary structure of a molecular (biopolymer) sequence. - beta12orEarlier - - - - - - - - - - Residue interaction prediction - - - - - - - - - Methods usually involve multiple sequence alignment analysis. - Predict contacts, non-covalent interactions and distance (constraints) between amino acids in protein sequences. - beta12orEarlier - - - - - - - - - - Protein interaction raw data analysis - - - - - - - - - - - - - - Analyse experimental protein-protein interaction data from for example yeast two-hybrid analysis, protein microarrays, immunoaffinity chromatography followed by mass spectrometry, phage display etc. - beta12orEarlier - - - - - - - - - - Protein-protein interaction prediction (from protein sequence) - - beta12orEarlier - 1.12 - true - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein sequences. - - - - - - - - - - Protein-protein interaction prediction (from protein structure) - - true - 1.12 - beta12orEarlier - Identify or predict protein-protein interactions, interfaces, binding sites etc in protein structures. - - - - - - - - - - Protein interaction network analysis - - - - - - - - - - - - - - - beta12orEarlier - Analyse a network of protein interactions. - - - - - - - - - - Protein interaction network comparison - - - beta12orEarlier - Compare two or more networks of protein interactions. - - - - - - - - - - RNA secondary structure prediction - - - - - - - - - - Predict RNA secondary structure (for example knots, pseudoknots, alternative structures etc). - beta12orEarlier - Methods might use RNA motifs, predicted intermolecular contacts, or RNA sequence-structure compatibility (inverse RNA folding). - - - - - - - - - - Nucleic acid folding analysis - - - - - - - - - - beta12orEarlier - Analyse some aspect of RNA/DNA folding, typically by processing sequence and/or structural data. - Nucleic acid folding modelling - Nucleic acid folding prediction - Nucleic acid folding - - - - - - - - - - Data retrieval (restriction enzyme annotation) - - beta13 - Restriction enzyme information retrieval - true - Retrieve information on restriction enzymes or restriction enzyme sites. - beta12orEarlier - - - - - - - - - - Genetic marker identification - - true - beta12orEarlier - beta13 - Identify genetic markers in DNA sequences. - A genetic marker is any DNA sequence of known chromosomal location that is associated with and specific to a particular gene or trait. This includes short sequences surrounding a SNP, Sequence-Tagged Sites (STS) which are well suited for PCR amplification, a longer minisatellites sequence etc. - - - - - - - - - - Genetic mapping - - - - - - - - - beta12orEarlier - QTL mapping - This includes mapping of the genetic architecture of dynamic complex traits (functional mapping), e.g. by characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Linkage mapping - Genetic map generation - Mapping involves ordering genetic loci along a chromosome and estimating the physical distance between loci. A genetic map shows the relative (not physical) position of known genes and genetic markers. - Generate a genetic (linkage) map of a DNA sequence (typically a chromosome) showing the relative positions of genetic markers based on estimation of non-physical distances. - Genetic map construction - Functional mapping - - - - - - - - - - Linkage analysis - - - - - - - - - - - - - - beta12orEarlier - For example, estimate how close two genes are on a chromosome by calculating how often they are transmitted together to an offspring, ascertain whether two genes are linked and parental linkage, calculate linkage map distance etc. - Analyse genetic linkage. - - - - - - - - - - Codon usage table generation - - - - - - - - - Calculate codon usage statistics and create a codon usage table. - beta12orEarlier - Codon usage table construction - - - - - - - - - - Codon usage table comparison - - - beta12orEarlier - Compare two or more codon usage tables. - - - - - - - - - - Codon usage analysis - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - beta12orEarlier - synon: Codon usage data analysis - Process (read and / or write) codon usage data, e.g. analyse codon usage tables or codon usage in molecular sequences. - synon: Codon usage table analysis - - - - - - - - - - Base position variability plotting - - - - - - - - - - - - - - - Identify and plot third base position variability in a nucleotide sequence. - beta12orEarlier - - - - - - - - - - Sequence word comparison - - Find exact character or word matches between molecular sequences without full sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence distance matrix generation - - - - - - - - - - - - - - - Sequence distance matrix construction - Phylogenetic distance matrix generation - beta12orEarlier - Calculate a sequence distance matrix or otherwise estimate genetic distances between molecular sequences. - - - - - - - - - - Sequence redundancy removal - - - - - - - - beta12orEarlier - Compare two or more molecular sequences, identify and remove redundant sequences based on some criteria. - - - - - - - - - - Sequence clustering - - - - - - - - - - The clusters may be output or used internally for some other purpose. - Sequence cluster construction - beta12orEarlier - Build clusters of similar sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - Sequence cluster generation - - - - - - - - - - Sequence alignment - - - - - - - - - - Sequence alignment construction - beta12orEarlier - Align (identify equivalent sites within) molecular sequences. - Sequence alignment generation - Sequence alignment computation - - - - - - - - - - Hybrid sequence alignment construction - - Hybrid sequence alignment - true - beta13 - beta12orEarlier - Align two or more molecular sequences of different types (for example genomic DNA to EST, cDNA or mRNA). - Hybrid sequence alignment generation - - - - - - - - - - Structure-based sequence alignment - - Sequence alignment generation (structure-based) - Structure-based sequence alignment construction - beta12orEarlier - Sequence alignment (structure-based) - Structure-based sequence alignment generation - Align molecular sequences using sequence and structural information. - - - - - - - - - - Structure alignment - - - - - - - - - - Align (superimpose) molecular tertiary structures. - Structure alignment generation - Structure alignment construction - beta12orEarlier - Multiple structure alignment construction - Multiple structure alignment generation - - - - - - - - - - Sequence profile generation - - - - - - - - - - - - - - - - - - - - - Sequence profile construction - beta12orEarlier - Generate some type of sequence profile (for example a hidden Markov model) from a sequence alignment. - - - - - - - - - - 3D profile generation - - - - - - - - - - - - - - - - - - - - - Structural profile generation - Generate some type of structural (3D) profile or template from a structure or structure alignment. - Structural profile construction - beta12orEarlier - - - - - - - - - - Profile-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence profile alignment - beta12orEarlier - See also 'Sequence alignment comparison'. - Sequence profile alignment construction - Align sequence profiles (representing sequence alignments). - Sequence profile alignment generation - - - - - - - - - - 3D profile-to-3D profile alignment - - - - - - - - - - - - - - beta12orEarlier - 3D profile alignment (multiple) - 3D profile alignment - Multiple 3D profile alignment construction - Structural profile alignment construction (multiple) - Structural profile alignment - Structural profile alignment generation - Structural profile alignment construction - Align structural (3D) profiles or templates (representing structures or structure alignments). - - - - - - - - - - Sequence-to-profile alignment - - - - - - - - - - - - - - - - - - - - Sequence-profile alignment construction - Sequence-profile alignment generation - beta12orEarlier - Align molecular sequence(s) to sequence profile(s). - Sequence-profile alignment - A sequence profile typically represents a sequence alignment. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - - - - - - - - - - Sequence-to-3D-profile alignment - - - - - - - - - - - - - - - beta12orEarlier - Sequence-3D profile alignment construction - Align molecular sequence(s) to structural (3D) profile(s) or template(s) (representing a structure or structure alignment). - Sequence-3D profile alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-3D profile alignment - - - - - - - - - - Protein threading - - - - - - - - - - - - - - - beta12orEarlier - Align molecular sequence to structure in 3D space (threading). - Use this concept for methods that evaluate sequence-structure compatibility by assessing residue interactions in 3D. Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Sequence-structure alignment - - - - - - - - - - Protein fold recognition - - - - - beta12orEarlier - Protein domain prediction - Methods use some type of mapping between sequence and fold, for example secondary structure prediction and alignment, profile comparison, sequence properties, homologous sequence search, kernel machines etc. Domains and folds might be taken from SCOP or CATH. - Recognize (predict and identify) known protein structural domains or folds in protein sequence(s). - Protein fold prediction - - - - - - - - - - Metadata retrieval - - - - - - - - Data retrieval (documentation) - Search for and retrieve data concerning or describing some core data, as distinct from the primary data that is being described. - Data retrieval (metadata) - beta12orEarlier - This includes documentation, general information and other metadata on entities such as databases, database entries and tools. - - - - - - - - - - Literature search - - - - - - - - - - - - - - beta12orEarlier - Query the biomedical and informatics literature. - - - - - - - - - - Text mining - - - - - - - - - - - - - - - - - - - - Text data mining - beta12orEarlier - Process and analyse text (typically the biomedical and informatics literature) to extract information from it. - - - - - - - - - - Virtual PCR - - - - - - - - beta12orEarlier - Perform in-silico (virtual) PCR. - - - - - - - - - - PCR primer design - - - - - - - - - - - - - - - - - - - - This includes predicting primers based on gene structure, promoters, exon-exon junctions, predicting primers that are conserved across multiple genomes or species, primers for for gene transcription profiling, for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs), for large scale sequencing, or for methylation PCRs. - PCR primer design (based on gene structure) - PCR primer design (for methylation PCRs) - beta12orEarlier - PCR primer design (for large scale sequencing) - PCR primer prediction - Primer design involves predicting or selecting primers that are specific to a provided PCR template. Primers can be designed with certain properties such as size of product desired, primer size etc. The output might be a minimal or overlapping primer set. - PCR primer design (for conserved primers) - Design or predict oligonucleotide primers for PCR and DNA amplification etc. - PCR primer design (for gene transcription profiling) - PCR primer design (for genotyping polymorphisms) - - - - - - - - - - Microarray probe design - - - - - - - - - - - - - - - - - - - - - - - - - - - Predict and/or optimize oligonucleotide probes for DNA microarrays, for example for transcription profiling of genes, or for genomes and gene families. - beta12orEarlier - Microarray probe prediction - - - - - - - - - - Sequence assembly - - - - - - - - - - - - - - - beta12orEarlier - For example, assemble overlapping reads from paired-end sequencers into contigs (a contiguous sequence corresponding to read overlaps). Or assemble contigs, for example ESTs and genomic DNA fragments, depending on the detected fragment overlaps. - Combine (align and merge) overlapping fragments of a DNA sequence to reconstruct the original sequence. - - - - - - - - - - Microarray data standardization and normalization - - - - - - - - - - - - - - - beta12orEarlier - Standardize or normalize microarray data. - This includes statistical analysis, for example of variability amongst microarrays experiments, comparison of heterogeneous microarray platforms etc. - - - - - - - - - - Sequencing-based expression profile data processing - - Process (read and / or write) SAGE, MPSS or SBS experimental data. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Gene expression profile clustering - - - - - - - - - beta12orEarlier - Perform cluster analysis of gene expression (microarray) data, for example clustering of similar gene expression profiles. - - - - - - - - - - Gene expression profiling - - - - - - - - - Expression profiling - Gene expression profile construction - Functional profiling - Generate a gene expression profile or pattern, for example from microarray data. - beta12orEarlier - Gene expression profile generation - - - - - - - - - - Gene expression profile comparison - - - - - - - - - beta12orEarlier - Compare gene expression profiles or patterns. - - - - - - - - - - Functional profiling - - true - beta12orEarlier - Interpret (in functional terms) and annotate gene expression data. - beta12orEarlier - - - - - - - - - - EST and cDNA sequence analysis - - Analyse EST or cDNA sequences. - For example, identify full-length cDNAs from EST sequences or detect potential EST antisense transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Structural genomics target selection - - beta12orEarlier - Identify and select targets for protein structural determination. - beta12orEarlier - Methods will typically navigate a graph of protein families of known structure. - true - - - - - - - - - - Protein secondary structure assignment - - - - - - - - - - - - - - beta12orEarlier - Assign secondary structure from protein coordinate or experimental data. - - - - - - - - - - Protein structure assignment - - - - - - - - - - - - - - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw experimental data. - - - - - - - - - - Protein model validation - - - - - - - - - - - - - - - WHATIF: UseResidueDB - Evaluate the quality or correctness a protein three-dimensional model. - This includes methods that calculate poor quality residues. The scoring function to identify poor quality residues may consider residues with bad atoms or atoms with high B-factor, residues in the N- or C-terminal position, adjacent to an unstructured residue, non-canonical residues, glycine and proline (or adjacent to these such residues). - Model validation might involve checks for atomic packing, steric clashes (bumps), volume irregularities, agreement with electron density maps, number of amino acid residues, percentage of residues with missing or bad atoms, irregular Ramachandran Z-scores, irregular Chi-1 / Chi-2 normality scores, RMS-Z score on bonds and angles etc. - Residue validation - WHATIF: CorrectedPDBasXML - Protein structure validation - WHATIF: UseFileDB - The PDB file format has had difficulties, inconsistencies and errors. Corrections can include identifying a meaningful sequence, removal of alternate atoms, correction of nomenclature problems, removal of incomplete residues and spurious waters, addition or removal of water, modelling of missing side chains, optimisation of cysteine bonds, regularisation of bond lengths, bond angles and planarities etc. - beta12orEarlier - - - - - - - - - - Molecular model refinement - - - Protein model refinement - WHATIF: CorrectedPDBasXML - beta12orEarlier - Refine (after evaluation) a model of a molecular structure (typically a protein structure) to reduce steric clashes, volume irregularities etc. - - - - - - - - - - Phylogenetic tree generation - - - - - - - - - - - - - - - Phylogenetic trees are usually constructed from a set of sequences from which an alignment (or data matrix) is calculated. - Phylogenetic tree construction - Construct a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis - - - - - - - - beta12orEarlier - Analyse an existing phylogenetic tree or trees, typically to detect features or make predictions. - - - - - - - - - - Phylogenetic tree comparison - - - beta12orEarlier - Compare two or more phylogenetic trees. - For example, to produce a consensus tree, subtrees, supertrees, calculate distances between trees or test topological similarity between trees (e.g. a congruence index) etc. - - - - - - - - - - Phylogenetic tree editing - - - - - - - - - - - - - - - Edit a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic footprinting / shadowing - - - - - - - - A phylogenetic 'shadow' represents the additive differences between individual sequences. By masking or 'shadowing' variable positions a conserved sequence is produced with few or none of the variations, which is then compared to the sequences of interest to identify significant regions of conservation. - beta12orEarlier - Infer a phylogenetic tree by comparing orthologous sequences in different species, particularly many closely related species (phylogenetic shadowing). - - - - - - - - - - Protein folding simulation - - beta12orEarlier - Simulate the folding of a protein. - - - - - - - - - - Protein folding pathway prediction - - - Predict the folding pathway(s) or non-native structural intermediates of a protein. - beta12orEarlier - - - - - - - - - - Protein SNP mapping - - true - beta12orEarlier - Map and model the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). - 1.12 - - - - - - - - - - Protein modelling (mutation) - - - - - - - - - - - - - - - Protein SNP mapping - Protein mutation modelling - Predict the effect of point mutation on a protein structure, in terms of strucural effects and protein folding, stability and function. - Rotamer likelihood prediction - beta12orEarlier - This includes 1) rotamer likelihood prediction: the prediction of rotamer likelihoods for all 20 amino acid types at each position in a protein structure, where output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. 2) Protein SNP mapping, which maps and modesl the effects of single nucleotide polymorphisms (SNPs) on protein structure(s). Methods might predict silent or pathological mutations. - - - - - - - - - - Immunogen design - - true - Design molecules that elicit an immune response (immunogens). - beta12orEarlier - beta12orEarlier - - - - - - - - - - Zinc finger prediction - - - - - - - - - - - - - - Predict and optimise zinc finger protein domains for DNA/RNA binding (for example for transcription factors and nucleases). - beta12orEarlier - - - - - - - - - - Enzyme kinetics calculation - - - - - - - - - - - - - - beta12orEarlier - Calculate Km, Vmax and derived data for an enzyme reaction. - - - - - - - - - - Formatting - - beta12orEarlier - Reformat a file of data (or equivalent entity in memory). - Format conversion - File formatting - Reformatting - File reformatting - File format conversion - - - - - - - - - - Format validation - - Test and validate the format and content of a data file. - File format validation - beta12orEarlier - - - - - - - - - - Visualisation - - - - - - - - - - - - - - - - - - - - Visualization - beta12orEarlier - Visualise, plot or render (graphically) biomolecular data such as molecular sequences or structures. - Rendering - - - - - - - - - - Sequence database search - - - - - - - - - Search a sequence database by sequence comparison and retrieve similar sequences. - -sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - This excludes direct retrieval methods (e.g. the dbfetch program). - - - - - - - - - - Structure database search - - - - - - - - beta12orEarlier - Search a tertiary structure database, typically by sequence and/or structure comparison, or some other means, and retrieve structures and associated data. - - - - - - - - - - Protein secondary database search - - 1.8 - beta12orEarlier - true - Search a secondary protein database (of classification information) to assign a protein sequence(s) to a known protein family or group. - - - - - - - - - - Motif database search - - beta12orEarlier - Screen a sequence against a motif or pattern database. - true - 1.8 - - - - - - - - - - Sequence profile database search - - true - beta12orEarlier - Search a database of sequence profiles with a query sequence. - 1.4 - - - - - - - - - - Transmembrane protein database search - - true - beta12orEarlier - Search a database of transmembrane proteins, for example for sequence or structural similarities. - beta12orEarlier - - - - - - - - - - Sequence retrieval (by code) - - Query a database and retrieve sequences with a given entry code or accession number. - true - 1.6 - beta12orEarlier - - - - - - - - - - Sequence retrieval (by keyword) - - true - Query a database and retrieve sequences containing a given keyword. - beta12orEarlier - 1.6 - - - - - - - - - - Sequence similarity search - - - Structure database search (by sequence) - Sequence database search (by sequence) - beta12orEarlier - Search a sequence database and retrieve sequences that are similar to a query sequence. - - - - - - - - - - Sequence database search (by motif or pattern) - - 1.8 - Search a sequence database and retrieve sequences matching a given sequence motif or pattern, such as a Prosite pattern or regular expression. - beta12orEarlier - true - - - - - - - - - - Sequence database search (by amino acid composition) - - true - Search a sequence database and retrieve sequences of a given amino acid composition. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence database search (by property) - - Search a sequence database and retrieve sequences with a specified property, typically a physicochemical or compositional property. - beta12orEarlier - - - - - - - - - - Sequence database search (by sequence using word-based methods) - - beta12orEarlier - Word-based methods (for example BLAST, gapped BLAST, MEGABLAST, WU-BLAST etc.) are usually quicker than alignment-based methods. They may or may not handle gaps. - 1.6 - true - Sequence similarity search (word-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a word-based method. - - - - - - - - - - Sequence database search (by sequence using profile-based methods) - - true - Sequence similarity search (profile-based methods) - Search a sequence database and retrieve sequences that are similar to a query sequence using a sequence profile-based method, or with a supplied profile as query. - beta12orEarlier - This includes tools based on PSI-BLAST. - 1.6 - - - - - - - - - - Sequence database search (by sequence using local alignment-based methods) - - Search a sequence database for sequences that are similar to a query sequence using a local alignment-based method. - 1.6 - beta12orEarlier - true - Sequence similarity search (local alignment-based methods) - This includes tools based on the Smith-Waterman algorithm or FASTA. - - - - - - - - - - Sequence database search (by sequence using global alignment-based methods) - - This includes tools based on the Needleman and Wunsch algorithm. - Search sequence(s) or a sequence database for sequences that are similar to a query sequence using a global alignment-based method. - 1.6 - Sequence similarity search (global alignment-based methods) - beta12orEarlier - true - - - - - - - - - - Sequence database search (by sequence for primer sequences) - - true - beta12orEarlier - Search a DNA database (for example a database of conserved sequence tags) for matches to Sequence-Tagged Site (STS) primer sequences. - 1.6 - STSs are genetic markers that are easily detected by the polymerase chain reaction (PCR) using specific primers. - Sequence similarity search (primer sequences) - - - - - - - - - - Sequence database search (by molecular weight) - - Search sequence(s) or a sequence database for sequences which match a set of peptide masses, for example a peptide mass fingerprint from mass spectrometry. - 1.6 - true - beta12orEarlier - - - - - - - - - - Sequence database search (by isoelectric point) - - 1.6 - beta12orEarlier - Search sequence(s) or a sequence database for sequences of a given isoelectric point. - true - - - - - - - - - - Structure retrieval (by code) - - Query a tertiary structure database and retrieve entries with a given entry code or accession number. - 1.6 - beta12orEarlier - true - - - - - - - - - - Structure retrieval (by keyword) - - true - 1.6 - Query a tertiary structure database and retrieve entries containing a given keyword. - beta12orEarlier - - - - - - - - - - Structure database search (by sequence) - - beta12orEarlier - true - Search a tertiary structure database and retrieve structures with a sequence similar to a query sequence. - 1.8 - - - - - - - - - - Structural similarity search - - - beta12orEarlier - Search a database of molecular structure and retrieve structures that are similar to a query structure. - Structure database search (by structure) - Structure retrieval by structure - - - - - - - - - - Sequence annotation - - - - - - - - - - - - - - beta12orEarlier - Annotate a molecular sequence record with terms from a controlled vocabulary. - - - - - - - - - - Genome annotation - - beta12orEarlier - Metagenome annotation - Annotate a genome sequence with terms from a controlled vocabulary. - - - - - - - - - - Nucleic acid sequence reverse and complement - - beta12orEarlier - Generate the reverse and / or complement of a nucleotide sequence. - - - - - - - - - - Random sequence generation - - Generate a random sequence, for example, with a specific character composition. - beta12orEarlier - - - - - - - - - - Nucleic acid restriction digest - - - - - - - - - beta12orEarlier - Generate digest fragments for a nucleotide sequence containing restriction sites. - - - - - - - - - - Protein sequence cleavage - - - - - - - - - - - - - - - beta12orEarlier - Cleave a protein sequence into peptide fragments (by enzymatic or chemical cleavage) and calculate the fragment masses. - - - - - - - - - - Sequence mutation and randomization - - beta12orEarlier - Mutate a molecular sequence a specified amount or shuffle it to produce a randomized sequence with the same overall composition. - - - - - - - - - - Sequence masking - - Mask characters in a molecular sequence (replacing those characters with a mask character). - For example, SNPs or repeats in a DNA sequence might be masked. - beta12orEarlier - - - - - - - - - - Sequence cutting - - Cut (remove) characters or a region from a molecular sequence. - beta12orEarlier - - - - - - - - - - Restriction site creation - - Create (or remove) restriction sites in sequences, for example using silent mutations. - beta12orEarlier - - - - - - - - - - DNA translation - - - - - - - - beta12orEarlier - Translate a DNA sequence into protein. - - - - - - - - - - DNA transcription - - - - - - - - beta12orEarlier - Transcribe a nucleotide sequence into mRNA sequence(s). - - - - - - - - - - Sequence composition calculation (nucleic acid) - - true - Calculate base frequency or word composition of a nucleotide sequence. - 1.8 - beta12orEarlier - - - - - - - - - - Sequence composition calculation (protein) - - 1.8 - Calculate amino acid frequency or word composition of a protein sequence. - beta12orEarlier - true - - - - - - - - - - Repeat sequence detection - - - beta12orEarlier - Find (and possibly render) short repetitive subsequences (repeat sequences) in (typically nucleotide) sequences. - - - - - - - - - - Repeat sequence organisation analysis - - - beta12orEarlier - Analyse repeat sequence organization such as periodicity. - - - - - - - - - - Protein hydropathy calculation (from structure) - - true - Analyse the hydrophobic, hydrophilic or charge properties of a protein structure. - 1.12 - beta12orEarlier - - - - - - - - - - Accessible surface calculation - - - - - - - - beta12orEarlier - WHATIF:AtomAccessibilitySolventPlus - Protein solvent accessibility calculation - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate solvent accessible or buried surface areas in protein or other molecular structures. - WHATIF:AtomAccessibilitySolvent - - - - - - - - - - Protein hydropathy cluster calculation - - true - 1.12 - beta12orEarlier - Identify clusters of hydrophobic or charged residues in a protein structure. - - - - - - - - - - Protein dipole moment calculation - - - - - - - - beta12orEarlier - Calculate whether a protein structure has an unusually large net charge (dipole moment). - - - - - - - - - - Molecular surface calculation - - WHATIF:ResidueAccessibilityMolecular - Protein surface calculation - Protein surface and interior calculation - WHATIF:AtomAccessibilityMolecularPlus - WHATIF:TotAccessibilityMolecular - Protein atom surface calculation - Calculate the molecular surface area in proteins and other macromolecules. - Protein residue surface calculation - WHATIF:ResidueAccessibilityVacuum - beta12orEarlier - WHATIF:TotAccessibilitySolvent - WHATIF:ResidueAccessibilitySolvent - WHATIF:ResidueAccessibilityVacuumMolecular - WHATIF:AtomAccessibilityMolecular - - - - - - - - - - Protein binding site prediction (from structure) - - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein structures. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein-nucleic acid binding site analysis - - - - - - - - Analyse RNA or DNA-binding sites in protein structure. - beta12orEarlier - - - - - - - - - - Protein peeling - - beta12orEarlier - Decompose a structure into compact or globular fragments (protein peeling). - - - - - - - - - - Protein distance matrix calculation - - - - - - - - beta12orEarlier - Calculate a matrix of distance between residues (for example the C-alpha atoms) in a protein structure. - - - - - - - - - - Protein contact map calculation - - - - - - - - beta12orEarlier - Calculate a residue contact map (typically all-versus-all inter-residue contacts) for a protein structure. - - - - - - - - - - Residue cluster calculation - - - - - - - - Calculate clusters of contacting residues in protein structures. - This includes for example clusters of hydrophobic or charged residues, or clusters of contacting residues which have a key structural or functional role. - beta12orEarlier - - - - - - - - - - Hydrogen bond calculation - - - - - - - - WHATIF:ShowHydrogenBonds - WHATIF:HasHydrogenBonds - The output might include the atoms involved in the bond, bond geometric parameters and bond enthalpy. - beta12orEarlier - WHATIF:ShowHydrogenBondsM - Identify potential hydrogen bonds between amino acids and other groups. - - - - - - - - - - Residue non-canonical interaction detection - - beta12orEarlier - 1.12 - Calculate non-canonical atomic interactions in protein structures. - true - - - - - - - - - - Ramachandran plot calculation - - - - - - - - Calculate a Ramachandran plot of a protein structure. - beta12orEarlier - - - - - - - - - - Ramachandran plot validation - - - - - - - - - - - - - - beta12orEarlier - Validate a Ramachandran plot of a protein structure. - - - - - - - - - - Protein molecular weight calculation - - - - - - - - - - - - - - Calculate the molecular weight of a protein sequence or fragments. - beta12orEarlier - - - - - - - - - - Protein extinction coefficient calculation - - - - - - - - beta12orEarlier - Predict extinction coefficients or optical density of a protein sequence. - - - - - - - - - - Protein pH-dependent property calculation - - - - - - - - - - - - - - Calculate pH-dependent properties from pKa calculations of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydropathy calculation (from sequence) - - 1.12 - Hydropathy calculation on a protein sequence. - beta12orEarlier - true - - - - - - - - - - Protein titration curve plotting - - - - - - - - - beta12orEarlier - Plot a protein titration curve. - - - - - - - - - - Protein isoelectric point calculation - - - - - - - - beta12orEarlier - Calculate isoelectric point of a protein sequence. - - - - - - - - - - Protein hydrogen exchange rate calculation - - - - - - - - Estimate hydrogen exchange rate of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein hydrophobic region calculation - - Calculate hydrophobic or hydrophilic / charged regions of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein aliphatic index calculation - - - - - - - - beta12orEarlier - Calculate aliphatic index (relative volume occupied by aliphatic side chains) of a protein. - - - - - - - - - - Protein hydrophobic moment plotting - - - - - - - - - beta12orEarlier - Hydrophobic moment is a peptides hydrophobicity measured for different angles of rotation. - Calculate the hydrophobic moment of a peptide sequence and recognize amphiphilicity. - - - - - - - - - - Protein globularity prediction - - - - - - - - Predict the stability or globularity of a protein sequence, whether it is intrinsically unfolded etc. - beta12orEarlier - - - - - - - - - - Protein solubility prediction - - - - - - - - Predict the solubility or atomic solvation energy of a protein sequence. - beta12orEarlier - - - - - - - - - - Protein crystallizability prediction - - - - - - - - beta12orEarlier - Predict crystallizability of a protein sequence. - - - - - - - - - - Protein signal peptide detection (eukaryotes) - - beta12orEarlier - Detect or predict signal peptides (and typically predict subcellular localization) of eukaryotic proteins. - - - - - - - - - - Protein signal peptide detection (bacteria) - - Detect or predict signal peptides (and typically predict subcellular localization) of bacterial proteins. - beta12orEarlier - - - - - - - - - - MHC peptide immunogenicity prediction - - true - - Predict MHC class I or class II binding peptides, promiscuous binding peptides, immunogenicity etc. - beta12orEarlier - 1.12 - - - - - - - - - - Protein feature prediction (from sequence) - - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - true - Sequence feature detection (protein) - 1.6 - Predict, recognise and identify positional features in protein sequences such as functional sites or regions and secondary structure. - - - - - - - - - - Nucleic acid feature detection - - - - - - - - - - - - - - - Sequence feature detection (nucleic acid) - Predict, recognise and identify features in nucleotide sequences such as functional sites or regions, typically by scanning for known motifs, patterns and regular expressions. - Methods typically involve scanning for known motifs, patterns and regular expressions. - beta12orEarlier - Nucleic acid feature recognition - Nucleic acid feature prediction - - - - - - - - - - Epitope mapping - - - - - - - - - beta12orEarlier - Predict antigenic determinant sites (epitopes) in protein sequences. - Epitope mapping is commonly done during vaccine design. - - - - - - - - - - Protein post-translation modification site prediction - - - - - - - - Predict post-translation modification sites in protein sequences. - beta12orEarlier - Methods might predict sites of methylation, N-terminal myristoylation, N-terminal acetylation, sumoylation, palmitoylation, phosphorylation, sulfation, glycosylation, glycosylphosphatidylinositol (GPI) modification sites (GPI lipid anchor signals) etc. - - - - - - - - - - Protein signal peptide detection - - - - - - - - - beta12orEarlier - Methods might use sequence motifs and features, amino acid composition, profiles, machine-learned classifiers, etc. - Detect or predict signal peptides and signal peptide cleavage sites in protein sequences. - - - - - - - - - - Protein binding site prediction (from sequence) - - 1.12 - Predict catalytic residues, active sites or other ligand-binding sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - Protein-nucleic acid binding prediction - - beta12orEarlier - Predict RNA and DNA-binding binding sites in protein sequences. - - - - - - - - - - Protein folding site prediction - - - Predict protein sites that are key to protein folding, such as possible sites of nucleation or stabilization. - beta12orEarlier - - - - - - - - - - Protein cleavage site prediction - - - - - - - - beta12orEarlier - Detect or predict cleavage sites (enzymatic or chemical) in protein sequences. - - - - - - - - - - Epitope mapping (MHC Class I) - - 1.8 - true - beta12orEarlier - Predict epitopes that bind to MHC class I molecules. - - - - - - - - - - Epitope mapping (MHC Class II) - - Predict epitopes that bind to MHC class II molecules. - 1.8 - true - beta12orEarlier - - - - - - - - - - - Whole gene prediction - - beta12orEarlier - 1.12 - true - Detect, predict and identify whole gene structure in DNA sequences. This includes protein coding regions, exon-intron structure, regulatory regions etc. - - - - - - - - - - Gene component prediction - - true - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - beta12orEarlier - Detect, predict and identify genetic elements such as promoters, coding regions, splice sites, etc in DNA sequences. - 1.12 - - - - - - - - - - Transposon prediction - - beta12orEarlier - Detect or predict transposons, retrotransposons / retrotransposition signatures etc. - - - - - - - - - - PolyA signal detection - - Detect polyA signals in nucleotide sequences. - beta12orEarlier - - - - - - - - - - Quadruplex formation site detection - - - - - - - - beta12orEarlier - Quadruplex structure prediction - Detect quadruplex-forming motifs in nucleotide sequences. - Quadruplex (4-stranded) structures are formed by guanine-rich regions and are implicated in various important biological processes and as therapeutic targets. - - - - - - - - - - CpG island and isochore detection - - - - - - - - An isochore is long region (> 3 KB) of DNA with very uniform GC content, in contrast to the rest of the genome. Isochores tend tends to have more genes, higher local melting or denaturation temperatures, and different flexibility. Methods might calculate fractional GC content or variation of GC content, predict methylation status of CpG islands etc. This includes methods that visualise CpG rich regions in a nucleotide sequence, for example plot isochores in a genome sequence. - beta12orEarlier - Find CpG rich regions in a nucleotide sequence or isochores in genome sequences. - CpG island and isochores rendering - CpG island and isochores detection - - - - - - - - - - Restriction site recognition - - - - - - - - beta12orEarlier - Find and identify restriction enzyme cleavage sites (restriction sites) in (typically) DNA sequences, for example to generate a restriction map. - - - - - - - - - - Nucleosome formation or exclusion sequence prediction - - beta12orEarlier - Identify or predict nucleosome exclusion sequences (nucleosome free regions) in DNA. - - - - - - - - - - Splice site prediction - - - - - - - - beta12orEarlier - Identify, predict or analyse splice sites in nucleotide sequences. - Methods might require a pre-mRNA or genomic DNA sequence. - - - - - - - - - - Integrated gene prediction - - Predict whole gene structure using a combination of multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Operon prediction - - Find operons (operators, promoters and genes) in bacteria genes. - beta12orEarlier - - - - - - - - - - Coding region prediction - - Predict protein-coding regions (CDS or exon) or open reading frames in nucleotide sequences. - ORF prediction - ORF finding - beta12orEarlier - - - - - - - - - - Selenocysteine insertion sequence (SECIS) prediction - - - - - - - - Predict selenocysteine insertion sequence (SECIS) in a DNA sequence. - SECIS elements are around 60 nucleotides in length with a stem-loop structure directs the cell to translate UGA codons as selenocysteines. - beta12orEarlier - - - - - - - - - - Regulatory element prediction - - - - - - - - Identify or predict transcription regulatory motifs, patterns, elements or regions in DNA sequences. - Translational regulatory element prediction - Transcription regulatory element prediction - This includes promoters, enhancers, silencers and boundary elements / insulators, regulatory protein or transcription factor binding sites etc. Methods might be specific to a particular genome and use motifs, word-based / grammatical methods, position-specific frequency matrices, discriminative pattern analysis etc. - beta12orEarlier - - - - - - - - - - Translation initiation site prediction - - - - - - - - Predict translation initiation sites, possibly by searching a database of sites. - beta12orEarlier - - - - - - - - - - Promoter prediction - - Identify or predict whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in DNA sequences. - Methods might recognize CG content, CpG islands, splice sites, polyA signals etc. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (DNA-cis) - - beta12orEarlier - Cis-regulatory elements (cis-elements) regulate the expression of genes located on the same strand. Cis-elements are found in the 5' promoter region of the gene, in an intron, or in the 3' untranslated region. Cis-elements are often binding sites of one or more trans-acting factors. - Identify, predict or analyse cis-regulatory elements (TATA box, Pribnow box, SOS box, CAAT box, CCAAT box, operator etc.) in DNA sequences. - - - - - - - - - - Transcription regulatory element prediction (RNA-cis) - - Cis-regulatory elements (cis-elements) regulate genes located on the same strand from which the element was transcribed. A riboswitch is a region of an mRNA molecule that bind a small target molecule that regulates the gene's activity. - Identify, predict or analyse cis-regulatory elements (for example riboswitches) in RNA sequences. - beta12orEarlier - - - - - - - - - - Transcription regulatory element prediction (trans) - - - - - - - - beta12orEarlier - Trans-regulatory elements regulate genes distant from the gene from which they were transcribed. - Identify or predict functional RNA sequences with a gene regulatory role (trans-regulatory elements) or targets. - Functional RNA identification - - - - - - - - - - Matrix/scaffold attachment site prediction - - MAR/SAR sites often flank a gene or gene cluster and are found nearby cis-regulatory sequences. They might contribute to transcription regulation. - Identify matrix/scaffold attachment regions (MARs/SARs) in DNA sequences. - beta12orEarlier - - - - - - - - - - Transcription factor binding site prediction - - beta12orEarlier - Identify or predict transcription factor binding sites in DNA sequences. - - - - - - - - - - Exonic splicing enhancer prediction - - - - - - - - An exonic splicing enhancer (ESE) is 6-base DNA sequence motif in an exon that enhances or directs splicing of pre-mRNA or hetero-nuclear RNA (hnRNA) into mRNA. - Identify or predict exonic splicing enhancers (ESE) in exons. - beta12orEarlier - - - - - - - - - - Sequence alignment validation - - - Evaluation might be purely sequence-based or use structural information. - Sequence alignment quality evaluation - Evaluate molecular sequence alignment accuracy. - beta12orEarlier - - - - - - - - - - Sequence alignment analysis (conservation) - - beta12orEarlier - Analyse character conservation in a molecular sequence alignment, for example to derive a consensus sequence. - Residue conservation analysis - Use this concept for methods that calculate substitution rates, estimate relative site variability, identify sites with biased properties, derive a consensus sequence, or identify highly conserved or very poorly conserved sites, regions, blocks etc. - - - - - - - - - - Sequence alignment analysis (site correlation) - - - Analyse correlations between sites in a molecular sequence alignment. - This is typically done to identify possible covarying positions and predict contacts or structural constraints in protein structures. - beta12orEarlier - - - - - - - - - - Chimeric sequence detection - - beta12orEarlier - A chimera includes regions from two or more phylogenetically distinct sequences. They are usually artifacts of PCR and are thought to occur when a prematurely terminated amplicon reanneals to another DNA strand and is subsequently copied to completion in later PCR cycles. - Detects chimeric sequences (chimeras) from a sequence alignment. - Sequence alignment analysis (chimeric sequence detection) - - - - - - - - - - Recombination detection - - Sequence alignment analysis (recombination detection) - beta12orEarlier - Detect recombination (hotspots and coldspots) and identify recombination breakpoints in a sequence alignment. - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - - - - - - - - - - Indel detection - - - beta12orEarlier - Sequence alignment analysis (indel detection) - Indel discovery - Tools might use a genetic algorithm, quartet-mapping, bootscanning, graphical methods, random forest model and so on. - Identify insertion, deletion and duplication events from a sequence alignment. - - - - - - - - - - Nucleosome formation potential prediction - - true - beta12orEarlier - Predict nucleosome formation potential of DNA sequences. - beta12orEarlier - - - - - - - - - - Nucleic acid thermodynamic property calculation - - - - - - - - Calculate a thermodynamic property of DNA or DNA/RNA, such as melting temperature, enthalpy and entropy. - beta12orEarlier - - - - - - - - - - Nucleic acid melting profile plotting - - - - - - - - - Calculate and plot a DNA or DNA/RNA melting profile. - A melting profile is used to visualise and analyse partly melted DNA conformations. - beta12orEarlier - - - - - - - - - - Nucleic acid stitch profile plotting - - - - - - - - A stitch profile represents the alternative conformations that partly melted DNA can adopt in a temperature range. - beta12orEarlier - Calculate and plot a DNA or DNA/RNA stitch profile. - - - - - - - - - - Nucleic acid melting curve plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA melting curve. - beta12orEarlier - - - - - - - - - - Nucleic acid probability profile plotting - - - - - - - - beta12orEarlier - Calculate and plot a DNA or DNA/RNA probability profile. - - - - - - - - - - Nucleic acid temperature profile plotting - - - - - - - - Calculate and plot a DNA or DNA/RNA temperature profile. - beta12orEarlier - - - - - - - - - - Nucleic acid curvature calculation - - - - - - - - Calculate curvature and flexibility / stiffness of a nucleotide sequence. - beta12orEarlier - This includes properties such as. - - - - - - - - - - microRNA detection - - Identify or predict microRNA sequences (miRNA) and precursors or microRNA targets / binding sites in a DNA sequence. - beta12orEarlier - - - - - - - - - - tRNA gene prediction - - - - - - - - Identify or predict tRNA genes in genomic sequences (tRNA). - beta12orEarlier - - - - - - - - - - siRNA binding specificity prediction - - - - - - - - beta12orEarlier - Assess binding specificity of putative siRNA sequence(s), for example for a functional assay, typically with respect to designing specific siRNA sequences. - - - - - - - - - - Protein secondary structure prediction (integrated) - - Predict secondary structure of protein sequence(s) using multiple methods to achieve better predictions. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (helices) - - beta12orEarlier - Predict helical secondary structure of protein sequences. - - - - - - - - - - Protein secondary structure prediction (turns) - - Predict turn structure (for example beta hairpin turns) of protein sequences. - beta12orEarlier - - - - - - - - - - Protein secondary structure prediction (coils) - - beta12orEarlier - Predict open coils, non-regular secondary structure and intrinsically disordered / unstructured regions of protein sequences. - - - - - - - - - - Protein secondary structure prediction (disulfide bonds) - - beta12orEarlier - Predict cysteine bonding state and disulfide bond partners in protein sequences. - - - - - - - - - - GPCR prediction - - - beta12orEarlier - G protein-coupled receptor (GPCR) prediction - Predict G protein-coupled receptors (GPCR). - - - - - - - - - - GPCR analysis - - - - - - - - Analyse G-protein coupled receptor proteins (GPCRs). - beta12orEarlier - G protein-coupled receptor (GPCR) analysis - - - - - - - - - - Protein structure prediction - - - - - - - - - - - beta12orEarlier - Predict tertiary structure (backbone and side-chain conformation) of protein sequences. - - - - - - - - - - Nucleic acid structure prediction - - - - - - - - - - beta12orEarlier - Methods might identify thermodynamically stable or evolutionarily conserved structures. - Predict tertiary structure of DNA or RNA. - - - - - - - - - - Ab initio structure prediction - - Predict tertiary structure of protein sequence(s) without homologs of known structure. - de novo structure prediction - beta12orEarlier - - - - - - - - - - Protein modelling - - - - - - - - - - Comparative modelling - beta12orEarlier - Build a three-dimensional protein model based on known (for example homologs) structures. - The model might be of a whole, part or aspect of protein structure. Molecular modelling methods might use sequence-structure alignment, structural templates, molecular dynamics, energy minimization etc. - Homology modelling - Homology structure modelling - Protein structure comparative modelling - - - - - - - - - - Molecular docking - - - - - - - - - - - - - - - Model the structure of a protein in complex with a small molecule or another macromolecule. - beta12orEarlier - This includes protein-protein interactions, protein-nucleic acid, protein-ligand binding etc. Methods might predict whether the molecules are likely to bind in vivo, their conformation when bound, the strength of the interaction, possible mutations to achieve bonding and so on. - Docking simulation - Protein docking - - - - - - - - - - Protein modelling (backbone) - - Model protein backbone conformation. - Methods might require a preliminary C(alpha) trace. - beta12orEarlier - - - - - - - - - - Protein modelling (side chains) - - beta12orEarlier - Methods might use a residue rotamer library. - Model, analyse or edit amino acid side chain conformation in protein structure, optimize side-chain packing, hydrogen bonding etc. - - - - - - - - - - Protein modelling (loops) - - beta12orEarlier - Model loop conformation in protein structures. - - - - - - - - - - Protein-ligand docking - - - - - - - - - - - - - - beta12orEarlier - Methods aim to predict the position and orientation of a ligand bound to a protein receptor or enzyme. - Ligand-binding simulation - Model protein-ligand (for example protein-peptide) binding using comparative modelling or other techniques. - Virtual ligand screening - - - - - - - - - - Structured RNA prediction and optimisation - - - - - - - - Nucleic acid folding family identification - RNA inverse folding - beta12orEarlier - Predict or optimise RNA sequences (sequence pools) with likely secondary and tertiary structure for in vitro selection. - - - - - - - - - - SNP detection - - - - Find single nucleotide polymorphisms (SNPs) between sequences. - Single nucleotide polymorphism detection - beta12orEarlier - This includes functional SNPs for large-scale genotyping purposes, disease-associated non-synonymous SNPs etc. - SNP discovery - - - - - - - - - - Radiation Hybrid Mapping - - - - - - - - Generate a physical (radiation hybrid) map of genetic markers in a DNA sequence using provided radiation hybrid (RH) scores for one or more markers. - beta12orEarlier - - - - - - - - - - Functional mapping - - beta12orEarlier - true - This can involve characterization of the underlying quantitative trait loci (QTLs) or nucleotides (QTNs). - Map the genetic architecture of dynamic complex traits. - beta12orEarlier - - - - - - - - - - Haplotype mapping - - - - - - - - - Haplotype map generation - Haplotype inference - Infer haplotypes, either alleles at multiple loci that are transmitted together on the same chromosome, or a set of single nucleotide polymorphisms (SNPs) on a single chromatid that are statistically associated. - beta12orEarlier - Haplotype inference can help in population genetic studies and the identification of complex disease genes, , and is typically based on aligned single nucleotide polymorphism (SNP) fragments. Haplotype comparison is a useful way to characterize the genetic variation between individuals. An individual's haplotype describes which nucleotide base occurs at each position for a set of common SNPs. Tools might use combinatorial functions (for example parsimony) or a likelihood function or model with optimization such as minimum error correction (MEC) model, expectation-maximization algorithm (EM), genetic algorithm or Markov chain Monte Carlo (MCMC). - Haplotype reconstruction - - - - - - - - - - Linkage disequilibrium calculation - - - - - - - - beta12orEarlier - Linkage disequilibrium is identified where a combination of alleles (or genetic markers) occurs more or less frequently in a population than expected by chance formation of haplotypes. - Calculate linkage disequilibrium; the non-random association of alleles or polymorphisms at two or more loci (not necessarily on the same chromosome). - - - - - - - - - - Genetic code prediction - - - - - - - - - beta12orEarlier - Predict genetic code from analysis of codon usage data. - - - - - - - - - - Dotplot plotting - - - - - - - - - - beta12orEarlier - Draw a dotplot of sequence similarities identified from word-matching or character comparison. - - - - - - - - - - Pairwise sequence alignment - - - - - - - - Pairwise sequence alignment generation - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Align exactly two molecular sequences. - Pairwise sequence alignment construction - beta12orEarlier - - - - - - - - - - Multiple sequence alignment - - Multiple sequence alignment construction - Align two or more molecular sequences. - This includes methods that use an existing alignment, for example to incorporate sequences into an alignment, or combine several multiple alignments into a single, improved alignment. - beta12orEarlier - Multiple sequence alignment generation - - - - - - - - - - Pairwise sequence alignment generation (local) - - beta12orEarlier - Local pairwise sequence alignment construction - Locally align exactly two molecular sequences. - Pairwise sequence alignment (local) - true - Local alignment methods identify regions of local similarity. - 1.6 - Pairwise sequence alignment construction (local) - - - - - - - - - - - Pairwise sequence alignment generation (global) - - Pairwise sequence alignment construction (global) - Global pairwise sequence alignment construction - 1.6 - true - Globally align exactly two molecular sequences. - beta12orEarlier - Global alignment methods identify similarity across the entire length of the sequences. - Pairwise sequence alignment (global) - - - - - - - - - - - Local sequence alignment - - Multiple sequence alignment (local) - Local multiple sequence alignment construction - beta12orEarlier - Local alignment methods identify regions of local similarity. - Multiple sequence alignment construction (local) - Sequence alignment generation (local) - Sequence alignment (local) - Locally align two or more molecular sequences. - Smith-Waterman - - - - - - - - - - Global sequence alignment - - Global multiple sequence alignment construction - Multiple sequence alignment (global) - beta12orEarlier - Sequence alignment (global) - Multiple sequence alignment construction (global) - Globally align two or more molecular sequences. - Sequence alignment generation (global) - Global alignment methods identify similarity across the entire length of the sequences. - - - - - - - - - - Constrained sequence alignment - - beta12orEarlier - Align two or more molecular sequences with user-defined constraints. - Multiple sequence alignment construction (constrained) - Sequence alignment generation (constrained) - Multiple sequence alignment (constrained) - Sequence alignment (constrained) - Constrained multiple sequence alignment construction - - - - - - - - - - Consensus-based sequence alignment - - Consensus multiple sequence alignment construction - Sequence alignment (consensus) - beta12orEarlier - Align two or more molecular sequences using multiple methods to achieve higher quality. - Sequence alignment generation (consensus) - Multiple sequence alignment construction (consensus) - Multiple sequence alignment (consensus) - - - - - - - - - - Tree-based sequence alignment - - - - - - - - Sequence alignment generation (phylogenetic tree-based) - This is supposed to give a more biologically meaningful alignment than standard alignments. - beta12orEarlier - Phylogenetic tree-based multiple sequence alignment construction - Align multiple sequences using relative gap costs calculated from neighbors in a supplied phylogenetic tree. - Sequence alignment (phylogenetic tree-based) - Multiple sequence alignment construction (phylogenetic tree-based) - Multiple sequence alignment (phylogenetic tree-based) - - - - - - - - - - Secondary structure alignment generation - - beta12orEarlier - 1.6 - Secondary structure alignment construction - Secondary structure alignment - true - Align molecular secondary structure (represented as a 1D string). - - - - - - - - - - Protein secondary structure alignment generation - - - - - - - - - Protein secondary structure alignment construction - Align protein secondary structures. - beta12orEarlier - Secondary structure alignment (protein) - Protein secondary structure alignment - - - - - - - - - - RNA secondary structure alignment - - - - - - - - - - - - - - - RNA secondary structure alignment generation - Align RNA secondary structures. - RNA secondary structure alignment construction - Secondary structure alignment (RNA) - beta12orEarlier - - - - - - - - - - Pairwise structure alignment - - beta12orEarlier - Pairwise structure alignment generation - Pairwise structure alignment construction - Align (superimpose) exactly two molecular tertiary structures. - - - - - - - - - - Multiple structure alignment construction - - Align (superimpose) two or more molecular tertiary structures. - This includes methods that use an existing alignment. - 1.6 - true - Multiple structure alignment - beta12orEarlier - - - - - - - - - - Structure alignment (protein) - - beta13 - true - beta12orEarlier - Align protein tertiary structures. - - - - - - - - - - Structure alignment (RNA) - - beta13 - true - Align RNA tertiary structures. - beta12orEarlier - - - - - - - - - - Pairwise structure alignment generation (local) - - Locally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (local) - Local alignment methods identify regions of local similarity, common substructures etc. - Pairwise structure alignment construction (local) - 1.6 - true - Local pairwise structure alignment construction - beta12orEarlier - - - - - - - - - - - Pairwise structure alignment generation (global) - - Global pairwise structure alignment construction - Global alignment methods identify similarity across the entire structures. - true - beta12orEarlier - 1.6 - Pairwise structure alignment construction (global) - Globally align (superimpose) exactly two molecular tertiary structures. - Pairwise structure alignment (global) - - - - - - - - - - - Local structure alignment - - Local multiple structure alignment construction - Local alignment methods identify regions of local similarity, common substructures etc. - Structure alignment construction (local) - beta12orEarlier - Locally align (superimpose) two or more molecular tertiary structures. - Multiple structure alignment construction (local) - Multiple structure alignment (local) - Structure alignment generation (local) - - - - - - - - - - Global structure alignment - - Structure alignment construction (global) - Multiple structure alignment (global) - Structure alignment generation (global) - Multiple structure alignment construction (global) - beta12orEarlier - Global alignment methods identify similarity across the entire structures. - Global multiple structure alignment construction - Globally align (superimpose) two or more molecular tertiary structures. - - - - - - - - - - Profile-to-profile alignment (pairwise) - - Sequence alignment generation (pairwise profile) - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise sequence profile alignment construction - Sequence profile alignment construction (pairwise) - Sequence profile alignment (pairwise) - beta12orEarlier - Align exactly two molecular profiles. - Sequence profile alignment generation (pairwise) - - - - - - - - - - Sequence alignment generation (multiple profile) - - Align two or more molecular profiles. - 1.6 - true - Sequence profile alignment generation (multiple) - beta12orEarlier - Sequence profile alignment (multiple) - Sequence profile alignment construction (multiple) - Multiple sequence profile alignment construction - - - - - - - - - - 3D profile-to-3D profile alignment (pairwise) - - Methods might perform one-to-one, one-to-many or many-to-many comparisons. - Pairwise structural (3D) profile alignment construction - Structural (3D) profile alignment (pairwise) - Structural profile alignment construction (pairwise) - Align exactly two molecular Structural (3D) profiles. - beta12orEarlier - Structural profile alignment generation (pairwise) - - - - - - - - - - Structural profile alignment generation (multiple) - - true - Structural profile alignment construction (multiple) - Align two or more molecular 3D profiles. - Multiple structural (3D) profile alignment construction - beta12orEarlier - Structural (3D) profile alignment (multiple) - 1.6 - - - - - - - - - - Data retrieval (tool metadata) - - Data retrieval (tool annotation) - 1.6 - Search and retrieve names of or documentation on bioinformatics tools, for example by keyword or which perform a particular function. - beta12orEarlier - true - Tool information retrieval - - - - - - - - - - Data retrieval (database metadata) - - beta12orEarlier - true - Data retrieval (database annotation) - Search and retrieve names of or documentation on bioinformatics databases or query terms, for example by keyword. - Database information retrieval - 1.6 - - - - - - - - - - PCR primer design (for large scale sequencing) - - 1.13 - Predict primers for large scale sequencing. - beta12orEarlier - true - - - - - - - - - - PCR primer design (for genotyping polymorphisms) - - true - beta12orEarlier - Predict primers for genotyping polymorphisms, for example single nucleotide polymorphisms (SNPs). - 1.13 - - - - - - - - - - PCR primer design (for gene transcription profiling) - - Predict primers for gene transcription profiling. - beta12orEarlier - true - 1.13 - - - - - - - - - - PCR primer design (for conserved primers) - - 1.13 - Predict primers that are conserved across multiple genomes or species. - beta12orEarlier - true - - - - - - - - - - PCR primer design (based on gene structure) - - 1.13 - true - beta12orEarlier - - - - - - - - - - PCR primer design (for methylation PCRs) - - true - beta12orEarlier - Predict primers for methylation PCRs. - 1.13 - - - - - - - - - - Mapping assembly - - Sequence assembly by combining fragments using an existing backbone sequence, typically a reference genome. - beta12orEarlier - Sequence assembly (mapping assembly) - The final sequence will resemble the backbone sequence. Mapping assemblers are usually much faster and less memory intensive than de-novo assemblers. - - - - - - - - - - De-novo assembly - - De Bruijn graph - Sequence assembly by combining fragments without the aid of a reference sequence or genome. - Sequence assembly (de-novo assembly) - De-novo assemblers are much slower and more memory intensive than mapping assemblers. - beta12orEarlier - - - - - - - - - - Genome assembly - - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - beta12orEarlier - Sequence assembly (genome assembly) - - - - - - - - - - EST assembly - - beta12orEarlier - Sequence assembly (EST assembly) - Sequence assembly for EST sequences (transcribed mRNA). - Assemblers must handle (or be complicated by) alternative splicing, trans-splicing, single-nucleotide polymorphism (SNP), recoding, and post-transcriptional modification. - - - - - - - - - - Tag mapping - - - Tag mapping might assign experimentally obtained tags to known transcripts or annotate potential virtual tags in a genome. - Tag to gene assignment - Make gene to tag assignments (tag mapping) of SAGE, MPSS and SBS data, by annotating tags with ontology concepts. - beta12orEarlier - - - - - - - - - - SAGE data processing - - beta12orEarlier - Serial analysis of gene expression data processing - beta12orEarlier - Process (read and / or write) serial analysis of gene expression (SAGE) data. - true - - - - - - - - - - MPSS data processing - - beta12orEarlier - Process (read and / or write) massively parallel signature sequencing (MPSS) data. - true - Massively parallel signature sequencing data processing - beta12orEarlier - - - - - - - - - - SBS data processing - - beta12orEarlier - Sequencing by synthesis data processing - beta12orEarlier - Process (read and / or write) sequencing by synthesis (SBS) data. - true - - - - - - - - - - Heat map generation - - - - - - - - - beta12orEarlier - The heat map usually uses a coloring scheme to represent clusters. They can show how expression of mRNA by a set of genes was influenced by experimental conditions. - Heat map construction - Generate a heat map of gene expression from microarray data. - - - - - - - - - - Gene expression profile analysis - - true - Functional profiling - beta12orEarlier - Analyse one or more gene expression profiles, typically to interpret them in functional terms. - 1.6 - - - - - - - - - - Gene expression profile pathway mapping - - - - - - - - - - beta12orEarlier - Map a gene expression profile to known biological pathways, for example, to identify or reconstruct a pathway. - - - - - - - - - - Protein secondary structure assignment (from coordinate data) - - - beta12orEarlier - Assign secondary structure from protein coordinate data. - - - - - - - - - - Protein secondary structure assignment (from CD data) - - - - - - - - Assign secondary structure from circular dichroism (CD) spectroscopic data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from X-ray crystallographic data) - - true - 1.7 - Assign a protein tertiary structure (3D coordinates) from raw X-ray crystallography data. - beta12orEarlier - - - - - - - - - - Protein structure assignment (from NMR data) - - beta12orEarlier - Assign a protein tertiary structure (3D coordinates) from raw NMR spectroscopy data. - true - 1.7 - - - - - - - - - - Phylogenetic tree generation (data centric) - - Phylogenetic tree construction (data centric) - beta12orEarlier - Construct a phylogenetic tree from a specific type of data. - - - - - - - - - - Phylogenetic tree generation (method centric) - - Phylogenetic tree construction (method centric) - Construct a phylogenetic tree using a specific method. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (from molecular sequences) - - - Phylogenetic tree construction from molecular sequences. - beta12orEarlier - Phylogenetic tree construction (from molecular sequences) - Methods typically compare multiple molecular sequence and estimate evolutionary distances and relationships to infer gene families or make functional predictions. - - - - - - - - - - Phylogenetic tree generation (from continuous quantitative characters) - - - - - - - - Phylogenetic tree construction (from continuous quantitative characters) - beta12orEarlier - Phylogenetic tree construction from continuous quantitative character data. - - - - - - - - - - Phylogenetic tree generation (from gene frequencies) - - - - - - - - - - - - - - Phylogenetic tree construction (from gene frequencies) - Phylogenetic tree construction from gene frequency data. - beta12orEarlier - - - - - - - - - - Phylogenetic tree construction (from polymorphism data) - - - - - - - - Phylogenetic tree construction from polymorphism data including microsatellites, RFLP (restriction fragment length polymorphisms), RAPD (random-amplified polymorphic DNA) and AFLP (amplified fragment length polymorphisms) data. - Phylogenetic tree generation (from polymorphism data) - beta12orEarlier - - - - - - - - - - Phylogenetic species tree construction - - Construct a phylogenetic species tree, for example, from a genome-wide sequence comparison. - Phylogenetic species tree generation - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (parsimony methods) - - Phylogenetic tree construction (parsimony methods) - Construct a phylogenetic tree by computing a sequence alignment and searching for the tree with the fewest number of character-state changes from the alignment. - This includes evolutionary parsimony (invariants) methods. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (minimum distance methods) - - This includes neighbor joining (NJ) clustering method. - beta12orEarlier - Phylogenetic tree construction (minimum distance methods) - Construct a phylogenetic tree by computing (or using precomputed) distances between sequences and searching for the tree with minimal discrepancies between pairwise distances. - - - - - - - - - - Phylogenetic tree generation (maximum likelihood and Bayesian methods) - - Phylogenetic tree construction (maximum likelihood and Bayesian methods) - Construct a phylogenetic tree by relating sequence data to a hypothetical tree topology using a model of sequence evolution. - Maximum likelihood methods search for a tree that maximizes a likelihood function, i.e. that is most likely given the data and model. Bayesian analysis estimate the probability of tree for branch lengths and topology, typically using a Monte Carlo algorithm. - beta12orEarlier - - - - - - - - - - Phylogenetic tree generation (quartet methods) - - beta12orEarlier - Phylogenetic tree construction (quartet methods) - Construct a phylogenetic tree by computing four-taxon trees (4-trees) and searching for the phylogeny that matches most closely. - - - - - - - - - - Phylogenetic tree generation (AI methods) - - Construct a phylogenetic tree by using artificial-intelligence methods, for example genetic algorithms. - Phylogenetic tree construction (AI methods) - beta12orEarlier - - - - - - - - - - DNA substitution modelling - - - - - - - - - - - - - - - Identify a plausible model of DNA substitution that explains a molecular (DNA or protein) sequence alignment. - Sequence alignment analysis (phylogenetic modelling) - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (shape) - - Phylogenetic tree topology analysis - Analyse the shape (topology) of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree bootstrapping - - - Apply bootstrapping or other measures to estimate confidence of a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (gene family prediction) - - - - - - - - - - - - - - Predict families of genes and gene function based on their position in a phylogenetic tree. - beta12orEarlier - - - - - - - - - - Phylogenetic tree analysis (natural selection) - - beta12orEarlier - Stabilizing/purifying (directional) selection favors a single phenotype and tends to decrease genetic diversity as a population stabilizes on a particular trait, selecting out trait extremes or deleterious mutations. In contrast, balancing selection maintain genetic polymorphisms (or multiple alleles), whereas disruptive (or diversifying) selection favors individuals at both extremes of a trait. - Analyse a phylogenetic tree to identify allele frequency distribution and change that is subject to evolutionary pressures (natural selection, genetic drift, mutation and gene flow). Identify type of natural selection (such as stabilizing, balancing or disruptive). - - - - - - - - - - Phylogenetic tree generation (consensus) - - - Compare two or more phylogenetic trees to produce a consensus tree. - Methods typically test for topological similarity between trees using for example a congruence index. - beta12orEarlier - Phylogenetic tree construction (consensus) - - - - - - - - - - Phylogenetic sub/super tree detection - - beta12orEarlier - Compare two or more phylogenetic trees to detect subtrees or supertrees. - - - - - - - - - - Phylogenetic tree distances calculation - - - - - - - - beta12orEarlier - Compare two or more phylogenetic trees to calculate distances between trees. - - - - - - - - - - Phylogenetic tree annotation - - beta12orEarlier - http://www.evolutionaryontology.org/cdao.owl#CDAOAnnotation - Annotate a phylogenetic tree with terms from a controlled vocabulary. - - - - - - - - - - Immunogenicity prediction - - true - 1.12 - beta12orEarlier - Peptide immunogen prediction - Predict and optimise peptide ligands that elicit an immunological response. - - - - - - - - - - DNA vaccine design - - - - - - - - beta12orEarlier - Predict or optimise DNA to elicit (via DNA vaccination) an immunological response. - - - - - - - - - - Sequence formatting - - 1.12 - beta12orEarlier - Reformat (a file or other report of) molecular sequence(s). - true - - - - - - - - - - Sequence alignment formatting - - Reformat (a file or other report of) molecular sequence alignment(s). - beta12orEarlier - true - 1.12 - - - - - - - - - - Codon usage table formatting - - Reformat a codon usage table. - true - beta12orEarlier - 1.12 - - - - - - - - - - Sequence visualisation - - - - - - - - - - - - - - - beta12orEarlier - Visualise, format or render a molecular sequence, possibly with sequence features or properties shown. - Sequence rendering - - - - - - - - - - Sequence alignment visualisation - - - - - - - - - - - - - - - Sequence alignment rendering - Visualise, format or print a molecular sequence alignment. - beta12orEarlier - - - - - - - - - - Sequence cluster visualisation - - - - - - - - Sequence cluster rendering - beta12orEarlier - Visualise, format or render sequence clusters. - - - - - - - - - - Phylogenetic tree visualisation - - - - - - - - - Render or visualise a phylogenetic tree. - Phylogenetic tree rendering - beta12orEarlier - - - - - - - - - - RNA secondary structure visualisation - - - - - - - - - RNA secondary structure rendering - Visualise RNA secondary structure, knots, pseudoknots etc. - beta12orEarlier - - - - - - - - - - Protein secondary structure rendering - Protein secondary structure visualisation - - - - - - - - Render and visualise protein secondary structure. - beta12orEarlier - - - - - - - - - - Structure visualisation - - - - - - - - - - - - - - - Structure rendering - Visualise or render a molecular tertiary structure, for example a high-quality static picture or animation. - beta12orEarlier - - - - - - - - - - Microarray data rendering - - - - - - - - - - Visualise microarray data. - beta12orEarlier - - - - - - - - - - Protein interaction network rendering - Protein interaction network visualisation - - - - - - - - - beta12orEarlier - Identify and analyse networks of protein interactions. - - - - - - - - - - Map drawing - - - - - - - - beta12orEarlier - DNA map drawing - Map rendering - Draw or visualise a DNA map. - - - - - - - - - - Sequence motif rendering - - Render a sequence with motifs. - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Restriction map drawing - - - - - - - - - Draw or visualise restriction maps in DNA sequences. - beta12orEarlier - - - - - - - - - - DNA linear map rendering - - beta12orEarlier - beta12orEarlier - true - Draw a linear maps of DNA. - - - - - - - - - - Plasmid map drawing - - beta12orEarlier - DNA circular map rendering - Draw a circular maps of DNA, for example a plasmid map. - - - - - - - - - - Operon drawing - - - - - - - - Visualise operon structure etc. - beta12orEarlier - Operon rendering - - - - - - - - - - Nucleic acid folding family identification - - true - beta12orEarlier - Identify folding families of related RNAs. - beta12orEarlier - - - - - - - - - - Nucleic acid folding energy calculation - - beta12orEarlier - Compute energies of nucleic acid folding, e.g. minimum folding energies for DNA or RNA sequences or energy landscape of RNA mutants. - - - - - - - - - - Annotation retrieval - - beta12orEarlier - Use this concepts for tools which retrieve pre-existing annotations, not for example prediction methods that might make annotations. - Retrieve existing annotation (or documentation), typically annotation on a database entity. - beta12orEarlier - true - - - - - - - - - - Protein function prediction - - - - - - - - - beta12orEarlier - Predict general functional properties of a protein. - For functional properties that can be mapped to a sequence, use 'Sequence feature detection (protein)' instead. - - - - - - - - - - Protein function comparison - - - - - - - - - Compare the functional properties of two or more proteins. - beta12orEarlier - - - - - - - - - - Sequence submission - - Submit a molecular sequence to a database. - beta12orEarlier - 1.6 - true - - - - - - - - - - Gene regulatory network analysis - - - - - - - - beta12orEarlier - Analyse a known network of gene regulation. - - - - - - - - - - - Loading - - - - - - - - Data loading - WHATIF:UploadPDB - Prepare or load a user-specified data file so that it is available for use. - beta12orEarlier - - - - - - - - - - Sequence retrieval - - This includes direct retrieval methods (e.g. the dbfetch program) but not those that perform calculations on the sequence. - Data retrieval (sequences) - 1.6 - Query a sequence data resource (typically a database) and retrieve sequences and / or annotation. - beta12orEarlier - true - - - - - - - - - - Structure retrieval - - true - WHATIF:EchoPDB - beta12orEarlier - WHATIF:DownloadPDB - This includes direct retrieval methods but not those that perform calculations on the sequence or structure. - Query a tertiary structure data resource (typically a database) and retrieve structures, structure-related data and annotation. - 1.6 - - - - - - - - - - Surface rendering - - - beta12orEarlier - WHATIF:GetSurfaceDots - Calculate the positions of dots that are homogeneously distributed over the surface of a molecule. - A dot has three coordinates (x,y,z) and (typically) a color. - - - - - - - - - - Protein atom surface calculation (accessible) - - beta12orEarlier - 1.12 - true - Calculate the solvent accessibility ('accessible surface') for each atom in a structure. - Waters are not considered. - - - - - - - - - - Protein atom surface calculation (accessible molecular) - - beta12orEarlier - 1.12 - Calculate the solvent accessibility ('accessible molecular surface') for each atom in a structure. - Waters are not considered. - true - - - - - - - - - - Protein residue surface calculation (accessible) - - true - 1.12 - beta12orEarlier - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('accessible surface') for each residue in a structure. - - - - - - - - - - Protein residue surface calculation (vacuum accessible) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - Calculate the solvent accessibility ('vacuum accessible surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (accessible molecular) - - Calculate the solvent accessibility ('accessible molecular surface') for each residue in a structure. - true - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - 1.12 - beta12orEarlier - - - - - - - - - - Protein residue surface calculation (vacuum molecular) - - Solvent accessibility might be calculated for the backbone, sidechain and total (backbone plus sidechain). - true - beta12orEarlier - Calculate the solvent accessibility ('vacuum molecular surface') for each residue in a structure. This is the accessibility of the residue when taken out of the protein together with the backbone atoms of any residue it is covalently bound to. - 1.12 - - - - - - - - - - Protein surface calculation (accessible molecular) - - true - 1.12 - beta12orEarlier - Calculate the solvent accessibility ('accessible molecular surface') for a structure as a whole. - - - - - - - - - - Protein surface calculation (accessible) - - Calculate the solvent accessibility ('accessible surface') for a structure as a whole. - beta12orEarlier - 1.12 - true - - - - - - - - - - Backbone torsion angle calculation - - 1.12 - beta12orEarlier - true - Calculate for each residue in a protein structure all its backbone torsion angles. - - - - - - - - - - Full torsion angle calculation - - 1.12 - beta12orEarlier - Calculate for each residue in a protein structure all its torsion angles. - true - - - - - - - - - - Cysteine torsion angle calculation - - beta12orEarlier - Calculate for each cysteine (bridge) all its torsion angles. - 1.12 - true - - - - - - - - - - Tau angle calculation - - beta12orEarlier - Tau is the backbone angle N-Calpha-C (angle over the C-alpha). - 1.12 - For each amino acid in a protein structure calculate the backbone angle tau. - true - - - - - - - - - - Cysteine bridge detection - - WHATIF:ShowCysteineBridge - Detect cysteine bridges (from coordinate data) in a protein structure. - beta12orEarlier - - - - - - - - - - Free cysteine detection - - beta12orEarlier - A free cysteine is neither involved in a cysteine bridge, nor functions as a ligand to a metal. - Detect free cysteines in a protein structure. - WHATIF:ShowCysteineFree - - - - - - - - - - Metal-bound cysteine detection - - - beta12orEarlier - WHATIF:ShowCysteineMetal - Detect cysteines that are bound to metal in a protein structure. - - - - - - - - - - Residue contact calculation (residue-nucleic acid) - - beta12orEarlier - 1.12 - true - Calculate protein residue contacts with nucleic acids in a structure. - - - - - - - - - - Protein-metal contact calculation - - beta12orEarlier - Calculate protein residue contacts with metal in a structure. - Residue-metal contact calculation - - - - - - - - - - Residue contact calculation (residue-negative ion) - - Calculate ion contacts in a structure (all ions for all side chain atoms). - beta12orEarlier - true - 1.12 - - - - - - - - - - Residue bump detection - - WHATIF:ShowBumps - beta12orEarlier - Detect 'bumps' between residues in a structure, i.e. those with pairs of atoms whose Van der Waals' radii interpenetrate more than a defined distance. - - - - - - - - - - Residue symmetry contact calculation - - Calculate the number of symmetry contacts made by residues in a protein structure. - true - 1.12 - WHATIF:SymmetryContact - A symmetry contact is a contact between two atoms in different asymmetric unit. - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-ligand) - - true - beta12orEarlier - 1.12 - Calculate contacts between residues and ligands in a protein structure. - - - - - - - - - - Salt bridge calculation - - Salt bridges are interactions between oppositely charged atoms in different residues. The output might include the inter-atomic distance. - WHATIF:HasSaltBridgePlus - WHATIF:ShowSaltBridges - beta12orEarlier - WHATIF:HasSaltBridge - WHATIF:ShowSaltBridgesH - Calculate (and possibly score) salt bridges in a protein structure. - - - - - - - - - - Rotamer likelihood prediction - - WHATIF:ShowLikelyRotamers - WHATIF:ShowLikelyRotamers500 - 1.12 - Predict rotamer likelihoods for all 20 amino acid types at each position in a protein structure. - WHATIF:ShowLikelyRotamers600 - WHATIF:ShowLikelyRotamers800 - WHATIF:ShowLikelyRotamers900 - true - Output typically includes, for each residue position, the likelihoods for the 20 amino acid types with estimated reliability of the 20 likelihoods. - WHATIF:ShowLikelyRotamers700 - WHATIF:ShowLikelyRotamers400 - WHATIF:ShowLikelyRotamers300 - WHATIF:ShowLikelyRotamers200 - WHATIF:ShowLikelyRotamers100 - beta12orEarlier - - - - - - - - - - Proline mutation value calculation - - true - 1.12 - Calculate for each position in a protein structure the chance that a proline, when introduced at this position, would increase the stability of the whole protein. - WHATIF:ProlineMutationValue - beta12orEarlier - - - - - - - - - - Residue packing validation - - beta12orEarlier - Identify poorly packed residues in protein structures. - WHATIF: PackingQuality - - - - - - - - - - Protein geometry validation - - WHATIF: ImproperQualitySum - beta12orEarlier - Validate protein geometry, for example bond lengths, bond angles, torsion angles, chiralities, planaraties etc. - WHATIF: ImproperQualityMax - - - - - - - - - - PDB file sequence retrieval - - Extract a molecular sequence from a PDB file. - beta12orEarlier - WHATIF: PDB_sequence - true - beta12orEarlier - - - - - - - - - - HET group detection - - true - Identify HET groups in PDB files. - beta12orEarlier - 1.12 - A HET group usually corresponds to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. - - - - - - - - - - DSSP secondary structure assignment - - Determine for residue the DSSP determined secondary structure in three-state (HSC). - beta12orEarlier - WHATIF: ResidueDSSP - beta12orEarlier - true - - - - - - - - - - Structure formatting - - 1.12 - true - Reformat (a file or other report of) tertiary structure data. - beta12orEarlier - WHATIF: PDBasXML - - - - - - - - - - Protein cysteine and disulfide bond assignment - - - - - - - - Assign cysteine bonding state and disulfide bond partners in protein structures. - beta12orEarlier - - - - - - - - - - Residue validation - - 1.12 - Identify poor quality amino acid positions in protein structures. - beta12orEarlier - true - - - - - - - - - - Structure retrieval (water) - - beta12orEarlier - 1.6 - WHATIF:MovedWaterPDB - true - Query a tertiary structure database and retrieve water molecules. - - - - - - - - - - siRNA duplex prediction - - - - - - - - beta12orEarlier - Identify or predict siRNA duplexes in RNA. - - - - - - - - - - Sequence alignment refinement - - - Refine an existing sequence alignment. - beta12orEarlier - - - - - - - - - - Listfile processing - - 1.6 - Process an EMBOSS listfile (list of EMBOSS Uniform Sequence Addresses). - true - beta12orEarlier - - - - - - - - - - Sequence file editing - - - beta12orEarlier - Perform basic (non-analytical) operations on a report or file of sequences (which might include features), such as file concatenation, removal or ordering of sequences, creation of subset or a new file of sequences. - - - - - - - - - - Sequence alignment file processing - - beta12orEarlier - Perform basic (non-analytical) operations on a sequence alignment file, such as copying or removal and ordering of sequences. - 1.6 - true - - - - - - - - - - Small molecule data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) physicochemical property data for small molecules. - - - - - - - - - - Data retrieval (ontology annotation) - - beta13 - Ontology information retrieval - true - Search and retrieve documentation on a bioinformatics ontology. - beta12orEarlier - - - - - - - - - - Data retrieval (ontology concept) - - Query an ontology and retrieve concepts or relations. - true - beta13 - beta12orEarlier - Ontology retrieval - - - - - - - - - - Representative sequence identification - - Identify a representative sequence from a set of sequences, typically using scores from pair-wise alignment or other comparison of the sequences. - beta12orEarlier - - - - - - - - - - Structure file processing - - Perform basic (non-analytical) operations on a file of molecular tertiary structural data. - 1.6 - beta12orEarlier - true - - - - - - - - - - Data retrieval (sequence profile) - - Query a profile data resource and retrieve one or more profile(s) and / or associated annotation. - true - This includes direct retrieval methods that retrieve a profile by, e.g. the profile name. - beta13 - beta12orEarlier - - - - - - - - - - Statistical calculation - - Statistics - Statistical testing - Statistical analysis - Perform a statistical data operation of some type, e.g. calibration or validation. - Gibbs sampling - beta12orEarlier - - - - - - - - - - 3D-1D scoring matrix generation - - - - - - - - - - - - - - - - beta12orEarlier - 3D-1D scoring matrix construction - A 3D-1D scoring matrix scores the probability of amino acids occurring in different structural environments. - Calculate a 3D-1D scoring matrix from analysis of protein sequence and structural data. - - - - - - - - - - Transmembrane protein visualisation - - - - - - - - - Visualise transmembrane proteins, typically the transmembrane regions within a sequence. - beta12orEarlier - Transmembrane protein rendering - - - - - - - - - - Demonstration - - beta12orEarlier - true - An operation performing purely illustrative (pedagogical) purposes. - beta13 - - - - - - - - - - Data retrieval (pathway or network) - - beta12orEarlier - true - Query a biological pathways database and retrieve annotation on one or more pathways. - beta13 - - - - - - - - - - Data retrieval (identifier) - - beta12orEarlier - Query a database and retrieve one or more data identifiers. - beta13 - true - - - - - - - - - - Nucleic acid density plotting - - - beta12orEarlier - Calculate a density plot (of base composition) for a nucleotide sequence. - - - - - - - - - - Sequence analysis - - - - - - - - Analyse one or more known molecular sequences. - beta12orEarlier - Sequence analysis (general) - - - - - - - - - - Sequence motif analysis - - Analyse molecular sequence motifs. - beta12orEarlier - Sequence motif processing - - - - - - - - - - Protein interaction data processing - - 1.6 - Process (read and / or write) protein interaction data. - true - beta12orEarlier - - - - - - - - - - Protein structure analysis - - - - - - - - - - - - - - - Structure analysis (protein) - beta12orEarlier - Analyse protein tertiary structural data. - - - - - - - - - - Annotation processing - - true - beta12orEarlier - beta12orEarlier - Process (read and / or write) annotation of some type, typically annotation on an entry from a biological or biomedical database entity. - - - - - - - - - - Sequence feature analysis - - beta12orEarlier - true - Analyse features in molecular sequences. - beta12orEarlier - - - - - - - - - - Data handling - - - - - - - - beta12orEarlier - File processing - Report handling - File handling - Utility operation - Processing - Basic (non-analytical) operations of some data, either a file or equivalent entity in memory, such that the same basic type of data is consumed as input and generated as output. - - - - - - - - - - Gene expression analysis - - Analyse gene expression and regulation data. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Structural profile processing - - beta12orEarlier - 1.6 - Process (read and / or write) one or more structural (3D) profile(s) or template(s) of some type. - 3D profile processing - true - - - - - - - - - - Data index processing - - Database index processing - true - Process (read and / or write) an index of (typically a file of) biological data. - 1.6 - beta12orEarlier - - - - - - - - - - Sequence profile processing - - true - beta12orEarlier - Process (read and / or write) some type of sequence profile. - 1.6 - - - - - - - - - - Protein function analysis - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - beta12orEarlier - Analyse protein function, typically by processing protein sequence and/or structural data, and generate an informative report. - - - - - - - - - - Protein folding analysis - - - - - - - - - - - - - - - This is a broad concept and is used a placeholder for other, more specific concepts. - Analyse protein folding, typically by processing sequence and / or structural data, and write an informative report. - Protein folding modelling - beta12orEarlier - - - - - - - - - - Protein secondary structure analysis - - - - - - - - - - - - - - Analyse known protein secondary structure data. - beta12orEarlier - Secondary structure analysis (protein) - - - - - - - - - - Physicochemical property data processing - - beta13 - true - Process (read and / or write) data on the physicochemical property of a molecule. - beta12orEarlier - - - - - - - - - - Primer and probe design - - - - - - - - - Primer and probe prediction - beta12orEarlier - Predict oligonucleotide primers or probes. - - - - - - - - - - Operation (typed) - - true - Process (read and / or write) data of a specific type, for example applying analytical methods. - beta12orEarlier - 1.12 - - - - - - - - - - Database search - - - - - - - - beta12orEarlier - Typically the query is compared to each entry and high scoring matches (hits) are returned. For example, a BLAST search of a sequence database. - Search a database (or other data resource) with a supplied query and retrieve entries (or parts of entries) that are similar to the query. - Search - - - - - - - - - - Data retrieval - - - - - - - - Information retrieval - beta12orEarlier - Retrieve an entry (or part of an entry) from a data resource that matches a supplied query. This might include some primary data and annotation. The query is a data identifier or other indexed term. For example, retrieve a sequence record with the specified accession number, or matching supplied keywords. - Retrieval - - - - - - - - - - Prediction and recognition - - beta12orEarlier - Recognition - Prediction - Predict, recognise, detect or identify some properties of a biomolecule. - Detection - - - - - - - - - - Comparison - - beta12orEarlier - Compare two or more things to identify similarities. - - - - - - - - - - Optimisation and refinement - - beta12orEarlier - Refine or optimise some data model. - - - - - - - - - - Modelling and simulation - - - - - - - - beta12orEarlier - Model or simulate some biological entity or system, typically using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - Mathematical modelling - - - - - - - - - - Data handling - - true - beta12orEarlier - Perform basic operations on some data or a database. - beta12orEarlier - - - - - - - - - - Validation - - beta12orEarlier - Validation and standardisation - Quality control - Validate some data. - - - - - - - - - - Mapping - - This is a broad concept and is used a placeholder for other, more specific concepts. - Map properties to positions on an biological entity (typically a molecular sequence or structure), or assemble such an entity from constituent parts. - beta12orEarlier - - - - - - - - - - Design - - beta12orEarlier - Design a biological entity (typically a molecular sequence or structure) with specific properties. - - - - - - - - - - Microarray data processing - - beta12orEarlier - Process (read and / or write) microarray data. - beta12orEarlier - true - - - - - - - - - - Codon usage table processing - - Process (read and / or write) a codon usage table. - beta12orEarlier - - - - - - - - - - Data retrieval (codon usage table) - - Retrieve a codon usage table and / or associated annotation. - beta12orEarlier - true - beta13 - - - - - - - - - - Gene expression profile processing - - 1.6 - Process (read and / or write) a gene expression profile. - true - beta12orEarlier - - - - - - - - - - Functional enrichment - - - - - - - - - Analyse a set of genes (genes corresponding to an expression profile, or any other set) to find functional annotations (such as cellular processes or metaobolic pathways) that the sets are significantly associated with, providing biological insight into the a set of genes. - beta12orEarlier - The Gene Ontology (GO) is invariably used, the input is a set of Gene IDs and the output of the analysis is typically a ranked list of GO terms, each associated with a p-value. - GO term enrichment - - - - - - - - - - Gene regulatory network prediction - - - - - - - - - - - - - - - Predict a network of gene regulation. - beta12orEarlier - - - - - - - - - - Pathway or network processing - - Generate, analyse or handle a biological pathway or network. - beta12orEarlier - true - 1.12 - - - - - - - - - - RNA secondary structure analysis - - - - - - - - beta12orEarlier - Process (read and / or write) RNA secondary structure data. - - - - - - - - - - Structure processing (RNA) - - Process (read and / or write) RNA tertiary structure data. - beta12orEarlier - beta13 - true - - - - - - - - - - RNA structure prediction - - - - - - - - beta12orEarlier - Predict RNA tertiary structure. - - - - - - - - - - DNA structure prediction - - - - - - - - Predict DNA tertiary structure. - beta12orEarlier - - - - - - - - - - Phylogenetic tree processing - - beta12orEarlier - 1.12 - true - Generate, process or analyse phylogenetic tree or trees. - - - - - - - - - - Protein secondary structure processing - - Process (read and / or write) protein secondary structure data. - 1.6 - true - beta12orEarlier - - - - - - - - - - Protein interaction network processing - - true - beta12orEarlier - Process (read and / or write) a network of protein interactions. - 1.6 - - - - - - - - - - Sequence processing - - Sequence processing (general) - Process (read and / or write) one or more molecular sequences and associated annotation. - true - beta12orEarlier - 1.6 - - - - - - - - - - Sequence processing (protein) - - Process (read and / or write) a protein sequence and associated annotation. - beta12orEarlier - true - 1.6 - - - - - - - - - - Sequence processing (nucleic acid) - - 1.6 - true - beta12orEarlier - Process (read and / or write) a nucleotide sequence and associated annotation. - - - - - - - - - - Sequence comparison - - - - - - - - - - - - - - - Compare two or more molecular sequences. - beta12orEarlier - - - - - - - - - - Sequence cluster processing - - Process (read and / or write) a sequence cluster. - true - beta12orEarlier - 1.6 - - - - - - - - - - Feature table processing - - Process (read and / or write) a sequence feature table. - 1.6 - true - beta12orEarlier - - - - - - - - - - Gene prediction - - - - - - - - - - - - - - Gene component prediction - Detect, predict and identify genes or components of genes in DNA sequences, including promoters, coding regions, splice sites, etc. - Whole gene prediction - Gene and gene component prediction - beta12orEarlier - Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene finding - - - - - - - - - - GPCR classification - - - - - - - - - beta12orEarlier - G protein-coupled receptor (GPCR) classification - Classify G-protein coupled receptors (GPCRs) into families and subfamilies. - - - - - - - - - - GPCR coupling selectivity prediction - - - - - - - - - - Predict G-protein coupled receptor (GPCR) coupling selectivity. - beta12orEarlier - - - - - - - - - - Structure processing (protein) - - true - 1.6 - beta12orEarlier - Process (read and / or write) a protein tertiary structure. - - - - - - - - - - Protein atom surface calculation - - Waters are not considered. - Calculate the solvent accessibility for each atom in a structure. - beta12orEarlier - 1.12 - true - - - - - - - - - - Protein residue surface calculation - - beta12orEarlier - true - Calculate the solvent accessibility for each residue in a structure. - 1.12 - - - - - - - - - - Protein surface calculation - - beta12orEarlier - Calculate the solvent accessibility of a structure as a whole. - 1.12 - true - - - - - - - - - - Sequence alignment processing - - beta12orEarlier - true - Process (read and / or write) a molecular sequence alignment. - 1.6 - - - - - - - - - - Protein-protein interaction prediction - - - - - - - - - - - - - - - Identify or predict protein-protein interactions, interfaces, binding sites etc. - beta12orEarlier - - - - - - - - - - Structure processing - - true - 1.6 - Process (read and / or write) a molecular tertiary structure. - beta12orEarlier - - - - - - - - - - Map annotation - - Annotate a DNA map of some type with terms from a controlled vocabulary. - true - beta12orEarlier - 1.6 - - - - - - - - - - Data retrieval (protein annotation) - - Retrieve information on a protein. - beta13 - true - Protein information retrieval - beta12orEarlier - - - - - - - - - - Data retrieval (phylogenetic tree) - - beta12orEarlier - beta13 - Retrieve a phylogenetic tree from a data resource. - true - - - - - - - - - - Data retrieval (protein interaction annotation) - - Retrieve information on a protein interaction. - true - beta13 - beta12orEarlier - - - - - - - - - - Data retrieval (protein family annotation) - - beta12orEarlier - Protein family information retrieval - beta13 - Retrieve information on a protein family. - true - - - - - - - - - - Data retrieval (RNA family annotation) - - true - Retrieve information on an RNA family. - RNA family information retrieval - beta12orEarlier - beta13 - - - - - - - - - - Data retrieval (gene annotation) - - beta12orEarlier - Gene information retrieval - Retrieve information on a specific gene. - true - beta13 - - - - - - - - - - Data retrieval (genotype and phenotype annotation) - - Retrieve information on a specific genotype or phenotype. - Genotype and phenotype information retrieval - beta12orEarlier - beta13 - true - - - - - - - - - - Protein architecture comparison - - - Compare the architecture of two or more protein structures. - beta12orEarlier - - - - - - - - - - Protein architecture recognition - - - - beta12orEarlier - Includes methods that try to suggest the most likely biological unit for a given protein X-ray crystal structure based on crystal symmetry and scoring of putative protein-protein interfaces. - Identify the architecture of a protein structure. - - - - - - - - - - Molecular dynamics simulation - - - - - - - - - - - - - - - - - - - - - - Simulate molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence analysis - - - - - - - - - - - - - - - Analyse a nucleic acid sequence (using methods that are only applicable to nucleic acid sequences). - beta12orEarlier - Sequence analysis (nucleic acid) - - - - - - - - - - Protein sequence analysis - - - - - - - - - Analyse a protein sequence (using methods that are only applicable to protein sequences). - Sequence analysis (protein) - beta12orEarlier - - - - - - - - - - Structure analysis - - - - - - - - beta12orEarlier - Analyse known molecular tertiary structures. - - - - - - - - - - Nucleic acid structure analysis - - - - - - - - - - - - - - - Analyse nucleic acid tertiary structural data. - beta12orEarlier - - - - - - - - - - Secondary structure processing - - 1.6 - Process (read and / or write) a molecular secondary structure. - true - beta12orEarlier - - - - - - - - - - Structure comparison - - - - - - - - - beta12orEarlier - Compare two or more molecular tertiary structures. - - - - - - - - - - Helical wheel drawing - - - - - - - - Helical wheel rendering - beta12orEarlier - Render a helical wheel representation of protein secondary structure. - - - - - - - - - - Topology diagram drawing - - - - - - - - Topology diagram rendering - beta12orEarlier - Render a topology diagram of protein secondary structure. - - - - - - - - - - Protein structure comparison - - - - - - - - - - beta12orEarlier - Structure comparison (protein) - Methods might identify structural neighbors, find structural similarities or define a structural core. - Compare protein tertiary structures. - - - - - - - - - - Protein secondary structure comparison - - - - Compare protein secondary structures. - beta12orEarlier - Secondary structure comparison (protein) - Protein secondary structure - - - - - - - - - - Protein subcellular localization prediction - - - - - - - - - The prediction might include subcellular localization (nuclear, cytoplasmic, mitochondrial, chloroplast, plastid, membrane etc) or export (extracellular proteins) of a protein. - Predict the subcellular localization of a protein sequence. - Protein targeting prediction - beta12orEarlier - - - - - - - - - - Residue contact calculation (residue-residue) - - true - beta12orEarlier - Calculate contacts between residues in a protein structure. - 1.12 - - - - - - - - - - Hydrogen bond calculation (inter-residue) - - Identify potential hydrogen bonds between amino acid residues. - 1.12 - true - beta12orEarlier - - - - - - - - - - Protein interaction prediction - - - - - - - - - - - - - - - Predict the interactions of proteins with other molecules. - beta12orEarlier - - - - - - - - - - Codon usage data processing - - beta12orEarlier - beta13 - Process (read and / or write) codon usage data. - true - - - - - - - - - - Gene expression data analysis - - - - - - - - Gene expression (microarray) data processing - Gene expression profile analysis - beta12orEarlier - Microarray data processing - Gene expression data processing - Gene expression analysis - Process (read and / or write) gene expression (typically microarray) data, including analysis of one or more gene expression profiles, typically to interpret them in functional terms. - - - - - - - - - - Gene regulatory network processing - - 1.6 - beta12orEarlier - Process (read and / or write) a network of gene regulation. - true - - - - - - - - - - Pathway or network analysis - - - - - - - - Pathway analysis - Generate, process or analyse a biological pathway or network. - Network analysis - beta12orEarlier - - - - - - - - - - Sequencing-based expression profile data analysis - - Analyse SAGE, MPSS or SBS experimental data, typically to identify or quantify mRNA transcripts. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - Splicing model analysis - - - - - - - - - - Analyse, characterize and model alternative splicing events from comparing multiple nucleic acid sequences. - Splicing analysis - beta12orEarlier - - - - - - - - - - Microarray raw data analysis - - beta12orEarlier - beta12orEarlier - true - Analyse raw microarray data. - - - - - - - - - - Nucleic acid analysis - - - - - - - - Process (read and / or write) nucleic acid sequence or structural data. - Nucleic acid data processing - beta12orEarlier - - - - - - - - - - Protein analysis - - - - - - - - beta12orEarlier - Protein data processing - Process (read and / or write) protein sequence or structural data. - - - - - - - - - - Sequence data processing - - beta12orEarlier - Process (read and / or write) molecular sequence data. - beta13 - true - - - - - - - - - - Structural data processing - - Process (read and / or write) molecular structural data. - beta13 - true - beta12orEarlier - - - - - - - - - - Text processing - - true - beta12orEarlier - Process (read and / or write) text. - 1.6 - - - - - - - - - - Protein sequence alignment analysis - - - - - - - - - - Analyse a protein sequence alignment, typically to detect features or make predictions. - beta12orEarlier - Sequence alignment analysis (protein) - - - - - - - - - - Nucleic acid sequence alignment analysis - - - - - - - - - - beta12orEarlier - Sequence alignment analysis (nucleic acid) - Analyse a protein sequence alignment, typically to detect features or make predictions. - - - - - - - - - - Nucleic acid sequence comparison - - - - Sequence comparison (nucleic acid) - Compare two or more nucleic acid sequences. - beta12orEarlier - - - - - - - - - - Protein sequence comparison - - - - beta12orEarlier - Sequence comparison (protein) - Compare two or more protein sequences. - - - - - - - - - - DNA back-translation - - - - - - - - beta12orEarlier - Back-translate a protein sequence into DNA. - - - - - - - - - - Sequence editing (nucleic acid) - - 1.8 - true - Edit or change a nucleic acid sequence, either randomly or specifically. - beta12orEarlier - - - - - - - - - - Sequence editing (protein) - - Edit or change a protein sequence, either randomly or specifically. - beta12orEarlier - true - 1.8 - - - - - - - - - - Sequence generation (nucleic acid) - - Generate a nucleic acid sequence by some means. - beta12orEarlier - - - - - - - - - - Sequence generation (protein) - - - Generate a protein sequence by some means. - beta12orEarlier - - - - - - - - - - Nucleic acid sequence visualisation - - Visualise, format or render a nucleic acid sequence. - true - Various nucleic acid sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - 1.8 - beta12orEarlier - - - - - - - - - - Protein sequence visualisation - - true - beta12orEarlier - Visualise, format or render a protein sequence. - 1.8 - Various protein sequence analysis methods might generate a sequence rendering but are not (for brevity) listed under here. - - - - - - - - - - Nucleic acid structure comparison - - - - Compare nucleic acid tertiary structures. - beta12orEarlier - Structure comparison (nucleic acid) - - - - - - - - - - Structure processing (nucleic acid) - - 1.6 - beta12orEarlier - true - Process (read and / or write) nucleic acid tertiary structure data. - - - - - - - - - - - DNA mapping - - - - - - - - - - - - - - - - - - - - beta12orEarlier - Generate a map of a DNA sequence annotated with positional or non-positional features of some type. - - - - - - - - - - Map data processing - - DNA map data processing - Process (read and / or write) a DNA map of some type. - beta12orEarlier - true - 1.6 - - - - - - - - - - Protein hydropathy calculation - - - - - - - - - - - - - - beta12orEarlier - Analyse the hydrophobic, hydrophilic or charge properties of a protein (from analysis of sequence or structural information). - - - - - - - - - - Protein binding site prediction - - - - - - - - - beta12orEarlier - Active site prediction - Binding site prediction - Protein binding site detection - Ligand-binding site prediction - Identify or predict catalytic residues, active sites or other ligand-binding sites in protein sequences or structures. - - - - - - - - - - Sequence tagged site (STS) mapping - - - - - - - - beta12orEarlier - Sequence mapping - An STS is a short subsequence of known sequence and location that occurs only once in the chromosome or genome that is being mapped. Sources of STSs include 1. expressed sequence tags (ESTs), simple sequence length polymorphisms (SSLPs), and random genomic sequences from cloned genomic DNA or database sequences. - Generate a physical DNA map (sequence map) from analysis of sequence tagged sites (STS). - - - - - - - - - - Alignment - - - - - - - - - Compare two or more entities, typically the sequence or structure (or derivatives) of macromolecules, to identify equivalent subunits. - Alignment generation - beta12orEarlier - Alignment construction - - - - - - - - - - Protein fragment weight comparison - - - Calculate the molecular weight of a protein (or fragments) and compare it to another protein or reference data. Generally used for protein identification. - Peptide mass fingerprinting - Protein fingerprinting - beta12orEarlier - PMF - - - - - - - - - - Protein property comparison - - - - - - - - Compare the physicochemical properties of two or more proteins (or reference data). - beta12orEarlier - - - - - - - - - - Secondary structure comparison - - - - - - - - Compare two or more molecular secondary structures. - beta12orEarlier - - - - - - - - - - Hopp and Woods plotting - - beta12orEarlier - 1.12 - Generate a Hopp and Woods plot of antigenicity of a protein. - true - - - - - - - - - - Microarray cluster textual view generation - - beta12orEarlier - Visualise gene clusters with gene names. - - - - - - - - - - Microarray wave graph plotting - - Microarray wave graph rendering - Microarray cluster temporal graph rendering - beta12orEarlier - This view can be rendered as a pie graph. The distance matrix is sorted by cluster number and typically represented as a diagonal matrix with distance values displayed in different color shades. - Visualise clustered gene expression data as a set of waves, where each wave corresponds to a gene across samples on the X-axis. - - - - - - - - - - Microarray dendrograph plotting - - Microarray dendrograph rendering - Generate a dendrograph of raw, preprocessed or clustered microarray data. - beta12orEarlier - Microarray checks view rendering - Microarray view rendering - - - - - - - - - - Microarray proximity map plotting - - beta12orEarlier - Microarray distance map rendering - Generate a plot of distances (distance matrix) between genes. - Microarray proximity map rendering - - - - - - - - - - Microarray tree or dendrogram rendering - - Microarray 2-way dendrogram rendering - beta12orEarlier - Visualise clustered gene expression data using a gene tree, array tree and color coded band of gene expression. - Microarray matrix tree plot rendering - - - - - - - - - - Microarray principal component plotting - - beta12orEarlier - Microarray principal component rendering - Generate a line graph drawn as sum of principal components (Eigen value) and individual expression values. - - - - - - - - - - Microarray scatter plot plotting - - Generate a scatter plot of microarray data, typically after principal component analysis. - beta12orEarlier - Microarray scatter plot rendering - - - - - - - - - - Whole microarray graph plotting - - Visualise gene expression data where each band (or line graph) corresponds to a sample. - beta12orEarlier - Whole microarray graph rendering - - - - - - - - - - Microarray tree-map rendering - - beta12orEarlier - Visualise gene expression data after hierarchical clustering for representing hierarchical relationships. - - - - - - - - - - Microarray Box-Whisker plot plotting - - beta12orEarlier - Visualise raw and pre-processed gene expression data, via a plot showing over- and under-expression along with mean, upper and lower quartiles. - - - - - - - - - - Physical mapping - - - - - - - - - - - - - - beta12orEarlier - Generate a physical (sequence) map of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. - - - - - - - - - - Analysis - - Apply analytical methods to existing data of a specific type. - This excludes non-analytical methods that read and write the same basic type of data (for that, see 'Data handling'). - beta12orEarlier - - - - - - - - - - Alignment analysis - - Process or analyse an alignment of molecular sequences or structures. - true - beta12orEarlier - 1.8 - - - - - - - - - - Article analysis - - - - - - - - - - - - - - - - - - - - Analyse a body of scientific text (typically a full text article from a scientific journal.) - beta12orEarlier - - - - - - - - - - Molecular interaction analysis - - Analyse the interactions of two or more molecules (or parts of molecules) that are known to interact. - beta12orEarlier - beta13 - true - - - - - - - - - - Protein interaction analysis - - - - - - - - - - - - - - beta12orEarlier - Analyse known protein-protein, protein-DNA/RNA or protein-ligand interactions. - - - - - - - - - - Residue distance calculation - - WHATIF:HasNegativeIonContacts - Residue contact calculation (residue-ligand) - Residue contact calculation (residue-metal) - WHATIF:SymmetryContact - Residue contact calculation (residue-negative ion) - This includes identifying HET groups, which usually correspond to ligands, lipids, but might also (not consistently) include groups that are attached to amino acids. Each HET group is supposed to have a unique three letter code and a unique name which might be given in the output. It can also include calculation of symmetry contacts, i.e. a contact between two atoms in different asymmetric unit. - WHATIF:HasMetalContactsPlus - Calculate contacts between residues, or between residues and other groups, in a protein structure, on the basis of distance calculations. - Residue contact calculation (residue-nucleic acid) - WHATIF: HETGroupNames - HET group detection - WHATIF:ShowDrugContacts - WHATIF:ShowLigandContacts - WHATIF:HasNucleicContacts - WHATIF:ShowDrugContactsShort - WHATIF:ShowProteiNucleicContacts - beta12orEarlier - WHATIF:HasMetalContacts - WHATIF:HasNegativeIonContactsPlus - - - - - - - - - - Alignment processing - - true - Process (read and / or write) an alignment of two or more molecular sequences, structures or derived data. - 1.6 - beta12orEarlier - - - - - - - - - - - Structure alignment processing - - Process (read and / or write) a molecular tertiary (3D) structure alignment. - 1.6 - beta12orEarlier - true - - - - - - - - - - Codon usage bias calculation - - - - - - - - Calculate codon usage bias. - beta12orEarlier - - - - - - - - - - Codon usage bias plotting - - - - - - - - - beta12orEarlier - Generate a codon usage bias plot. - - - - - - - - - - Codon usage fraction calculation - - - - - - - - Calculate the differences in codon usage fractions between two sequences, sets of sequences, codon usage tables etc. - beta12orEarlier - - - - - - - - - - Classification - - beta12orEarlier - Assign molecular sequences, structures or other biological data to a specific group or category according to qualities it shares with that group or category. - - - - - - - - - - Molecular interaction data processing - - beta13 - true - beta12orEarlier - Process (read and / or write) molecular interaction data. - - - - - - - - - - Sequence classification - - - beta12orEarlier - Assign molecular sequence(s) to a group or category. - - - - - - - - - - Structure classification - - - Assign molecular structure(s) to a group or category. - beta12orEarlier - - - - - - - - - - Protein comparison - - Compare two or more proteins (or some aspect) to identify similarities. - beta12orEarlier - - - - - - - - - - Nucleic acid comparison - - beta12orEarlier - Compare two or more nucleic acids to identify similarities. - - - - - - - - - - Prediction and recognition (protein) - - beta12orEarlier - Predict, recognise, detect or identify some properties of proteins. - - - - - - - - - - Prediction and recognition (nucleic acid) - - beta12orEarlier - Predict, recognise, detect or identify some properties of nucleic acids. - - - - - - - - - - Structure editing - - - - - - - - beta13 - Edit, convert or otherwise change a molecular tertiary structure, either randomly or specifically. - - - - - - - - - - Sequence alignment editing - - Edit, convert or otherwise change a molecular sequence alignment, either randomly or specifically. - beta13 - - - - - - - - - - Pathway or network visualisation - - - - - - - - - Render (visualise) a biological pathway or network. - Pathway or network rendering - beta13 - - - - - - - - - - Protein function prediction (from sequence) - - beta13 - true - Predict general (non-positional) functional properties of a protein from analysing its sequence. - For functional properties that are positional, use 'Protein site detection' instead. - 1.6 - - - - - - - - - - Protein sequence feature detection - - - - Protein site recognition - Predict, recognise and identify functional or other key sites within protein sequences, typically by scanning for known motifs, patterns and regular expressions. - Protein site prediction - Sequence profile database search - Protein site detection - Protein secondary database search - Sequence feature detection (protein) - beta13 - - - - - - - - - - Protein property calculation (from sequence) - - - beta13 - Calculate (or predict) physical or chemical properties of a protein, including any non-positional properties of the molecular sequence, from processing a protein sequence. - - - - - - - - - - Protein feature prediction (from structure) - - beta13 - 1.6 - true - Predict, recognise and identify positional features in proteins from analysing protein structure. - - - - - - - - - - Protein feature detection - - - - - - - - - - - - - - - Features includes functional sites or regions, secondary structure, structural domains and so on. Methods might use fingerprints, motifs, profiles, hidden Markov models, sequence alignment etc to provide a mapping of a query protein sequence to a discriminatory element. This includes methods that search a secondary protein database (Prosite, Blocks, ProDom, Prints, Pfam etc.) to assign a protein sequence(s) to a known protein family or group. - - Predict, recognise and identify positional features in proteins from analysing protein sequences or structures. - beta13 - Protein feature recognition - Protein feature prediction - - - - - - - - - - Database search (by sequence) - - Sequence screening - true - 1.6 - Screen a molecular sequence(s) against a database (of some type) to identify similarities between the sequence and database entries. - beta13 - - - - - - - - - - Protein interaction network prediction - - - - - - - - - - - - - - beta13 - Predict a network of protein interactions. - - - - - - - - - - Nucleic acid design - - - beta13 - Design (or predict) nucleic acid sequences with specific chemical or physical properties. - - - - - - - - - - Editing - - beta13 - Edit a data entity, either randomly or specifically. - - - - - - - - - - Sequence assembly validation - - - - - - - - - - - - - - - - - - - - - Assembly quality evaluation - Assembly QC - Sequence assembly quality evaluation - Sequence assembly QC - Evaluate a DNA sequence assembly, typically for purposes of quality control. - 1.1 - - - - - - - - - - Genome alignment - - Align two or more (tpyically huge) molecular sequences that represent genomes. - Genome alignment construction - 1.1 - - - - - - - - - - Localized reassembly - - Reconstruction of a sequence assembly in a localised area. - 1.1 - - - - - - - - - - Sequence assembly visualisation - - Assembly rendering - Sequence assembly rendering - Render and visualise a DNA sequence assembly. - 1.1 - Assembly visualisation - - - - - - - - - - Base-calling - - - - - - - - Phred base calling - 1.1 - Identify base (nucleobase) sequence from a fluorescence 'trace' data generated by an automated DNA sequencer. - Base calling - Phred base-calling - - - - - - - - - - Bisulfite mapping - - 1.1 - Bisulfite mapping follows high-throughput sequencing of DNA which has undergone bisulfite treatment followed by PCR amplification; unmethylated cytosines are specifically converted to thymine, allowing the methylation status of cytosine in the DNA to be detected. - The mapping of methylation sites in a DNA (genome) sequence. - Bisulfite sequence alignment - Bisulfite sequence mapping - - - - - - - - - - Sequence contamination filtering - - - - - - - - beta12orEarlier - Identify and filter a (typically large) sequence data set to remove sequences from contaminants in the sample that was sequenced. - - - - - - - - - - Trim ends - - 1.1 - Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. - 1.12 - For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - true - - - - - - - - - - Trim vector - - true - Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - 1.12 - 1.1 - - - - - - - - - - Trim to reference - - true - 1.1 - 1.12 - Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - - - - - - - - - - Sequence trimming - - 1.1 - Cut (remove) the end from a molecular sequence. - Barcode sequence removal - Trim vector - Trimming - Trim ends - Trim to reference - This includes - -ennd trimming -Trim sequences (typically from an automated DNA sequencer) to remove misleading ends. -For example trim polyA tails, introns and primer sequence flanking the sequence of amplified exons, or other unwanted sequence. - -trimming to a reference sequence, -Trim sequences (typically from an automated DNA sequencer) to remove the sequence ends that extend beyond an assembled reference sequence. - -vector trimming -Trim sequences (typically from an automated DNA sequencer) to remove sequence-specific end regions, typically contamination from vector sequences. - - - - - - - - - - - Genome feature comparison - - Genomic elements that might be compared include genes, indels, single nucleotide polymorphisms (SNPs), retrotransposons, tandem repeats and so on. - Compare the features of two genome sequences. - 1.1 - - - - - - - - - - Sequencing error detection - - - - - - - - Short read error correction - Short-read error correction - beta12orEarlier - Detect errors in DNA sequences generated from sequencing projects). - - - - - - - - - - Genotyping - - 1.1 - Methods might consider cytogenetic analyses, copy number polymorphism (and calculate copy number calls for copy-number variation(CNV) regions), single nucleotide polymorphism (SNP), , rare copy number variation (CNV) identification, loss of heterozygosity data and so on. - Analyse DNA sequence data to identify differences between the genetic composition (genotype) of an individual compared to other individual's or a reference sequence. - - - - - - - - - - Genetic variation analysis - - - 1.1 - Sequence variation analysis - Genetic variation annotation provides contextual interpretation of coding SNP consequences in transcripts. It allows comparisons to be made between variation data in different populations or strains for the same transcript. - Genetic variation annotation - Analyse a genetic variation, for example to annotate its location, alleles, classification, and effects on individual transcripts predicted for a gene model. - - - - - - - - - - Read mapping - - - Short oligonucleotide alignment - Oligonucleotide mapping - Oligonucleotide alignment generation - Short read mapping - Oligonucleotide alignment construction - The purpose of read mapping is to identify the location of sequenced fragments within a reference genome and assumes that there is, in fact, at least local similarity between the fragment and reference sequences. - Oligonucleotide alignment - Read alignment - 1.1 - Short read alignment - Align short oligonucleotide sequences (reads) to a larger (genomic) sequence. - Short sequence read mapping - - - - - - - - - - Split read mapping - - A varient of oligonucleotide mapping where a read is mapped to two separate locations because of possible structural variation. - 1.1 - - - - - - - - - - Community profiling - - - Analyse DNA sequences in order to identify a DNA 'barcode'; marker genes or any short fragment(s) of DNA that are useful to diagnose the taxa of biological organisms. - 1.1 - DNA barcoding - Sample barcoding - - - - - - - - - - SNP calling - - Identify single nucleotide change in base positions in sequencing data that differ from a reference genome and which might, especially by reference to population frequency or functional data, indicate a polymorphism. - Operations usually score confidence in the prediction or some other statistical measure of evidence. - 1.1 - - - - - - - - - - Polymorphism detection - - Polymorphism detection - Detect mutations in multiple DNA sequences, for example, from the alignment and comparison of the fluorescent traces produced by DNA sequencing hardware. - 1.1 - Mutation detection - - - - - - - - - - Chromatogram visualisation - - Visualise, format or render an image of a Chromatogram. - Chromatogram viewing - 1.1 - - - - - - - - - - Methylation analysis - - 1.1 - Determine cytosine methylation states in nucleic acid sequences. - - - - - - - - - - Methylation calling - - - 1.1 - Determine cytosine methylation status of specific positions in a nucleic acid sequences. - - - - - - - - - - Methylation level analysis (global) - - 1.1 - Global methylation analysis - Measure the overall level of methyl cytosines in a genome from analysis of experimental data, typically from chromatographic methods and methyl accepting capacity assay. - - - - - - - - - - Methylation level analysis (gene-specific) - - Gene-specific methylation analysis - Many different techniques are available for this. - Measure the level of methyl cytosines in specific genes. - 1.1 - - - - - - - - - - Genome visualisation - - 1.1 - Genome visualization - Visualise, format or render a nucleic acid sequence that is part of (and in context of) a complete genome sequence. - Genome rendering - Genome browser - Genome viewing - Genome browsing - - - - - - - - - - Genome comparison - - Compare the sequence or features of two or more genomes, for example, to find matching regions. - 1.1 - Genomic region matching - - - - - - - - - - Genome indexing - - - - - - - - Genome indexing (Burrows-Wheeler) - Many sequence alignment tasks involving many or very large sequences rely on a precomputed index of the sequence to accelerate the alignment. The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. A suffix array consists of the lexicographically sorted list of suffixes of a genome. - Genome indexing (suffix arrays) - Generate an index of a genome sequence. - Suffix arrays - Burrows-Wheeler - 1.1 - - - - - - - - - - Genome indexing (Burrows-Wheeler) - - The Burrows-Wheeler Transform (BWT) is a permutation of the genome based on a suffix array algorithm. - 1.12 - true - Generate an index of a genome sequence using the Burrows-Wheeler algorithm. - 1.1 - - - - - - - - - - Genome indexing (suffix arrays) - - 1.1 - Generate an index of a genome sequence using a suffix arrays algorithm. - A suffix array consists of the lexicographically sorted list of suffixes of a genome. - true - 1.12 - Suffix arrays - - - - - - - - - - Spectral analysis - - - - - - - - 1.1 - Analyse one or more spectra from mass spectrometry (or other) experiments. - Spectrum analysis - Mass spectrum analysis - - - - - - - - - - Peak detection - - - - - - - - 1.1 - Peak finding - Peak assignment - Identify peaks in a spectrum from a mass spectrometry, NMR, or some other spectrum-generating experiment. - - - - - - - - - - Scaffolding - - - - - - - - - Scaffold construction - Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically typically contigs; contiguous sequences corresponding to read overlaps. - 1.1 - Scaffold may be positioned along a chromosome physical map to create a "golden path". - Scaffold generation - - - - - - - - - - Scaffold gap completion - - Fill the gaps in a sequence assembly (scaffold) by merging in additional sequences. - Different techniques are used to generate gap sequences to connect contigs, depending on the size of the gap. For small (5-20kb) gaps, PCR amplification and sequencing is used. For large (>20kb) gaps, fragments are cloned (e.g. in BAC (Bacterial artificial chromosomes) vectors) and then sequenced. - 1.1 - - - - - - - - - - Sequencing quality control - - - Raw sequence data quality control. - Analyse raw sequence data from a sequencing pipeline and identify (and possiby fix) problems. - Sequencing QC - 1.1 - - - - - - - - - - Read pre-processing - - - Sequence read pre-processing - Pre-process sequence reads to ensure (or improve) quality and reliability. - For example process paired end reads to trim low quality ends remove short sequences, identify sequence inserts, detect chimeric reads, or remove low quality sequnces including vector, adaptor, low complexity and contaminant sequences. Sequences might come from genomic DNA library, EST libraries, SSH library and so on. - 1.1 - - - - - - - - - - Species frequency estimation - - - - - - - - Estimate the frequencies of different species from analysis of the molecular sequences, typically of DNA recovered from environmental samples. - 1.1 - - - - - - - - - - Peak calling - - Peak-pair calling - Chip-sequencing combines chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing to generate a set of reads, which are aligned to a genome sequence. The enriched areas contain the binding sites of DNA-associated proteins. For example, a transcription factor binding site. ChIP-on-chip in contrast combines chromatin immunoprecipitation ('ChIP') with microarray ('chip'). "Peak-pair calling" is similar to "Peak calling" in the context of ChIP-exo. - Identify putative protein-binding regions in a genome sequence from analysis of Chip-sequencing data or ChIP-on-chip data. - Protein binding peak detection - 1.1 - - - - - - - - - - Differential expression analysis - - Identify (typically from analysis of microarray or RNA-seq data) genes whose expression levels are significantly different between two sample groups. - Differentially expressed gene identification - Differential expression analysis is used, for example, to identify which genes are up-regulated (increased expression) or down-regulated (decreased expression) between a group treated with a drug and a control groups. - 1.1 - - - - - - - - - - Gene set testing - - 1.1 - Gene sets can be defined beforehand by biological function, chromosome locations and so on. - Analyse gene expression patterns (typically from DNA microarray datasets) to identify sets of genes that are associated with a specific trait, condition, clinical outcome etc. - - - - - - - - - - Variant classification - - - Classify variants based on their potential effect on genes, especially functional effects on the expressed proteins. - 1.1 - Variants are typically classified by their position (intronic, exonic, etc.) in a gene transcript and (for variants in coding exons) by their effect on the protein sequence (synonymous, non-synonymous, frameshifting, etc.) - - - - - - - - - - Variant prioritization - - Variant prioritization can be used for example to produce a list of variants responsible for 'knocking out' genes in specific genomes. Methods amino acid substitution, aggregative approaches, probabilistic approach, inheritance and unified likelihood-frameworks. - Identify biologically interesting variants by prioritizing individual variants, for example, homozygous variants absent in control genomes. - 1.1 - - - - - - - - - - Variant calling - - Allele calling - Somatic variant calling - Germ line variant calling - Somatic variant calling is the detection of variations established in somatic cells and hence not inherited as a germ line variant. - Methods often utilise a database of aligned reads. - Variant mapping - 1.1 - Variant detection - Identify and map genomic alterations, including single nucleotide polymorphisms, short indels and structural variants, in a genome sequence. - - - - - - - - - - Structural variation discovery - - Detect large regions in a genome subject to copy-number variation, or other structural variations in genome(s). - 1.1 - Methods might involve analysis of whole-genome array comparative genome hybridization or single-nucleotide polymorphism arrays, paired-end mapping of sequencing data, or from analysis of short reads from new sequencing technologies. - - - - - - - - - - Exome assembly - Exome analysis - - 1.1 - Exome sequence analysis - Anaylse sequencing data from experiments aiming to selectively sequence the coding regions of the genome. - - - - - - - - - - Read depth analysis - - 1.1 - Analyse mapping density (read depth) of (typically) short reads from sequencing platforms, for example, to detect deletions and duplications. - - - - - - - - - - Gene expression QTL analysis - - - - - - - - expression quantitative trait loci profiling - 1.1 - eQTL profiling - Combine classical quantitative trait loci (QTL) analysis with gene expression profiling, for example, to describe describe cis- and trans-controlling elements for the expression of phenotype associated genes. - expression QTL profiling - - - - - - - - - - Copy number estimation - - Methods typically implement some statistical model for hypothesis testing, and methods estimate total copy number, i.e. do not distinguish the two inherited chromosomes quantities (specific copy number). - Transcript copy number estimation - 1.1 - Estimate the number of copies of loci of particular gene(s) in DNA sequences typically from gene-expression profiling technology based on microarray hybridization-based experiments. For example, estimate copy number (or marker dosage) of a dominant marker in samples from polyploid plant cells or tissues, or chromosomal gains and losses in tumors. - - - - - - - - - - Primer removal - - 1.2 - Remove forward and/or reverse primers from nucleic acid sequences (typically PCR products). - Adapter removal - - - - - - - - - - Transcriptome assembly - - - - - - - - - - - - - - Infer a transcriptome sequence by analysis of short sequence reads. - 1.2 - - - - - - - - - - Transcriptome assembly (de novo) - - de novo transcriptome assembly - true - 1.6 - 1.2 - Infer a transcriptome sequence without the aid of a reference genome, i.e. by comparing short sequences (reads) to each other. - - - - - - - - - - Transcriptome assembly (mapping) - - Infer a transcriptome sequence by mapping short reads to a reference genome. - 1.6 - 1.2 - true - - - - - - - - - - Sequence coordinate conversion - - - - - - - - - - - - - - 1.3 - Convert one set of sequence coordinates to another, e.g. convert coordinates of one assembly to another, cDNA to genomic, CDS to genomic, protein translation to genomic etc. - - - - - - - - - - Document similarity calculation - - Calculate similarity between 2 or more documents. - 1.3 - - - - - - - - - - Document clustering - - - Cluster (group) documents on the basis of their calculated similarity. - 1.3 - - - - - - - - - - Named entity recognition - - - Entity identification - Entity chunking - Entity extraction - Recognise named entities (text tokens) within documents. - 1.3 - - - - - - - - - - ID mapping - - - Identifier mapping - The mapping can be achieved by comparing identifier values or some other means, e.g. exact matches to a provided sequence. - 1.3 - Accession mapping - Map data identifiers to one another for example to establish a link between two biological databases for the purposes of data integration. - - - - - - - - - - Anonymisation - - Process data in such a way that makes it hard to trace to the person which the data concerns. - 1.3 - Data anonymisation - - - - - - - - - - ID retrieval - - - - - - - - id retrieval - Data retrieval (accession) - Data retrieval (ID) - Identifier retrieval - Data retrieval (id) - Accession retrieval - Search for and retrieve a data identifier of some kind, e.g. a database entry accession. - 1.3 - - - - - - - - - - Sequence checksum generation - - - - - - - - - - - - - - Generate a checksum of a molecular sequence. - 1.4 - - - - - - - - - - Bibliography generation - - - - - - - - Bibliography construction - Construct a bibliography from the scientific literature. - 1.4 - - - - - - - - - - Protein quaternary structure prediction - - 1.4 - Predict the structure of a multi-subunit protein and particularly how the subunits fit together. - - - - - - - - - - Molecular surface analysis - - - - - - - - - - - - - - 1.4 - Analyse the surface properties of proteins or other macromolecules, including surface accessible pockets, interior inaccessible cavities etc. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - - - - - - - - - - Ontology comparison - - 1.4 - Compare two or more ontologies, e.g. identify differences. - 1.9 - - - - - - - - - - Format detection - - - - - - - - - - - - - - Recognition of which format the given data is in. - 1.4 - Format identification - Format recognition - 'Format recognition' is not a bioinformatics-specific operation, but of great relevance in bioinformatics. Should be removed from EDAM if/when captured satisfactorily in a suitable domain-generic ontology. - Format inference - - - - - - The has_input "Data" (data_0006) may cause visualisation or other problems although ontologically correct. But on the other hand it may be useful to distinguish from nullary operations without inputs. - - - - - - - - - - - Splitting - - File splitting - Split a file containing multiple data items into many files, each containing one item - 1.4 - - - - - - - - - - Generation - - Construction - beta12orEarlier - For non-analytical operations, see the 'Processing' branch. - Construct some data entity. - - - - - - - - - - Nucleic acid sequence feature detection - - - Nucleic acid site prediction - Predict, recognise and identify functional or other key sites within nucleic acid sequences, typically by scanning for known motifs, patterns and regular expressions. - Nucleic acid site recognition - 1.6 - Nucleic acid site detection - - - - - - - - - - Deposition - - Deposit some data in a database or some other type of repository or software system. - 1.6 - Database submission - Submission - Data submission - Data deposition - Database deposition - For non-analytical operations, see the 'Processing' branch. - - - - - - - - - - Clustering - - 1.6 - Group together some data entities on the basis of similarities such that entities in the same group (cluster) are more similar to each other than to those in other groups (clusters). - - - - - - - - - - Assembly - - 1.6 - Construct some entity (typically a molecule sequence) from component pieces. - - - - - - - - - - Conversion - - Convert a data set from one form to another. - 1.6 - - - - - - - - - - Standardization and normalization - - Normalization - 1.6 - Standardization - Standardize or normalize data. - - - - - - - - - - Aggregation - - Combine multiple files or data items into a single file or object. - 1.6 - - - - - - - - - - Article comparison - - Compare two or more scientific articles. - 1.6 - - - - - - - - - - Calculation - - Mathemetical determination of the value of something, typically a properly of a molecule. - 1.6 - - - - - - - - - - Pathway or network prediction - - - 1.6 - Predict a molecular pathway or network. - - - - - - - - - - Genome assembly - - 1.12 - 1.6 - The process of assembling many short DNA sequences together such thay they represent the original chromosomes from which the DNA originated. - true - - - - - - - - - - - Plotting - - Generate a graph, or other visual representation, of data, showing the relationship between two or more variables. - 1.6 - - - - - - - - - - Image analysis - - - - - - - - 1.7 - The analysis of a image (typically a digital image) of some type in order to extract information from it. - Image processing - - - - - - - - - - - Diffraction data analysis - - 1.7 - Analysis of data from a diffraction experiment. - - - - - - - - - - Cell migration analysis - - - - - - - - 1.7 - Analysis of cell migration images in order to study cell migration, typically in order to study the processes that play a role in the disease progression. - - - - - - - - - - Diffraction data reduction - - 1.7 - Processing of diffraction data into a corrected, ordered, and simplified form. - - - - - - - - - - Neurite measurement - - - - - - - - Measurement of neurites; projections (axons or dendrites) from the cell body of a neuron, from analysis of neuron images. - 1.7 - - - - - - - - - - Diffraction data integration - - 1.7 - Diffraction summation integration - Diffraction profile fitting - The evaluation of diffraction intensities and integration of diffraction maxima from a diffraction experiment. - - - - - - - - - - Phasing - - Phase a macromolecular crystal structure, for example by using molecular replacement or experimental phasing methods. - 1.7 - - - - - - - - - - Molecular replacement - - 1.7 - A technique used to construct an atomic model of an unknown structure from diffraction data, based upon an atomic model of a known structure, either a related protein or the same protein from a different crystal form. - The technique solves the phase problem, i.e. retrieve information concern phases of the structure. - - - - - - - - - - Rigid body refinement - - 1.7 - Rigid body refinement usually follows molecular replacement in the assignment of a structure from diffraction data. - A method used to refine a structure by moving the whole molecule or parts of it as a rigid unit, rather than moving individual atoms. - - - - - - - - - - Single particle analysis - - - - - - - - - An image processing technique that combines and analyze multiple images of a particulate sample, in order to produce an image with clearer features that are more easily interpreted. - 1.7 - Single particle analysis is used to improve the information that can be obtained by relatively low resolution techniques, , e.g. an image of a protein or virus from transmission electron microscopy (TEM). - - - - - - - - - - Single particle alignment and classification - - - Compare (align and classify) multiple particle images from a micrograph in order to produce a representative image of the particle. - 1.7 - A micrograph can include particles in multiple different orientations and/or conformations. Particles are compared and organised into sets based on their similarity. Typically iterations of classification and alignment and are performed to optimise the final image; average images produced by classification are used as a reference image for subsequent alignment of the whole image set. - - - - - - - - - - Functional clustering - - - - - - - - 1.7 - Clustering of molecular sequences on the basis of their function, typically using information from an ontology of gene function, or some other measure of functional phenotype. - Functional sequence clustering - - - - - - - - - - Taxonomic classification - - Taxonomy assignment - Classifiication (typically of molecular sequences) by assignment to some taxonomic hierarchy. - 1.7 - - - - - - - - - - Virulence prediction - - - - - - - - - Pathogenicity prediction - The prediction of the degree of pathogenicity of a microorganism from analysis of molecular sequences. - 1.7 - - - - - - - - - - Gene expression correlation analysis - - - 1.7 - Gene co-expression network analysis - Analyse the correlation patterns among genes across across a variety of experiments, microarray samples etc. - - - - - - - - - - - Correlation - - - - - - - - 1.7 - Identify a correlation, i.e. a statistical relationship between two random variables or two sets of data. - - - - - - - - - - RNA structure covariance model generation - - - - - - - - - Compute the covariance model for (a family of) RNA secondary structures. - 1.7 - - - - - - - - - - RNA secondary structure prediction (shape-based) - - RNA shape prediction - Predict RNA secondary structure by analysis, e.g. probabilistic analysis, of the shape of RNA folds. - 1.7 - - - - - - - - - - Nucleic acid folding prediction (alignment-based) - - 1.7 - Prediction of nucleic-acid folding using sequence alignments as a source of data. - - - - - - - - - - k-mer counting - - Count k-mers (substrings of length k) in DNA sequence data. - 1.7 - k-mer counting is used in genome and transcriptome assembly, metagenomic sequencing, and for error correction of sequence reads. - - - - - - - - - - Phylogenetic tree reconstruction - - - - - - - - Reconstructing the inner node labels of a phylogenetic tree from its leafes. - Note that this is somewhat different from simply analysing an existing tree or constructing a completely new one. - 1.7 - - - - - - - - - - Probabilistic data generation - - Generate some data from a choosen probibalistic model, possibly to evaluate algorithms. - 1.7 - - - - - - - - - - Probabilistic sequence generation - - - 1.7 - Generate sequences from some probabilistic model, e.g. a model that simulates evolution. - - - - - - - - - - Antimicrobial resistance prediction - - - - - - - - - 1.7 - Identify or predict causes for antibiotic resistance from molecular sequence analysis. - - - - - - - - - - Enrichment - - - - - - - - - A relevant ontology will be used. The input is typically a set of identifiers or other data, and the output of the analysis is typically a ranked list of ontology terms, each associated with a p-value. - Term enrichment - 1.8 - Analyse a dataset with respect to concepts from an ontology. - - - - - - - - - - Chemical class enrichment - - - - - - - - - 1.8 - Analyse a dataset with respect to concepts from an ontology of chemical structure. - - - - - - - - - - Incident curve plotting - - 1.8 - Plot an incident curve such as a survival curve, death curve, mortality curve. - - - - - - - - - - Variant pattern analysis - - Methods often utilise a database of aligned reads. - Identify and map patterns of genomic variations. - 1.8 - - - - - - - - - - Mathematical modelling - - 1.12 - Model some biological system using mathematical techniques including dynamical systems, statistical models, differential equations, and game theoretic models. - true - beta12orEarlier - - - - - - - - - - Microscope image visualisation - - - - - - - - Visualise images resulting from various types of microscopy. - 1.9 - Microscopy image visualisation - - - - - - - - - - Image annotation - - 1.9 - Annotate an image of some sort, typically with terms from a controlled vocabulary. - - - - - - - - - - Imputation - - Data imputation - Replace missing data with substituted values, usually by using some statistical or other mathematical approach. - 1.9 - - - - - - - - - - Ontology visualisation - - 1.9 - Visualise, format or render data from an ontology, typically a tree of terms. - Ontology browsing - - - - - - - - - - Maximum occurence analysis - - A method for making numerical assessments about the maximum percent of time that a conformer of a flexible macromolecule can exist and still be compatible with the experimental data. - beta12orEarlier - - - - - - - - - - Database comparison - - - 1.9 - Data model comparison - Compare the models or schemas used by two or more databases, or any other general comparison of databases rather than a detailed comparison of the entries themselves. - Schema comparison - - - - - - - - - - Network simulation - - - - - - - - Simulate the bevaviour of a biological pathway or network. - Pathway simulation - Network topology simulation - 1.9 - - - - - - - - - - RNA-seq read count analysis - - Analyze read counts from RNA-seq experiments. - 1.9 - - - - - - - - - - Chemical redundancy removal - - 1.9 - Identify and remove redudancy from a set of small molecule structures. - - - - - - - - - - RNA-seq time series data analysis - - 1.9 - Analyze time series data from an RNA-seq experiment. - - - - - - - - - - Simulated gene expression data generation - - 1.9 - Simulate gene expression data, e.g. for purposes of benchmarking. - - - - - - - - - - Relationship inference - - - - - - - - - - - - - - - - - - - - 1.12 - Identify semantic relationships within a text or between two or more texts using text mining techniques. - - - - - - - - - - Mass spectra calibration - - - - - - - - Re-adjust the output of mass spectrometry experiments with shifted ppm values. - 1.12 - - - - - - - - - - Chromatographic alignment - - - - - - - - Align multiple data sets using information from chromatography and/or peptide identification, from mass spectrometry experiments. - 1.12 - - - - - - - - - - Deisotoping - - - - - - - - The removal of isotope peaks in a spectrum, to represent the fragment ion as one data point. - Deconvolution - 1.12 - Deisotoping is commonly done to reduce complexity, and done in conjunction with the charge state deconvolution. - - - - - - - - - - Quantification - - - - - - - - Technique for determining the amount of proteins in a sample. - 1.12 - Quantitation - - - - - - - - - - Peptide identification - - - - - - - - Peptide-spectrum-matching - Determination of peptide sequence from mass spectrum. - 1.12 - - - - - - - - - - Isotopic distributions calculation - - - - - - - - - - - - - - 1.12 - Calculate the isotope distribution of a given chemical species. - - - - - - - - - - Retention times prediction - - Retention times calculation - Prediction of retention times in a mass spectrometry experiment based on compositional and structural properties of the separated species. - 1.12 - - - - - - - - - - Label-free quantification - - 1.12 - Quantification without the use of chemical tags. - - - - - - - - - - Labeled quantification - - 1.12 - Quantification based on the use of chemical tags. - - - - - - - - - - MRM/SRM - - 1.12 - Quantification by Selected/multiple Reaction Monitoring workflow (XIC quantitation of precursor / fragment mass pair). - - - - - - - - - - Spectral counting - - 1.12 - Calculate number of identified MS2 spectra as approximation of peptide / protein quantity. - - - - - - - - - - SILAC - - Quantification analysis using stable isotope labeling by amino acids in cell culture. - 1.12 - - - - - - - - - - iTRAQ - - 1.12 - Quantification analysis using the AB SCIEX iTRAQ isobaric labelling workflow, wherein 2-8 reporter ions are measured in MS2 spectra near 114 m/z. - - - - - - - - - - 18O labeling - - 1.12 - Quantification analysis using labeling based on 18O-enriched H2O. - - - - - - - - - - TMT-tag - - 1.12 - Quantification analysis using the Thermo Fisher tandem mass tag labelling workflow. - - - - - - - - - - Dimethyl - - 1.12 - Quantification analysis using chemical labeling by stable isotope dimethylation - - - - - - - - - - Tag-based peptide identification - - Peptide sequence tags are used as piece of information about a peptide obtained by tandem mass spectrometry. - 1.12 - - - - - - - - - - de Novo sequencing - - - Analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum (MS/MS) without the assistance of a sequence database. - 1.12 - - - - - - - - - - PTM identification - - Identification of post-translational modifications (PTMs) of peptides/proteins in mass spectrum. - 1.12 - - - - - - - - - - Peptide database search - - - 1.12 - Determination of best matches between MS/MS spectrum and a database of protein or nucleic acid sequences. - - - - - - - - - - Blind peptide database search - - Modification-tolerant peptide database search - Unrestricted peptide database search - 1.12 - Peptide database search for identification of known and unknown PTMs looking for mass difference mismatches. - - - - - - - - - - Validation of peptide-spectrum matches - - - Statistical estimation of false discovery rate from score distribution for peptide-spectrum-matches, following a peptide database search. - 1.12 - - - - - - - - - - Target-Decoy - - Estimation of false discovery rate by comparison to search results with a database containing incorrect information. - 1.12 - - - - - - - - - - Statistical inference - - 1.12 - Empirical Bayes - Analyse data in order to deduce properties of an underlying distribution or population. - - - - - - - - - - Regression analysis - - A statistical calculation to estimate the relationships among variables. - Regression - 1.12 - - - - - - - - - - Metabolic network modelling - - - - - - - - Model a metabolic network, for example, to reconstruct pathways or to simulate metabolism. - Metabolic reconstruction - Metabolic network reconstruction - Metabolic network simulation - 1.12 - - - - - - - - - - SNP annotation - - Predict the effect or function of an individual single nucleotide polymorphism (SNP). - 1.12 - - - - - - - - - - Ab-initio gene prediction - - Prediction of genes or gene components from first principles, i.e. without reference to existing genes. - 1.12 - Gene prediction (ab-initio) - - - - - - - - - - Homology-based gene prediction - - Gene prediction (homology-based) - Prediction of genes or gene components by reference to homologous genes. - 1.12 - - - - - - - - - - Statistical modelling - - 1.12 - Construction of a statistical model, or a set of assumptions around some observed data, usually by describing a set of probability distributions which approximate the distribution of data. - - - - - - - - - - Molecular surface comparison - - - 1.12 - Compare two or more molecular surfaces. - - - - - - - - - - Gene functional annotation - - 1.12 - Annotate one or more sequences with functional information, such as cellular processes or metaobolic pathways, by reference to a controlled vocabulary - invariably the Gene Ontology (GO). - - - - - - - - - - Variant filtering - - - 1.12 - Variant filtering is used to eliminate false positive variants based for example on base calling quality, strand and position information, and mapping info. - - - - - - - - - - Differential binding analysis - - 1.12 - Differential binding analysis identifies binding sites in nucleic acid sequences that are statistically significantly differentially bound between sample groups. - - - - - - - - - - RNA-Seq analysis - - Analyze data from RNA-seq experiments. - 1.13 - - - - - - - - - - Mass spectrum visualisation - - 1.1 - Visualise, format or render a mass spectrum. - - - - - - - - - - Filtering - - Filter a set of files or data items according to some property. - 1.13 - Sequence filtering - - - - - - - - - - Reference identification - - Identification of the best reference for mapping for a specific dataset from a list of potential references, when performing genetic variation analysis. - 1.1 - - - - - - - - - - Ion counting - - Ion current integration - Label-free quantification by integration of ion current (ion counting). - 1.14 - - - - - - - - - - Isotope-coded protein label - - Chemical tagging free amino groups of intact proteins with stable isotopes. - ICPL - 1.14 - - - - - - - - - - Metabolic labeling - - Labeling all proteins and (possibly) all amino acids using C-13 or N-15 enriched grown medium or feed. - 1.14 - This includes N-15 metabolic labeling (labeling all proteins and (possibly) all amino acids using N-15 enriched grown medium or feed) and C-13 metabolic labeling (labeling all proteins and (possibly) all amino acids using C-13 enriched grown medium or feed). - N-15 metabolic labeling - C-13 metabolic labeling - - - - - - - - - - Topic - - http://purl.org/biotop/biotop.owl#Quality - http://bioontology.org/ontologies/ResearchArea.owl#Area_of_Research - http://www.onto-med.de/ontologies/gfo.owl#Category - http://www.ifomis.org/bfo/1.1/snap#Quality - http://www.onto-med.de/ontologies/gfo.owl#Perpetuant - A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other. - http://www.loa-cnr.it/ontologies/DOLCE-Lite.owl#quality - beta12orEarlier - http://www.ifomis.org/bfo/1.1/snap#Continuant - sumo:FieldOfStudy - http://onto.eva.mpg.de/ontologies/gfo-bio.owl#Method - - - - - - - - - - Nucleic acid analysis - - The processing and analysis of nucleic acid sequence, structural and other data. - Nucleic acid bioinformatics - Nucleic acids - Nucleic acid informatics - http://purl.bioontology.org/ontology/MSH/D017423 - Nucleic acid properties - Nucleic acid physicochemistry - http://purl.bioontology.org/ontology/MSH/D017422 - true - beta12orEarlier - - - - - - - - - - Protein analysis - - Protein informatics - Proteins - http://purl.bioontology.org/ontology/MSH/D020539 - Protein bioinformatics - Protein databases - true - beta12orEarlier - Archival, processing and analysis of protein data, typically molecular sequence and structural data. - - - - - - - - - - Metabolites - - 1.13 - true - The structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. - beta12orEarlier - - - - - - - - - - Sequence analysis - - true - beta12orEarlier - Sequence databases - Sequences - http://purl.bioontology.org/ontology/MSH/D017421 - The archival, processing and analysis of molecular sequences (monomer composition of polymers) including molecular sequence data resources, sequence sites, alignments, motifs and profiles. - - - - - - - - - - - Structure analysis - - Computational structural biology - true - The curation, processing and analysis of the structure of biological molecules, typically proteins and nucleic acids and other macromolecules. - http://purl.bioontology.org/ontology/MSH/D015394 - Structural bioinformatics - Structure databases - This includes related concepts such as structural properties, alignments and structural motifs. - Structure data resources - beta12orEarlier - - - - - - - - - - - Structure prediction - - Protein fold recognition - The prediction of molecular structure, including the prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features, and the folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - - - Nucleic acid structure prediction - beta12orEarlier - Protein structure prediction - true - DNA structure prediction - Nucleic acid design - Nucleic acid folding - RNA structure prediction - This includes the recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s), for example by threading, or the alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - - - - - - - - - - Alignment - - beta12orEarlier - true - The alignment (equivalence between sites) of molecular sequences, structures or profiles (representing a sequence or structure alignment). - beta12orEarlier - - - - - - - - - - - Phylogeny - - - Phylogeny reconstruction - Phylogenetic stratigraphy - beta12orEarlier - Phylogenetic dating - Phylogenetic clocks - true - http://purl.bioontology.org/ontology/MSH/D010802 - The study of evolutionary relationships amongst organisms. - Phylogenetic simulation - This includes diverse phylogenetic methods, including phylogenetic tree construction, typically from molecular sequence or morphological data, methods that simulate DNA sequence evolution, a phylogenetic tree or the underlying data, or which estimate or use molecular clock and stratigraphic (age) data, methods for studying gene evolution etc. - - - - - - - - - - - Functional genomics - - - beta12orEarlier - true - The study of gene or protein functions and their interactions in totality in a given organism, tissue, cell etc. - - - - - - - - - - - Ontology and terminology - - true - Terminology - beta12orEarlier - http://purl.bioontology.org/ontology/MSH/D002965 - Applied ontology - Ontology - The conceptualisation, categorisation and nomenclature (naming) of entities or phenomena within biology or bioinformatics. This includes formal ontologies, controlled vocabularies, structured glossary, symbols and terminology or other related resource. - Ontologies - - - - - - - - - - - Information retrieval - - beta12orEarlier - 1.13 - true - The search and query of data sources (typically databases or ontologies) in order to retrieve entries or other information. - VT 1.3.3 Information retrieval - - - - - - - - - - Bioinformatics - - This includes data processing in general, including basic handling of files and databases, datatypes, workflows and annotation. - VT 1.5.6 Bioinformatics - The archival, curation, processing and analysis of complex biological data. - http://purl.bioontology.org/ontology/MSH/D016247 - beta12orEarlier - true - - - - - - - - - - - Data visualisation - - Data rendering - Rendering (drawing on a computer screen) or visualisation of molecular sequences, structures or other biomolecular data. - true - VT 1.2.5 Computer graphics - beta12orEarlier - Computer graphics - - - - - - - - - - Nucleic acid thermodynamics - - true - The study of the thermodynamic properties of a nucleic acid. - 1.3 - - - - - - - - - - Nucleic acid structure analysis - - - Includes secondary and tertiary nucleic acid structural data, nucleic acid thermodynamic, thermal and conformational properties including DNA or DNA/RNA denaturation (melting) etc. - DNA melting - Nucleic acid denaturation - RNA alignment - The archival, curation, processing and analysis of nucleic acid structural information, such as whole structures, structural features and alignments, and associated annotation. - beta12orEarlier - RNA structure alignment - Nucleic acid structure - Nucleic acid thermodynamics - RNA structure - - - - - - - - - - RNA - - beta12orEarlier - Small RNA - RNA sequences and structures. - - - - - - - - - - Nucleic acid restriction - - 1.3 - beta12orEarlier - Topic for the study of restriction enzymes, their cleavage sites and the restriction of nucleic acids. - true - - - - - - - - - - Mapping - - The mapping of complete (typically nucleotide) sequences. Mapping (in the sense of short read alignment, or more generally, just alignment) has application in RNA-Seq analysis (mapping of transcriptomics reads), variant discovery (e.g. mapping of exome capture), and re-sequencing (mapping of WGS reads). - Genetic linkage - Linkage - Linkage mapping - true - Synteny - This includes resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. It also includes resources for physical (sequence) maps of a DNA sequence showing the physical distance (base pairs) between features or landmarks such as restriction sites, cloned DNA fragments, genes and other genetic markers. It also covers for example the alignment of sequences of (typically millions) of short reads to a reference genome. - DNA mapping - beta12orEarlier - - - - - - - - - Genetic codes and codon usage - - beta12orEarlier - true - 1.3 - Codon usage analysis - The study of codon usage in nucleotide sequence(s), genetic codes and so on. - - - - - - - - - - Protein expression - - Translation - The translation of mRNA into protein and subsequent protein processing in the cell. - beta12orEarlier - - - - - - - - - - - Gene finding - - 1.3 - This includes the study of promoters, coding regions, splice sites, etc. Methods for gene prediction might be ab initio, based on phylogenetic comparisons, use motifs, sequence features, support vector machine, alignment etc. - Gene discovery - Methods that aims to identify, predict, model or analyse genes or gene structure in DNA sequences. - beta12orEarlier - Gene prediction - true - - - - - - - - - - Transcription - - 1.3 - The transcription of DNA into mRNA. - beta12orEarlier - true - - - - - - - - - - Promoters - - true - beta12orEarlier - Promoters in DNA sequences (region of DNA that facilitates the transcription of a particular gene by binding RNA polymerase and transcription factor proteins). - beta13 - - - - - - - - - - Nucleic acid folding - - beta12orEarlier - The folding (in 3D space) of nucleic acid molecules. - true - beta12orEarlier - - - - - - - - - - Gene structure - - This includes the study of promoters, coding regions etc. - beta12orEarlier - Fusion genes - Gene features - true - Gene structure, regions which make an RNA product and features such as promoters, coding regions, gene fusion, splice sites etc. - - This incudes operons (operators, promoters and genes) from a bacterial genome. For example the operon leader and trailer gene, gene composition of the operon and associated information. - - - - - - - - - - Proteomics - - beta12orEarlier - Protein and peptide identification, especially in the study of whole proteomes of organisms. - Protein and peptide identification - Peptide identification - Proteomics includes any methods (especially high-throughput) that separate, characterize and identify expressed proteins such as mass spectrometry, two-dimensional gel electrophoresis and protein microarrays, as well as in-silico methods that perform proteolytic or mass calculations on a protein sequence and other analyses of protein expression data, for example in different cells or tissues. - true - http://purl.bioontology.org/ontology/MSH/D040901 - Protein expression - - - - - - - - - - - Structural genomics - - - true - beta12orEarlier - The elucidation of the three dimensional structure for all (available) proteins in a given organism. - - - - - - - - - - - Protein properties - - The study of the physical and biochemical properties of peptides and proteins, for example the hydrophobic, hydrophilic and charge properties of a protein. - Protein hydropathy - true - Protein physicochemistry - beta12orEarlier - - - - - - - - - - Protein interactions - - - Protein-protein, protein-DNA/RNA and protein-ligand interactions, including analysis of known interactions and prediction of putative interactions. - Protein-nucleic acid interactions - Protein-RNA interaction - Protein interaction networks - This includes experimental (e.g. yeast two-hybrid) and computational analysis techniques. - Protein-protein interactions - Protein-ligand interactions - beta12orEarlier - Protein-DNA interaction - true - - - - - - - - - - Protein folding, stability and design - - beta12orEarlier - Protein residue interactions - Protein design - true - Protein folding - Protein stability - Protein stability, folding (in 3D space) and protein sequence-structure-function relationships. This includes for example study of inter-atomic or inter-residue interactions in protein (3D) structures, the effect of mutation, and the design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - Rational protein design - - - - - - - - - - Two-dimensional gel electrophoresis - - Two-dimensional gel electrophoresis image and related data. - beta13 - beta12orEarlier - true - - - - - - - - - - Mass spectrometry - - beta12orEarlier - true - 1.13 - An analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. - - - - - - - - - - Protein microarrays - - Protein microarray data. - true - beta12orEarlier - beta13 - - - - - - - - - - Protein hydropathy - - beta12orEarlier - true - The study of the hydrophobic, hydrophilic and charge properties of a protein. - 1.3 - - - - - - - - - - Protein targeting and localization - - Protein targeting - Protein sorting - The study of how proteins are transported within and without the cell, including signal peptides, protein subcellular localization and export. - Protein localization - beta12orEarlier - - - - - - - - - - Protein cleavage sites and proteolysis - - true - beta12orEarlier - 1.3 - Enzyme or chemical cleavage sites and proteolytic or mass calculations on a protein sequence. - - - - - - - - - - Protein structure comparison - - The comparison of two or more protein structures. - beta12orEarlier - true - Use this concept for methods that are exclusively for protein structure. - beta12orEarlier - - - - - - - - - - Protein residue interactions - - The processing and analysis of inter-atomic or inter-residue interactions in protein (3D) structures. - true - 1.3 - beta12orEarlier - - - - - - - - - - Protein-protein interactions - - Protein interaction networks - true - Protein-protein interactions, individual interactions and networks, protein complexes, protein functional coupling etc. - beta12orEarlier - 1.3 - - - - - - - - - - Protein-ligand interactions - - beta12orEarlier - true - 1.3 - Protein-ligand (small molecule) interactions. - - - - - - - - - - Protein-nucleic acid interactions - - beta12orEarlier - 1.3 - Protein-DNA/RNA interactions. - true - - - - - - - - - - Protein design - - 1.3 - beta12orEarlier - The design of proteins with specific properties, typically by designing changes (via site-directed mutagenesis) to an existing protein. - true - - - - - - - - - - G protein-coupled receptors (GPCR) - - G-protein coupled receptors (GPCRs). - true - beta12orEarlier - beta12orEarlier - - - - - - - - - - Carbohydrates - - beta12orEarlier - Carbohydrates, typically including structural information. - true - - - - - - - - - - Lipids - - beta12orEarlier - true - Lipidomics - Lipids and their structures. - - - - - - - - - - Small molecules - - Drugs and target structures - Amino acids - Targets - Drug structures - Metabolite structures - Target structures - Small molecules of biological significance, typically archival, curation, processing and analysis of structural information. - Small molecules include organic molecules, metal-organic compounds, small polypeptides, small polysaccharides and oligonucleotides. Structural data is usually included. - true - This concept excludes macromolecules such as proteins and nucleic acids. - Toxins and targets - CHEBI:23367 - Toxins - Metabolites - Drug targets - Peptides and amino acids - beta12orEarlier - Chemical structures - This includes the structures of drugs, drug target, their interactions and binding affinities. Also the structures of reactants or products of metabolism, for example small molecules such as including vitamins, polyols, nucleotides and amino acids. Also the physicochemical, biochemical or structural properties of amino acids or peptides. Also structural and associated data for toxic chemical substances. - Peptides - - - - - - - - - - Sequence editing - - beta12orEarlier - true - beta12orEarlier - Edit, convert or otherwise change a molecular sequence, either randomly or specifically. - - - - - - - - - - - Sequence composition, complexity and repeats - - Repeat sequences - This includes short repetitive subsequences (repeat sequences) in a protein sequence. - true - The archival, processing and analysis of the basic character composition of molecular sequences, for example character or word frequency, ambiguity, complexity, particularly regions of low complexity, and repeats or the repetitive nature of molecular sequences. - beta12orEarlier - Protein sequence repeats - Nucleic acid repeats - This includes repetitive elements within a nucleic acid sequence, e.g. -long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a sequence and other types of repeating unit. - Sequence complexity - Low complexity sequences - Sequence repeats - Sequence composition - Protein repeats - - - - - - - - - - Sequence motifs - - beta12orEarlier - Motifs - true - 1.3 - Conserved patterns (motifs) in molecular sequences, that (typically) describe functional or other key sites. - - - - - - - - - - Sequence comparison - - true - The comparison might be on the basis of sequence, physico-chemical or some other properties of the sequences. - beta12orEarlier - 1.12 - The comparison of two or more molecular sequences, for example sequence alignment and clustering. - - - - - - - - - - Sequence sites, features and motifs - - Sequence features - true - Functional sites - The archival, detection, prediction and analysis of positional features such as functional and other key sites, in molecular sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence motifs - Sequence profiles - Sequence sites - HMMs - beta12orEarlier - - - - - - - - - - Sequence database search - - beta12orEarlier - Search and retrieve molecular sequences that are similar to a sequence-based query (typically a simple sequence). - beta12orEarlier - true - The query is a sequence-based entity such as another sequence, a motif or profile. - - - - - - - - - - Sequence clustering - - This includes systems that generate, process and analyse sequence clusters. - beta12orEarlier - true - 1.7 - The comparison and grouping together of molecular sequences on the basis of their similarities. - Sequence clusters - - - - - - - - - - Protein structural motifs and surfaces - - This includes conformation of conserved substructures, conserved geometry (spatial arrangement) of secondary structure or protein backbone, solvent-exposed surfaces, internal cavities, the analysis of shape, hydropathy, electrostatic patches, role and functions etc. - Protein structural features - Structural motifs - Protein 3D motifs - true - beta12orEarlier - Protein structural motifs - Structural features or common 3D motifs within protein structures, including the surface of a protein structure, such as biological interfaces with other molecules. - Protein surfaces - - - - - - - - - - Structural (3D) profiles - - The processing, analysis or use of some type of structural (3D) profile or template; a computational entity (typically a numerical matrix) that is derived from and represents a structure or structure alignment. - true - beta12orEarlier - 1.3 - Structural profiles - - - - - - - - - - Protein structure prediction - - true - beta12orEarlier - The prediction, modelling, recognition or design of protein secondary or tertiary structure or other structural features. - 1.12 - - - - - - - - - - Nucleic acid structure prediction - - The folding of nucleic acid molecules and the prediction or design of nucleic acid (typically RNA) sequences with specific conformations. - 1.12 - true - beta12orEarlier - - - - - - - - - - Ab initio structure prediction - - 1.7 - The prediction of three-dimensional structure of a (typically protein) sequence from first principles, using a physics-based or empirical scoring function and without using explicit structural templates. - true - beta12orEarlier - - - - - - - - - - Homology modelling - - 1.4 - The modelling of the three-dimensional structure of a protein using known sequence and structural data. - true - beta12orEarlier - - - - - - - - - - Molecular dynamics - - This includes resources concerning flexibility and motion in protein and other molecular structures. - Protein dynamics - true - Molecular flexibility - Molecular motions - beta12orEarlier - The study and simulation of molecular (typically protein) conformation using a computational model of physical forces and computer simulation. - - - - - - - - - - Molecular docking - - beta12orEarlier - true - The modelling the structure of proteins in complex with small molecules or other macromolecules. - true - 1.12 - - - - - - - - - - Protein secondary structure prediction - - beta12orEarlier - 1.3 - The prediction of secondary or supersecondary structure of protein sequences. - true - - - - - - - - - - Protein tertiary structure prediction - - 1.3 - true - The prediction of tertiary structure of protein sequences. - beta12orEarlier - - - - - - - - - - Protein fold recognition - - 1.12 - The recognition (prediction and assignment) of known protein structural domains or folds in protein sequence(s). - true - beta12orEarlier - - - - - - - - - - Sequence alignment - - This includes the generation of alignments (the identification of equivalent sites), the analysis of alignments, editing, visualisation, alignment databases, the alignment (equivalence between sites) of sequence profiles (representing sequence alignments) and so on. - beta12orEarlier - 1.7 - The alignment of molecular sequences or sequence profiles (representing sequence alignments). - true - - - - - - - - - - Structure alignment - - The superimposition of molecular tertiary structures or structural (3D) profiles (representing a structure or structure alignment). - This includes the generation, storage, analysis, rendering etc. of structure alignments. - true - 1.7 - beta12orEarlier - - - - - - - - - - Threading - - Sequence-structure alignment - 1.3 - beta12orEarlier - The alignment of molecular sequences to structures, structural (3D) profiles or templates (representing a structure or structure alignment). - true - - - - - - - - - - Sequence profiles and HMMs - - true - Sequence profiles; typically a positional, numerical matrix representing a sequence alignment. - beta12orEarlier - 1.3 - Sequence profiles include position-specific scoring matrix (position weight matrix), hidden Markov models etc. - - - - - - - - - - Phylogeny reconstruction - - The reconstruction of a phylogeny (evolutionary relatedness amongst organisms), for example, by building a phylogenetic tree. - 1.3 - true - Currently too specific for the topic sub-ontology (but might be unobsoleted). - beta12orEarlier - - - - - - - - - - Phylogenomics - - - beta12orEarlier - The integrated study of evolutionary relationships and whole genome data, for example, in the analysis of species trees, horizontal gene transfer and evolutionary reconstruction. - true - - - - - - - - - - - Virtual PCR - - beta13 - Polymerase chain reaction - beta12orEarlier - Simulated polymerase chain reaction (PCR). - PCR - true - - - - - - - - - - Sequence assembly - - true - Assembly - The assembly of fragments of a DNA sequence to reconstruct the original sequence. - beta12orEarlier - Assembly has two broad types, de-novo and re-sequencing. Re-sequencing is a specialized case of assembly, where an assembled (typically de-novo assembled) reference genome is available and is about 95% identical to the re-sequenced genome. All other cases of assembly are 'de-novo'. - - - - - - - - - - Genetic variation - - Mutation - beta12orEarlier - Polymorphism - Somatic mutations - Stable, naturally occuring mutations in a nucleotide sequence including alleles, naturally occurring mutations such as single base nucleotide substitutions, deletions and insertions, RFLPs and other polymorphisms. - http://purl.bioontology.org/ontology/MSH/D014644 - DNA variation - true - - - - - - - - - - Microarrays - - true - http://purl.bioontology.org/ontology/MSH/D046228 - Microarrays, for example, to process microarray data or design probes and experiments. - 1.3 - DNA microarrays - beta12orEarlier - - - - - - - - - - Pharmacology - - Computational pharmacology - beta12orEarlier - Pharmacoinformatics - The study of drugs and their effects or responses in living systems. - VT 3.1.7 Pharmacology and pharmacy - true - - - - - - - - - - - Gene expression - - This includes the study of codon usage in nucleotide sequence(s), genetic codes and so on. - Transcription - Gene expression profiling - Expression profiling - beta12orEarlier - http://edamontology.org/topic_0197 - Gene expression levels are analysed by identifying, quantifying or comparing mRNA transcripts, for example using microarrays, RNA-seq, northern blots, gene-indexed expression profiles etc. - http://purl.bioontology.org/ontology/MSH/D015870 - Gene expression analysis - DNA microarrays - The analysis of levels and patterns of synthesis of gene products (proteins and functional RNA) including interpretation in functional terms of gene expression data. - Codon usage - true - - - - - - - - - - - Gene regulation - - true - Regulatory genomics - beta12orEarlier - The regulation of gene expression. - - - - - - - - - - Pharmacogenomics - - - true - beta12orEarlier - Pharmacogenetics - The influence of genotype on drug response, for example by correlating gene expression or single-nucleotide polymorphisms with drug efficacy or toxicity. - - - - - - - - - - - Medicinal chemistry - - - VT 3.1.4 Medicinal chemistry - The design and chemical synthesis of bioactive molecules, for example drugs or potential drug compounds, for medicinal purposes. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - true - Drug design - beta12orEarlier - - - - - - - - - - - Fish - - beta12orEarlier - true - 1.3 - Information on a specific fish genome including molecular sequences, genes and annotation. - - - - - - - - - - Flies - - 1.3 - true - beta12orEarlier - Information on a specific fly genome including molecular sequences, genes and annotation. - - - - - - - - - - Mice or rats - - Information on a specific mouse or rat genome including molecular sequences, genes and annotation. - The resource may be specific to a group of mice / rats or all mice / rats. - beta12orEarlier - - - - - - - - - - Worms - - true - 1.3 - beta12orEarlier - Information on a specific worm genome including molecular sequences, genes and annotation. - - - - - - - - - - Literature analysis - - beta12orEarlier - 1.3 - The processing and analysis of the bioinformatics literature and bibliographic data, such as literature search and query. - true - - - - - - - - - - Text mining - - beta12orEarlier - The analysis of the biomedical and informatics literature. - Literature analysis - Literature mining - Text data mining - - - - - - - - - - - Data submission, annotation and curation - - Database curation - Deposition and curation of database accessions, including annotation, typically with terms from a controlled vocabulary. - beta12orEarlier - - - - - - - - - - - Document, record and content management - - true - The management and manipulation of digital documents, including database records, files and reports. - VT 1.3.6 Multimedia, hypermedia - 1.13 - beta12orEarlier - - - - - - - - - - Sequence annotation - - beta12orEarlier - beta12orEarlier - true - Annotation of a molecular sequence. - - - - - - - - - - Genome annotation - - Annotation of a genome. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - - NMR - - - ROESY - NOESY - Nuclear Overhauser Effect Spectroscopy - An analytical technique that exploits the magenetic properties of certain atomic nuclei to provide information on the structure, dynamics, reaction state and chemical environment of molecules. - HOESY - beta12orEarlier - Heteronuclear Overhauser Effect Spectroscopy - Nuclear magnetic resonance spectroscopy - Spectroscopy - NMR spectroscopy - Rotational Frame Nuclear Overhauser Effect Spectroscopy - - - - - - - - - - - Sequence classification - - 1.12 - true - beta12orEarlier - The classification of molecular sequences based on some measure of their similarity. - Methods including sequence motifs, profile and other diagnostic elements which (typically) represent conserved patterns (of residues or properties) in molecular sequences. - - - - - - - - - - Protein classification - - 1.3 - true - beta12orEarlier - primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc. - - - - - - - - - - Sequence motif or profile - - beta12orEarlier - true - Sequence motifs, or sequence profiles derived from an alignment of molecular sequences of a particular type. - This includes comparison, discovery, recognition etc. of sequence motifs. - beta12orEarlier - - - - - - - - - - Protein modifications - - GO:0006464 - Protein chemical modifications - Protein post-translational modification - Protein chemical modifications, e.g. post-translational modifications. - true - EDAM does not describe all possible protein modifications. For fine-grained annotation of protein modification use the Gene Ontology (children of concept GO:0006464) and/or the Protein Modifications ontology (children of concept MOD:00000) - Protein post-translational modifications - Post-translation modifications - MOD:00000 - beta12orEarlier - - - - - - - - - - Molecular interactions, pathways and networks - - Networks - Environmental information processing pathways - Pathways - Biological networks - Disease pathways - true - Signal transduction pathways - beta13 - Biological models - Cellular process pathways - Molecular interactions - Gene regulatory networks - Molecular interactions, biological pathways, networks and other models. - Biological pathways - Interactions - Genetic information processing pathways - Signaling pathways - http://edamontology.org/topic_3076 - - - - - - - - - - - Informatics - - true - The study and practice of information processing and use of computer information systems. - VT 1.3.99 Other - Knowledge management - VT 1.3.4 Information management - beta12orEarlier - Information management - VT 1.3.5 Knowledge management - VT 1.3.3 Information retrieval - VT 1.3 Information sciences - Information science - - - - - - - - - Literature data resources - - Data resources for the biological or biomedical literature, either a primary source of literature or some derivative. - true - 1.3 - beta12orEarlier - - - - - - - - - - Laboratory information management - - true - Laboratory management and resources, for example, catalogues of biological resources for use in the lab including cell lines, viruses, plasmids, phages, DNA probes and primers and so on. - beta12orEarlier - Laboratory resources - - - - - - - - - - - - Cell and tissue culture - - Tissue culture - 1.3 - true - General cell culture or data on a specific cell lines. - Cell culture - beta12orEarlier - - - - - - - - - - Ecology - - true - The ecological and environmental sciences and especially the application of information technology (ecoinformatics). - http://purl.bioontology.org/ontology/MSH/D004777 - Ecological informatics - VT 1.5.15 Ecology - Computational ecology - beta12orEarlier - Ecoinformatics - Environmental science - - - - - - - - - - - Electron microscopy - - - SEM - Scanning electron microscopy - TEM - The study of matter by studying the interference pattern from firing electrons at a sample, to analyse structures at resolutions higher than can be achieved using light. - - Transmission electron microscopy - beta12orEarlier - Electron crystallography - Electron diffraction experiment - Single particle electron microscopy - - - - - - - - - - - Cell cycle - - beta13 - beta12orEarlier - true - The cell cycle including key genes and proteins. - - - - - - - - - - Peptides and amino acids - - beta12orEarlier - The physicochemical, biochemical or structural properties of amino acids or peptides. - 1.13 - true - - - - - - - - - - Organelles - - Cell membrane - Cytoplasm - Organelle genes and proteins - Smooth endoplasmic reticulum - beta12orEarlier - Lysosome - Centriole - Ribosome - Nucleus - true - A specific organelle, or organelles in general, typically the genes and proteins (or genome and proteome). - Mitochondria - Golgi apparatus - Rough endoplasmic reticulum - 1.3 - - - - - - - - - - Ribosomes - - beta12orEarlier - Ribosomes, typically of ribosome-related genes and proteins. - Ribosome genes and proteins - 1.3 - true - - - - - - - - - - Scents - - A database about scents. - beta12orEarlier - beta13 - true - - - - - - - - - - Drugs and target structures - - beta12orEarlier - The structures of drugs, drug target, their interactions and binding affinities. - true - 1.13 - - - - - - - - - - Model organisms - - This may include information on the genome (including molecular sequences and map, genes and annotation), proteome, as well as more general information about an organism. - beta12orEarlier - A specific organism, or group of organisms, used to study a particular aspect of biology. - true - Organisms - - - - - - - - - - - Genomics - - http://purl.bioontology.org/ontology/MSH/D023281 - Personal genomics - beta12orEarlier - Whole genomes of one or more organisms, or genomes in general, such as meta-information on genomes, genome projects, gene names etc. - true - - - - - - - - - - - Gene and protein families - - - beta12orEarlier - Gene family - A protein families database might include the classifier (e.g. a sequence profile) used to build the classification. - Protein families - Genes, gene family or system - Gene system - Protein sequence classification - Particular gene(s), gene family or other gene group or system and their encoded proteins.Primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc., curation of a particular protein or protein family, or any other proteins that have been classified as members of a common group. - true - Gene families - - - - - - - - - - - Chromosomes - - beta12orEarlier - Study of chromosomes. - 1.13 - true - - - - - - - - - - Genotype and phenotype - - Genotype and phenotype resources - The study of genetic constitution of a living entity, such as an individual, and organism, a cell and so on, typically with respect to a particular observable phenotypic traits, or resources concerning such traits, which might be an aspect of biochemistry, physiology, morphology, anatomy, development and so on. - Genotyping - Phenotyping - true - beta12orEarlier - - - - - - - - - - - Gene expression and microarray - - true - beta12orEarlier - beta12orEarlier - Gene expression e.g. microarray data, northern blots, gene-indexed expression profiles etc. - - - - - - - - - - Probes and primers - - Probes - This includes the design of primers for PCR and DNA amplification or the design of molecular probes. - http://purl.bioontology.org/ontology/MSH/D015335 - Primers - true - beta12orEarlier - Molecular probes (e.g. a peptide probe or DNA microarray probe) or PCR primers and hybridization oligos in a nucleic acid sequence. - - - - - - - - - - - Pathology - - Disease - Diseases, including diseases in general and the genes, gene variations and proteins involved in one or more specific diseases. - true - beta12orEarlier - VT 3.1.6 Pathology - - - - - - - - - - - Specific protein resources - - 1.3 - A particular protein, protein family or other group of proteins. - true - Specific protein - beta12orEarlier - - - - - - - - - - Taxonomy - - true - beta12orEarlier - VT 1.5.25 Taxonomy - Organism classification, identification and naming. - - - - - - - - - - Protein sequence analysis - - beta12orEarlier - Archival, processing and analysis of protein sequences and sequence-based entities such as alignments, motifs and profiles. - 1.8 - true - - - - - - - - - - Nucleic acid sequence analysis - - beta12orEarlier - 1.8 - true - The archival, processing and analysis of nucleotide sequences and and sequence-based entities such as alignments, motifs and profiles. - - - - - - - - - - - Repeat sequences - - true - The repetitive nature of molecular sequences. - beta12orEarlier - 1.3 - - - - - - - - - - Low complexity sequences - - true - The (character) complexity of molecular sequences, particularly regions of low complexity. - 1.3 - beta12orEarlier - - - - - - - - - - Proteome - - A specific proteome including protein sequences and annotation. - beta12orEarlier - beta13 - true - - - - - - - - - - DNA - - DNA analysis - beta12orEarlier - Ancient DNA - Chromosomes - DNA sequences and structure, including processes such as methylation and replication. - The DNA sequences might be coding or non-coding sequences. - - - - - - - - - - Coding RNA - - Protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames - 1.13 - beta12orEarlier - true - - - - - - - - - - Functional, regulatory and non-coding RNA - - - true - small interfering RNA - small nucleolar RNA - ncRNA - Non-coding RNA - Functional RNA - snRNA - Non-coding or functional RNA sequences, including regulatory RNA sequences, ribosomal RNA (rRNA) and transfer RNA (tRNA). - Non-coding RNA includes piwi-interacting RNA (piRNA), small nuclear RNA (snRNA) and small nucleolar RNA (snoRNA). Regulatory RNA includes microRNA (miRNA) - short single stranded RNA molecules that regulate gene expression, and small interfering RNA (siRNA). - Regulatory RNA - siRNA - piRNA - snoRNA - small nuclear RNA - beta12orEarlier - miRNA - microRNA - piwi-interacting RNA - - - - - - - - - - rRNA - - 1.3 - One or more ribosomal RNA (rRNA) sequences. - true - - - - - - - - - - tRNA - - 1.3 - true - One or more transfer RNA (tRNA) sequences. - - - - - - - - - - Protein secondary structure - - true - beta12orEarlier - 1.8 - Protein secondary structure or secondary structure alignments. - This includes assignment, analysis, comparison, prediction, rendering etc. of secondary structure data. - - - - - - - - - - RNA structure - - 1.3 - RNA secondary or tertiary structure and alignments. - beta12orEarlier - true - - - - - - - - - - Protein tertiary structure - - 1.8 - true - Protein tertiary structures. - beta12orEarlier - - - - - - - - - - Nucleic acid classification - - Classification of nucleic acid sequences and structures. - 1.3 - true - beta12orEarlier - - - - - - - - - - Protein families - - beta12orEarlier - true - Primarily the classification of proteins (from sequence or structural data) into clusters, groups, families etc., curation of a particular protein or protein family, or any other proteins that have been classified as members of a common group. - 1.14 - - - - - - - - - - Protein folds and structural domains - - Protein tertiary structural domains and folds in a protein or polypeptide chain. - This includes topological domains such as cytoplasmic regions in a protein. - Protein transmembrane regions - Protein domains - Protein membrane regions - Intramembrane regions - beta12orEarlier - Protein topological domains - true - This includes trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. For example, the location and size of the membrane spanning segments and intervening loop regions, transmembrane region IN/OUT orientation relative to the membrane, plus the following data for each amino acid: A Z-coordinate (the distance to the membrane center), the free energy of membrane insertion (calculated in a sliding window over the sequence) and a reliability score. The z-coordinate implies information about re-entrant helices, interfacial helices, the tilt of a transmembrane helix and loop lengths. - Protein folds - Transmembrane regions - Protein structural domains - - - - - - - - - - Nucleic acid sequence alignment - - beta12orEarlier - true - 1.3 - Nucleotide sequence alignments. - - - - - - - - - - Protein sequence alignment - - 1.3 - Protein sequence alignments. - beta12orEarlier - true - A sequence profile typically represents a sequence alignment. - - - - - - - - - - Nucleic acid sites and features - - beta12orEarlier - 1.3 - true - The archival, detection, prediction and analysis of -positional features such as functional sites in nucleotide sequences. - - - - - - - - - - - Protein sites and features - - beta12orEarlier - The detection, identification and analysis of positional features in proteins, such as functional sites. - 1.3 - true - - - - - - - - - - - Transcription factors and regulatory sites - - - - CpG islands - Proteins that bind to DNA and control transcription of DNA to mRNA (transcription factors) and also transcriptional regulatory sites, elements and regions (such as promoters, enhancers, silencers and boundary elements / insulators) in nucleotide sequences. - Enhancers - Attenuators - CAAT signals - Transcriptional regulatory sites - TFBS - CAT box - CCAAT box - This includes CpG rich regions (isochores) in a nucleotide sequence. - This includes promoters, CAAT signals, TATA signals, -35 signals, -10 signals, GC signals, primer binding sites for initiation of transcription or reverse transcription, enhancer, attenuator, terminators and ribosome binding sites. - -10 signals - Transcription factor proteins either promote (as an activator) or block (as a repressor) the binding to DNA of RNA polymerase. Regulatory sites including transcription factor binding site as well as promoters, enhancers, silencers and boundary elements / insulators. - Terminators - TATA signals - GC signals - Promoters - -35 signals - Transcription factors - Isochores - beta12orEarlier - Transcription factor binding sites - - - - - - - - - - Phosphorylation sites - - 1.0 - Protein phosphorylation and phosphorylation sites in protein sequences. - true - beta12orEarlier - - - - - - - - - - - Metabolic pathways - - beta12orEarlier - 1.13 - true - Metabolic pathways. - - - - - - - - - - Signaling pathways - - true - Signaling pathways. - 1.13 - beta12orEarlier - - - - - - - - - - Protein and peptide identification - - 1.3 - beta12orEarlier - true - - - - - - - - - - Workflows - - Pipelines - Biological or biomedical analytical workflows or pipelines. - beta12orEarlier - - - - - - - - - Data types and objects - - Structuring data into basic types and (computational) objects. - beta12orEarlier - 1.0 - true - - - - - - - - - - Theoretical biology - - 1.3 - true - - - - - - - - - - Mitochondria - - beta12orEarlier - true - Mitochondria, typically of mitochondrial genes and proteins. - 1.3 - - - - - - - - - - Plants - - The resource may be specific to a plant, a group of plants or all plants. - Plant science - Plants, e.g. information on a specific plant genome including molecular sequences, genes and annotation. - Plant biology - Botany - VT 1.5.22 Plant science - Plant - VT 1.5.10 Botany - beta12orEarlier - - - - - - - - - - Viruses - - Virology - VT 1.5.28 Virology - beta12orEarlier - Viruses, e.g. sequence and structural data, interactions of viral proteins, or a viral genome including molecular sequences, genes and annotation. - The resource may be specific to a virus, a group of viruses or all viruses. - - - - - - - - - - Fungi - - Mycology - beta12orEarlier - The resource may be specific to a fungus, a group of fungi or all fungi. - Yeast - VT 1.5.21 Mycology - Fungi and molds, e.g. information on a specific fungal genome including molecular sequences, genes and annotation. - - - - - - - - - - Pathogens - - Pathogens, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a pathogen, a group of pathogens or all pathogens. - - - - - - - - - - Arabidopsis - - beta12orEarlier - Arabidopsis-specific data. - 1.3 - true - - - - - - - - - - Rice - - Rice-specific data. - true - 1.3 - beta12orEarlier - - - - - - - - - - Genetic mapping and linkage - - Linkage mapping - beta12orEarlier - 1.3 - true - Genetic linkage - Informatics resources that aim to identify, map or analyse genetic markers in DNA sequences, for example to produce a genetic (linkage) map of a chromosome or genome or to analyse genetic linkage and synteny. - - - - - - - - - - Comparative genomics - - The study (typically comparison) of the sequence, structure or function of multiple genomes. - true - beta12orEarlier - - - - - - - - - - - Mobile genetic elements - - Transposons - beta12orEarlier - Mobile genetic elements, such as transposons, Plasmids, Bacteriophage elements and Group II introns. - - - - - - - - - - Human disease - - Human diseases, typically describing the genes, mutations and proteins implicated in disease. - beta13 - true - beta12orEarlier - - - - - - - - - - Immunology - - VT 3.1.3 Immunology - Immunoinformatics - http://purl.bioontology.org/ontology/MSH/D007120 - http://purl.bioontology.org/ontology/MSH/D007125 - beta12orEarlier - true - Computational immunology - The application of information technology to immunology such as immunological processes, immunological genes, proteins and peptide ligands, antigens and so on. - - - - - - - - - - - Membrane and lipoproteins - - Lipoproteins (protein-lipid assemblies), and proteins or region of a protein that spans or are associated with a membrane. - true - beta12orEarlier - Membrane proteins - Lipoproteins - Transmembrane proteins - - - - - - - - - - Enzymes - - Proteins that catalyze chemical reaction, the kinetics of enzyme-catalysed reactions, enzyme nomenclature etc. - beta12orEarlier - Enzymology - true - - - - - - - - - - Primers - - true - 1.13 - PCR primers and hybridization oligos in a nucleic acid sequence. - beta12orEarlier - - - - - - - - - - PolyA signal or sites - - beta12orEarlier - 1.13 - true - Regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. - - - - - - - - - - CpG island and isochores - - beta12orEarlier - 1.13 - true - CpG rich regions (isochores) in a nucleotide sequence. - - - - - - - - - - Restriction sites - - Restriction enzyme recognition sites (restriction sites) in a nucleic acid sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Splice sites - - beta12orEarlier - Splice sites in a nucleotide sequence or alternative RNA splicing events. - 1.13 - true - - - - - - - - - - - Matrix/scaffold attachment sites - - 1.13 - true - beta12orEarlier - Matrix/scaffold attachment regions (MARs/SARs) in a DNA sequence. - - - - - - - - - - Operon - - beta12orEarlier - 1.13 - true - Operons (operators, promoters and genes) from a bacterial genome. - - - - - - - - - - Promoters - - true - 1.13 - Whole promoters or promoter elements (transcription start sites, RNA polymerase binding site, transcription factor binding sites, promoter enhancers etc) in a DNA sequence. - beta12orEarlier - - - - - - - - - - Structural biology - - Structural assignment - Structure determination - This includes experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - 1.3 - This includes Informatics concerning data generated from the use of microscopes, including optical, electron and scanning probe microscopy. Includes methods for digitizing microscope images and viewing the produced virtual slides and associated data on a computer screen. - The molecular structure of biological molecules, particularly macromolecules such as proteins and nucleic acids. - true - VT 1.5.24 Structural biology - Structural determination - - - - - - - - - - - Protein membrane regions - - 1.8 - 1.13 - true - Trans- or intra-membrane regions of a protein, typically describing physicochemical properties of the secondary structure elements. - - - - - - - - - - Structure comparison - - This might involve comparison of secondary or tertiary (3D) structural information. - true - The comparison of two or more molecular structures, for example structure alignment and clustering. - 1.13 - beta12orEarlier - - - - - - - - - - Function analysis - - true - Protein function prediction - The study of gene and protein function including the prediction of functional properties of a protein. - Protein function analysis - beta12orEarlier - - - - - - - - - - - Prokaryotes and archae - - The resource may be specific to a prokaryote, a group of prokaryotes or all prokaryotes. - VT 1.5.2 Bacteriology - Bacteriology - beta12orEarlier - Specific bacteria or archaea, e.g. information on a specific prokaryote genome including molecular sequences, genes and annotation. - - - - - - - - - - Protein databases - - true - 1.3 - Protein data resources. - beta12orEarlier - Protein data resources - - - - - - - - - - Structure determination - - Experimental methods for biomolecular structure determination, such as X-ray crystallography, nuclear magnetic resonance (NMR), circular dichroism (CD) spectroscopy, microscopy etc., including the assignment or modelling of molecular structure from such data. - beta12orEarlier - true - 1.3 - - - - - - - - - - Cell biology - - beta12orEarlier - true - VT 1.5.11 Cell biology - Cellular processes - Cells, such as key genes and proteins involved in the cell cycle. - - - - - - - - - - Classification - - beta13 - beta12orEarlier - Topic focused on identifying, grouping, or naming things in a structured way according to some schema based on observable relationships. - true - - - - - - - - - - Lipoproteins - - true - 1.3 - beta12orEarlier - Lipoproteins (protein-lipid assemblies). - - - - - - - - - - Phylogeny visualisation - - true - Visualise a phylogeny, for example, render a phylogenetic tree. - beta12orEarlier - beta12orEarlier - - - - - - - - - - Cheminformatics - - The application of information technology to chemistry in biological research environment. - Chemical informatics - beta12orEarlier - Chemoinformatics - true - - - - - - - - - - - Systems biology - - http://en.wikipedia.org/wiki/Systems_biology - This includes databases of models and methods to construct or analyse a model. - Biological models - http://purl.bioontology.org/ontology/MSH/D049490 - true - beta12orEarlier - Biological modelling - Biological system modelling - The holistic modelling and analysis of complex biological systems and the interactions therein. - - - - - - - - - - - Statistics and probability - - Biostatistics - Probability - http://en.wikipedia.org/wiki/Biostatistics - beta12orEarlier - The application of statistical methods to biological problems. - Statistics - http://purl.bioontology.org/ontology/MSH/D056808 - - - - - - - - - - - Structure database search - - The query is a structure-based entity such as another structure, a 3D (structural) motif, 3D profile or template. - beta12orEarlier - Search for and retrieve molecular structures that are similar to a structure-based query (typically another structure or part of a structure). - beta12orEarlier - true - - - - - - - - - - Molecular modelling - - Molecular docking - Homology modeling - beta12orEarlier - Comparative modelling - Homology modelling - Molecular modeling - Comparative modeling - true - The construction, analysis, evaluation, refinement etc. of models of a molecules properties or behaviour, including the modelling the structure of proteins in complex with small molecules or other macromolecules (docking). - - - - - - - - - - Protein function prediction - - 1.2 - beta12orEarlier - true - The prediction of functional properties of a protein. - - - - - - - - - - SNP - - true - Single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. - beta12orEarlier - 1.13 - - - - - - - - - - Transmembrane protein prediction - - Predict transmembrane domains and topology in protein sequences. - beta12orEarlier - beta12orEarlier - true - - - - - - - - - - - Nucleic acid structure comparison - - The comparison two or more nucleic acid (typically RNA) secondary or tertiary structures. - beta12orEarlier - true - beta12orEarlier - Use this concept for methods that are exclusively for nucleic acid structures. - - - - - - - - - - - Exons - - beta12orEarlier - true - Exons in a nucleotide sequences. - 1.13 - - - - - - - - - - Gene transcription - - Transcription of DNA into RNA including the regulation of transcription. - true - 1.13 - beta12orEarlier - - - - - - - - - - DNA mutation - - - beta12orEarlier - DNA mutation. - - - - - - - - - - Oncology - - beta12orEarlier - VT 3.2.16 Oncology - Cancer - true - The study of cancer, for example, genes and proteins implicated in cancer. - Cancer biology - - - - - - - - - - - Toxins and targets - - 1.13 - beta12orEarlier - true - Structural and associated data for toxic chemical substances. - - - - - - - - - - Introns - - 1.13 - Introns in a nucleotide sequences. - beta12orEarlier - true - - - - - - - - - - Tool topic - - beta12orEarlier - A topic concerning primarily bioinformatics software tools, typically the broad function or purpose of a tool. - true - beta12orEarlier - - - - - - - - - - Study topic - - A general area of bioinformatics study, typically the broad scope or category of content of a bioinformatics journal or conference proceeding. - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - Nomenclature - - true - 1.3 - beta12orEarlier - Biological nomenclature (naming), symbols and terminology. - - - - - - - - - - Disease genes and proteins - - 1.3 - true - beta12orEarlier - The genes, gene variations and proteins involved in one or more specific diseases. - - - - - - - - - - Protein structure analysis - - - Protein structure - true - Protein secondary or tertiary structural data and/or associated annotation. - http://edamontology.org/topic_3040 - beta12orEarlier - - - - - - - - - - - Humans - - beta12orEarlier - The human genome, including molecular sequences, genes, annotation, maps and viewers, the human proteome or human beings in general. - - - - - - - - - - Gene resources - - Gene resource - beta12orEarlier - 1.3 - Informatics resource (typically a database) primarily focussed on genes. - Gene database - true - - - - - - - - - - Yeast - - beta12orEarlier - Yeast, e.g. information on a specific yeast genome including molecular sequences, genes and annotation. - true - 1.3 - - - - - - - - - - Eukaryotes - - Eukaryote - Eukaryotes or data concerning eukaryotes, e.g. information on a specific eukaryote genome including molecular sequences, genes and annotation. - The resource may be specific to a eukaryote, a group of eukaryotes or all eukaryotes. - beta12orEarlier - - - - - - - - - - Invertebrates - - The resource may be specific to an invertebrate, a group of invertebrates or all invertebrates. - beta12orEarlier - Invertebrates, e.g. information on a specific invertebrate genome including molecular sequences, genes and annotation. - - - - - - - - - - Vertebrates - - The resource may be specific to a vertebrate, a group of vertebrates or all vertebrates. - Vertebrates, e.g. information on a specific vertebrate genome including molecular sequences, genes and annotation. - beta12orEarlier - - - - - - - - - - Unicellular eukaryotes - - Unicellular eukaryotes, e.g. information on a unicellular eukaryote genome including molecular sequences, genes and annotation. - beta12orEarlier - The resource may be specific to a unicellular eukaryote, a group of unicellular eukaryotes or all unicellular eukaryotes. - - - - - - - - - - Protein structure alignment - - Protein secondary or tertiary structure alignments. - beta12orEarlier - true - 1.3 - - - - - - - - - - X-ray diffraction - - - The study of matter and their structure by means of the diffraction of X-rays, typically the diffraction pattern caused by the regularly spaced atoms of a crystalline sample. - beta12orEarlier - X-ray microscopy - Crystallography - X-ray crystallography - - - - - - - - - - - Ontologies, nomenclature and classification - - true - Conceptualisation, categorisation and naming of entities or phenomena within biology or bioinformatics. - 1.3 - http://purl.bioontology.org/ontology/MSH/D002965 - beta12orEarlier - - - - - - - - - - Immunoproteins, genes and antigens - - - Immunopeptides - Immunity-related genes, proteins and their ligands. - Antigens - This includes T cell receptors (TR), major histocompatibility complex (MHC), immunoglobulin superfamily (IgSF) / antibodies, major histocompatibility complex superfamily (MhcSF), etc." - beta12orEarlier - Immunoproteins - Immunogenes - - - - - - - - - - - Molecules - - CHEBI:23367 - beta12orEarlier - beta12orEarlier - Specific molecules, including large molecules built from repeating subunits (macromolecules) and small molecules of biological significance. - true - - - - - - - - - - Toxicology - - - Toxins and the adverse effects of these chemical substances on living organisms. - VT 3.1.9 Toxicology - Toxicoinformatics - true - beta12orEarlier - Computational toxicology - - - - - - - - - - - High-throughput sequencing - - Next-generation sequencing - beta13 - true - beta12orEarlier - Parallelized sequencing processes that are capable of sequencing many thousands of sequences simultaneously. - - - - - - - - - - Structural clustering - - The comparison and grouping together of molecular structures on the basis of similarity; generate, process or analyse structural clusters. - 1.7 - Structure classification - true - beta12orEarlier - - - - - - - - - - Gene regulatory networks - - Gene regulatory networks. - true - 1.13 - beta12orEarlier - - - - - - - - - - Disease (specific) - - Informatics resources dedicated to one or more specific diseases (not diseases in general). - beta12orEarlier - true - beta12orEarlier - - - - - - - - - - VNTR - - Variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. - beta12orEarlier - 1.13 - true - - - - - - - - - - Microsatellites - - true - 1.13 - beta12orEarlier - Microsatellite polymorphism in a DNA sequence. - - - - - - - - - - - RFLP - - Restriction fragment length polymorphisms (RFLP) in a DNA sequence. - true - 1.13 - beta12orEarlier - - - - - - - - - - - DNA polymorphism - - - Includes restriction fragment length polymorphisms (RFLP) in a DNA sequence. An RFLP is defined by the presence or absence of a specific restriction site of a bacterial restriction enzyme. - true - RFLP - Single nucleotide polymorphism - Microsatellites - VNTR - SNP - Includes microsatellite polymorphism in a DNA sequence. A microsatellite polymorphism is a very short subsequence that is repeated a variable number of times between individuals. These repeats consist of the nucleotides cytosine and adenosine. - DNA polymorphism. - Variable number of tandem repeat polymorphism - Includes single nucleotide polymorphisms (SNP) and associated data, for example, the discovery and annotation of SNPs. A SNP is a DNA sequence variation where a single nucleotide differs between members of a species or paired chromosomes in an individual. - beta12orEarlier - Includes variable number of tandem repeat (VNTR) polymorphism in a DNA sequence. VNTRs occur in non-coding regions of DNA and consists sub-sequence that is repeated a multiple (and varied) number of times. - - - - - - - - - - Nucleic acid design - - Topic for the design of nucleic acid sequences with specific conformations. - 1.3 - beta12orEarlier - true - - - - - - - - - - Primer or probe design - - 1.3 - true - beta13 - The design of primers for PCR and DNA amplification or the design of molecular probes. - - - - - - - - - - Structure databases - - beta13 - true - 1.2 - Structure data resources - Molecular secondary or tertiary (3D) structural data resources, typically of proteins and nucleic acids. - - - - - - - - - - Nucleic acid structure - - true - beta13 - Nucleic acid (secondary or tertiary) structure, such as whole structures, structural features and associated annotation. - 1.2 - - - - - - - - - - Sequence databases - - Molecular sequence data resources, including sequence sites, alignments, motifs and profiles. - true - beta13 - Sequence data resources - Sequence data - Sequence data resource - 1.3 - - - - - - - - - - Nucleic acid sequences - - Nucleotide sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - Nucleotide sequences - - - - - - - - - - Protein sequences - - Protein sequences and associated concepts such as sequence sites, alignments, motifs and profiles. - beta13 - 1.3 - true - - - - - - - - - - Protein interaction networks - - 1.3 - true - - - - - - - - - - Molecular biology - - true - VT 1.5.4 Biochemistry and molecular biology - beta13 - The molecular basis of biological activity, particularly the macromolecules (e.g. proteins and nucleic acids) that are essential to life. - - - - - - - - - - - Mammals - - true - beta13 - 1.3 - Mammals, e.g. information on a specific mammal genome including molecular sequences, genes and annotation. - - - - - - - - - - Biodiversity - - The degree of variation of life forms within a given ecosystem, biome or an entire planet. - beta13 - VT 1.5.5 Biodiversity conservation - true - http://purl.bioontology.org/ontology/MSH/D044822 - - - - - - - - - - - Sequence clusters and classification - - This includes the results of sequence clustering, ortholog identification, assignment to families, annotation etc. - The comparison, grouping together and classification of macromolecules on the basis of sequence similarity. - Sequence families - 1.3 - true - Sequence clusters - beta13 - - - - - - - - - - Genetics - - http://purl.bioontology.org/ontology/MSH/D005823 - true - The study of genes, genetic variation and heredity in living organisms. - beta13 - Heredity - - - - - - - - - - - Quantitative genetics - - beta13 - The genes and genetic mechanisms such as Mendelian inheritance that underly continuous phenotypic traits (such as height or weight). - true - - - - - - - - - - Population genetics - - The distribution of allele frequencies in a population of organisms and its change subject to evolutionary processes including natural selection, genetic drift, mutation and gene flow. - true - beta13 - - - - - - - - - - - Regulatory RNA - - 1.3 - Regulatory RNA sequences including microRNA (miRNA) and small interfering RNA (siRNA). - true - beta13 - - - - - - - - - - Documentation and help - - The documentation of resources such as tools, services and databases and how to get help. - true - beta13 - 1.13 - - - - - - - - - - Genetic organisation - - The structural and functional organisation of genes and other genetic elements. - 1.3 - beta13 - true - - - - - - - - - - Medical informatics - - true - Health informatics - Clinical informatics - Biomedical informatics - Translational medicine - The application of information technology to health, disease and biomedicine. - Healthcare informatics - beta13 - Health and disease - Molecular medicine - - - - - - - - - - - Developmental biology - - VT 1.5.14 Developmental biology - true - beta13 - How organisms grow and develop. - - - - - - - - - - - Embryology - - true - beta13 - The development of organisms between the one-cell stage (typically the zygote) and the end of the embryonic stage. - - - - - - - - - - - Anatomy - - VT 3.1.1 Anatomy and morphology - beta13 - The form and function of the structures of living organisms. - true - - - - - - - - - - - Literature and reference - - beta13 - true - http://purl.bioontology.org/ontology/MSH/D011642 - The scientific literature, reference information and documentation. - Literature sources - Bibliography - This includes the documentation of resources such as tools, services and databases, user support, how to get help etc. - Documentation - - - - - - - - - - - Biology - - VT 1.5.8 Biology - beta13 - VT 1.5 Biological sciences - VT 1.5.23 Reproductive biology - Cryobiology - Biological rhythms - A particular biological science, especially observable traits such as aspects of biochemistry, physiology, morphology, anatomy, development and so on. - VT 1.5.7 Biological rhythm - Biological science - Aerobiology - VT 1.5.99 Other - Chronobiology - true - VT 1.5.13 Cryobiology - - VT 1.5.1 Aerobiology - VT 1.5.3 Behavioural biology - Reproductive biology - Behavioural biology - - - - - - - - - - - Data management - - The development and use of architectures, policies, practices and procedures for management of data. - true - beta13 - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - VT 1.3.1 Data management - - - - - - - - - - - Sequence feature detection - - 1.3 - true - beta13 - The detection of the positional features, such as functional and other key sites, in molecular sequences. - http://purl.bioontology.org/ontology/MSH/D058977 - - - - - - - - - - Nucleic acid feature detection - - The detection of positional features such as functional sites in nucleotide sequences. - true - beta13 - 1.3 - - - - - - - - - - Protein feature detection - - The detection, identification and analysis of positional protein sequence features, such as functional sites. - beta13 - 1.3 - true - - - - - - - - - - Biological system modelling - - 1.2 - true - beta13 - Topic for modelling biological systems in mathematical terms. - - - - - - - - - - Data acquisition - - The acquisition of data, typically measurements of physical systems using any type of sampling system, or by another other means. - beta13 - - - - - - - - - - Genes and proteins resources - - 1.3 - Gene family - beta13 - Gene and protein families - Specific genes and/or their encoded proteins or a family or other grouping of related genes and proteins. - true - - - - - - - - - - Protein topological domains - - 1.13 - Topological domains such as cytoplasmic regions in a protein. - true - 1.8 - - - - - - - - - - Protein variants - - beta13 - true - Protein sequence variants produced e.g. from alternative splicing, alternative promoter usage, alternative initiation and ribosomal frameshifting. - - - - - - - - - - - Expression signals - - beta13 - true - 1.12 - Regions within a nucleic acid sequence containing a signal that alters a biological function. - - - - - - - - - - DNA binding sites - - - Matrix-attachment region - beta13 - Nucleosome exclusion sequences - This includes ribosome binding sites (Shine-Dalgarno sequence in prokaryotes), restriction enzyme recognition sites (restriction sites) etc. - Restriction sites - Ribosome binding sites - Scaffold-attachment region - This includes sites involved with DNA replication and recombination. This includes binding sites for initiation of replication (origin of replication), regions where transfer is initiated during the conjugation or mobilization (origin of transfer), starting sites for DNA duplication (origin of replication) and regions which are eliminated through any of kind of recombination. Also nucleosome exclusion regions, i.e. specific patterns or regions which exclude nucleosomes (the basic structural units of eukaryotic chromatin which play a significant role in regulating gene expression). - Nucleic acids binding to some other molecule. - Matrix/scaffold attachment region - - - - - - - - - - - Nucleic acid repeats - - true - beta13 - This includes long terminal repeats (LTRs); sequences (typically retroviral) directly repeated at both ends of a defined sequence and other types of repeating unit. - Repetitive elements within a nucleic acid sequence. - 1,13 - - - - - - - - - - DNA replication and recombination - - DNA replication or recombination. - beta13 - true - - - - - - - - - - Signal or transit peptide - - beta13 - 1.13 - true - Coding sequences for a signal or transit peptide. - - - - - - - - - - Sequence tagged sites - - beta13 - 1.13 - Sequence tagged sites (STS) in nucleic acid sequences. - true - - - - - - - - - - Sequencing - - Resequencing - true - http://purl.bioontology.org/ontology/MSH/D059014 - Chromosome walking - NGS - Next gen sequencing - DNA-Seq - High throughput sequencing - 1.1 - Primer walking - Next generation sequencing - The determination of complete (typically nucleotide) sequences, including those of genomes (full genome sequencing, de novo sequencing and resequencing), amplicons and transcriptomes. - - - - - - - - - - - ChIP-seq - - - Chip sequencing - 1.1 - The analysis of protein-DNA interactions where chromatin immunoprecipitation (ChIP) is used in combination with massively parallel DNA sequencing to identify the binding sites of DNA-associated proteins. - Chip Seq - Chip-sequencing - - - - - - - - - RNA-Seq - - Small RNA-seq - Whole transcriptome shotgun sequencing - RNA-seq - miRNA-seq - 1.1 - A topic concerning high-throughput sequencing of cDNA to measure the RNA content (transcriptome) of a sample, for example, to investigate how different alleles of a gene are expressed, detect post-transcriptional mutations or identify gene fusions. - Small RNA-Seq - WTSS - This includes small RNA profiling (small RNA-Seq), for example to find novel small RNAs, characterize mutations and analyze expression of small RNAs. - - - - - - - - - DNA methylation - - true - DNA methylation including bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - 1.3 - http://purl.bioontology.org/ontology/MSH/D019175 - 1.1 - - - - - - - - - - Metabolomics - - The systematic study of metabolites, the chemical processes they are involved, and the chemical fingerprints of specific cellular processes in a whole cell, tissue, organ or organism. - true - http://purl.bioontology.org/ontology/MSH/D055432 - 1.1 - - - - - - - - - - - Epigenomics - - - Epigenetics concerns the heritable changes in gene expression owing to mechanisms other than DNA sequence variation. - 1.1 - http://purl.bioontology.org/ontology/MSH/D057890 - The study of the epigenetic modifications of a whole cell, tissue, organism etc. - true - - - - - - - - - - - Metagenomics - - - http://purl.bioontology.org/ontology/MSH/D056186 - Ecogenomics - Community genomics - Environmental genomics - true - 1.1 - The study of genetic material recovered from environmental samples, and associated environmental data. - - - - - - - - - - - DNA structural variation - - - 1.1 - Variation in chromosome structure including microscopic and submicroscopic types of variation such as deletions, duplications, copy-number variants, insertions, inversions and translocations. - Structural variation - Genomic structural variation - - - - - - - - - - DNA packaging - - Nucleosome positioning - beta12orEarlier - DNA-histone complexes (chromatin), organisation of chromatin into nucleosomes and packaging into higher-order structures. - http://purl.bioontology.org/ontology/MSH/D042003 - - - - - - - - - - DNA-Seq - - 1.1 - A topic concerning high-throughput sequencing of randomly fragmented genomic DNA, for example, to investigate whole-genome sequencing and resequencing, SNP discovery, identification of copy number variations and chromosomal rearrangements. - 1.3 - DNA-seq - true - - - - - - - - - - RNA-Seq alignment - - true - 1.3 - RNA-seq alignment - The alignment of sequences of (typically millions) of short reads to a reference genome. This is a specialised topic within sequence alignment, especially because of complications arising from RNA splicing. - beta12orEarlier - - - - - - - - - - ChIP-on-chip - - ChiP - ChIP-Chip - 1.1 - Experimental techniques that combine chromatin immunoprecipitation ('ChIP') with microarray ('chip'). ChIP-on-chip is used for high-throughput study protein-DNA interactions. - ChIP-chip - - - - - - - - - Data security - - 1.3 - Data privacy - The protection of data, such as patient health data, from damage or unwanted access from unauthorized users. - - - - - - - - - - Sample collections - - samples - biobanking - 1.3 - biosamples - Biological samples and specimens. - Specimen collections - - - - - - - - - - - Biochemistry - - - VT 1.5.4 Biochemistry and molecular biology - Chemical biology - 1.3 - Biological chemistry - true - Chemical substances and physico-chemical processes and that occur within living organisms. - - - - - - - - - - - Phylogenetics - - - The study of evolutionary relationships amongst organisms from analysis of genetic information (typically gene or protein sequences). - 1.3 - http://purl.bioontology.org/ontology/MSH/D010802 - true - - - - - - - - - - Epigenetics - - Topic concerning the study of heritable changes, for example in gene expression or phenotype, caused by mechanisms other than changes in the DNA sequence. - This includes sub-topics such as histone modification and DNA methylation. DNA methylation includes bisulfite sequencing, methylation sites and analysis, for example of patterns and profiles of DNA methylation in a population, tissue etc. - http://purl.bioontology.org/ontology/MSH/D019175 - DNA methylation - Bisulfite sequencing - Histone modification - true - 1.3 - - - - - - - - - - - Biotechnology - - true - 1.3 - The exploitation of biological process, structure and function for industrial purposes, for example the genetic manipulation of microorganisms for the antibody production. - - - - - - - - - - - Phenomics - - - - Phenomes, or the study of the change in phenotype (the physical and biochemical traits of organisms) in response to genetic and environmental factors. - 1.3 - true - - - - - - - - - - - Evolutionary biology - - VT 1.5.16 Evolutionary biology - true - 1.3 - The evolutionary processes, from the genetic to environmental scale, that produced life in all its diversity. - - - - - - - - - - - Physiology - - The functions of living organisms and their constituent parts. - 1.3 - VT 3.1.8 Physiology - true - - - - - - - - - - - Microbiology - - true - The biology of microorganisms. - 1.3 - VT 1.5.20 Microbiology - - - - - - - - - - - Parasitology - - true - 1.3 - The biology of parasites. - - - - - - - - - - - Medicine - - General medicine - Research in support of healing by diagnosis, treatment, and prevention of disease. - true - 1.3 - VT 3.1 Basic medicine - VT 3.2.9 General and internal medicine - Experimental medicine - Biomedical research - Clinical medicine - VT 3.2 Clinical medicine - Internal medicine - - - - - - - - - - - Neurobiology - - Neuroscience - 1.3 - true - The study of the nervous system and brain; its anatomy, physiology and function. - VT 3.1.5 Neuroscience - - - - - - - - - - - Public health and epidemiology - - VT 3.3.1 Epidemiology - Topic concerning the the patterns, cause, and effect of disease within populations. - true - 1.3 - Public health - Epidemiology - - - - - - - - - - - Biophysics - - - 1.3 - true - VT 1.5.9 Biophysics - The use of physics to study biological system. - - - - - - - - - - - Computational biology - - VT 1.5.19 Mathematical biology - VT 1.5.12 Computational biology - This includes the modeling and treatment of biological processes and systems in mathematical terms (theoretical biology). - Mathematical biology - VT 1.5.26 Theoretical biology - Theoretical biology - 1.3 - The development and application of theory, analytical methods, mathematical models and computational simulation of biological systems. - true - Biomathematics - - - - - - - - - - - Transcriptomics - - - Comparative transcriptomics - Metatranscriptomics - The analysis of transcriptomes, or a set of all the RNA molecules in a specific cell, tissue etc. - Transcriptome - 1.3 - true - - - - - - - - - - - Chemistry - - VT 1.7.10 Polymer science - VT 1.7.7 Mathematical chemistry - VT 1.7.3 Colloid chemistry - 1.3 - Mathematical chemistry - Physical chemistry - VT 1.7.9 Physical chemistry - Polymer science - Chemical science - Organic chemistry - VT 1.7.6 Inorganic and nuclear chemistry - VT 1.7 Chemical sciences - VT 1.7.5 Electrochemistry - Inorganic chemistry - VT 1.7.2 Chemistry - Nuclear chemistry - VT 1.7.8 Organic chemistry - The composition and properties of matter, reactions, and the use of reactions to create new substances. - - - - - - - - - - - Mathematics - - The study of numbers (quantity) and other topics including structure, space, and change. - VT:1.1 Mathematics - Maths - VT 1.1.99 Other - 1.3 - - - - - - - - - - - Computer science - - 1.3 - VT 1.2 Computer sciences - VT 1.2.99 Other - The theory and practical use of computer systems. - - - - - - - - - - - Physics - - The study of matter, space and time, and related concepts such as energy and force. - 1.3 - - - - - - - - - - - RNA splicing - - - This includes the study of splice sites, splicing patterns, alternative splicing events and variants, isoforms, etc.. - Splice sites - RNA splicing; post-transcription RNA modification involving the removal of introns and joining of exons. - 1.3 - Alternative splicing - true - - - - - - - - - - Molecular genetics - - - 1.3 - The structure and function of genes at a molecular level. - true - - - - - - - - - - - Respiratory medicine - - true - VT 3.2.25 Respiratory systems - Pulmonology - The study of respiratory system. - Pulmonary medicine - Respiratory disease - 1.3 - Pulmonary disorders - - - - - - - - - - - Metabolic disease - - The study of metabolic diseases. - 1.4 - 1.3 - true - - - - - - - - - - Infectious disease - - Transmissable disease - VT 3.3.4 Infectious diseases - Communicable disease - The branch of medicine that deals with the prevention, diagnosis and management of transmissable disease with clinically evident illness resulting from infection with pathogenic biological agents (viruses, bacteria, fungi, protozoa, parasites and prions). - 1.3 - - - - - - - - - - - Rare diseases - - 1.3 - The study of rare diseases. - - - - - - - - - - - Computational chemistry - - - 1.3 - VT 1.7.4 Computational chemistry - true - Topic concerning the development and application of theory, analytical methods, mathematical models and computational simulation of chemical systems. - - - - - - - - - - - Neurology - - Neurological disorders - true - 1.3 - The branch of medicine that deals with the anatomy, functions and disorders of the nervous system. - - - - - - - - - - - Cardiology - - true - Cardiovascular disease - VT 3.2.4 Cardiac and Cardiovascular systems - 1.3 - Cardiovascular medicine - Heart disease - VT 3.2.22 Peripheral vascular disease - The diseases and abnormalities of the heart and circulatory system. - - - - - - - - - - - Drug discovery - - - The discovery and design of drugs or potential drug compounds. - This includes methods that search compound collections, generate or analyse drug 3D conformations, identify drug targets with structural docking etc. - 1.3 - true - - - - - - - - - - - Biobank - - true - biobanking - 1.3 - Repositories of biological samples, typically human, for basic biological and clinical research. - Tissue collection - - - - - - - - - - - Mouse clinic - - 1.3 - Laboratory study of mice, for example, phenotyping, and mutagenesis of mouse cell lines. - - - - - - - - - - - Microbial collection - - Collections of microbial cells including bacteria, yeasts and moulds. - 1.3 - - - - - - - - - - - Cell culture collection - - 1.3 - Collections of cells grown under laboratory conditions, specifically, cells from multi-cellular eukaryotes and especially animal cells. - - - - - - - - - - - Clone library - - 1.3 - Collections of DNA, including both collections of cloned molecules, and populations of micro-organisms that store and propagate cloned DNA. - - - - - - - - - - - Translational medicine - - 'translating' the output of basic and biomedical research into better diagnostic tools, medicines, medical procedures, policies and advice. - true - 1.3 - - - - - - - - - - - Compound libraries and screening - - Translational medicine - Chemical library - Collections of chemicals, typically for use in high-throughput screening experiments. - Compound library - Chemical screening - 1.3 - - - - - - - - - - - Biomedical science - - Topic concerning biological science that is (typically) performed in the context of medicine. - true - VT 3.3 Health sciences - Health science - 1.3 - - - - - - - - - - - Data identity and mapping - - Topic concerning the identity of biological entities, or reports on such entities, and the mapping of entities and records in different databases. - 1.3 - - - - - - - - - - - Sequence search - - 1.3 - Sequence database search - true - 1.12 - The search and retrieval from a database on the basis of molecular sequence similarity. - - - - - - - - - - Biomarkers - - Diagnostic markers - 1.4 - Objective indicators of biological state often used to assess health, and determinate treatment. - true - - - - - - - - - - Laboratory techniques - - The procedures used to conduct an experiment. - Lab techniques - 1.4 - - - - - - - - - - - Data architecture, analysis and design - - The development of policies, models and standards that cover data acquisitioin, storage and integration, such that it can be put to use, typically through a process of systematically applying statistical and / or logical techniques to describe, illustrate, summarise or evaluate data. - Data analysis - Data design - 1.4 - Data architecture - - - - - - - - - - - Data integration and warehousing - - The combination and integration of data from different sources, for example into a central repository or warehouse, to provide users with a unified view of these data. - - - Data integration - 1.4 - Data warehousing - - - - - - - - - - - Biomaterials - - Any matter, surface or construct that interacts with a biological system. - Diagnostic markers - 1.4 - - - - - - - - - - - Chemical biology - - - true - 1.4 - The use of synthetic chemistry to study and manipulate biological systems. - - - - - - - - - - - Analytical chemistry - - 1.4 - The study of the separation, identification, and quantification of the chemical components of natural and artificial materials. - VT 1.7.1 Analytical chemistry - - - - - - - - - - - Synthetic chemistry - - Synthetic organic chemistry - The use of chemistry to create new compounds. - 1.4 - - - - - - - - - - - Software engineering - - VT 1.2.1 Algorithms - Programming languages - VT 1.2.7 Data structures - Software development - Software engineering - Computer programming - 1.4 - 1.2.12 Programming languages - The process that leads from an original formulation of a computing problem to executable programs. - Data structures - Algorithms - VT 1.2.14 Software engineering - - - - - - - - - - - Drug development - - 1.4 - Medicine development - The process of bringing a new drug to market once a lead compounds has been identified through drug discovery. - Drug development science - Medicines development - true - - - - - - - - - - - Drug formulation and delivery - - The process of formulating abd administering a pharmaceutical compound to achieve a therapeutic effect. - Drug delivery - Drug formulation - 1.4 - - - - - - - - - - - Pharmacokinetics and pharmacodynamics - - Pharmacodynamics - Pharmacokinetics - Drug distribution - true - 1.4 - Drug excretion - The study of how a drug interacts with the body. - Drug absorption - ADME - Drug metabolism - Drug metabolism - - - - - - - - - - - Medicines research and development - Medicine research and development - - The discovery, development and approval of medicines. - Health care research - Drug discovery and development - 1.4 - Health care science - - - - - - - - - - - Safety sciences - - 1.4 - Drug safety - The safety (or lack) of drugs and other medical interventions. - - - - - - - - - - - Pharmacovigilence - - 1.4 - Pharmacovigilence concerns safety once a drug has gone to market. - The detection, assesment, understanding and prevention of adverse effects of medicines. - - - - - - - - - - - Preclinical and clinical studies - - - The testing of new medicines, vaccines or procedures on animals (preclinical) and humans (clinical) prior to their approval by regulatory authorities. - Preclinical studies - 1.4 - Clinical study - Preclinical study - Clinical studies - - - - - - - - - - - Imaging - - true - Microscopy imaging - Microscopy - Diffraction experiment - The visual representation of an object. - This includes diffraction experiments that are based upon the interference of waves, typically electromagnetic waves such as X-rays or visible light, by some object being studied, typical in order to produce an image of the object or determine its structure. - 1.4 - - - - - - - - - - - Biological imaging - - The use of imaging techniques to understand biology. - 1.4 - - - - - - - - - - - Medical imaging - - VT 3.2.24 Radiology - The use of imaging techniques for clinical purposes for medical research. - 1.4 - Radiology - VT 3.2.14 Nuclear medicine - Nuclear medicine - VT 3.2.13 Medical imaging - - - - - - - - - - - Light microscopy - - The use of optical instruments to magnify the image of an object. - 1.4 - - - - - - - - - - - Laboratory animal science - - 1.4 - The use of animals and alternatives in experimental research. - - - - - - - - - - - Marine biology - - 1.4 - VT 1.5.18 Marine and Freshwater biology - true - The study of organisms in the ocean or brackish waters. - - - - - - - - - - - Molecular medicine - - The identification of molecular and genetic causes of disease and the development of interventions to correct them. - 1.4 - true - - - - - - - - - - - Nutritional science - - 1.4 - VT 3.3.7 Nutrition and Dietetics - Dietetics - The study of the effects of food components on the metabolism, health, performance and disease resistance of humans and animals. It also includes the study of human behaviours related to food choices. - Nutrition science - - - - - - - - - - - Omics - - true - The collective characterisation and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms. - 1.4 - - - - - - - - - - - Quality affairs - - The processes that need to be in place to ensure the quality of products for human or animal use. - Good clinical practice - Good manufacturing practice - Quality assurance - Good laboratory practice - 1.4 - - - - - - - - - - - Regulatory affairs - - The protection of public health by controlling the safety and efficacy of products in areas including pharmaceuticals, veterinary medicine, medical devices, pesticides, agrochemicals, cosmetics, and complementary medicines. - 1.4 - - - - - - - - - - - Regnerative medicine - - Stem cell research - Biomedical approaches to clinical interventions that involve the use of stem cells. - true - 1.4 - - - - - - - - - - - Systems medicine - - true - 1.4 - An interdisciplinary field of study that looks at the dynamic systems of the human body as part of an integrted whole, incoporating biochemical, physiological, and environmental interactions that sustain life. - - - - - - - - - - - Veterinary medicine - - Topic concerning the branch of medicine that deals with the prevention, diagnosis, and treatment of disease, disorder and injury in animals. - 1.4 - - - - - - - - - - - Bioengineering - - 1.4 - The application of biological concepts and methods to the analytical and synthetic methodologies of engineering. - Diagnostic markers - - - - - - - - - - - Geriatric medicine - - The branch of medicine dealing with the diagnosis, treatment and prevention of disease in older people, and the problems specific to aging. - VT 3.2.10 Geriatrics and gerontology - true - Ageing - Gerontology - Aging - 1.4 - Geriatrics - - - - - - - - - - - Allergy, clinical immunology and immunotherapeutics. - - VT 3.2.1 Allergy - Health issues related to the immune system and their prevention, diagnosis and mangement. - 1.4 - true - Immune disorders - Clinical immunology - Immunomodulators - Allergy - Immunotherapeutics - - - - - - - - - - - Pain medicine - - 1.4 - Algiatry - true - The prevention of pain and the evaluation, treatment and rehabilitation of persons in pain. - - - - - - - - - - - Anaesthesiology - - Anaesthetics - Anaesthesia and anaesthetics. - 1.4 - VT 3.2.2 Anaesthesiology - - - - - - - - - - - Critical care medicine - - Acute medicine - VT 3.2.5 Critical care/Emergency medicine - Emergency medicine - 1.4 - The multidisciplinary that cares for patients with acute, life-threatening illness or injury. - - - - - - - - - - - Dermatology - - The branch of medicine that deals with prevention, diagnosis and treatment of disorders of the skin, scalp, hair and nails. - Dermatological disorders - 1.4 - VT 3.2.7 Dermatology and venereal diseases - - - - - - - - - - - Dentistry - - 1.4 - The study, diagnosis, prevention and treatments of disorders of the oral cavity, maxillofacial area and adjacent structures. - - - - - - - - - - - Ear, nose and throat medicine - - Otolaryngology - 1.4 - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the ear, nose and throat. - Otorhinolaryngology - Head and neck disorders - VT 3.2.20 Otorhinolaryngology - Audiovestibular medicine - - - - - - - - - - - Endocrinology and metabolism - - 1.4 - Metabolic disorders - true - The branch of medicine dealing with diseases of endocrine organs, hormone systems, their target organs, and disorders of the pathways of glucose and lipid metabolism. - Metabolism - Endocrinology - Endocrine disorders - - - - - - - - - - - Haematology - - VT 3.2.11 Hematology - true - The branch of medicine that deals with the blood, blood-forming organs and blood diseases. - Haematological disorders - 1.4 - Blood disorders - - - - - - - - - - - Gastroenterology - - true - The branch of medicine that deals with disorders of the oesophagus, stomach, duodenum, jejenum, ileum, large intestine, sigmoid colon and rectum. - Gastrointestinal disorders - VT 3.2.8 Gastroenterology and hepatology - 1.4 - - - - - - - - - - - Gender medicine - - The study of the biological and physiological differences between males and females and how they effect differences in disease presentation and management. - 1.4 - - - - - - - - - - - Gynaecology and obstetrics - - The branch of medicine that deals with the health of the female reproductive system, pregnancy and birth. - true - 1.4 - VT 3.2.15 Obstetrics and gynaecology - Gynaecology - Gynaecological disorders - Obstetrics - - - - - - - - - - - Hepatic and biliary medicine - - Hepatobiliary medicine - Liver disorders - 1.4 - true - The branch of medicine that deals with the liver, gallbladder, bile ducts and bile. - - - - - - - - - - - Infectious tropical disease - - The branch of medicine that deals with the infectious diseases of the tropics. - 1.13 - true - 1.4 - - - - - - - - - - Trauma medicine - - 1.4 - The branch of medicine that treats body wounds or shock produced by sudden physical injury, as from violence or accident. - - - - - - - - - - - Medical toxicology - - true - The branch of medicine that deals with the diagnosis, management and prevention of poisoning and other adverse health effects caused by medications, occupational and environmental toxins, and biological agents. - 1.4 - - - - - - - - - - - Musculoskeletal medicine - - The branch of medicine that deals with the prevention, diagnosis, and treatment of disorders of the muscle, bone and connective tissue. It incorporates aspects of orthopaedics, rheumatology, rehabilitation medicine and pain medicine. - VT 3.2.26 Rheumatology - VT 3.2.19 Orthopaedics - Musculoskeletal disorders - Orthopaedics - Rheumatology - 1.4 - - - - - - - - - - - Opthalmology - - Eye disoders - VT 3.2.18 Optometry - 1.4 - Optometry - VT 3.2.17 Ophthalmology - Audiovestibular medicine - The branch of medicine that deals with disorders of the eye, including eyelid, optic nerve/visual pathways and occular muscles. - - - - - - - - - - - Paediatrics - - 1.4 - The branch of medicine that deals with the medical care of infants, children and adolescents. - VT 3.2.21 Paediatrics - Child health - - - - - - - - - - - Psychiatry - - The branch of medicine that deals with the mangement of mental illness, emotional disturbance and abnormal behaviour. - 1.4 - Psychiatric disorders - VT 3.2.23 Psychiatry - Mental health - - - - - - - - - - - Reproductive health - - Reproductive disorders - Audiovestibular medicine - VT 3.2.3 Andrology - Andrology - 1.4 - Family planning - The health of the reproductive processes, functions and systems at all stages of life. - Fertility medicine - - - - - - - - - - - Surgery - - Transplantation - VT 3.2.28 Transplantation - The use of operative, manual and instrumental techniques on a patient to investigate and/or treat a pathological condition or help improve bodily function or appearance. - 1.4 - - - - - - - - - - - Urology and nephrology - - The branches of medicine and physiology focussing on the function and disorders of the urinary system in males and females, the reproductive system in males, and the kidney. - VT 3.2.29 Urology and nephrology - 1.4 - Urology - Kidney disease - Urological disorders - Nephrology - - - - - - - - - - - Complementary medicine - - Medical therapies that fall beyond the scope of conventional medicine but may be used alongside it in the treatment of disease and ill health. - VT 3.2.12 Integrative and Complementary medicine - Holistic medicine - 1.4 - Alternative medicine - Integrative medicine - - - - - - - - - - - MRI - - Nuclear magnetic resonance imaging - 1.7 - MRT - Magnetic resonance tomography - Techniques that uses magnetic fields and radiowaves to form images, typically to investigate the anatomy and physiology of the human body. - NMRI - Magnetic resonance imaging - - - - - - - - - - - Neutron diffraction - - - The study of matter by studying the diffraction pattern from firing neutrons at a sample, typically to determine atomic and/or magnetic structure. - Neutron microscopy - Elastic neutron scattering - 1.7 - Neutron diffraction experiment - - - - - - - - - - Tomography - - X-ray tomography - Imaging in sections (sectioning), through the use of a wave-generating device (tomograph) that generates an image (a tomogram). - Electron tomography - 1.7 - - - - - - - - - - Data mining - - 1.7 - VT 1.3.2 Data mining - The discovery of patterns in large data sets and the extraction and trasnsformation of those patterns into a useful format. - true - KDD - Knowledge discovery in databases - - - - - - - - - - Machine learning - - A topic concerning the application of artificial intelligence methods to algorithms, in order to create methods that can learn from data in order to generate an ouput, rather than relying on explicitly encoded information only. - Artificial Intelligence - 1.7 - VT 1.2.2 Artificial Intelligence (expert systems, machine learning, robotics) - - - - - - - - - - Database management - - File management - Document, record and content management - Database administration - This includes databases for the results of scientific experiments, the application of high-throughput technology, computational analysis and the scientific literature. It covers the management and manipulation of digital documents, including database records, files and reports. - Document management - Content management - 1.8 - Databases - Data maintenance - The general handling of data stored in digital archives such as databanks, databases proper, web portals and other data resources. - - Record management - Biological databases - - - - - - - - - - Animals - - 1.8 - Animal biology - Animals, e.g. information on a specific animal genome including molecular sequences, genes and annotation. - Zoology - Animal - VT 1.5.29 Zoology - The resource may be specific to a plant, a group of plants or all plants. - Metazoa - - - - - - - - - - Protein sites, features and motifs - - - A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Protein sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in protein sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Signal peptide cleavage sites - - - - - - - - - - Nucleic acid sites, features and motifs - - - Primer binding sites - Nucleic acid functional sites - Sequence tagged sites - Nucleic acid sequence features - 1.8 - The biology, archival, detection, prediction and analysis of positional features such as functional and other key sites, in nucleic acid sequences and the conserved patterns (motifs, profiles etc.) that may be used to describe them. - Sequence tagged sites are short DNA sequences that are unique within a genome and serve as a mapping landmark, detectable by PCR they allow a genome to be mapped via an ordering of STSs. - - - - - - - - - - Gene transcripts - - - EST - This includes Introns, and protein-coding regions including coding sequences (CDS), exons, translation initiation sites and open reading frames. Also expressed sequence tag (EST) or complementary DNA (cDNA) sequences. - Transcription - mRNA features - This includes regions or sites in a eukaryotic and eukaryotic viral RNA sequence which directs endonuclease cleavage or polyadenylation of an RNA transcript. A polyA signal is required for endonuclease cleavage of an RNA transcript that is followed by polyadenylation. A polyA site is a site on an RNA transcript to which adenine residues will be added during post-transcriptional polyadenylation. - cDNA - Introns - PolyA site - Fusion transcripts - Exons - Signal peptide coding sequence - This includes coding sequences for a signal or transit peptide. A signal peptide coding sequence encodes an N-terminal domain of a secreted protein, which is involved in attaching the polypeptide to a membrane leader sequence. A transit peptide coding sequence encodes an N-terminal domain of a nuclear-encoded organellar protein; which is involved in import of the protein into the organelle. - Transcription of DNA into RNA and features of a messenger RNA (mRNA) molecules including precursor RNA, primary (unprocessed) transcript and fully processed molecules. - 1.8 - PolyA signal - mRNA - Transit peptide coding sequence - This includes 5'untranslated region (5'UTR), coding sequences (CDS), exons, intervening sequences (intron) and 3'untranslated regions (3'UTR). - Coding RNA - Gene transcript features - - - - - - - - - - Protein-ligand interactions - - true - 1.8 - Protein-ligand (small molecule) interaction(s). - 1.13 - Protein-drug interactions - - - - - - - - - - Protein-drug interactions - - 1.13 - 1.8 - true - Protein-drug interaction(s). - - - - - - - - - - Genotyping experiment - - 1.8 - Genotype experiment including case control, population, and family studies. These might use array based methods and re-sequencing methods. - - - - - - - - - - GWAS study - - 1.8 - Genome-wide association study experiments. - Genome-wide association study - - - - - - - - - - Microarray experiment - - ChIP-chip - Microarray experiments including conditions, protocol, sample:data relationships etc. - Microarrays - Tissue microarray - Reverse phase protein array - Methylation array - mRNA microarray - Multichannel microarray - Proprietary platform micoarray - MicroRNA array - 1.8 - Two channel microarray - miRNA array - This might specify which raw data file relates to which sample and information on hybridisations, e.g. which are technical and which are biological replicates. - One channel microarray - ChIP-on-chip - Genotyping array - - - - - - - - - - PCR experiment - - 1.8 - PCR experiments, e.g. quantitative real-time PCR. - - - - - - - - - - Proteomics experiment - - Proteomics experiments. - Northern blot experiment - 2D PAGE experiment - 1.8 - This includes two-dimensional gel electrophoresis (2D PAGE) experiments, gels or spots in a gel. Also mass spectrometry - an analytical chemistry technique that measures the mass-to-charge ratio and abundance of irons in the gas phase. Also Northern blot experiments. - Mass spectrometry - - - - - - - - - - 2D PAGE experiment - - true - Two-dimensional gel electrophoresis experiments, gels or spots in a gel. - 1.8 - 1.13 - - - - - - - - - - Northern blot experiment - - Northern Blot experiments. - true - 1.13 - 1.8 - - - - - - - - - - RNAi experiment - - 1.8 - RNAi experiments. - - - - - - - - - - Simulation experiment - - 1.8 - Biological computational model experiments (simulation), for example the minimum information required in order to permit its correct interpretation and reproduction. - - - - - - - - - - Protein-nucleic acid interactions - - true - 1.8 - Protein-DNA/RNA interaction(s). - 1.13 - - - - - - - - - - Protein-protein interactions - - 1.13 - Protein-protein interaction(s), including interactions between protein domains. - 1.8 - true - - - - - - - - - - Cellular process pathways - - 1.8 - Cellular process pathways. - true - 1.13 - - - - - - - - - - Disease pathways - - 1.13 - Disease pathways, typically of human disease. - true - 1.8 - - - - - - - - - - Environmental information processing pathways - - true - Environmental information processing pathways. - 1.8 - 1.13 - - - - - - - - - - Genetic information processing pathways - - true - 1.8 - Genetic information processing pathways. - 1.13 - - - - - - - - - - Protein super-secondary structure - - Super-secondary structure of protein sequence(s). - true - 1.8 - 1.13 - - - - - - - - - - Protein active sites - - 1.8 - 1.13 - true - Catalytic residues (active site) of an enzyme. - - - - - - - - - - Protein binding sites - - Protein functional sites - Enzyme active site - Binding sites in proteins, including cleavage sites (for a proteolytic enzyme or agent), key residues involved in protein folding, catalytic residues (active site) of an enzyme, ligand-binding (non-catalytic) residues of a protein, such as sites that bind metal, prosthetic groups or lipids, RNA and DNA-binding proteins and binding sites etc. - Protein-nucleic acid binding sites - 1.8 - Protein cleavage sites - Protein key folding sites - - - - - - - - - - Protein-nucleic acid binding sites - - RNA and DNA-binding proteins and binding sites in protein sequences. - 1.13 - 1.8 - true - - - - - - - - - - Protein cleavage sites - - Cleavage sites (for a proteolytic enzyme or agent) in a protein sequence. - true - 1.8 - 1.13 - - - - - - - - - - Protein chemical modifications - - true - Chemical modification of a protein. - 1.13 - 1.8 - - - - - - - - - - Protein disordered structure - - Disordered structure in a protein. - 1.8 - Protein features (disordered structure) - - - - - - - - - - Protein domains - - true - 1.13 - Structural domains or 3D folds in a protein or polypeptide chain. - 1.8 - - - - - - - - - - Protein key folding sites - - 1.8 - 1.13 - true - Key residues involved in protein folding. - - - - - - - - - - Protein post-translational modifications - - true - 1.13 - Post-translation modifications in a protein sequence, typically describing the specific sites involved. - 1.8 - - - - - - - - - - Protein secondary structure - - The location and size of the secondary structure elements and intervening loop regions is typically given. The report can include disulphide bonds and post-translationally formed peptide bonds (crosslinks). - Secondary structure (predicted or real) of a protein, including super-secondary structure. - Protein super-secondary structure - Super-secondary structures include leucine zippers, coiled coils, Helix-Turn-Helix etc. - Protein features (secondary structure) - 1.8 - - - - - - - - - - Protein sequence repeats - - true - 1.8 - Short repetitive subsequences (repeat sequences) in a protein sequence. - 1.13 - - - - - - - - - - Protein signal peptides - - 1.13 - Signal peptides or signal peptide cleavage sites in protein sequences. - true - 1.8 - - - - - - - - - - Protein interaction experiment - - 1.12 - Yeast one-hybrid - Co-immunoprecipitation - An experiment for studying protein-protein interactions. - Yeast two-hybrid - Phage display - - - - - - - - - - Applied mathematics - - VT 1.1.1 Applied mathematics - The application of mathematics to specific problems in science, typically by the formulation and analysis of mathematical models. - 1.10 - - - - - - - - - - Pure mathematics - - VT 1.1.1 Pure mathematics - The study of abstract mathematical concepts. - 1.10 - - - - - - - - - - Data governance - - Data handling - http://purl.bioontology.org/ontology/MSH/D030541 - The control of data entry and maintenance to ensure the data meets defined standards, qualities or constraints. - 1.10 - Data stewardship - - - - - - - - - - Data quality management - - http://purl.bioontology.org/ontology/MSH/D030541 - 1.10 - Data quality - Data integrity - Data clean-up - Data enrichment - The quality, integrity, cleaning up and enrichment of data. - - - - - - - - - - Freshwater biology - - 1.10 - VT 1.5.18 Marine and Freshwater biology - The study of organisms in freshwater ecosystems. - - - - - - - - - - - Human genetics - - true - The study of inheritatnce in human beings. - VT 3.1.2 Human genetics - 1.10 - - - - - - - - - - - Tropical medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - VT 3.3.14 Tropical medicine - - - - - - - - - - - Medical biotechnology - - VT 3.4.1 Biomedical devices - 1.10 - true - VT 3.4.2 Health-related biotechnology - VT 3.4 Medical biotechnology - VT 3.3.14 Tropical medicine - Pharmaceutical biotechnology - Biotechnology applied to the medical sciences and the development of medicines. - - - - - - - - - - - Personalized medicine - - 1.10 - Health problems that are prevalent in tropical and subtropical regions. - Molecular diagnostics - true - VT 3.4.5 Molecular diagnostics - - - - - - - - - - - Immunoprecipitation experiment - - - - Chromatin immunoprecipitation - Experimental techniques to purify a protein-DNA crosslinked complex. Usually sequencing follows e.g. in the techniques ChIP-chip, ChIP-seq and MeDIP-seq. - 1.12 - - - - - - - - - - Whole genome sequencing - - 1.12 - Laboratory technique to sequence the complete DNA sequence of an organism's genome at a single time. - WGS - Whole genome resequencing - - - - - - - - - - Methylated DNA immunoprecipitation - - 1.12 - MeDIP-seq - Methylated DNA immunoprecipitation (MeDIP) - Methylation sequencing - Laboratory technique to sequence the methylated regions in DNA. - MeDIP-chip - Bisulfite sequencing - MeDIP - mDIP - - - - - - - - - - Exome sequencing - - 1.1 - Exome capture - Exome sequencing is considered a cheap alternative to whole genome sequencing. - Targeted exome capture - Exome sequence analysis - Laboratory technique to sequence all the protein-coding regions in a genome, i.e., the exome. - Exome analysis - - - - - - - - - - - Experimental design and studies - - Design of experiments - 1.12 - Experimental design - Studies - The design of an experiment intended to test a hypothesis, and describe or explain empirical data obtained under various experimental conditions. - true - - - - - - - - - - - Animal study - - - Challenge study - 1.12 - The design of an experiment involving non-human animals. - - - - - - - - - - Microbial ecology - - - 1.13 - The ecology of microorganisms including their relationship with one another and their environment. - Microbiome - true - Environmental microbiology - - - - - - - - - - Obsolete concept (EDAM) - - 1.2 - Needed for conversion to the OBO format. - An obsolete concept (redefined in EDAM). - true - - - - - - - - - - - - - - diff --git a/stylesheets/github-light.css b/stylesheets/github-light.css new file mode 100644 index 0000000..872a6f4 --- /dev/null +++ b/stylesheets/github-light.css @@ -0,0 +1,116 @@ +/* + Copyright 2014 GitHub Inc. + + Licensed under the Apache License, Version 2.0 (the "License"); + you may not use this file except in compliance with the License. + You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. + +*/ + +.pl-c /* comment */ { + color: #969896; +} + +.pl-c1 /* constant, markup.raw, meta.diff.header, meta.module-reference, meta.property-name, support, support.constant, support.variable, variable.other.constant */, +.pl-s .pl-v /* string variable */ { + color: #0086b3; +} + +.pl-e /* entity */, +.pl-en /* entity.name */ { + color: #795da3; +} + +.pl-s .pl-s1 /* string source */, +.pl-smi /* storage.modifier.import, storage.modifier.package, storage.type.java, variable.other, variable.parameter.function */ { + color: #333; +} + +.pl-ent /* entity.name.tag */ { + color: #63a35c; +} + +.pl-k /* keyword, storage, storage.type */ { + color: #a71d5d; +} + +.pl-pds /* punctuation.definition.string, string.regexp.character-class */, +.pl-s /* string */, +.pl-s .pl-pse .pl-s1 /* string punctuation.section.embedded source */, +.pl-sr /* string.regexp */, +.pl-sr .pl-cce /* string.regexp constant.character.escape */, +.pl-sr .pl-sra /* string.regexp string.regexp.arbitrary-repitition */, +.pl-sr .pl-sre /* string.regexp source.ruby.embedded */ { + color: #183691; +} + +.pl-v /* variable */ { + color: #ed6a43; +} + +.pl-id /* invalid.deprecated */ { + color: #b52a1d; +} + +.pl-ii /* invalid.illegal */ { + background-color: #b52a1d; + color: #f8f8f8; +} + +.pl-sr .pl-cce /* string.regexp constant.character.escape */ { + color: #63a35c; + font-weight: bold; +} + +.pl-ml /* markup.list */ { + color: #693a17; +} + +.pl-mh /* markup.heading */, +.pl-mh .pl-en /* markup.heading entity.name */, +.pl-ms /* meta.separator */ { + color: #1d3e81; + font-weight: bold; +} + +.pl-mq /* markup.quote */ { + color: #008080; +} + +.pl-mi /* markup.italic */ { + color: #333; + font-style: italic; +} + +.pl-mb /* markup.bold */ { + color: #333; + font-weight: bold; +} + +.pl-md /* markup.deleted, meta.diff.header.from-file */ { + background-color: #ffecec; + color: #bd2c00; +} + +.pl-mi1 /* markup.inserted, meta.diff.header.to-file */ { + background-color: #eaffea; + color: #55a532; +} + +.pl-mdr /* meta.diff.range */ { + color: #795da3; + font-weight: bold; +} + +.pl-mo /* meta.output */ { + color: #1d3e81; +} + diff --git a/stylesheets/print.css b/stylesheets/print.css new file mode 100644 index 0000000..7da6db0 --- /dev/null +++ b/stylesheets/print.css @@ -0,0 +1,228 @@ +html, body, div, span, applet, object, iframe, +h1, h2, h3, h4, h5, h6, p, blockquote, pre, +a, abbr, acronym, address, big, cite, code, +del, dfn, em, img, ins, kbd, q, s, samp, +small, strike, strong, sub, sup, tt, var, +b, u, i, center, +dl, dt, dd, ol, ul, li, +fieldset, form, label, legend, +table, caption, tbody, tfoot, thead, tr, th, td, +article, aside, canvas, details, embed, +figure, figcaption, footer, header, hgroup, +menu, nav, output, ruby, section, summary, +time, mark, audio, video { + padding: 0; + margin: 0; + font: inherit; + font-size: 100%; + vertical-align: baseline; + border: 0; +} +/* HTML5 display-role reset for older browsers */ +article, aside, details, figcaption, figure, +footer, header, hgroup, menu, nav, section { + display: block; +} +body { + line-height: 1; +} +ol, ul { + list-style: none; +} +blockquote, q { + quotes: none; +} +blockquote:before, blockquote:after, +q:before, q:after { + content: ''; + content: none; +} +table { + border-spacing: 0; + border-collapse: collapse; +} +body { + font-family: 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 13px; + line-height: 1.5; + color: #000; +} + +a { + font-weight: bold; + color: #d5000d; +} + +header { + padding-top: 35px; + padding-bottom: 10px; +} + +header h1 { + font-size: 48px; + font-weight: bold; + line-height: 1.2; + color: #303030; + letter-spacing: -1px; +} + +header h2 { + font-size: 24px; + font-weight: normal; + line-height: 1.3; + color: #aaa; + letter-spacing: -1px; +} +#downloads { + display: none; +} +#main_content { + padding-top: 20px; +} + +code, pre { + margin-bottom: 30px; + font-family: Monaco, "Bitstream Vera Sans Mono", "Lucida Console", Terminal; + font-size: 12px; + color: #222; +} + +code { + padding: 0 3px; +} + +pre { + padding: 20px; + overflow: auto; + border: solid 1px #ddd; +} +pre code { + padding: 0; +} + +ul, ol, dl { + margin-bottom: 20px; +} + + +/* COMMON STYLES */ + +table { + width: 100%; + border: 1px solid #ebebeb; +} + +th { + font-weight: 500; +} + +td { + font-weight: 300; + text-align: center; + border: 1px solid #ebebeb; +} + +form { + padding: 20px; + background: #f2f2f2; + +} + + +/* GENERAL ELEMENT TYPE STYLES */ + +h1 { + font-size: 2.8em; +} + +h2 { + margin-bottom: 8px; + font-size: 22px; + font-weight: bold; + color: #303030; +} + +h3 { + margin-bottom: 8px; + font-size: 18px; + font-weight: bold; + color: #d5000d; +} + +h4 { + font-size: 16px; + font-weight: bold; + color: #303030; +} + +h5 { + font-size: 1em; + color: #303030; +} + +h6 { + font-size: .8em; + color: #303030; +} + +p { + margin-bottom: 20px; + font-weight: 300; +} + +a { + text-decoration: none; +} + +p a { + font-weight: 400; +} + +blockquote { + padding: 0 0 0 30px; + margin-bottom: 20px; + font-size: 1.6em; + border-left: 10px solid #e9e9e9; +} + +ul li { + padding-left: 20px; + list-style-position: inside; + list-style: disc; +} + +ol li { + padding-left: 3px; + list-style-position: inside; + list-style: decimal; +} + +dl dd { + font-style: italic; + font-weight: 100; +} + +footer { + padding-top: 20px; + padding-bottom: 30px; + margin-top: 40px; + font-size: 13px; + color: #aaa; +} + +footer a { + color: #666; +} + +/* MISC */ +.clearfix:after { + display: block; + height: 0; + clear: both; + visibility: hidden; + content: '.'; +} + +.clearfix {display: inline-block;} +* html .clearfix {height: 1%;} +.clearfix {display: block;} diff --git a/stylesheets/stylesheet.css b/stylesheets/stylesheet.css new file mode 100644 index 0000000..543c951 --- /dev/null +++ b/stylesheets/stylesheet.css @@ -0,0 +1,881 @@ +/*! normalize.css v3.0.2 | MIT License | git.io/normalize */ + +/** + * 1. Set default font family to sans-serif. + * 2. Prevent iOS text size adjust after orientation change, without disabling + * user zoom. + */ + +html { + font-family: sans-serif; /* 1 */ + -webkit-text-size-adjust: 100%; /* 2 */ + -ms-text-size-adjust: 100%; /* 2 */ +} + +/** + * Remove default margin. + */ + +body { + margin: 0; +} + +/* HTML5 display definitions + ========================================================================== */ + +/** + * Correct `block` display not defined for any HTML5 element in IE 8/9. + * Correct `block` display not defined for `details` or `summary` in IE 10/11 + * and Firefox. + * Correct `block` display not defined for `main` in IE 11. + */ + +article, +aside, +details, +figcaption, +figure, +footer, +header, +hgroup, +main, +menu, +nav, +section, +summary { + display: block; +} + +/** + * 1. Correct `inline-block` display not defined in IE 8/9. + * 2. Normalize vertical alignment of `progress` in Chrome, Firefox, and Opera. + */ + +audio, +canvas, +progress, +video { + display: inline-block; /* 1 */ + vertical-align: baseline; /* 2 */ +} + +/** + * Prevent modern browsers from displaying `audio` without controls. + * Remove excess height in iOS 5 devices. + */ + +audio:not([controls]) { + display: none; + height: 0; +} + +/** + * Address `[hidden]` styling not present in IE 8/9/10. + * Hide the `template` element in IE 8/9/11, Safari, and Firefox < 22. + */ + +[hidden], +template { + display: none; +} + +/* Links + ========================================================================== */ + +/** + * Remove the gray background color from active links in IE 10. + */ + +a { + background-color: transparent; +} + +/** + * Improve readability when focused and also mouse hovered in all browsers. + */ + +a:active, +a:hover { + outline: 0; +} + +/* Text-level semantics + ========================================================================== */ + +/** + * Address styling not present in IE 8/9/10/11, Safari, and Chrome. + */ + +abbr[title] { + border-bottom: 1px dotted; +} + +/** + * Address style set to `bolder` in Firefox 4+, Safari, and Chrome. + */ + +b, +strong { + font-weight: bold; +} + +/** + * Address styling not present in Safari and Chrome. + */ + +dfn { + font-style: italic; +} + +/** + * Address variable `h1` font-size and margin within `section` and `article` + * contexts in Firefox 4+, Safari, and Chrome. + */ + +h1 { + margin: 0.67em 0; + font-size: 2em; +} + +/** + * Address styling not present in IE 8/9. + */ + +mark { + color: #000; + background: #ff0; +} + +/** + * Address inconsistent and variable font size in all browsers. + */ + +small { + font-size: 80%; +} + +/** + * Prevent `sub` and `sup` affecting `line-height` in all browsers. + */ + +sub, +sup { + position: relative; + font-size: 75%; + line-height: 0; + vertical-align: baseline; +} + +sup { + top: -0.5em; +} + +sub { + bottom: -0.25em; +} + +/* Embedded content + ========================================================================== */ + +/** + * Remove border when inside `a` element in IE 8/9/10. + */ + +img { + border: 0; +} + +/** + * Correct overflow not hidden in IE 9/10/11. + */ + +svg:not(:root) { + overflow: hidden; +} + +/* Grouping content + ========================================================================== */ + +/** + * Address margin not present in IE 8/9 and Safari. + */ + +figure { + margin: 1em 40px; +} + +/** + * Address differences between Firefox and other browsers. + */ + +hr { + height: 0; + -moz-box-sizing: content-box; + box-sizing: content-box; +} + +/** + * Contain overflow in all browsers. + */ + +pre { + overflow: auto; +} + +/** + * Address odd `em`-unit font size rendering in all browsers. + */ + +code, +kbd, +pre, +samp { + font-family: monospace, monospace; + font-size: 1em; +} + +/* Forms + ========================================================================== */ + +/** + * Known limitation: by default, Chrome and Safari on OS X allow very limited + * styling of `select`, unless a `border` property is set. + */ + +/** + * 1. Correct color not being inherited. + * Known issue: affects color of disabled elements. + * 2. Correct font properties not being inherited. + * 3. Address margins set differently in Firefox 4+, Safari, and Chrome. + */ + +button, +input, +optgroup, +select, +textarea { + margin: 0; /* 3 */ + font: inherit; /* 2 */ + color: inherit; /* 1 */ +} + +/** + * Address `overflow` set to `hidden` in IE 8/9/10/11. + */ + +button { + overflow: visible; +} + +/** + * Address inconsistent `text-transform` inheritance for `button` and `select`. + * All other form control elements do not inherit `text-transform` values. + * Correct `button` style inheritance in Firefox, IE 8/9/10/11, and Opera. + * Correct `select` style inheritance in Firefox. + */ + +button, +select { + text-transform: none; +} + +/** + * 1. Avoid the WebKit bug in Android 4.0.* where (2) destroys native `audio` + * and `video` controls. + * 2. Correct inability to style clickable `input` types in iOS. + * 3. Improve usability and consistency of cursor style between image-type + * `input` and others. + */ + +button, +html input[type="button"], /* 1 */ +input[type="reset"], +input[type="submit"] { + -webkit-appearance: button; /* 2 */ + cursor: pointer; /* 3 */ +} + +/** + * Re-set default cursor for disabled elements. + */ + +button[disabled], +html input[disabled] { + cursor: default; +} + +/** + * Remove inner padding and border in Firefox 4+. + */ + +button::-moz-focus-inner, +input::-moz-focus-inner { + padding: 0; + border: 0; +} + +/** + * Address Firefox 4+ setting `line-height` on `input` using `!important` in + * the UA stylesheet. + */ + +input { + line-height: normal; +} + +/** + * It's recommended that you don't attempt to style these elements. + * Firefox's implementation doesn't respect box-sizing, padding, or width. + * + * 1. Address box sizing set to `content-box` in IE 8/9/10. + * 2. Remove excess padding in IE 8/9/10. + */ + +input[type="checkbox"], +input[type="radio"] { + box-sizing: border-box; /* 1 */ + padding: 0; /* 2 */ +} + +/** + * Fix the cursor style for Chrome's increment/decrement buttons. For certain + * `font-size` values of the `input`, it causes the cursor style of the + * decrement button to change from `default` to `text`. + */ + +input[type="number"]::-webkit-inner-spin-button, +input[type="number"]::-webkit-outer-spin-button { + height: auto; +} + +/** + * 1. Address `appearance` set to `searchfield` in Safari and Chrome. + * 2. Address `box-sizing` set to `border-box` in Safari and Chrome + * (include `-moz` to future-proof). + */ + +input[type="search"] { + -webkit-box-sizing: content-box; /* 2 */ + -moz-box-sizing: content-box; + box-sizing: content-box; + -webkit-appearance: textfield; /* 1 */ +} + +/** + * Remove inner padding and search cancel button in Safari and Chrome on OS X. + * Safari (but not Chrome) clips the cancel button when the search input has + * padding (and `textfield` appearance). + */ + +input[type="search"]::-webkit-search-cancel-button, +input[type="search"]::-webkit-search-decoration { + -webkit-appearance: none; +} + +/** + * Define consistent border, margin, and padding. + */ + +fieldset { + padding: 0.35em 0.625em 0.75em; + margin: 0 2px; + border: 1px solid #c0c0c0; +} + +/** + * 1. Correct `color` not being inherited in IE 8/9/10/11. + * 2. Remove padding so people aren't caught out if they zero out fieldsets. + */ + +legend { + padding: 0; /* 2 */ + border: 0; /* 1 */ +} + +/** + * Remove default vertical scrollbar in IE 8/9/10/11. + */ + +textarea { + overflow: auto; +} + +/** + * Don't inherit the `font-weight` (applied by a rule above). + * NOTE: the default cannot safely be changed in Chrome and Safari on OS X. + */ + +optgroup { + font-weight: bold; +} + +/* Tables + ========================================================================== */ + +/** + * Remove most spacing between table cells. + */ + +table { + border-spacing: 0; + border-collapse: collapse; +} + +td, +th { + padding: 0; +} + +/* LAYOUT STYLES */ +body { + font-family: 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 15px; + font-weight: 400; + line-height: 1.5; + color: #666; + background: #fafafa url(../images/body-bg.jpg) 0 0 repeat; +} + +p { + margin-top: 0; +} + +a { + color: #2879d0; +} +a:hover { + color: #2268b2; +} + +header { + padding-top: 40px; + padding-bottom: 40px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + background: #2e7bcf url(../images/header-bg.jpg) 0 0 repeat-x; + border-bottom: solid 1px #275da1; +} + +header h1 { + width: 540px; + margin-top: 0; + margin-bottom: 0.2em; + font-size: 72px; + font-weight: normal; + line-height: 1; + color: #fff; + letter-spacing: -1px; +} + +header h2 { + width: 540px; + margin-top: 0; + margin-bottom: 0; + font-size: 26px; + font-weight: normal; + line-height: 1.3; + color: #9ddcff; + letter-spacing: 0; +} + +.inner { + position: relative; + width: 940px; + margin: 0 auto; +} + +#content-wrapper { + padding-top: 30px; + border-top: solid 1px #fff; +} + +#main-content { + float: left; + width: 690px; +} + +#main-content img { + max-width: 100%; +} + +aside#sidebar { + float: right; + width: 200px; + min-height: 504px; + padding-left: 20px; + font-size: 12px; + line-height: 1.3; + background: transparent url(../images/sidebar-bg.jpg) 0 0 no-repeat; +} + +aside#sidebar p.repo-owner, +aside#sidebar p.repo-owner a { + font-weight: bold; +} + +#downloads { + margin-bottom: 40px; +} + +a.button { + width: 134px; + height: 58px; + padding-top: 22px; + padding-left: 68px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 23px; + line-height: 1.2; + color: #fff; +} +a.button small { + display: block; + font-size: 11px; +} +header a.button { + position: absolute; + top: 0; + right: 0; + background: transparent url(../images/github-button.png) 0 0 no-repeat; +} +aside a.button { + display: block; + width: 138px; + padding-left: 64px; + margin-bottom: 20px; + font-size: 21px; + background: transparent url(../images/download-button.png) 0 0 no-repeat; +} + +code, pre { + margin-bottom: 30px; + font-family: Monaco, "Bitstream Vera Sans Mono", "Lucida Console", Terminal, monospace; + font-size: 13px; + color: #222; +} + +code { + padding: 0 3px; + background-color: #f2f8fc; + border: solid 1px #dbe7f3; +} + +pre { + padding: 20px; + overflow: auto; + text-shadow: none; + background: #fff; + border: solid 1px #f2f2f2; +} +pre code { + padding: 0; + color: #2879d0; + background-color: #fff; + border: none; +} + +ul, ol, dl { + margin-bottom: 20px; +} + + +/* COMMON STYLES */ + +hr { + height: 0; + margin-top: 1em; + margin-bottom: 1em; + border: 0; + border-top: solid 1px #ddd; +} + +table { + width: 100%; + border: 1px solid #ebebeb; +} + +th { + font-weight: 500; +} + +td { + font-weight: 300; + text-align: center; + border: 1px solid #ebebeb; +} + +form { + padding: 20px; + background: #f2f2f2; + +} + + +/* GENERAL ELEMENT TYPE STYLES */ + +#main-content h1 { + margin-top: 0; + margin-bottom: 0; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 2.8em; + font-weight: normal; + color: #474747; + text-indent: 6px; + letter-spacing: -1px; +} + +#main-content h1:before { + padding-right: 0.3em; + margin-left: -0.9em; + color: #9ddcff; + content: "/"; +} + +#main-content h2 { + margin-bottom: 8px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 22px; + font-weight: bold; + color: #474747; + text-indent: 4px; +} +#main-content h2:before { + padding-right: 0.3em; + margin-left: -1.5em; + content: "//"; + color: #9ddcff; +} + +#main-content h3 { + margin-top: 24px; + margin-bottom: 8px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 18px; + font-weight: bold; + color: #474747; + text-indent: 3px; +} + +#main-content h3:before { + padding-right: 0.3em; + margin-left: -2em; + content: "///"; + color: #9ddcff; +} + +#main-content h4 { + margin-bottom: 8px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 15px; + font-weight: bold; + color: #474747; + text-indent: 3px; +} + +h4:before { + padding-right: 0.3em; + margin-left: -2.8em; + content: "////"; + color: #9ddcff; +} + +#main-content h5 { + margin-bottom: 8px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: 14px; + color: #474747; + text-indent: 3px; +} +h5:before { + padding-right: 0.3em; + margin-left: -3.2em; + content: "/////"; + color: #9ddcff; +} + +#main-content h6 { + margin-bottom: 8px; + font-family: 'Architects Daughter', 'Helvetica Neue', Helvetica, Arial, serif; + font-size: .8em; + color: #474747; + text-indent: 3px; +} +h6:before { + padding-right: 0.3em; + margin-left: -3.7em; + content: "//////"; + color: #9ddcff; +} + +p { + margin-bottom: 20px; +} + +a { + text-decoration: none; +} + +p a { + font-weight: 400; +} + +blockquote { + padding: 0 0 0 30px; + margin-bottom: 20px; + font-size: 1.6em; + border-left: 10px solid #e9e9e9; +} + +ul { + list-style-position: inside; + list-style: disc; + padding-left: 20px; +} + +ol { + list-style-position: inside; + list-style: decimal; + padding-left: 3px; +} + +dl dd { + font-style: italic; + font-weight: 100; +} + +footer { + padding-top: 20px; + padding-bottom: 30px; + margin-top: 40px; + font-size: 13px; + color: #aaa; + background: transparent url('../images/hr.png') 0 0 no-repeat; +} + +footer a { + color: #666; +} +footer a:hover { + color: #444; +} + +/* MISC */ +.clearfix:after { + display: block; + height: 0; + clear: both; + visibility: hidden; + content: '.'; +} + +.clearfix {display: inline-block;} +* html .clearfix {height: 1%;} +.clearfix {display: block;} + +/* #Media Queries +================================================== */ + +/* Smaller than standard 960 (devices and browsers) */ +@media only screen and (max-width: 959px) { } + +/* Tablet Portrait size to standard 960 (devices and browsers) */ +@media only screen and (min-width: 768px) and (max-width: 959px) { + .inner { + width: 740px; + } + header h1, header h2 { + width: 340px; + } + header h1 { + font-size: 60px; + } + header h2 { + font-size: 30px; + } + #main-content { + width: 490px; + } + #main-content h1:before, + #main-content h2:before, + #main-content h3:before, + #main-content h4:before, + #main-content h5:before, + #main-content h6:before { + padding-right: 0; + margin-left: 0; + content: none; + } +} + +/* All Mobile Sizes (devices and browser) */ +@media only screen and (max-width: 767px) { + .inner { + width: 93%; + } + header { + padding: 20px 0; + } + header .inner { + position: relative; + } + header h1, header h2 { + width: 100%; + } + header h1 { + font-size: 48px; + } + header h2 { + font-size: 24px; + } + header a.button { + position: relative; + display: inline-block; + width: auto; + height: auto; + padding: 5px 10px; + margin-top: 15px; + font-size: 13px; + line-height: 1; + color: #2879d0; + text-align: center; + background-color: #9ddcff; + background-image: none; + border-radius: 5px; + -moz-border-radius: 5px; + -webkit-border-radius: 5px; + } + header a.button small { + display: inline; + font-size: 13px; + } + #main-content, + aside#sidebar { + float: none; + width: 100% ! important; + } + aside#sidebar { + min-height: 0; + padding: 20px 0; + margin-top: 20px; + background-image: none; + border-top: solid 1px #ddd; + } + aside#sidebar a.button { + display: none; + } + #main-content h1:before, + #main-content h2:before, + #main-content h3:before, + #main-content h4:before, + #main-content h5:before, + #main-content h6:before { + padding-right: 0; + margin-left: 0; + content: none; + } +} + +/* Mobile Landscape Size to Tablet Portrait (devices and browsers) */ +@media only screen and (min-width: 480px) and (max-width: 767px) { } + +/* Mobile Portrait Size to Mobile Landscape Size (devices and browsers) */ +@media only screen and (max-width: 479px) { } + diff --git a/web/EDAM.uris b/web/EDAM.uris deleted file mode 100644 index 1f2e4ef..0000000 --- a/web/EDAM.uris +++ /dev/null @@ -1,73 +0,0 @@ -# =============================================================================================================== -# -# This is the information about the EDAM ontology URI -# -http://edamontology.org -# -# -# =============================================================================================================== -# -# -# For a human-readable Web page about the EDAM ontology, please refer to -# -http://edamontology.org/page -# -# [type: text/html|application/xhtml+xml; ?format=html|htm|xhtml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of EDAM in RDF/XML (OWL), please refer to -# -http://edamontology.org/EDAM.owl -# -# [type: application/rdf+xml|application/xml|text/xml; ?format=owl|rdf|xml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of EDAM in OBO format, please refer to -# -http://edamontology.org/EDAM.obo -# -# [type: text/plain; ?format=obo|text|txt; charset=us-ascii; language: en] -# -# Note that the OBO-format representation lacks certain details present only in the OWL version -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# Information about the EDAM URI (http://edamontology.org) is in this file right here -# -http://edamontology.org/EDAM.uris -# -# [type: text/uri-list; ?format=uri|url|about; charset=us-ascii; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# The same information about the EDAM URI (http://edamontology.org), in the form of an HTML document, is available at -# -http://edamontology.org/URIs -# -# -# =============================================================================================================== -# -# -# For information about the URIs of the EDAM concepts, please refer to -# -http://edamontology.org/concept.uris -# -# -# -# And for information about the URIs of the EDAM relations and concept properties, please refer to -# -http://edamontology.org/relations-and-properties.uris -# -# -# =============================================================================================================== -# Note that the URI of this file is http://edamontology.org/EDAM.uris, and not http://edamontology.org \ No newline at end of file diff --git a/web/EDAMconcepts.png b/web/EDAMconcepts.png deleted file mode 100644 index 0e7abc7..0000000 Binary files a/web/EDAMconcepts.png and /dev/null differ diff --git a/web/EDAMrelations.png b/web/EDAMrelations.png deleted file mode 100644 index e31e82c..0000000 Binary files a/web/EDAMrelations.png and /dev/null differ diff --git a/web/URIs.html b/web/URIs.html deleted file mode 100644 index f124871..0000000 --- a/web/URIs.html +++ /dev/null @@ -1,36 +0,0 @@ - - - - - - http://edamontology.org - - - - - - - - - - - -
-

http://edamontology.org

-
-
- -

http://edamontology.org is the URI of the EDAM ontology.

- -

-

For a human-readable Web page about the EDAM ontology, please refer to http://edamontology.org/page.

-

For a machine-understandable representation of the last stable version of EDAM in RDF/XML (OWL), please refer to http://edamontology.org/EDAM.owl.

-

For a machine-understandable representation of the last stable version of EDAM in OBO format, please refer to http://edamontology.org/EDAM.obo. (Note that the OBO-format representation lacks certain details present only in the OWL version.)

-

For information about the EDAM URI (http://edamontology.org), equivalent to this HTML document, please refer to http://edamontology.org/EDAM.uris.

-

-

For information about the URIs of the EDAM concepts, please refer to http://edamontology.org/concept.uris.

-

And for information about the URIs of the EDAM relations and concept properties, please refer to http://edamontology.org/relations-and-properties.uris.

-

-

Note also that the URI of this document is http://edamontology.org/URIs, and not http://edamontology.org.

- - diff --git a/web/concept.uris b/web/concept.uris deleted file mode 100644 index 3f92419..0000000 --- a/web/concept.uris +++ /dev/null @@ -1,62 +0,0 @@ -# =============================================================================================================== -# -# -# This is the information about the URI of an EDAM concept -# -# -# -# The EDAM concept URI has the form http://edamontology.org/_ -# -# As a regular expression, it is http://edamontology\.org/(data|format|operation|topic)_[0-9]{4} -# -# -# -# NB! This is the only form of URI that is supposed to be used when referring to an EDAM concept. -# (for example in annotation using SAWSDL, or within RDF) -# -# -# =============================================================================================================== -# -# -# For a human-readable Web page about an EDAM concept, please refer to -# -# http://bioportal.bioontology.org/ontologies/1498?p=terms&conceptid=_ -# -# [type: text/html|application/xhtml+xml; ?format=html|htm|xhtml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of the EDAM concepts in RDF/XML (OWL), please refer to the EDAM OWL file -# -http://edamontology.org/EDAM.owl -# -# [type: application/rdf+xml|application/xml|text/xml; ?format=owl|rdf|xml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of the EDAM concept in OBO format, please refer to -# -# http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=edam&id=000&format=obo&style=raw -# -# [type: text/plain; ?format=obo|text|txt; charset=utf-8; language: en] -# -# Note that the OBO-format representation lacks certain details present only in the OWL version -# (Note also that dbfetch returns raw text in UTF-8) -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# Information about the URIs of EDAM concepts (http://edamontology.org/_) is in this file right here -# -http://edamontology.org/concept.uris -# -# [type: text/uri-list; ?format=uri|url|about; charset=us-ascii; language: en] -# -# -# =============================================================================================================== -# Note that the URI of this file is http://edamontology.org/concept.uris \ No newline at end of file diff --git a/web/favicon.ico b/web/favicon.ico deleted file mode 100644 index 348598a..0000000 Binary files a/web/favicon.ico and /dev/null differ diff --git a/web/favicon.png b/web/favicon.png deleted file mode 100644 index 04218c9..0000000 Binary files a/web/favicon.png and /dev/null differ diff --git a/web/page_1.14.html b/web/page_1.14.html deleted file mode 100644 index 4a8cea0..0000000 --- a/web/page_1.14.html +++ /dev/null @@ -1,1853 +0,0 @@ - - - - - - EDAM: Ontology of bioinformatics operations, types of data, formats, and topics - - - - - - -

EDAM Ontology

-

Bioinformatics operations, types of data, formats, and topics

- -
    -
  1. Introduction - -
  2. -
  3. Concepts - -
  4. -
  5. Relations - -
  6. -
  7. Rules - -
  8. -
  9. Sources - -
  10. - -
  11. Guidelines for annotators - -
  12. -
  13. Guidelines for contributors - -
  14. - -
  15. Existing implementations and annotations with EDAM - -
  16. -
- - - - - - - - - - -

Introduction

- - -


Motivation

-

Bioinformaticians handle an increasingly large and diverse set of tools and data. Meanwhile, researchers demand ever more powerful and convenient means to organise, find, understand, compare, select, use and connect the available resources. These tasks often rely on consistent, machine-understandable descriptions of the underlying components, but these have been generally lacking in ad hoc resource descriptions. The urgent need - filled by EDAM - is for an ontology that unifies semantically the bioinformatics concepts in common use, provides the curator with a comprehensive controlled vocabulary that is broadly applicable, and supports new and powerful search, browse and query functions.

- - - -


What is EDAM?

- -

EDAM (originally from “EMBRACE Data and Methods”) is an ontology of well established, familiar concepts that are prevalent within bioinformatics, including types of data and data identifiers, data formats, operations and topics. EDAM is a simple ontology - essentially a set of terms with synonyms and definitions - organised into an intuitive hierarchy for convenient use by curators, software developers and end-users.

- - - -


Applications

- -

EDAM is suitable for large-scale semantic annotations and categorization of diverse bioinformatics resources, including:

- -
    -
  • Web services including REST and SOAP APIs
  • -
  • Application software
  • -
  • Tool collections and packages
  • -
  • Workflows / pipelines
  • -
  • Databases
  • -
  • XML Schemata and data objects
  • -
  • Data syntax and file formats
  • -
  • Web portals and pages
  • -
  • Resource catalogues
  • -
  • Training materials
  • -
  • Courses, tutorials, and other events
  • -
  • Areas of scientific interest
  • -
  • Documents, such as scientific publications
  • -
- -

EDAM is also suitable for diverse application including for example within workbenches and workflow-management systems, software distributions, and resource registries. Examples of existing implementations are listed at the end of this document.

- - - - - -


Scope

- -

EDAM includes 4 main sub-ontologies or 'branches' of concepts:

-
    -
  • Data - “Information, represented in an information artefact (data record) that is 'understandable' by dedicated computational tools that can use the data as input or produce it as output.”
  • -
  • Format - “A defined way or layout of representing and structuring data in a computer file, blob, string, message, or elsewhere.”
  • -
  • Operation - “A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs).”
  • -
  • Topic - “A category denoting a rather broad domain or field of interest, of study, application, work, data, or technology. Topics have no clearly defined borders between each other.”
  • -
- -

Noteworthy within the Data sub-ontology is:

-
    -
  • Identifier - “A text token, number or something else which identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things).”
  • -
- - -
-

EDAM concepts

-

Figure 1. The EDAM concepts. Boxes indicate top-level concepts (sub-ontologies or 'branches'), with a couple of specific concepts exemplified.

- - - -
-

As a general rule, the Data, Format, and Operation branches include concepts strictly in domain of bioinformatics and computational biology: concepts purely concerning biology, computer science, etc. are not included. The Topic branch, however, includes broader inter-disciplinary concepts from the biological and medical domains.

- -

EDAM provides different semantic 'axes' for annotation. For example, annotation of a software tool might include:

-
    -
  • Topic - general scientific domain the software serves, e.g. “Structural biology”
  • -
  • Operation - the precise function of the tool, e.g. “Homology modelling”
  • -
  • Data - the primary input and output, e.g. “Protein structure”
  • -
  • Format - the supported format(s) of the input and output, e.g. “PDB format”
  • -

- - - -


Principles

- -

EDAM strives to uphold a few founding principles including:

-
    -
  • Quality - a controlled vocabulary that is moderated and assured via a gatekeeper model
  • -
  • Openness - development in collaboration with the community
  • -
  • Relevance - prioritising use-case-driven development towards comprehensive but practical coverage
  • -
  • Practicality - practical utility is valued over ontological “strictness” or any metaphysical doctrine
  • -
  • Clear scope - respecting the scope of other complementary, well-developed ontologies
  • -
  • Familiarity - including only concepts that are well established; familiar are prevalent and jargon is discouraged
  • -
  • Usability - conceptual hierarchy with sufficient richness but only necessary complexity
  • -
  • Maintainability - development must be efficient and sustainably up to date in the long term
  • -
- -

EDAM is working towards implementing these principles fully and is open to suggestions.

- - - - - -


Architecture

- -

EDAM has 3 components:

-
    -
  • Concepts - All concepts have a name (the term or label) and definition. Further, a concept may have simple relations (see below) to other EDAM concepts, as well other intrinsic properties, e.g. an identifier may have a regular expression defining its syntax.
  • -
  • Hierarchy - Every concept (excluding top-level concepts) is related to one or more other concepts within the same branch by an is a relation (specialisation). Hence EDAM has 4 primary hierarchies (for Data, Format, Operation, and Topic).
  • -
  • Relations - Concepts are related by defined relation types (see figure below), which reflect well established or self-evident principles, and are used primarily to define internal consistency of EDAM. These have external applications too, e.g. annotations on the Semantic Web.
  • -
- - - -
-

EDAM relations

-

Figure 2. The EDAM architecture is intentionally simple. Boxes indicate top-level concepts (sub-ontologies), and lines indicate types of relations that are maintained between concepts in EDAM.

- - - - -


Download and Status

- -

Version 1.14 of EDAM has been released. Contributions and suggestions are welcome!


- -

Locations for download in OWL format:

-

http://edamontology.org/EDAM.owl (Always points to the last stable version)


- -

Locations for download in OBO format:

-

http://edamontology.org/EDAM.obo (Always points to the last stable version in OBO format. OBO-format version lacks certain details.)

-

Please note that the last stable version of EDAM available in OBO format was version 1.2. Because the conversion from OWL to OBO hasn't yet fully been automated for EDAM, we will only resume providing OBO format in case of substantial demand or full automation of the conversion.


- -

All versions:

-

https://github.com/edamontology/edamontology/releases


- -

The edamontology.org site provides content negotiation with respect to the desired media type (i.e. format, e.g. HTML, OWL, etc.). This applies also to the URIs of EDAM concepts that are in this way dereferencable, concise, and stable. Alternatively to requesting the format in the HTTP header, users can retrieve the desired content from a web browser by inserting ?format=<desiredformat> query into the URL.


- -

EDAM is being actively developed:

-
    -
  • Future versions should not be a fundamental departure from the current sub-ontologies (top-level concepts), relations and hierarchy (is a relation).
  • -
  • EDAM uses numerical identifiers inside the concepts' URIs to uniquely identify concepts. These identifiers will persist between versions: a given identifier and URI are guaranteed to continue identifying the same concept. This does not imply names (terms), definitions and other fields will remain constant, but they will remain true to concept.
  • -
  • Concepts that are deprecated will also persist; they will not be removed and will maintain their identifier and URI.
  • -
-

The development of EDAM can be followed at GitHub. For the ways to contribute, please read further below.

- - - - -


Viewing

- - -

EDAM is available in the following Web-based ontology browsers: -

- - - - - -


Publication

-

EDAM is described in the following article. If you use EDAM or its part, please reference:

-

Ison, J., Kalaš, M., Jonassen, I., Bolser, D., Uludag, M., McWilliam, H., Malone, J., Lopez, R., Pettifer, S. and Rice, P. (2013). EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats. Bioinformatics, 29(10): 1325-1332.
doi: 10.1093/bioinformatics/btt113   PMID: 23479348

-

The article is freely available (Open Access).

- - - - -


Licence

- -

- - Creative Commons Licence -
-EDAM (a.k.a. the EDAM ontology) is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0). -

- -

We recommend, however, that while EDAM is being actively maintained by its authors, substantial derived work, major modifications (especially conceptual and semantic), and re-definitions of concepts and other content (e.g. additional constraints on EDAM concepts/owl:Class-es within owl:imports meant with universal validity, that would "close" some desired options of the open-world assumption) are consulted with the EDAM core developers beforehand at the time of consideration, and consistent solutions are sought in collaboration.

- - - - - - -


Priorities

-

Our core priority is to be responsive to users of EDAM. Furthermore, to establish a more sustainable footing for essential EDAM maintenance and developments, including:

-
    -
  • Content review and refactoring to ensure structural and semantic simplicity ensuring high usability
  • -
  • Community build-up and development including more formal, but agile, governance and maintenance models and mechanisms
  • -
  • Agile and responsive development of content in close collaboration with end-users and serving concrete use-cases
  • -
  • Technical refactoring to minimise the cost of routine housekeeping and content development
  • -
  • Implementation of tooling for routine maintenance to serve the needs of end-users, e.g. harvesting change requests and mappings between concepts
  • -
- - - - -


Governance of EDAM

- -

EDAM follows a 'gatekeeper' model with 4 tiers of governance:

- -

1. EDAM Advisory Board - has the purpose of advising the EDAM core developers on how best to uphold the EDAM principles and achieve its current aims. It will include people with diverse skills, experience and expertise. Advisory Board members have no formal responsibilities, but are expected to advocate EDAM and actively offer frank and constructive advice on scientific, technical and strategic issues. The EDAM Core Developers will respect the advice and give quarterly updates on progress via the edamontology-advisory mailing list. The Core Developers would aim to assemble with the Advisory Board virtually 2 or 3 times a year or as circumstances dictate, in meetings with open agenda and be followed up with actions and notes on key recommendations. The Advisory Board will be reconstituted each year and the core developers reserve the right to drop inactive members. Members of the Advisory Board who are committing resources to EDAM may elect to serve on the EDAM Steering Board which has 3 primary responsibilities: 1) Help the EDAM Core Developers to make strategic decisions. 2) Verify whether stated aims and actions are coherent and wise. 3) Monitor progress and provide feedback. 4) Help seek funding for EDAM.

- -

2. EDAM Core Developers - have GitHub commit rights. Responsible for agreeing strategy and tactics, setting priorities, overseeing and approving developments and routine maintenance. Quasi-democratic with a 'gatekeeper' (Jon Ison by default) having the final say. The gatekeeper may be temporarily appointed from the core developers as necessary, e.g. during holidays. Core Developers must have the intent and some bandwidth to develop EDAM in the long-term. They have 3 primary responsibilities: 1) Understand and uphold the EDAM principles. 2) Advocate EDAM. 3) Develop EDAM as bandwidth permits.

- -

3. Developers - may have temporary 'core developer' status as convenient, but would not normally have GitHub commit rights long-term. They include anyone who makes significant technical or scientific contributions, by whatever means, but have none of the commitments or responsibilities of the core developers.

- -

4. Other contributors - do not have GitHub commit rights, but can still make comments, contribute suggestions for new terms and other changes.

- - - -


Contact

-

Please direct all enquiries to the mailing lists: EDAM core developers by default, or EDAM discussion in case the issue needs a discussion within the community of EDAM users and contributors.

-

Thanks for valuable discussions and contributions to Peter Rice, Inge Jonassen, Dan Bolser, Rodrigo Lopez, Gert Vriend, Steve Pettifer, Hamish McWilliam, Alan Bleasby, Mahmut Uludag, László Kaján and others.

-

EDAM Core Developers: Jon Ison, Matúš Kalaš, Hervé Ménager.

- - - -


Mailing lists

-

Feel free to subscribe to the mailing lists:

- -

Once subscribed, you can mail the edam list: -

- - -

edam-announce is for announcements (very minimal traffic!), while edam is for discussions around the use of EDAM and its concepts.

- - - - - - - - - -



Concepts

- - - -


Operation

-

"A function or process performed by a tool; what is done, but not (typically) how or in what context."

-

e.g. "Sequence alignment", "Pairwise sequence alignment", "Sequence database search".

- -

"Operation" concepts provide mostly fine-grained concepts for annotation of tool functions.

- -

The top-level concepts are:

-
    -
  • "Alignment"
  • -
  • "Analysis and processing"
  • -
  • "Annotation"
  • -
  • "Classification"
  • -
  • "Comparison"
  • -
  • "Editing"
  • -
  • "Mapping and assembly"
  • -
  • "Modelling and simulation"
  • -
  • "Optimisation and refinement"
  • -
  • "Plotting and rendering"
  • -
  • "Prediction, detection and recognition"
  • -
  • "Search and retrieval"
  • -
  • "Validation and standardisation"
  • -
- -

The top-level operations are necessarily coarse-grained (abstract) -providing a navigable top-level. They serve as placeholders for other, -more specific concepts lower down in the tree.

- - -


Data

- - -

"A type of data in common use in bioinformatics."

-

e.g. "Sequence alignment", "Comparison matrix", "Phylogenetic tree" etc.

- -

Data concepts:

-
    -
  • Provide coarse and fine-grained concepts for annotating types of data
  • -
  • Cover everything from primitive types and simple parameters, through to derived types and complex, bioinformatics datatypes
  • -
  • Reflect but do not describe how the data is specified or represented (syntax)
  • -
  • Can be somewhat (necessarily) overlapping
  • -
- -

The top-level concepts are:

-
    -
  • "Core data"
  • -
  • "Identifier"
  • -
  • "Parameter"
  • -
  • "Report"
  • -
- -

Their meaning is:

-
    -
  • "Core data" - Data that typically are the primary input or -output of a tool or which correspond to entries from the primary (e.g. -sequence or structural) biological databases.
  • -
  • "Identifier" - A short numerical or textual label that -identifies (typically uniquely) something such as data, a resource or a -biological entity.
  • -
  • "Parameter" - Typically a simple numerical or string value that controls the operation of a tool.
  • -
  • "Report" - A human-readable collection of information that is - distinct from primary (e.g. sequence or structural) biological data, -including free text, annotation about biological entities and phenomena, - computer-generated reports of analysis of primary data and metadata.
  • -
- -

Concepts within "Core data" are:

-
    -
  • "Alignment"
  • -
  • "Article"
  • -
  • "Biological model"
  • -
  • "Classification"
  • -
  • "Codon usage table"
  • -
  • "Data index"
  • -
  • "Data reference"
  • -
  • "Experimental measurement"
  • -
  • "Gene expression profile"
  • -
  • "Image"
  • -
  • "Map"
  • -
  • "Matrix"
  • -
  • "Microarray data"
  • -
  • "Molecular interaction"
  • -
  • "Molecular property"
  • -
  • "Ontology"
  • -
  • "Ontology concept"
  • -
  • "Pathway or network"
  • -
  • "Phylogenetic raw data"
  • -
  • "Phylogenetic tree"
  • -
  • "Reaction data"
  • -
  • "Schema"
  • -
  • "Secondary structure"
  • -
  • "Sequence"
  • -
  • "Sequence motif"
  • -
  • "Sequence profile"
  • -
  • "Structural (3D) profile"
  • -
  • "Structure"
  • -
  • "Workflow"
  • -
- - - - - -


Topic

- - -

"A general bioinformatics subject or category, such as a field of study, data, processing, analysis or technology."

-

e.g. "Sequence analysis", "Alignment", "Sequencing", "Microarrays".

-

"Topic" concepts provide coarse-grained categories for -annotation of diverse bioinformatics resources. They do not cover -biology or computer science exhaustively.

- -

The top-level concepts are:

-
    -
  • "Biological data resources"
  • -
  • "Nucleic acid analysis"
  • -
  • "Protein analysis"
  • -
  • "Sequence analysis"
  • -
  • "Structure analysis"
  • -
  • "Phylogenetics"
  • -
  • "Proteomics"
  • -
  • "Data handling"
  • -
  • "Chemoinformatics"
  • -
  • "Transcriptomics"
  • -
  • "Literature and reference"
  • -
  • "Ontologies, nomenclature and classification"
  • -
  • "Genetics"
  • -
  • "Systems biology"
  • -
  • "Ecoinformatics"
  • -
  • "Genomics"
  • -
  • "Immunoinformatics"
  • -
- -


Format

-

"A specific layout for encoding a specific type of data in a computer file or memory."

-

e.g. "FASTA format", "PDB format", "mmCIF" etc.

- -

"Format" concepts:

-
    -
  • Provide mostly fine-grained concepts for annotation of data formats / syntaxes.
  • -
  • Formats are generally only listed if they are in common use, for example by public databases or multiple tools.
  • -
  • Concept statements may include a reference (typically a URL) to the format specification proper.
  • -
- -

The top-level concepts are:

-
    -
  • "Binary format"
  • -
  • "Format (typed)"
  • -
  • "HTML"
  • -
  • "RDF"
  • -
  • "Textual format"
  • -
  • "XML"
  • -
- -

All concepts are nested under "Binary format", "Textual format" and "XML", with exception of pure "HTML" or "RDF" (and "BioPAX"). The "Format (typed)" branch arranges formats by type of data and provides an additional axis over (the same set of) concepts under "Binary format", "Textual format" and "XML".

- - -


Identifier

-

"A label that identifies (typically uniquely) something such as data, a resource or a biological entity."

-

e.g. "UniProt accession", "EC number", "Gene symbol" etc.

- -

"Identifier" concepts:

-
    -
  • Provide mostly fine-grained concepts for annotation of identifiers of data.
  • -
  • Typically correspond to simple strings.
  • -
  • Have concept definitions which may include a regular expression defining valid string values.
  • -
- -

The top-level concepts are:

-
    -
  • "Accession"
  • -
  • "Identifier (hybrid)"
  • -
  • "Identifier (typed)"
  • -
  • "Identifier with metadata"
  • -
  • "Name"
  • -
- -

As for "Format", the "Identifier (typed)" branch provides an additional axis over (the same set of) concepts under "Accession" and "Name".

- - - - - -



Relations

- -


is a

-

Defines a concept as a specialisation of another concept. If A is a B, then A is a specialisation -of B, and B is a generalisation of A.

-

The is a relation is transitive: if A is a B and B is a C then A is a C.

-

All relations are transitive over is a: e.g. if A has input B and B is a C then A has input C, and if A is a B and B has input C then A has input C.

- -

e.g. "Pairwise sequence alignment" is a "Sequence alignment"

- - - -


has input

-

Defines an "Operation" concept as reading (inputting) a "Data" concept.

-

e.g. "Sequence alignment construction" has input "Sequence"

- - -


has output

-

Defines an "Operation" concept as writing (outputting) a "Data" concept.

-

e.g. "Sequence alignment construction" has output "Sequence alignment"

- - -


has topic

-

Defines a "Data" or "Operation" concept as being within the scope of a "Topic" concept.

-

e.g. "PolyA signal identification" has topic "Nucleic acid sequence analysis"

- - -


is identifier of

-

Defines that an "Identifier" concept identifies a "Data" concept.

-

e.g. "Sequence accession number" is identifier of "Sequence"

- - - -


is format of

-

Defines that a "Format" concept is the format of a "Data" concept.

-

e.g. "Sequence format" is format of "Sequence record"

- - - -



Rules

-

Rules define how concepts are related.

- -


Rules by concept type

-

"Topic"

-
    -
  • "Topic" is a "Topic" -

    ... a specialisation of a topic.

  • -
- - - -

"Operation"

-
    -
  • "Operation" is a "Operation" -

    ... a specialisation of an operation.

  • -
  • "Operation" has input "Data" -

    ... inputs a type of data.

  • -
  • "Operation" has output "Data" -

    ... outputs a type of data.

  • -
  • "Operation" has topic "Topic" -

    ... within a topic.

  • -
- - - - -

"Data"

-
    -
  • "Data" is a "Data" -

    ... a specialisation of a type of data.

  • -
  • "Data" has topic "Topic" -

    ... within a topic.

  • -
- - - - -

"Format"

-
    -
  • "Format" is a "Format" -

    ... a specialisation of a data format.

  • -
  • "Format" is format of "Data" -

    ... a format specification of a datatype.

  • -
- - - - - -

"Identifier"

-
    -
  • "Identifier" is identifier of "Data" -

    ... identifier of a datatype.

  • -
- - - - -


Rules by relation type

- -

is a

-
    -
  • "Topic" is a "Topic"
  • -
  • "Operation" is a "Operation"
  • -
  • "Data" is a "Data" -
  • "Format" is a "Format" -
- - - - - -

has input

-
    -
  • "Operation" has input "Data" -
- - -

has output

-
    -
  • "Operation" has output "Data" -
- -

has topic

-
    -
  • "Operation" has topic "Topic"
  • -
  • "Data" has topic "Topic" -
- - - -

is identifier of

-
    -
  • "Identifier" is identifier of "Data"
  • -
- -

is format of

-
    -
  • "Format" is format of "Data" -
- - - - - - - -



Sources

- - -

Various resources were analysed while constructing EDAM and were used as sources listing common bioinformatics concepts in scope.

- -

Web services and applications

- - - -

Domain ontologies, taxonomies, data models

- - -

For database-related concepts

-
    -
  1. dbxref.txt (databases cross-referenced in UniProtKB/Swiss-Prot)
  2. -
  3. List of databases collated by the ELIXIR project
  4. -
  5. Lists of databases from the Web -
  6. -
- - -

Other resources

- - - - - - - - - - - - - - - - - - - - - - -



Guidelines for annotators

-

Annotators may email the EDAM mailing list or the EDAM core developers for help.

- -


General guidelines

- -

Which EDAM sub-ontology to use?

-
    -
  1. "Topic" for coarse-grained annotation of diverse entities
  2. -
  3. "Operation" for fine-grained annotation of tool functions
  4. -
  5. "Data" for annotation of data in semantic terms
  6. -
  7. "Format" for annotation of the syntax or format of data
  8. -
  9. "Identifier" (as a special type of "Data") for annotation of identifiers (names and accessions) of data or other entities
  10. -
- - - - -

Use of other ontologies

-

The expectation is for EDAM to be used alongside other ontologies for - annotation where possible and desirable. For example, an operation -that predicts specific features of a molecular sequence could be -annotated with concepts from SO (Sequence Ontology) for the features.

- - - - - - -

Picking concepts

-

If you have many annotations to do, it will help to familiarise yourself with EDAM first using a browser (see Viewing).

- -
    -
  1. Identify the correct sub-ontology ("Operation", "Data" etc.) of concepts considering what is being annotated (see above)
  2. -
  3. Search EDAM using keywords to find candidate concepts. Multiple searches using synonyms, alternative spellings and so are preferable.
  4. -
  5. Pick the most specific concept(s) available, bearing in mind some concepts are necessarily overlapping or general.
  6. -
  7. Only pick a correct concept. If it doesn't exist, request it's added to EDAM
  8. -
- - - - - - - - - - - - - - - - -


Annotation of Web services

- -

Model of a Web service

-

A Web service is considered as an arbitrary (but usually related) set of one or more operations, reducing the problem of Web service interoperation to one of compatibility between operations.

- -

Operation

-
    -
  • Discrete unit of functionality performing (typically) one or more definite functions
  • -
  • Reads an input
  • -
  • Writes an output
  • -
  • Uses zero or more data resources
  • -
- -

Input

-
    -
  • Payload (e.g. of HTTP or SOAP message) passed in operation call
  • -
  • Name and (ideally) description is given (e.g. in WSDL file)
  • -
  • Input has one or more XML elements which must be set (input values)
  • -
- -

Output

-
    -
  • Payload (e.g. of HTTP or SOAP message) returned from operation call
  • -
  • Name and (ideally) description is given (e.g. in WSDL file)
  • -
  • Output has one or more XML elements which are written (output values)
  • -
- -

XML elements

-
    -
  • Correspond to the inputs (parameters) and outputs of a service
  • -
  • Are simple or complex XSD types given in an XML Schema (within or referenced from a WSDL file)
  • -
  • Have values that are instances of specific semantic type.
  • -
  • Have values in a specific syntax, either fully specified by the schema, or (occasionally) text in a specific file format which is not specified by the schema.
  • -
- - -

Levels of annotation

- -

Annotation of a WSDL file or associated XSD schema is possible at several levels. Assuming SAWSDL annotation (http://www.w3.org/TR/sawsdl/), the XML elements that may be annotated by EDAM concepts are: -

    - -
  1. Web service (as a whole) (<wsdl:portType>) -
      -
    • One (or more) "Topic" concepts to describe the general area(s) the service concerns
    • -
    • If applicable, one (or more) "Operation" concepts to describe the functions of the service (if all operations peform essentially the same function)
    • -
  2. - -
  3. Operation (<wsdl:operation> inside <wsdl:portType>) -
      -
    • One (or more) "Operation" concepts for each WSDL operation (more than one in exceptional circumstances)
    • -
    -
  4. -
  5. Input parameters and their sub-parts (<xs:element>, <xs:complexType>, <xs:simpleType>, <xs:attribute>) -
      -
    • One (or more) "Data" concepts
    • -
    • One (or more) "Format" concepts
    • - -
    -
  6. -
  7. Output parameters and their sub-parts (<xs:element>, <xs:complexType>, <xs:simpleType>, <xs:attribute>) -
      -
    • One (or more) "Data" concepts
    • -
    • One (or more) "Format" concepts
    • - -
    -
- -

NB. The input and output parameters should be annotated inside the XML Schema that defines them. In case of services that are not following the highly recommended document/literal wrapped SOAP-binding style, the <wsdl:part> inside <wsdl:message> can be annotated (the same applies to faults, but meanings of faults are not modelled by EDAM)

- -

The following annotations might be useful but are not directly recommended by SAWSDL:

- -
    -
  1. Enumerated values of input/output parameters (<xs:enumeration>) -
      -
    • One (or more) "Format" or "Data" concepts defining the particular enumerated value
    • -
    -
- -

For details of incorporating the SAWSDL annotations into WSDLs and XSDs, see EDAM URIs and SAWSDL annotation.

- - - - - - - - - - -


EDAM URIs and SAWSDL annotation

-

SAWSDL mandates the use of sawsdl:modelReference attributes for annotation. The format of EDAM URIs used inside this attribute includes the ontology name (http://edamontology.org), main sub-ontology, and the unique identifier (ID) of the particular concept:

-
 
-<xs:element name="elementName" sawsdl:modelReference="http://edamontology.org/subontology_id">
-
- -

Where ...

-
    -
  • xs:element is the XML element being annotated (can be also xs:attribute, xs:complexType, xs:simpleType, sawsdl:attrExtension, wsdl:portType, in special cases wsdl:part, or eventually xs:enumeration)
  • -
  • elementName is the name of the XML element
  • -
- -

The value of the sawsdl:modelReference attribute is a URI -pointing to the concept definition. The URI to use is in case of EDAM includes the concept's sub-ontology:

-
    -
  • sub-ontology is the top-level sub-ontology of the EDAM concept; one of topic, data, format, or operation
  • -
  • id is the unique local identifier of the concept, e.g. "0295"
  • -
- - -

So for these 3 concepts:

-

-EDAM_topic:0182
-
-EDAM_operation:0292
-
-EDAM_data:0863
-
-
- -

We'd have - -

http://edamontology.org/topic_0182
-http://edamontology.org/operation_0292
-http://edamontology.org/data_0863
-
- -

Which can be used in SAWSDL annotation, e.g. -

<wsdl:portType name="myService" sawsdl:modelReference="http://edamontology.org/topic_0182">
-<sawsdl:attrExtension sawsdl:modelReference="http://edamontology.org/operation_0292>
-<xs:element name="outfile" sawsdl:modelReference="http://edamontology.org/data_0863>
-
- - - -

If more than one annotation of an element is required, these can be given in the sawsdl:modelReference attribute delimited by space characters: -

<wsdl:portType name="myService" sawsdl:modelReference="http://edamontology.org/topic_0182 http://edamontology.org/operation_0292">
-
- -

NB. Such multiple annotations need not be in the same namespace, and need not at all to refer to the same ontology.

- - -

SAWSDL guidelines for annotating operations

-

One peculiarity of the SAWSDL specification is that annotations on <wsdl:operation> element inside <wsdl:portType> should be handled using a <sawsdl:attrExtensions> element. This is not a requirement for other elements.

- -

Importantly, the <sawsdl:attrExtension> element inside the wsdl:operation must be before <wsdl:input>, <wsdl:output> and <wsdl:fault> elements (so typically after the <wsdl:documentation> element).

- -

For example:

-
 <wsdl:portType name="Clustalw2PortType" sawsdl:modelReference="http://edamontology.org/topic_0186 http://edamontology.org/operation_0496">
- <wsdl:operation name="submitClustalw2">
- <wsdl:documentation>Submit a sequence and get a jobID</wsdl:documentation>
- <sawsdl:attrExtensions sawsdl:modelReference="http://edamontology.org/operation_0496"/>
- <wsdl:input message="submitClustalw2Msg"/>
- <wsdl:output message="submitClustalw2ResponseMsg"/>
- </wsdl:operation>
-
- -

Some WSDL/XSD validators or SOAP libraries do not check for it, but some do require the strict order of these elements.

- - - - - - - - - - - - -



Guidelines for contributors

-

EDAM is a community project, and suggestions for additions, corrections, and other improvements are always welcome. There are 3 ways to contribute suggestions:

- - -

Web form

-

Straightforward requests for one or a few changes can be made straight away on the EDAM Change Request form:

-

http://tinyurl.com/EDAMChangeRequest

-
- -

Requests for many new concepts should, however, first be discussed with the EDAM core developers:

-

edam-core@elixir-dk.org

-
- -

If you agree with the EDAM core developers on substantial additions or other changes and are funded for such developments, we are open to you becoming a temporary EDAM core developer. In any case, we will work with you to find the most efficient way to proceed, depending on your requirements, expertise and bandwidth.

- -

We will make every effort to be responsive to your requests, given our limited resources.

- -

When requesting a new concept:

-
    -
  • The preferred label should be a short name or phrase in common use.
  • -
  • Please consider providing common synonyms of the preferred term.
  • -
  • The definition should be a concise and lucid description of the concept, without acronyms, and avoiding jargon.
  • -
  • Peripheral but important information can go in the comment.
  • -
- - -

Mailing lists

- -

For a low-traffic mailing list for announcements about the EDAM ontology, subscribe to edam-announce:

-

http://elixirmail.cbs.dtu.dk/mailman/listinfo/edam-announce

-
- -

The is the preferred way to make suggestions that require some discussion is via the edam mailing list. Please subscribe:

-

http://elixirmail.cbs.dtu.dk/mailman/listinfo/edam

-
- -

To post to the list, mail:

-

edam@elixir-dk.org

-
- -

You can use the same list for discussions around the use of EDAM concepts, i.e. for purposes of resource annotation and in software implementations.

-
- - -

GitHub issue tracker

-

The GitHub issue tracker can be used to submit issues for which, for whatever reason, you want a public record: we prefer it is not used for any other purpose.

- -

To open a GitHub issue you must have a GitHub account, and follow these simple steps:

-
    -
  • Go to edamontology issues and click on “New issue”.
  • -
  • If you are not logged in, you will be asked first to log in or create an account.
  • -
  • Provide a title, and a report that is concise but sufficiently detailed to be actionable.
  • -
- - - -



-

NB.! The workflow for the core and appointed developers of EDAM is documented in HOW_TO_EDIT.md inside the EDAM GitHub project.

- - - - - - - - - - - - -



Existing implementations and annotations with EDAM

- - - -

EMBOSS

-

EMBOSS applications have been annotated using EDAM and these annotations appear in corresponding Web services. -

-

Annotated WSDL files (and associated XSD data schema) are available from:

- - -

You will see a list of service end-points with WSDL URLs. For example: -

- -

To see the data schema associated with a WSDL, you must replace "?wsdl" with "?xsd=1", "?xsd=2" or "?xsd=3". For example: -

- - - - - -

BioXSD

-

The BioXSD XML schema (XSD) defines exchange formats of everyday -bioinformatics data types. BioXSD aims to serve as the common, canonical - data model for bioinformatics Web services. It includes commonly used -types including sequences, sequence annotations, alignments and -references to resources:

-

BioXSD has been annotated with EDAM concepts.

- - -

DRCAT biological resource catalogue

-

A catalogue of data resources (DRCAT) is being compiled as part of -the EMBOSS project. Each entry in DRCAT gives metadata on a data -resource available on the Web. The metadata includes "Query" lines -describe the type(s) of data available, the data format, data identifier - (used to query) and a URL from which data can be retrieved. The -"Query" lines and the resources themselves are annotated with EDAM -concepts.

- -

A typical entry is shown below:

-

(NB. The format of EDAM ids has not been upgraded yet. Will be done asap.)

-
ID PDB
-Acc DB-0070
-Name The RCSB Protein Data Bank
-Desc A repository for 3D biological macromolecular structure data.
-URL http://www.rcsb.org/pdb/
-Cat 3D structure databases
-EDAMres EDAM:0000693 | Tertiary structure
-EDAMdat EDAM:0000883 | Tertiary structure
-EDAMdat EDAM:0002085 | Structure annotation
-EDAMfmt EDAM:0001476 | pdb
-EDAMfmt EDAM:0001478 | pdbml
-EDAMfmt EDAM:0001477 | mmCIF
-EDAMfmt EDAM:0002331 | HTML 
-EDAMid EDAM:0001127 | PDB ID
-Xref SP_explicit | None
-Xref SP_FT | None
-Xref EMBL_explicit | None
-Query EDAM:0002085 | EDAM:0002331 | EDAM:0001127 | http://www.pdb.org/pdb/explore/explore.do?structureId=%s
-Query EDAM:0000693 | EDAM:0001476 | EDAM:0001127 | http://www.pdb.org/pdb/files/%s.pdb
-Query EDAM:0000693 | EDAM:0001477 | EDAM:0001127 | http://www.pdb.org/pdb/files/%s.cif
-Query EDAM:0000693 | EDAM:0001478 | EDAM:0001127 | http://www.pdb.org/pdb/files/%s.xml
-Example EDAM:0001127 | 1rbp
-Email deposit@deposit.rcsb.org
-CCmisc EMBL DR line example "1OSN", /dbxref="PDB:12GS"
-Status Referenced
-
- -

DRCAT development will proceed in harmony with bioDBCore, which proposes a community-defined, uniform, generic description of the core attributes of biological databases:

- - - -

bioDBCore is under the auspices of the International Society for Biocuration:

- - - -

All enquiries to EDAM developers.

- - - - - - -

Bio-jETI

-

Bio-jETI allows -automatic composition of functional units into software systems -according to higher-level specifications using EDAM:

- - - - -

iHOP Web service

-

The iHOP Web service is annotated with EDAM concepts, either directly or via its use of BioXSD:

- - - - - - -

CBU Web services

-

The Web services provided by the Computational Biology Unit (CBU) of the University of Bergen and its affiliated Uni Computing are annotated with EDAM concepts:

- - - - - - -

eSysbio

-

eSysbio was a proof-of-concept prototype workbench for sharing and analysing bioinformatics data using public or private Web services and R scripts. eSysbio used EDAM to annotate and denote the type and format of data items submitted to the system.

- - - -

SEQwiki

-

The SEQanswers wiki is an open catalogue of bioinformatics software tools, non-exclusively focussed on sequencing data analysis. SEQanswers tool wiki uses EDAM for annotation of the listed tools where applicable.

- - - - - - -


-

Last update: 2016-Feb-24

- - diff --git a/web/relations-and-properties.uris b/web/relations-and-properties.uris deleted file mode 100644 index af136cd..0000000 --- a/web/relations-and-properties.uris +++ /dev/null @@ -1,61 +0,0 @@ -# =============================================================================================================== -# -# -# This is the information about the URI of an EDAM relation or concept property -# -# -# -# The EDAM relation or property URI has the form http://edamontology.org/ -# -# As a regular expression, it is http://edamontology\.org/(created_in|obsolete_since|regex|example|documentation|has_format|has_function|has_identifier|has_input|has_output|has_topic|is_format_of|is_function_of|is_identifier_of|is_input_of|is_output_of|is_topic_of) -# -# -# -# NB! This is the only form of URI that is supposed to be used when referring to an EDAM relation or concept property. -# (for example within RDF, or when insisting on using triples inside SAWSDL annotation) -# -# -# =============================================================================================================== -# -# -# For a human-readable Web page about the EDAM relations and concept properties, please refer to -# -http://edamontology.org/relations-and-properties.html -# -# [type: text/html|application/xhtml+xml; ?format=html|htm|xhtml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of the EDAM relations and properties in RDF/XML (OWL), please refer to the EDAM OWL file -# -http://edamontology.org/EDAM.owl -# -# [type: application/rdf+xml|application/xml|text/xml; ?format=owl|rdf|xml; charset=utf-8; language: en] -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# For a machine-understandable representation of the last stable version of the EDAM relations and properties in OBO format, please refer to -# -http://edamontology.org/relations-and-properties.obo -# -# [type: text/plain; ?format=obo|text|txt; charset=us-ascii; language: en] -# -# Note that the OBO-format representation lacks certain details present only in the OWL version -# -# -# --------------------------------------------------------------------------------------------------------------- -# -# -# Information about the URIs of EDAM relations and properties (http://edamontology.org/_) is in this file right here -# -http://edamontology.org/relations-and-properties.uris -# -# [type: text/uri-list; ?format=uri|url|about; charset=us-ascii; language: en] -# -# -# =============================================================================================================== -# Note that the URI of this file is http://edamontology.org/relations-and-properties.uris \ No newline at end of file diff --git a/web/relations-and-properties_1.14.html b/web/relations-and-properties_1.14.html deleted file mode 100644 index dbc2acf..0000000 --- a/web/relations-and-properties_1.14.html +++ /dev/null @@ -1,136 +0,0 @@ - - - - - - - - Relations and concept properties defined in EDAM - - - - - - -
-

Relations and concept properties defined in EDAM

-
-
- -

- Note: URIs, definitions, domains and ranges are present in the EDAM.owl file. EDAM relations apply between concepts and/or annotated entities. -

- -
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
RelationInverseMaintained in EDAMExample
has inputis input ofOperation has input DataSequence annotation has input Sequence record
has outputis output ofOperation has output DataRNA structure prediction has output RNA structure record
- has topic - - is topic of - - - Operation - or - Data - has topic - Topic - - - Phylogenetic tree - has topic - Phylogenetics -
- has format - - is format of - - - Format - is format of - Data - - - CHP - is format of - Processed microarray data -
- has identifier - - is identifier of - - - Identifier - is identifier of - Data - - - InterPro accession - is identifier of - Protein signature -
- has function - - is function of - not between EDAM conceptsa tool has function - Sequence assembly -
-
-
-

Some concepts have additional properties declared in EDAM:

-
-

Citation contains a dereferenceable URI, preferrably including a DOI, pointing to a citeable publication of the given data format.

-

Created in states which version of EDAM a concept was added in.

-

Documentation includes a URL within a Format concept pointing to its documentation.

-

Example lists one or more valid examples (among the identifiers).

-

File extension lists examples of usual file extensions of a format.

-

Media type includes a link pointing to a page specifying a media type of the given data format.

-

Obsolete since states the version since which an obsolete concept has been deprecated.

-

Regular expression constrains allowed values of types of identifiers (mostly accessions) and is useful for validation of inputs to tools.

- - - -