diff --git a/EDAM_dev.owl b/EDAM_dev.owl index cd9497c..7e0345b 100644 --- a/EDAM_dev.owl +++ b/EDAM_dev.owl @@ -41,7 +41,7 @@ operations "EDAM operations" Bioinformatics operations, data types, formats, identifiers and topics EDAM http://edamontology.org/ "EDAM relations and concept properties" - 3770 + 3777 application/rdf+xml 12.05.2016 18:23 GMT EDAM_data http://edamontology.org/data_ "EDAM types of data" @@ -113,7 +113,7 @@ Example 'Example' concept property ('example' metadata tag) lists examples of valid values of types of identifiers (accessions). Applicable to some other types of data, too. true - Separated by bar ('|'). + Separated by bar ('|'). For more complex data and data formats, it can be a link to a website with examples, instead. concept_properties @@ -125,25 +125,40 @@ File extension 'File extension' concept property ('file_extension' metadata tag) lists examples of usual file extensions of formats. Separated by bar ('|'), without a dot ('.') prefix, preferrably not all capital characters. + N.B.: File extensions that are not correspondigly defined at http://filext.com are recorded in EDAM only if not in conflict with http://filext.com, and/or unique and usual within life-science computing. concept_properties true + + + + + + Information standard + Minimum information standard + Minimum information checklist + 'Information standard' trailing modifier (qualifier, 'information_standard') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to an information standard supported by the given data format. + "Supported by the given data format" here means, that the given format enables representation of data that satisfies the information standard. + true + concept_properties + + isdebtag - When 'true', the term has been proposed or is supported within Debian Med as a tag. + When 'true', the concept has been proposed or is supported within Debian as a tag. concept_properties true - - + + - + Media type MIME type @@ -152,11 +167,23 @@ concept_properties + + + + + + Organisation + Organization + 'Organisation' trailing modifier (qualifier, 'organisation') of 'xref' links of 'Format' concepts. When 'true', the link is pointing to an organisation that developed, standardised, and maintains the given data format. + true + concept_properties + + - + - + @@ -180,6 +207,17 @@ true + + + + + + Ontology used + 'Ontology used' concept property ('ontology_used' metadata tag) of format concepts links to a domain ontology that is used inside the given data format, or contains a note about ontology use within the format. + concept_properties + true + + @@ -27531,8 +27569,9 @@ Generic Feature Format version 3 (GFF3) of sequence features. - - + + + @@ -29506,11 +29545,12 @@ - Format (typed) + Format (by type of data) + Format (typed) This concept exists only to assist EDAM maintenance and navigation in graphical browsers. It does not add semantic information. The concept branch under 'Format (typed)' provides an alternative organisation of the concepts nested under the other top-level branches ('Binary', 'HTML', 'RDF', 'Text' and 'XML'. All concepts under here are already included under those branches. beta12orEarlier - A broad class of format distinguished by the scientific nature of the data that is identified. + A placeholder concept for visual navigation by dividing data formats by the content of the data that is represented. @@ -29520,7 +29560,21 @@ - BioXSD + BioXSD (XML) + BioXSD + BioXSD format + BioXSD data model + BioXSD/GTrack + BioXSD|GTrack + BioXSD|BioJSON|BioYAML + BioXSD in XML + BioXSD XML + BioXSD+XML + BioXSD XML format + BioXSD in XML format + beta12orEarlier + BioXSD schema-based XML format of sequence-based data and some other common data - sequence records, alignments, feature records, references to resources, and more - optimised for integrative bioinformatics, Web services, and object-oriented programming. + 'BioXSD' belongs to the 'BioXSD|GTrack' ecosystem of generic formats. 'BioXSD in XML' is the XML format based on the common, unified 'BioXSD data model', a.k.a. 'BioXSD|BioJSON|BioYAML'. @@ -29544,13 +29598,30 @@ - BioXSD XML format - beta12orEarlier - BioXSD XML format of basic bioinformatics types of data (sequence records, alignments, feature records, references to resources, and more). + + + + + + + + + + + + + + + + + + + Any ontology allowed, none mandatory. Preferrably with URIs but URIs are not mandatory. Non-ontology terms are also allowed as the last resort in case of a lack of suitable ontology. + @@ -30717,12 +30788,25 @@ GTrack + BioXSD/GTrack GTrack + BioXSD|GTrack GTrack + GTrack ecosystem of formats + GTrack|GSuite|BTrack GTrack + GTrack|BTrack|GSuite GTrack + GTrack format + 1.0 - GTrack is an optimised tabular format for genome/sequence feature tracks unifying the power of other tabular formats (e.g. GFF3, BED, WIG). + GTrack is a generic and optimised tabular format for genome or sequence feature tracks. GTrack unifies the power of other track formats (e.g. GFF3, BED, WIG), and while optimised in size, adds more flexibility, customisation, and automation ("machine understandability"). + 'GTrack' belongs to the 'BioXSD|GTrack' ecosystem of generic formats, and particular to its subset, the 'GTrack ecosystem' (GTrack, GSuite, BTrack). 'GTrack' is the tabular format for representing features of sequences and genomes. + + + + + @@ -31448,6 +31532,7 @@ + @@ -31743,12 +31828,28 @@ - MHT - MIME HTML format for Web pages, which can include external resources, including images, Flash animations and so on. + MHTML + MHT + MIME HTML + MHTML format + MHT format + MIME HTML format + HTML email format + HTML email message format + MIME multipart + MIME multipart format + MIME multipart message + MIME multipart message format + MIME HTML format for Web pages, which can include external resources, including images, Flash animations and so on. + MHTML is not strictly an HTML format, it is encoded as an HTML email message (although with multipart/related instead of multipart/alternative). It, however, contains the main HTML block as its core, and thus it is for practical reasons incuded in EDAM as a specialisation of 'HTML'. - EMBL entry format wrapped in HTML elements. 1.9 - MHTML + mhtml|mht|eml + + + + + @@ -33267,9 +33368,9 @@ experiments employing a combination of technologies. YAML Ain't Markup Language YAML (YAML Ain't Markup Language) is a human-readable tree-structured data serialisation language. yaml|yml - - + + @@ -33283,9 +33384,9 @@ experiments employing a combination of technologies. 1.16 Tabular data represented as values in a text file delimited by some character. - Tabular format + Tabular format Delimiter-separated values - https://en.wikipedia.org/wiki/Delimiter-separated_values + @@ -33298,9 +33399,11 @@ experiments employing a combination of technologies. CSV Comma-separated values - http://filext.com/file-extension/CSV + csv + 1.16 - http://www.iana.org/assignments/media-types/text/csv + + Tabular data represented as comma-separated values in a text file. @@ -33353,16 +33456,261 @@ experiments employing a combination of technologies. - UniProt XML format + UniProtKB XML + UniProt XML + UniProtKB XML format + UniProt XML format + 1.16 - XML sequence format used for UniProt entries. + UniProtKB XML sequence features format is an XML format available for downloading UniProt entries. + + + + + + UniProtKB RDF + UniProt RDF + UniProtKB RDF format + UniProt RDF format + UniProtKB RDF/XML + UniProt RDF/XML + UniProtKB RDF/XML format + UniProt RDF/XML format + + + 1.16 + UniProtKB RDF sequence features format is an RDF format available for downloading UniProt entries (in RDF/XML). + + + + + + + + + + + BioJSON (BioXSD) + BioXSD BioJSON + BioXSD BioJSON format + BioJSON (BioXSD data model) + BioXSD/GTrack BioJSON + BioXSD|GTrack BioJSON + BioXSD|BioJSON|BioYAML BioJSON + BioJSON format (BioXSD) + BioXSD in JSON format + BioXSD in JSON + BioXSD JSON format + BioXSD JSON + BioXSD+JSON + 1.16 + BioJSON is a BioXSD schema-based JSON format of sequence-based data and some other common data - sequence records, alignments, feature records, references to resources, and more - optimised for integrative bioinformatics, web applications and APIs, and object-oriented programming. + Work in progress. 'BioXSD' belongs to the 'BioXSD|GTrack' ecosystem of generic formats. 'BioJSON' is the JSON format based on the common, unified 'BioXSD data model', a.k.a. 'BioXSD|BioJSON|BioYAML'. + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + BioYAML + BioXSD BioYAML + BioXSD BioYAML format + BioYAML (BioXSD data model) + BioYAML (BioXSD) + BioYAML format (BioXSD) + BioXSD/GTrack BioYAML + BioXSD|GTrack BioYAML + BioXSD|BioJSON|BioYAML BioYAML + BioXSD in YAML format + BioXSD in YAML + BioXSD YAML format + BioXSD YAML + BioXSD+YAML + BioYAML format + 1.16 + BioYAML is a BioXSD schema-based YAML format of sequence-based data and some other common data - sequence records, alignments, feature records, references to resources, and more - optimised for integrative bioinformatics, web APIs, human readability and editting, and object-oriented programming. + Work in progress. 'BioXSD' belongs to the 'BioXSD|GTrack' ecosystem of generic formats. 'BioYAML' is the YAML format based on the common, unified 'BioXSD data model', a.k.a. 'BioXSD|BioJSON|BioYAML'. + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + BioJSON (Jalview) + BioJSON format (Jalview) + Jalview BioJSON format + Jalview BioJSON + JSON format (Jalview) + JSON (Jalview) + Jalview JSON format + Jalview JSON + 1.16 + BioJSON. + + + + + + + + + + + + + + + + + + + + + + + + + + + GSuite + BioXSD/GTrack GSuite + BioXSD|GTrack GSuite + GSuite (GTrack ecosystem of formats) + GTrack|GSuite|BTrack GSuite + GTrack|BTrack|GSuite GSuite + GSuite format + + + 1.16 + GSuite is a tabular format for collections of genome or sequence feature tracks, suitable for integrative multi-track analysis. GSuite contains links to genome/sequence tracks, with additional metadata. + 'GSuite' belongs to the 'BioXSD|GTrack' ecosystem of generic formats, and particular to its subset, the 'GTrack ecosystem' (GTrack, GSuite, BTrack). 'GSuite' is the tabular format for an annotated collection of individual GTrack files. + + + + + + + + + + + + + + + + + BTrack + BioXSD/GTrack BTrack + BioXSD|GTrack BTrack + BTrack (GTrack ecosystem of formats) + GTrack|GSuite|BTrack BTrack + GTrack|BTrack|GSuite BTrack + BTrack format + + + + + 1.16 + BTrack is an HDF5-based binary format for genome or sequence feature tracks and their collections, suitable for integrative multi-track analysis. BTrack is a binary, compressed alternative to the GTrack and GSuite formats. + 'BTrack' belongs to the 'BioXSD|GTrack' ecosystem of generic formats, and particular to its subset, the 'GTrack ecosystem' (GTrack, GSuite, BTrack). 'BTrack' is the binary, optionally compressed HDF5-based version of the GTrack and GSuite formats. + + + + @@ -44840,14 +45188,14 @@ Trim sequences (typically from an automated DNA sequencer) to remove sequence-sp Format recognition 'Format recognition' is not a bioinformatics-specific operation, but of great relevance in bioinformatics. Should be removed from EDAM if/when captured satisfactorily in a suitable domain-generic ontology. Format inference - + The has_input "Data" (data_0006) may cause visualisation or other problems although ontologically correct. But on the other hand it may be useful to distinguish from nullary operations without inputs. - - + + @@ -46515,7 +46863,6 @@ Trim sequences (typically from an automated DNA sequencer) to remove sequence-sp - Peptide-spectrum-matching Protein inference Identification of protein, for example from one or more peptide identifications by tandem mass spectrometry. 1.16