Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BDQ Core - VOCABULARY of terms #152

Closed
Tasilee opened this issue Aug 28, 2018 · 174 comments
Closed

BDQ Core - VOCABULARY of terms #152

Tasilee opened this issue Aug 28, 2018 · 174 comments

Comments

@Tasilee
Copy link
Collaborator

Tasilee commented Aug 28, 2018

Terms in the bdqffdq namespace are from the Fitness for Use Framework (Viega et al. 2017). Use the reference to the Framework Definitions for more details and examples. The use of a vocabulary term in a test specification without a namespace prefix (sometimes represented in all UPPER CASE), implies that the bdq: or bdqffdq: namespace is applicable. Note that wherever "DQ" is used in a definition it implies "Data Quality" and wherever "FFU Framework" is used it refers to the "Fitness for Use Framework" (Veiga et al. 2017).

Note: There are two tables in this issue, the first is for vocabulary for the standard, the second is for additional terms for supplement files that will go into tables in those documents rather than controlled vocabularies.

Do not edit, moved to csv files

Pending further splits, this vocabulary moved to https://github.com/tdwg/bdq/blob/master/tg2/vocabularies/combined_vocabulary.csv

namespace:Term Term Definition Context Comment
bdqffdq:ActedUpon ActedUpon A subtype of an bdqffdq:InformationElement that is the primary focus of a test. InformationElements may also be Consulted, data quality reports should identify the ActedUpon terms of tests separately from Consulted terms. bdqffdq:InformationElement
bdq:Alien-Species Alien-Species Research uses for occurrence data of alien species where 1) the information elements concern what organism occurred where and when and the means, degree, and pathways of establishment, and 2) that are used for analysis of spatial and/or temporal patterns of biodiversity (see examples in Groom et al. (2019). Improving Darwin Core for research and management of alien species. https://doi.org/10.3897/biss.3.38084). bdqffdq:UseCase
bdq:AllAmendmentTestsRunOnSingleRecord AllAmendmentTestsRunOnSingleRecord A list of Amendments that have been run on a Single Record. bdqffdq:InformationElements Used in Measure of Single Record Tests
bdq:AllValidationTestsRunOnSingleRecord AllValidationTestsRunOnSingleRecord A list of Core Validation Tests that have been run on a Single Record. bdqffdq:InformationElements Used in Measure of Single Record Tests
bdq:Ambiguous Ambiguous Used to report where bdq:Conformance is not satisfied due to bdqffdq:InformationElements not being unambiguously resolvable by a bdq:sourceAuthority. bdqffdq:DataQualityDimension
bdq:AMENDED AMENDED A bdq:Response.status used to indicate that a response for a bdqffdq:Amendment contains a proposed change to a record in the bdq:Response.result. bdq:Response.status Applies only to a bdqffdq:Amendment.
bdqffdq:Amendment Amendment A Data Quality needs level concept that describes a run of a test that proposes changes based on some data quality enhancement. The AMENDMENT concept in the Tests involves data that were amended by modification or addition of a value or values following defined bdqffdq:Criteria of a run result (bdq:Response) that includes a status of (bdq:AMENDED, bdq:FILLED_IN, bdq:TRANSPOSED, etc) as well as the proposed changes to values from the original data (in bdq:Response.value). FFU Framework: Class Formally in the Fitness for Use Framework (Veiga et al. 2017), the description of a test that can propose a change is a bdqffdq:Enhancement, while the corresponding report level concept is a bdqffdq:Amendment.
bdqffdq:AmendmentMethod AmendmentMethod A Data Quality solutions level concept describing the relationship between a bdqffdq:Specification (technical description of a test) and a bdqffdq:Enhancement in the context of bdqffdq:ResourceType (bdqffdq:SingleRecord or bdqffdq:MultiRecord) and associated bdqffdq:InformationElements. FFU Framework: Class Veiga et al. (2017).
bdqffdq:AmendmentPolicy AmendmentPolicy A Data Quality needs level concept that describes how some bdqffdq:contextualizedEnhancement relates to a bdqffdq:UseCase. This relationship defines which amendments are supported by a given use case. FFU Framework: Class Veiga et al. (2017).
bdqffdq:amendmentProperties amendmentProperties Sub properties of bdqffdq:ObjectProperties that apply to amendment concepts such as bdqffdq:AmendmentPolicy (DQ needs), bdqffdq:AmendmentMethod (DQ solutions) and bdqffdq:Amendment (DQ reports). FFU Framework: ObjectProperty Veiga et al. (2017).
bdqffdq:AmendmentReport AmendmentReport A Data Quality report level concept that results from a bdqffdq:Amendment that proposed changes to a bdqffdq:InformationElement. FFU Framework: Class Veiga et al. (2017).
bdq:annotationAlertIf annotationAlertIf Optionally establishes if an annotation exists within a bdq:annotationSystem by describing the criteria for relating annotations in the system to records in a bdq:ParameterizedTest. bdq:Parameter Used in test "ANNOTATION_ISSUE_NOTEMPTY" (fecaa8a3-bbd8-4c5a-a424-13c37c4bb7b1).
bdq:annotationSystem annotationSystem Optionally established a system for annotations within a bdq:ParameterizedTest with the default being the w3c Annotations Data Model's "oa:Annotation" bdq:Parameter Used in test "ANNOTATION_ISSUE_NOTEMPTY" (fecaa8a3-bbd8-4c5a-a424-13c37c4bb7b1).
bdqffdq:Assertion Assertion The bdqffdq:Assertion type in FFDQ is the fundamental concept that makes up a bdqffdq:DataQualityReport. bdqffdq:Assertion can be any one of four types (represented as subClasses), bdqffdq:Measure, bdqffdq:Validation, bdqffdq:Issue, and bdqffdq:Amendement. FFU Framework: Class Veiga et al. (2017). The assertion concept consists of a bdqffdq:Specification (the technical description of a performed test), a bdqffdq:DataResource (initial values of input data expressed in terms of some controlled vocabulary), the bdqffdq:Mechanism (external service, actor, or code that performs the test), and some form of bdqffdq:Result.
bdq:ASSUMEDDEFAULT ASSUMEDDEFAULT A bdqffdq:Amendment that replaces a bdq:EMPTY term with a predefined default bdq:Parameter value. bdqTestField:Term-Actions Would be used only in an extension or in bdq:Response.comment, bdq:Response.status value for this case is bdq:AMENDED.
bdq:assumptionOnUnknownBiome assumptionOnUnknownBiome Used when a bdq:taxonomyIsMarine source authority is unable to assert the marine or non-marine status of a taxon, the biome (marine/nonmarine) to assume instead or noassumption. bdq:Parameter See VALIDATION_COORDINATES_TERRESTRIALMARINE (b9c184ce-a859-410c-9d12-71a338200380).
bdq:Biotic-Relationships Biotic-Relationships Research uses for relationships between organisms where 1) the information elements concern what organisms have a relationship and 2) that are used for analysis of the relationship of one organism to another (see examples in ​​Poelen JH, Simons JD, Mungall CJ. (2014). Global Biotic Interactions: An open infrastructure to share and analyze species-interaction datasets. Ecological Informatics, 24, 148–159. https://doi.org/10.1016/j.ecoinf.2014.08.005) bdqffdq:UseCase
bdq:COMPLETE COMPLETE A bdqffdq:Assertion of a bdqffdq:Measure where data are present and sufficiently comprehensive for use. bdq:Response.result
bdq:Completeness Completeness The extent to which data are present and sufficiently comprehensive for use. bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:Conformance Conformance Conforms to a format, syntax, data type, range, or standard of the bdqffdq:InformationElement. bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:COMPLIANT COMPLIANT A bdq:ExpectedResponse of a bdqffdq:Validation where the data conforms to the test bdqffdq:Criterion. bdq:Response.result Applies only to bdqffdq:Validations.
bdqffdq:ComposedOf ComposedOf Describes the properties from a controlled vocabulary that compose a bdqffdq:InformationElement. For example, a bdqffdq:InformationElement may be bdqffdq:composedOf properties such as dwc:day, dwc:month and dwc:year. FFU Framework: ObjectProperty Veiga et al. (2017).
bdq:Consistency Consistency Agreement among related bdqffdq:InformationElements that are present in the data. Note that missing bdqffdq:InformationElements do not make a test bdq:Inconsistent. bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:CONSISTENT CONSISTENT Identifies inconsistency among values between bdqffdq:InformationElements. bdqTestField:Term-Actions
bdqffdq:Consulted Consulted A bdqffdq:InformationElement that was referenced in a test but was not the primary focus of the test. bdqffdq:InformationElements
bdqffdq:ContextualizedCriterion ContextualizedCriterion Describes an instance of the criterion concept in terms of the associated bdqffdq:InformationElements from some controlled vocabulary (fields actedUpon or consulted), and a bdqffdq:ResourceType of bdqffdq:SingleRecord or bdqffdq:MultiRecord. FFU Framework: Class Veiga et al. (2017).
bdqffdq:ContextualizedDimension ContextualizedDimension Describes an instance of the bdqffdq:Dimension concept in terms of the associated bndqffdq:InformationElements from some controlled vocabulary (fields actedUpon or consulted), and a bdqffdq:ResourceType of bdqffdq:SingleRecord or bdqffdq:MultiRecord. FFU Framework: Class Veiga et al. (2017).
bdqffdq:ContextualizedEnhancement ContextualizedEnhancement Describes an instance of the bdqffdq:Enhancement concept in terms of the associated bdqffdq:InformationElements from some controlled vocabulary (fields actedUpon or consulted), and a bdqffdq:ResourceType of bdqffdq:SingleRecord or bdqffdq:MultiRecord. FFU Framework: Class Veiga et al. (2017).
bdqffdq:ContextualizedIssue ContextualizedIssue Describes an instance of the bdqffdq:Issue concept in terms of the associated bdqffdq:InformationElements from some controlled vocabulary (fields actedUpon or consulted), and a bdqffdq:ResourceType of bdqffdq:SingleRecord or bdqffdq:MultiRecord. FFU Framework: Class Veiga et al. (2017).
bdq:CONVERTED CONVERTED A conversion has been proposed to values in the bdqffdq:InformationElements to conform with a targeted reference system. bdqTestField:Term-Actions See Test "AMENDMENT_COORDINATES_CONVERTED" (620749b9-7d9c-4890-97d2-be3d1cde6da8).
bdq:COORDINATES COORDINATES Represents the combination of the Darwin Core terms dwc:decimalLatitude and dwc:decimalLongitude. bdqffdq:InformationElement
bdqffdq:coversUseCase coversUseCase Used by concepts in the Data Quality needs category to describe the relationship between DQ Policies (bdqffdq:ValidationPolicy, bdqffdq:AmendmentPolicy, bdqffdq:MeasurementPolicy) and an instance of the bdqffdq:UseCase covered by that policy. FFU Framework: ObjectProperty Veiga et al. (2017).
bdqffdq:Criterion Criterion Describes the criterion a bdqffdq:Validation test uses to determine compliance. For example, "The value of dwc:basisOfRecord of bdqffdq:SingleRecords must be in the controlled vocabulary". FFU Framework: Class Veiga et al (2017).
bdqffdq:criterionInContext criterionInContext Describes the relationship between a bdqffdq:Validation concept in the FFU Framework (needs, solutions, reports) and a bdqffdq:contextualizedCriterion. FFU Framework: ObjectProperty Veiga et al. (2017).
bdq:dataID dataID The local (to bdq:ValidationData) integer indentifier for the Validation Data record bdq:ValidationData
bdqffdq:DataQualityDimension DataQualityDimension Describes the aspect of data quality (accuracy, precision, completeness, etc.) that a test examines. For example, "precision" in "coordinate precision of single records". Includes Completeness (q.v.), Conformance (q.v.), Consistency (q.v.), Likeliness (q.v.), Reliability (q.v.), and Resolution (q.v.). FFU Framework: Class Note that the fail (bdq:NOT_COMPLIANT) assertions from running a test are one of: bdq:Ambiguous, bdq:Incomplete, bdq:Inconsistent, bdq:Invalid, or bdq:Unlikely.
bdqffdq:DataQualityReport DataQualityReport A set of bdqffdq:Assertions (bdqffdq:Measures, bdqffdq:Validations bdqffdq:Issues and bdqffdq:Amendments) that represent the output of a test run produced by a bdqffdq:Mechanism designed to assess the fitness for use of the tested data for a particular purpose as. FFU Framework: Class Veiga et al. (2017).
bdqffdq:DataResource DataResource Describes a data resource described in terms of a controlled vocabulary such as dwc and represents the original values of the data operated on by an assertion test (i.e. an instance of dwc:Occurrence). FFU Framework: Class Veiga et al (2017).
bdq:DefaultSourceAuthority DefaultSourceAuthority A default where a required bdq:Parameter or a bdq:sourceAuthority (q.v.) has not been provided. bdq:Parameter
bdq:defaultGeodeticDatum defaultGeodeticDatum Optionally established the default datum in a bdq:ParameterizedTest. A default datum is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter See test AMENDMENT_GEODETICDATUM_ASSUMEDDEFAULT (7498ca76-c4d4-42e2-8103-acacccbdffa7).
bdq:defaultValue defaultValue A preselected value (e.g. year, elevation) where a required bdq:Parameter value has not been provided. bdq:Parameter
bdqffdq:dimensionInContext dimensionInContext Describes the relationship between a bdqffdq:Amendment concept in the FFU Framework (needs, solutions, reports) and a bdqffdq:ContextualizedDimension. FFU Framework: ObjectProperty Veiga et al. (2017).
dwc: dwc: A namespace to indicate Darwin Core terms and which are listed in the dwcffdq:InformationElements of each Test. Data
bdq:earliestValidDate earliestValidDate Optionally establishes the earliest date in a bdq:ParameterizedTest. A default date is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdq:EMPTY EMPTY A bdqffdq:InformationElement that is either not present or does not contain any characters or values other than those in the range U+0000 to U+0020. Data Note: A bdqffdq:InformationElement containing invalid characters (e.g. letters in an information element that would be expected to contain integers) or values (including string serializations of the NULL value) are NOT_EMPTY and may be separately detected.
bdqffdq:Enhancement Enhancement Describes the enhancement to the original data performed by a bdqffdq:Amendment test. For example, "Recommends valid value for taxon name in a single record". FFU Framework: Class Veiga et al (2017).
bdqffdq:enhancementInContext enhancementInContext Describes the relationship between a bdqffdq:Amendment concept in FFU Framweork (needs, solutions, reports) and a bdqffdq:ContextualizedEnhancement. FFU Framework: Object Property Veiga et al. (2017)
EPSG: EPSG A pseudo-namespace referenced in dwc:datum to indicate the EPSG API where the numeric value following the colon is used as the search key. Example: EPSG:4326. Data
bdq:Examples Examples Provide one pass (i.e. COMPLIANT) example and one fail (NON_COMPLIANT) example for each test. bdq:Parameter
bdq:ExpectedResponse ExpectedResponse bdq:ExpectedReponse is one of the properties of a bdqffdq:Specification used in the markdown of the tests in the bdq GitHub. bdq:Specification
bdq:EXTERNAL_PREREQUISITES_NOT_MET EXTERNAL_PREREQUISITES_NOT_MET A bdq;Response.status indicating that a bdq:Response.value was not generated because a bdq:sourceAuthority was not available or was off-line. If the test is run at a later time, it may produce a different result. bdq:Response.status
bdq:FILLED_IN FILLED_IN A Response.status for a bdqffdq:Amendment where a value has been proposed for a bdqffdq:InformationElement that has no value. bdq:Response.status
bdq:FOUND FOUND The value in a bdqffdq:InformationElement that matched a value in a bdq:sourceAuthority. bdqTestField:Term-Actions Use bdq:COMPLIANT for bdq:Response.result, and include this in bdq:Response.comments or bdq:Response.qualifier.
gbif: gbif: A pseudo-namespace referenced in dwc:taxonID to indicate the GBIF API where the numeric value following the colon is used as the search key. Example: gbif:8102122. Data
bdq:GEOGRAPHY GEOGRAPHY A combination of Darwin Core administrative geography terms dwc:continent, dwc:country, dwc:countryCode, dwc:stateProvince, dwc:county, dwc:municipality. bdqffdq:InformationElement
bdq:geospatialLand geospatialLand Polygons derived from a union of Natural Earth vectors for Land and for Minor Islands at 1:10,000,000 resolution. bdq:Parameter See VALIDATION_COORDINATES_TERRESTRIALMARINE (b9c184ce-a859-410c-9d12-71a338200380)
bdq:GUID GUID Gobally Unique Identifier. In this document, the GUID for a test is a UUID (128-bit universally unique identifier) which identifies the test. Data GUID is intended to identify the tests for machine consumption, "Label" is used for human consumption. https://en.wikipedia.org/wiki/Universally_unique_identifier
bdqffdq:hasCriterion hasCriterion Used to link the derived concept of a bdqffdq:ContextualizedCriterion to the fundamental concept of a bdqffdq:Criterion. FFU Framework: ObjectProperty Veiga et al (2017).
bdqffdq:hasDimension hasDimension Used to link the derived concept of a bdqffdq:ContextualizedDimension to the fundamental concept of a bdqffdq:Dimension. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasEnhancement hasEnhancement Used to link the derived concept of a bdqffdq:ContextualizedEnhancement to the fundamental concept of a bdqffdq:Enhancement. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasInformationElement hasInformationElement Provides a relationship between FFDQ concepts and the information elements. For example, bdqffdq:ContextualizedCriterion uses this property along with bdqffdq:hasResourceType to define a criterion in the context of related information elements. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasIssue hasIssue Used to link the derived concept of a bdqffdq:ContextualizedIssue to the fundamental concept of a Problem. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasResourceType hasResourceType Provides additional metadata, along with the bdqffdq:InformationElements, that describes the level (bdqffdq:SingleRecord or bdqffdq:MultiRecord) at which the FFDQ concept operates. For example, a bdqffdq:enhancementInContext with resource type of bdqffdq:MultiRecord could be used to define a bdqffdq:Amendment that applies at the level of multiple record values. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasSpecification hasSpecification Describes the relationship between a derived FFDQ concept and the fundamental concept of a bdqffdq:Specification (technical description of a test). FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:hasStatus hasStatus Used in the bdqffdqReport concept to describe result status. For example, in the case of a bdqffdq:Validation result, values could be bdq:COMPLIANT or bdq:NON_COMPLIANT. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:Implementation Implementation The FFDQ derived concept of a bdqffdq:Implementation describes the relationship between a bdqffdq:Specification (technical description of a test) and the bdqffdq:Mechanism that implements it. FFU Framework: Class Veiga et al (2017).
bdqffdq:implementedBy implementedBy Describes the link between the bdqffdq:Implementation concept in FFDQ and the bdqffdq:Mechanism that implements some bdqffdq:Specification (also defined in bdqffdq:Implementation). FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:ImprovementTarget ImprovementTarget The bdqffdq:ImprovementTarget concept in FFDQ describes which bdqffdq:Measures and bdqqffdq:Validations are improved by some bdqffdq:Amendment. bdqffdq:ImprovementTarget includes relationships between a bdqffdq:contextualizedEnhancement (for a bdqffdq:Amendment) and one or more bdqffdq:contextualizedCriterion (link to bdqffdq:Validations) or bdqffdq:contextualizedDimension (link to bdqffdq:Measures). FFU Framework: Class Veiga et al (2017).
bdqffdq:improvedBy improvedBy Object property that describes a bdqffdq:Enhancement, as part of the bdqffdq:ImprovementTarget, that would improve data acted upon by some set of bdqffdq:Measures or bdqffdq:Validations. FFU Framework: Object Property Veiga et al. (2017).
bdq:includeEventDate bdq:includeEventDate Allows dwc:eventDate to be excluded in a bdq:ParameterizedTest. The default is to include the event date in the test, but it may be excluded to allow an identification to be prior to the event date. bdq:Parameter Used in test "VALIDATION_DATEIDENTIFIED_INRANGE" (dc8aae4b-134f-4d75-8a71-c4186239178e).
bdq:Incomplete Incomplete Where a bdqffdq:InformationElement does not contain sufficient information to satisfy the scope of the test. bdqffdq:DataQualityDimension
bdq:Inconsistent Inconsistent Where the Data Quality Dimension (q.v.): Consistency (q.v.) is not satisfied due to inconsistent values between the different Information Elements (q.v.) of a single record. bdqffdq:DataQualityDimension
bdqffdq:InformationElement InformationElement A bdqffdq:InformationElement identifies a portion of data to which a test pertains. The bdqffdq:InformationElement in FFDQ can be represented as a single or composite element that consists of one or more terms from a controlled vocabulary (fields actedUpon or consulted by an assertion test) that identifies concepts in data relevant to a use case. An abstraction or a concrete term that represents relevant content (e.g., coordinates; dwc.decimalLatitude, dwc:decimalLongitude). FFU Framework: Class For the test descriptions, bdqffdq:InformationElements are concrete Darwin Core terms, to remove ambiguity for implementors. Veiga et al (2017).
bdq:INTERNAL_PREREQUISITES_NOT_MET INTERNAL_PREREQUISITES_NOT_MET A bdq:Response.status where values of the bdqffdq:InformationElement were insufficient to run the test. If the test is run at a later time on unmodified data, it should produce the same bdq:Response. bdq:Response.status
bdq:interpretedAs interpretedAs (1) For Implementors, where Darwin Core data are serialized as strings, but the test refers to data as numeric or other non-string data type, can the string value be parsed into the target data type in the language of implementation (e.g., "1" as the integer 1), (2) matching a representation of a value unambiguously onto a controlled vocabulary (e.g., ‘WGS84’ to ’EPSG:4326’), or (3) interpreting the representation of a numeric value (e.g., a roman numeral) as a number (e.g., an integer). Data
bdq:Invalid Invalid Where the bdqffdq:DataQualityDimension: bdq:Conformance is not satisfied due to bdqffdq:InformationElements containing non-standard values. bdqffdq:DataQualityDimension
bdqffdq:Issue Issue A Data Quality needs level concept that flags issue or problems with the data. In the context of the tests, bdqffdq:Issue(s) are all either bdq:POTENTIAL_ISSUE, bdq:IS_ISSUE where potential problems are flagged and may need examination by the user to determine if data have quality for their use; or bdq:NOT_ISSUE. FFU Framework: Class Veiga et al (2017).
bdqffdq:issueInContext issueInContext Describes the relationship between a bdqffdq:Issue concept in FFU Framework (needs, solutions, reports) and a bdqffdq:ContextualizedIssue. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:IssueMethod IssueMethod A Data Quality solutions level concept describing the relationship between a bdqffdq:Specification (technical description of a test) and a bdqffdq:Issue in the context of bdqffdq:ResourceType (bdqffdq:SingleRecord or bdqffdq:MultiRecord) and associated bdqffdq:InformationElements. FFU Framework: Class In Veiga et al. (2017) this was treated as "ProblemMethod"
bdqffdq:IssuePolicy IssuePolicy A Data Quality needs level concept that describes how some bdqffdq:contextualizedIssue relates to a bdqffdq:UseCase. This relationship defines which bdqffdq:Issues are supported by a given bdqffdq:UseCase. FFU Framework: Class In Veiga et al. (2017) this was treated as "ProblemPolicy"
bdqffdq:issueProperties issueProperties Sub properties of bdqffdq:ObjectProperties that apply to bdqffdq:Issue concepts such as bdqffdq:IssuePolicy (DQ needs), bdqffdq:IssueMethod (DQ solutions) and bdqffdq:Issue (DQ reports). FFU Framework: Object Property In Veiga et al. (2017) treated as "ProblemProperties"
bdqffdq:IssuesReport IssuesReport A Data Quality report level concept that results from a bdqffdq:Issue that flagged a problem in a test as bdq:IS_ISSUE, bdq:POTENTIAL_ISSUE or bdq:NOT_ISSUE. FFU Framework: Class Veiga et al. (2017).
bdq:IS_ISSUE IS_ISSUE A bdq:Response.result for a bdqffdq:Issue that flags where the data do not have sufficient quality for a use. bdq:Response.result
rdfs:label label "See: https://www.w3.org/TR/rdf-schema/#ch_label" RDF representation "skos:prefLabel/skos:label may be prefered."
bdq:latestValidDate latestValidDate Optionally establishes the latest date in a bdq:ParameterizedTest. A default date is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdq:Likeliness Likeliness The likelihood of Darwin Core Term(s) having true or expected values. bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:LineNumber LineNumber The sequence number of the data record in the bdq:ValidationData bdq:ValidationData
bdq:LineForTest LineForTest A local to bdq:ValidationData identifier for test records within one test bdq:ValidationData
bdq:maximumValidDepthInMeters maximumValidDepthInMeters Optionally establishes the maximum depth in a bdq:ParameterizedTest. A default depth is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdq:maximumValidElevationInMeters maximumValidElevationInMeters Optionally establishes the highest elevation in a bdq:ParameterizedTest. A default elevation is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdqffdq:Measure Measure A Data Quality needs level concept that describes a run of a test that performs a measurement according to some data quality dimension. In FFDQ, the Measure concept consists of a run result of COMPLETE or NOT_COMPLETE, a value of the measurement (i.e. a measure of dwc:eventDate duration in seconds) or counts of the number of tests from a run where bdq:Response.result was bdq:NOT_COMPLIANT, or bdq:PREREQUISITES_NOT_MET in bdqffdq:Validation tests, or was bdq:AMENDED in bdqffdq:Amendment tests. FFU Framework: Class Veiga et al. (2017).
bdqffdq:MeasurementMethod MeasurementMethod A Data Quality solutions level concept describing the relationship between a bdqffdq:Specification (technical description of a test) and a bdqffdq:Dimension in the context of bdqffdq:ResourceType (bdqffdq:SingleRecord or bdqffdq:MultiRecord) and associated bdqffdq:InformationElements. FFU Framework: Class Veiga et al (2017).
bdqffdq:MeasurementPolicy MeasurementPolicy A Data Quality needs level concept that describes how some bdqffdq:contextualizedDimension relates to a bdqffdq:UseCase. This relationship defines which bdqffdq:Measures are supported by a given bdqffdq:UseCase. FFU Framework: Class Veiga et al (2017).
bdqffdq:measurementProperties measurementProperties Sub properties of bdqffdq:ObjectProperties that apply to measurement concepts such as bdqffdq:MeasurementPolicy (DQ needs), bdqffdq:MeasurementMethod (DQ solutions) and bdqffdq:Measure (DQ reports). FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:MeasurementReport MeasurementReport A Data Quality report level concept that describes the results of a run of a test that performs a measurement according to some data quality dimension. FFU Framework: Class Veiga et al. (2017).
bdqffdq:Mechanism Mechanism The FFDQ concept of bdqffdq:Mechanism describes the entity that performs an assertion test (code, external service, actor, etc.). Tied to a bdqffdq:Specification via the concept of a bdqffdq:Implementation. FFU Framework: Class Veiga et al (2017).
bdq:minimumValidDepthInMeters minimumValidDepthInMeters Optionally establishes the minimum depth in a bdq:ParameterizedTest. A default depth is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdq:minimumValidElevationInMeters minimumValidElevationInMeters Optionally establishes the lowest elevation in a bdq:ParameterizedTest. A default elevation is supplied in cases where a bdq:Parameter is not set at the time the test is run. bdq:Parameter
bdqffdq:MultiRecord MultiRecord A data set composed of one or more bdqffdq:SingleRecords. FFU Framework: namedIndividual Veiga et al. (2017).
non-printing characters non-printing characters ASCII 0-32 and 127 decimal. Non printing characters or formatting marks that are not displayed at printing. These may include pilcrow, space, non-breaking space, tab character. etc. For the purposes of the tests they are treated as bdq:EMPTY. Data
bdq:NOT_AMENDED NOT_AMENDED A bdq:Result.status that indicates that a bdq:Response for a bdqffdq:Amendment proposed no change. bdq:Response.status
bdq:NOT_COMPLETE NOT_COMPLETE An assertion of a bdqffdq:Measure on a bdqffdq:MultiRecord where not all the bdqffdq:Validation bdq:Response.result from all included records in the dataset are bdq:COMPLIANT. bdq:Response.result The scope in the Fitness of Use Framework (Veiga et al. (20117) is broader.
bdq:NOT_COMPLIANT NOT_COMPLIANT A bdq:Response.result of a bdqffdq:Validation where the data do not conform to the Test bdqffdq:Criterion. bdq:Response.status
bdq:NOT_ISSUE NOT_ISSUE The bdq:Response of a test of type bdqffdq:Issue where no potential problems were detected. bdq:Response.result
bdq:NOTEMPTY NOTEMPTY The value of a bdqffdq:InformationElement that is present and has content (cf. bdq:EMPTY) Data
null null A value that is used in some databases to signify that a value is unknown or missing. It may be represented in serializations by "NULL", "Null", "null". "/n", "9999", etc. These should be treated as bdq:NOTEMPTY. Data
bdq:OUTOFRANGE OUTOFRANGE The value in a bdqffdq:InformationElement that is outside an acceptable range for that bdqffdq:InformationElement. bdqTestField:Term-Actions Use in bdq:Response.qualifier or bdq:Response.comment.
bdq:Parameter Parameter A value provided to a test that changes the behavior of a test to fit a particular user need within the scope of the test. Either 1) a link to a bdq:sourceAuthority to find matching values, or 2) a value used to define limits for a bdqffdq:InformationElement. Data
bdq:paramaterizedTest paramaterizedTest A test that allows a bdq:Parameter to be set prior to the test being run. Where a bdq:Parameter value has not been provided, a default is specified within the test. Test
bdq:POLYNOMIAL POLYNOMIAL Represents a combination of the Darwin Core terms dwc:genericName, dwc:specificEpithet, dwc:infraspecificEpithet. bdqffdq:InformationElement See test "VALIDATION_POLYNOMIAL_CONSISTENT" (17f03f1f-f74d-40c0-8071-2927cfc9487b)
bdq:POTENTIAL_ISSUE POTENTIAL_ISSUE A bdq:Response.result for a bdqffdq:Issue that flags where the data may not have sufficient quality for a use. See also bdq:IS_ISSUE and bdq:NOT_ISSUE. The user will need to evaluate if the data is fit for their particular use or not. bdq:Response.result
bdq:PRECISIONINSECONDS PRECISIONINSECONDS The length of the period of an event in seconds. bdqTestField:Term-Actions This is description of the bdq:Response.result from this bdqffdq:Measure, where the result is a numeric value in seconds. See Test "MEASURE_EVENTDATE_DURATIONINSECONDS" (56b6c695-adf1-418e-95d2-da04cad7be53).
skos:prefLabel Preferred Label See https://www.w3.org/TR/skos-reference/#labels. SKOS Representation from SKOS Simple Knowledge Organization System
bdq:PREREQUISITESNOTMET PREREQUISITESNOTMET A test of type bdqffdq:Measure that counts the number of tests of type bdqffdq:Validation that did not run due to one or more prerequisites not being met (e.g. bdq:INTERNAL_PREREQUISITES_NOTMET and bdq:EXTERNAL_PREREQUISITES_NOTMET) bdqTestField:Term-Actions See test "MEASURE_VALIDATIONTESTS_PREREQUISITESNOTMET" (49a94636-a562-4e6b-803c-665c80628a3d).
bdqffdq:Profile Profile a Data Quality needs level concept describing the bdqffdq:UseCases that make up some data quality operation such as the behavior of a single actor or workflow producing the relevant bdqffdq:Assertions. FFU Framework: Class Veiga et al. (2017).
bdq:PROPOSED PROPOSED A test of type bdqffdq:Measure that pertains to a bdqffdq:Amendment where an action to modify a value in some way through a change or addition is recommended. bdqTestField:Term-Actions Example see test "MEASURE_AMENDMENTS_PROPOSED" (03049fe5-a575-404f-b564-ae63f5a1cf8b).
bdq:Record-Management Record-Management Management of the quality of biodiversity data records (see examples in Rees ER & Nicholls M (2020) Data Quality Use Case Study Results https://doi.org/10.3897/biss.4.50889.suppl2). bdqffdq:UseCase
bdq:Reliability Reliability Measure of how the data values agree with an identified source of truth. The degree to which data correctly describes the truth (object, event or any abstract or real 'thing'). bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:Resolution Resolution Refers to the data having sufficiently detailed information. Measure of the granularity of the data, or the smallest measurable increment. bdqffdq:DataQualityDimension Definition from the Fitness for Use Framework: Data Quality Dimensions Document (**Link needed to RDF document ** - https://github.com/tdwg/bdq/wiki/TG2-Data-Quality-Dimension).
bdq:Response Response The report from a single execution of a single test, consisting of bdq:Response.status, bdq:Response.result, bdq:Response.comment, and optionally, bdq:Response.qualifier. bdq:Response Parent of RESULT and RESULT_STATUS in the Fitness for Use Framework (Viega et al. 2017).
bdq:Response.comment Response.comment Human readable interpretation of the results of the test. bdq:Response
bdq:Response.qualifier Response.qualifier Additional structured information that qualifies the bdq:Response, intended as an extension point for uncertainty. bdq:Response
bdq:Response.result Response.result The element in a bdq:Response containing the value returned by the particular test (bdqffdq:Validation, bdqffdq:Amendment, bdqffdq:Measure, or bdqffdq:Issue) bdq:Response
bdq:Response.status Response.status A metadata element in a bdq:Response indicating whether a particular test (bdqffdq:Validation, bdqffdq:Amendment, bdqffdq:Measure, or bdqffdq:Issue) was able to be performed or not. bdq:Response
bdqffdq:ResourceType ResourceType In FFDQ the concept of bdqffdq:ResourceType has instances for bdqffdq:SingleRecord or bdqffdq:MultiRecord. FFU Framework: Class Veiga et al. (2017).
bdqffdq:Result Result The report bdqffdq:Result concept in FFDQ is represented as a value or a bdqffdq:ResultStatus for bdqffdq:Measures, just a bdqffdq:ResultStatus for bdqffdq:Validations and a bdqffdq:ResultStatus as well as values for changes proposed by bdqffdq:Amendments. FFU Framework: Class Veiga et al. (2017).
bdqffdq:ResultStatus ResultStatus Depending on the assertion type would have values of bdq:COMPLIANT or bdq:NOT_COMPLIANT for a bdqffdq:Validation, bdq:COMPLETE or bdq:NOT_COMPLETE for a bdqffdq:Measure, bdq:AMENDED, bdq:FILLED_IN, bdq:TRANSPOSED, bdq:NOT_AMENDED for a bdqffdq:Amendment and bdq:IS_ISSUE, bdq:POTENTIAL_ISSUE or bdq:NOT_ISSUE for a bdqffdq:Issue FFU Framework: Class Veiga et al. (2017). Note that a separate concept describes the resultstate with values of bdq:INTERNAL_PREREQUISITES_NOT_MET and bdq:EXTERNAL_PREREQUISITES_NOT_MET.
Roman numerals Roman numerals Roman numerals are interpreted as the equivalent integer for months (e.g. "X" as "10") in appropriate tests. Roman numerals may not be unambiguously interpreted for other Darwin Core terms such as dwc:day or in text fields as they may mean unknown or something else entirely. Data
bdq:RUN_HAS_RESULT RUN_HAS_RESULT A bdq:Response.status that implies that a result was correctly generated. bdq:Response.status Applies to bdqffdq:Validations, bdqfdfq:Measures, and bdqffdq:Issues, but not bdqffdq:Amendments. See Fitness for Use Framework definition in Need link to OWL Document @chicoreus. See also bdq:INTERNAL_PREREQUISITES_NOT_MET and bdq:EXTERNAL_PREREQUISITES_NOT_MET
bdqffdq:SingleRecord SingleRecord A record from a dataset without dependencies on any other record. FFU Framework: namedIndividual Veiga et al. (2017). Note that all the current tests are run on a bdqffdq:SingleRecord, of Darwin Core data, and not designed to be run across a bdqffdq:MultiRecord, except for bdqffdq:MultiRecord bdqffdq:Measures.
bdq:sourceAuthority sourceAuthority An authority using the "bdq" namespace that provides a reference for values required for a test evaluation. Where the test is a bdq:ParameterizedTest a bdq:defaultSourceAuthority ("bdq:sourceAuthority default = xxx") is specified. bdq:Parameter
bdq:spatialBufferInMeters spatialBufferInMeters A buffer in meters from a polygon (geopolitical boundary, coastline, etc.). bdq:Parameter
bdq:Spatial-Temporal Patterns Spatial-Temporal Patterns Research uses for biodiversity occurrence data where 1) the information elements concern what organism occurred where and when and 2) that are used for analysis of spatial and/or temporal patterns of biodiversity (see examples in Rees ER & Nicholls M (2020) Data Quality Use Case Study Results https://doi.org/10.3897/biss.4.50889.suppl2). bdqffdq:UseCase
bdqffdq:Specification Specification A technical description of the performed test upon which a bdqffdq:Implementation could be made. FFU Framework: Class
bdq:STANDARD STANDARD A bdqffdq:Amendment where a value in a bdqffdq:InformationElement is proposed from a bdq:sourceAuthority. bdqTestField:Term-Actions Use in bdq:Response.qualifier or bdq:Response.comment.
bdq:STANDARDIZED STANDARDIZED A bdqffdq:Amendment where a bdq:STANDARD value for a bdqffdq:InformationElement is proposed. bdqTestField:Term-Actions Use bdq:AMENDED as the bdq:Response.status, report bdq:STANDARDIZED in a bdq:Response.qualifier or in a bdq:Response.comment.
bdq:targetCRS targetCRS The Coordinate Reference System (CRS) used as the output when converting coordinates from one CRS to another. The default is EPSG:4326. bdq:Parameter Used in the test AMENDMENT_COORDINATES_CONVERTED (620749b9-7d9c-4890-97d2-be3d1cde6da8).
bdqffdq:targetedCriterion targetedCriterion The bdffdq:Criterion targeted by some enhancement via the bdqffdq:ImprovementTarget object. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:targetedDimension targetedDimension The bdqffdq:Dimension targeted by some enhancement via the bdqffdq:ImprovementTarget object. FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:targetedIssue targetedIssue The bdqffdq:Issue targeted by some problem via the bdqffdq:ImprovementTarget object. FFU Framework: Object Property Veiga et al. (2017).
bdq:Taxon-Management Taxon-Management Management of the quality of taxonomic names (see examples in Rees ER & Nicholls M (2020) Data Quality Use Case Study Results https://doi.org/10.3897/biss.4.50889.suppl2). bdqffdq:UseCase
bdq:taxonIsMarine taxonIsMarine Marine/non-marine status obtained from the World Register of Marine Species (WORMS) database bdq:Parameter See VALIDATION_COORDINATES_TERRESTRIALMARINE (b9c184ce-a859-410c-9d12-71a338200380).
bdq:TERRESTRIALMARINE TERRESTRIALMARINE A terrestrial taxon that has geographic coordinates that fall within terrestrial boundaries; or a marine taxon that has geographic coordinates that fall within marine boundaries. bdqTestField:Term-Actions Use bdq:AMENDED as the bdq:Response.status, report bdq:TERRESTRIALMARINE in a bdq:Response.qualifier or in a bdq:Response.comment. See test "VALIDATION_COORDINATES_TERRESTRIALMARINE" (b9c184ce-a859-410c-9d12-71a338200380).
bdq:TestField testFields Column heading in the markdown of the tests in the bdq GitHub that list all the normative and informative metadata elements that describe a Data Quality Test Test
bdq:TestPrerequisite TestPrerequisite Conditions that must be met for a test to be run (e.g., fields having values, tests that need to be run before the current test, availability of a bdq:sourceAuthority) Test See for example, INTERNAL_PREREQUISTES_NOT_MET (q.v.) and EXTERNAL_PREREQUISITES_NOT_MET (q.v.).
bdq:TestType TestType There are four types of tests, viz. bdqffdq:Validation, bdqffdq:Amendment, bdqffdq:Issue, and bdqffdq:Measure. Test  
bdq:TRANSPOSED TRANSPOSED The sign and/or value of one or more bdqffdq:InformationElements were swapped. bdqTestField:Term-Actions Use bdq:AMENDED as the bdq:Response.status, report bdq:TRANSPOSED in a bdq:Response.qualifier or in a bdq:Response.comment. See Test "AMENDMENT_COORDINATES_TRANSPOSED" (f2b4a50a-6b2f-4930-b9df-da87b6a21082).
bdq:Unlikely Unlikely The bdqffdq:DataQualityDimension: bdq:Likeliness is not satisfied due to the bdq:InformationElements containing a value that is not likely to occur (for example where the geographic coordinates are "0", "0"). bdqffdq:DataQualityDimension Needs a bdq:Response.qualifier (q.v.) response for the uncertainty.
bdqffdq:UseCase UseCase The bdqffdq:UseCase concept in FFDQ describes some data quality control use case. The bdqffdq:Amendment, bdqffdq:Measure and bdqffdq:Validation policies that make up a bdqffdq:UseCase define which bdqffdq:Assertions cover a given bdqffdq:UseCase. FFU Framework: Class An example of a bdqffdq:UseCase could be "Check for internal consistency of dates", with bdqffdq:Validation policies for checking consistency between atomic date fields and a bdqffdq:Amendment such as "eventDate filled in from verbatim". Veiga et al. (2017).
bdqffdq:Validation Validation A Data Quality needs level concept that describes a run of a test for validity. The bdqffdq:Validation concept in the Tests consists of a run with a bdq:Response:result of bdq:COMPLIANT or bdq:NOT_COMPLIANT and a bdqffdq:Criterion that describes the conditions for validity that result in a status of bdq:COMPLIANT. FFU Framework: Class Veiga et al. (2017).
bdq:ValidationData ValidationData Test data set established for testing Test Implementations Data
bdqffdq:ValidationMethod ValidationMethod TA Data Quality solutions level concept describing the relationship between a bdqffdq:Specification (technical description of a test) and a bdqffdq:Criterion in the context of a bdqffdq:ResourceType (bdqffdq:SingleRecord or bdqffdq:MultiRecord) and associated bdqffdq:InformationElements. FFU Framework: Class Veiga et al. (2017).
bdqffdq:ValidationPolicy ValidationPolicy A Data Quality needs level concept that describes how some bdqffdq:contextualizedCriterion relates to a bdqffdq:UseCase. This relationship defines which validations are supported by a given bdqffdq:UseCase. FFU Framework: Class Veiga et al. (2017).
bdqffdq:validationProperties validationProperties Sub properties of bdqffdq:ObjectProperties that apply to validation concepts such as bdqffdq:ValidationPolicy (DQ needs), bdqfdq:ValidationMethod (DQ solutions) and bdqffdq:Validation (DQ reports). FFU Framework: Object Property Veiga et al. (2017).
bdqffdq:ValidationReport ValidationReport A Data Quality report level concept that reports the results of a run of a bdqffdq:Validation test on some data. FFU Framework: Class Veiga et al. (2017).
bdq:VERBATIM VERBATIM An original value. bdqffdq:InformationElement
white space white space 1) A field that only includes white space (blanks) is treated as bdq:EMPTY (q.v.). 2) In bdqffdq:Validation tests (q.v.) that require the looking up of a bdq:sourceAuthority, leading and/or trailing white space will cause the test to fail as no preprocessing is carried out on the data. These leading and trailing white spaces may be stripped out in a subsequent bdqffdq:Amendment and thus pass when the bdqffdq:Validation test is run again. Data
bdq:YEARMONTHDAY YEARMONTHDAY Represents a combination of the Darwin Core terms dwc:year, dwc:month, dwc:day. bdqffdq:InformationElement
bdq:YEARSTARTDAYOFYEARENDDAYOFYEAR YEARSTARTDAYOFYEARENDDAYOFYEAR Represents a combination of the Darwin Core terms dwc:year, dwc:startDayOfYear, dwc:endDayofYear. bdqffdq:InformationElement

Supplement: GitHub Label Terms
These are terms that are outside the Standard but that have been used as either GitHub Labels or TestFields in the BDQ GitHub

Do not edit, moved to csv files

**Pending further moves, this vocabulary moved to https://github.com/tdwg/bdq/blob/master/tg2/vocabularies/glossary_terms.csv **

namespace:Term Label Definition Context Comment
bdqtag:Amendment Amendment A label to indicate a test of type AMENDMENT which may propose a change or addition to at least one Darwin Core term that is intended to improve one or more components of the quality of the record. GitHub Label See bdqffdq:Amendment
bdqtag:CORE CORE Tests for evaluating biodiversity data quality as represented by the values of Darwin Core terms. CORE tests address identified user needs, are widely applicable, informative, unambiguous, well defined, and straight forward to implement. GitHub Label
bdqTestField:Darwin Core Class Darwin Core Class The Information Element in the original terms of the framework, the general sort of information this test operates on. TestField
bdqTestField:Data Quality Dimension Data Quality Dimension The data quality dimension for this test. See bbqffdq:DataQualityDimension. TestField
bdqTestField:Description Description A non-technical description of what the test does, intended for consumers of data quality reports in concert with the bdq:Response.comment. TestField
bdqtag:DO NOT IMPLEMENT DO NOT IMPLEMENT Tests that are not CORE (cf. bdqtag:CORE) and not recommended to be implemented with the current level of understanding for one or more reasons: Available vocabularies are ambiguous; the test is too complex to implement concisely; implementation is expected to lead to ambiguous or inaccurate results. GitHub Label
bdqTestField:Example Implementations (Mechanisms) Example Implementations (Mechanisms) Known Mechanisms with implementations of the test. TestField
bdqTestField:Examples Examples A ’pass’ and a ‘fail’ example of test data. All examples listed are present in the the validation data suite. TestField
bdqTestField:Expected Response Expected Response The specification for implementors describing the expected behavior of the test. See bdqffdq:Specification TestField = bdqffdq:Specification
bdqTestField:GUID GUID see bdq:GUID TestField
bdqtag:Immature/Incomplete Immature/Incomplete Tests where substantial work is needed to develop the specification to the point where the test can be reliably and usefully implemented. This may indicate work that is wholly internal to the test specification such as developing a consistent Expected Response, or may indicate that external work is needed to develop an agreed vocabulary for values of the tested term. An immature/incomplete test may be made CORE, Supplementary, or DO NOT IMPLEMENT when relevant criteria are satisfied.
bdqTestField:Information Elements Acted Upon Information Elements Acted Upon A list of the specific Darwin Core terms that are the focus of a test. TestField
bdqTestField:Information Elements Consulted Information Elements Consulted AA list of Darwin Core terms that are consulted in the evaluation of the Information Elements ActedUpon. TestField
bdqtag:ISO/DCMI STANDARD ISO/DCMI STANDARD A reference to either an ISO (International Organization for Standardization) Standard or a DCMI (Dublin Core Metadata Initiative) Standard GitHub Label
bdqtag:Issue Issue A label to indicate a test of type ISSUE, where potential problems are flagged and may need examination by the user to determine if data have quality for their use. GitHub Label see bdqffdq:Issue
bdqTestField:Label Label A human readable label identifying the test. The labels largely follow the pattern TYPE_INFORMATIONELEMENT_STATUS. TestField cf. rdfs:label
bdqTestField:Link to Specification Source Code Link to Specification Source Code A link to code that implements the test. TestField
bdqtag:Measure Measure A label to indicate a test of type MEASURE that performs a measurement according to some data quality dimension. GitHub Label See bdqffdq:Measure
bdqtag:NAME NAME A label to indicate that the test is related to Darwin Core terms in the dwc:Taxon Class. GitHub Label
bdqtag:NEEDS WORK NEEDS WORK A label that indicates that an issue (Test) requires more work before finalising. GitHub Label
bdqTestField:Notes Notes Additional, non-normative comments that the Task Group believed necessary for an accurate understanding of the test or issues that implementers needed to be aware of. TestField
bdqtag:OTHER OTHER A label to indicate that the test is related to Darwin Core terms other than Classes dwc:Taxon, dwc:Location or dwc:Event. GitHub Label
bdqtag:Parameterized Parameterized A label for a test that requires a bdq:Parameter to be set prior to a bdq:parameterizedTest being run. GitHub Label
bdqTestField:Parameter(s) Parameter(s) Any parameters that change the behavior of the test for a subset of users with special data quality needs within the domain. TestField
bdqTestField:References References A list of references pertinent to the test. TestField
bdqTestField:Source Source The origin of the concept of the test. TestField
bdqTestField:Source Authority Source Authority A reference to an external (non-Darwin Core) authority required for the test. See bdq:sourceAuthority TestField
bdqtag:SPACE SPACE A label to indicate that the test is related to Darwin Core terms in the dwc:Location Class. GitHub Label
bdqTestField:Specification Last Updated Specification Last Updated The last date a change was made to a test that affects the operation of the test. TestField
bdqtag:Supplementary Supplementary Tests regarded as not CORE (cf. bdqtag:CORE) because of one or more reasons: Not widely applicable; not clearly matched to an identified data quality need; not informative concerning the 'quality' or lack of quality of the data; likely to return a high percentage of either bdq:NOT_COMPLIANT or bdq:POTENTIAL_ISSUE records. A Supplementary test MAY be implemented in a local implementation if a suitable use case exists. GitHub Label A Supplementary test may be made CORE at a later time.
bdqTestField:Term-Actions Term-Actions Equivalent to the bdqTestField:Label without the leading Test Type. TestField
bdqtag:Test Test Tests descriptions created by TG2, either CORE, Immature/Incomplete, Supplementary, or DO NOT IMPLEMENT. GitHub Label
bdqTestField:Test Type Test Type The Type of assertion that the test produces, Measure, Validation, Amendment, Issue. TestField
bdqtag:TG1 TG1 Issues pertinent to Task Group 1 (Framework on Data Quality) of the TDWG Data Quality Interest Group. GitHub Label
bdqtag:TG2 TG2 Issues including Tests, developed by, or pertinent to Task Group 2 (Data Quality Tests and Assertions) of the TDWG Data Quality Interest Group. GitHub Label
bdqtag:TG3 TG3 Issues pertinent to Task Group 3 (Data Quality Use Cases) of the TDWG Data Quality Interest Group. GitHub Label
bdqtag:TG4 TG4 Issues pertinent to Task Group 4 (Best Practices for Development of Vocabularies of Value) of the TDWG Data Quality Interest Group. GitHub Label
bdqtag:TIME TIME A label to indicate that the test is related to Darwin Core terms in the dwc:Event Class. GitHub Label
bdqtag:Validation Validation A label to indicate a test of type VALIDATION that describes a run of a test for validity against a set of criteria. GitHub Label See bdqffdq:Validation
bdqtag:VOCABULARY VOCABULARY A label to indicate that a bdqlabel:Test requires a Vocabulary GitHub Label
@chicoreus
Copy link
Collaborator

Much (or all) of the vocabulary will come out of the framework as a technical specification, probably with additional supporting vocabularies (such as for values for data quality dimension). There is still a need to express the tests themselves as a formal specification (s.l.) and move this towards a TDWG standard.

@ArthurChapman
Copy link
Collaborator

ArthurChapman commented Aug 29, 2018

There may (probably) be terms associated with the Tests and Assertions beyond the Framework. The Framework's terms are broader than just Darwin Core - the TG2 tests need to define some terms that are outside the Framework (CORE is one that comes to mind. Whether it makes sense or not to expand the Framework terms to cover these is probably worth discussing.

@tucotuco
Copy link
Member

I propose that the framework should have a distinct vocabulary product consisting of the framework terms and their controlled vocabularies of values. I propose that there we a task group spawned specifically to create this. Tests and assertions rely on these for rigorous definition, so to me it has highest priority as a new vocabulary.

@chicoreus
Copy link
Collaborator

@tucotuco sounds like a deliverable from TG1.

@saraiva-usp
Copy link

Agree!

@ArthurChapman
Copy link
Collaborator

Are there terms that we may be using in TG2 that are outside of the Framework such that we need a separate vocabulary (but where most terms link to the Framework, or to Darwin Core)? I will go through the Tests and pull out a list of terms for which I think may need definitions.

@Tasilee
Copy link
Collaborator Author

Tasilee commented Aug 31, 2018

We do have terms in TG2 that are beyond the Framework. I'll work with @ArthurChapman to generate the list.

@ArthurChapman
Copy link
Collaborator

In the PSSR-CORE (Citizen Science) document from 2017 (https://www.wilsoncenter.org/sites/default/files/wilson_171204_meta_data_f2.pdf)

One of the tasks mentioned in that report is
1 Understand and develop a common vocabulary for discussing the range of data quality practices in citizen science.

Peter Brenton is going to send me the name of someone from that working group with whom we should liaise

@ArthurChapman
Copy link
Collaborator

@Tasilee @pzermoglio and others. What columns do we require in our draft dq vocabulary? Some initial suggestions

Term | Definition | Source | Reference | Link | GUID |

@chicoreus
Copy link
Collaborator

Let me suggest the list of columns found in the header column in this Audubon Core source document: https://github.com/tdwg/rs.tdwg.org/blob/master/audubon/audubon-column-mappings.csv

In particular:

label rdfs_comment dcterms_description term_localName term_isDefinedBy term_created term_modified
(=skos:preferedLabel (en)) (=skos_definition) (=dcterms:description) (with term_isDefinedBy forms the guid) (=dcterms:isPartOf) (=dcterms:created) (=dcterms:modified)

We should also check skos for appropriate skos terms for source, reference, and link.

@ArthurChapman
Copy link
Collaborator

Thanks @chicoreus - I will look at that. We have a few different processes - the DQ Vocabulary - and that will depend a lot on what @pzermoglio comes up with. TG1 - Vocabulary will form a major part of the Vocabulary. I am looking at extracting the terms from the tests and just want to make sure we capture what we need at this stage so that we can then add the terms to the main Vocabulary we develop and not have to revisit things later.

@Tasilee
Copy link
Collaborator Author

Tasilee commented Aug 31, 2018

Those columns of @ArthurChapman are 95% the table from my keynote :) and ah yes, there is SKOS

@tucotuco
Copy link
Member

tucotuco commented Sep 2, 2018

For Darwin Core, the full set of columns to manage the term definitions, usages, and examples is:

iri,label,definition,comments,examples,organized_in,issued,status,replaces,rdf_type,term_iri,abcd_equivalence,flags

@Tasilee
Copy link
Collaborator Author

Tasilee commented Sep 4, 2018

From @tucotuco: "To me it is clear that the Framework will result in a vocabulary that should be made into a standard. To me this is separate from a possible standard arising from the tests and assertions. These are two distinct products to me, with the latter relying on the former, thus increasing the priority of the former."

"Steve Baskauf clarified that the TDWG Standards Documentation Standard (SDS; https://www.tdwg.org/standards/sds/, in its single document, the "TDWG Standards Documentation Specification") describes how to create Data standards (for Vocabularies) as well as Best Current Practices documents. The Vocabularies of Values Best Current Practices Document must conform with that document, just as any vocabularies of values must also conform to the specifications set out in the SDS. The DQIG believes that a Vocabularies of Values Best Current Practices document is needed to provide more specific and common guidance on vocabularies of values construction and maintenance - for example, guidance on the type of vocabulary to use (Thesaurus, Vocabulary, Dictionary, Ontology, etc.), and how to deal with synonymy, multiple languages, etc."

I suggest then a rename of this issue to "TG2-" and tags. Alan, Miles...can create a separate issue. :)

@Tasilee Tasilee changed the title Develop a VOCABULARY that covers the terms used under TG1-3 TG2 - Develop a VOCABULARY that covers the terms used under TG2 Sep 4, 2018
@chicoreus
Copy link
Collaborator

But, the text for the CORE project here states that it is: "Links to Confirmed Tests and Assertions arising out of Task Group 2." If we expand CORE beyond this, we still have to define the new use cases, in detail, likely a several year task. Best route is to keep CORE to the research uses of what organisms occurred where when use case that came out of TG2, and specify an additional category of tests that aren't core. But if they are being put forward as part of the standard they will need to have at use cases to hang them off of.

@tucotuco
Copy link
Member

I would like to propose an alternative approach as none of us can afford years more of working on this before proposing a standard. I want to repeat my plea for simplicity and a decoupling of tests from use cases.

I see the set of tests as parallel to the Darwin Core bag of terms and and the use cases as parallel to the distinct Darwin Core Archive cores in terms of combining Darwin Core terms for a particular purpose. I think the tests should not depend on use cases. It should be the other way around. I think a use case should be a level of construct (a profile we called it before) that brings together a set of tests on a declared set of Darwin Core terms and that can declare data quality measures based on their values.

With this approach we could make the one occurrence use case based on the TG3 work as a model to show how that is done and let future work define new use cases as demand arises. These could be the stuff of task groups, and would be more tractable the less monolithic we make the standard. This is how I have thought about the BDQ work since the beginning, and ever more so now that the tribulations of coupled tests and use cases is creating a seemingly insurmountable obstacle.

This would leave us free of the otherwise somewhat arbitrary and controversial distinctions of CORE, SUPPLEMENTARY and DO NOT IMPLEMENT. The ones we finalize for the use case would become part of the standard set of tests. The rest just remain documented in GitHub (with their labels, also documented), but nowhere in the standard documentation. Having all of the rest in the standard just seems like noise to me. Put in the standard that which is mature and useful for a given purpose and leave the rest as a solid basis for future work if demand arises.

@ArthurChapman
Copy link
Collaborator

I 100% agree with @tucotuco. TG3 was a proof of concept and was never meant to be a comprehensive set of tests. I, like @tucotuco "This is how I have thought about the BDQ work since the beginning, and ever more so now that the tribulations of coupled tests and use cases is creating a seemingly insurmountable obstacle." I also have, until recently, seen the tests types as "somewhat arbitrary and controversial distinctions of CORE, SUPPLEMENTARY and DO NOT IMPLEMENT. The ones we finalize for the use case would become part of the standard set of tests." Originally treating these as basically a 1) a good test we can implement practically and that will be useful (CORE) 2). A tests that for the reasons stated in the definition are not ready to be Core tests and we only include them in the GitHub to not waste the work we've already done - not for the Standard but but may be mentioned (as a group, not individually) in the document for documentation purposes (SUPPLEMENTARY) 3). Tests that are close to being CORE but need more work because something is missing - e.g. a suitable Vocabulary (Immature/Incomplete) - note a couple may become CORE before we release the Standard if suitable Vocabularies become available and 4). Tests that for some reason we believe should not be implemented as doing so could lead to ambiguous or misleading results (DO NOT IMPLEMENT).

I know that I, and others, are getting at the limit of my capacity to continue with this work and want to see it finished - so let's keep it simple. The suggestions of @tucotuco I support and would vote to continue in that direction - not getting bogged down on things that will not go in the Standard - including both non CORE tests, and detailed Use Cases for each test. We all know Use Cases exist for the tests but to fully document them all now would take another two years of work at least (103 tests @ conservatively 2 days work per test = 206 days work!).

@chicoreus
Copy link
Collaborator

chicoreus commented Feb 28, 2024 via email

@ymgan
Copy link
Collaborator

ymgan commented Feb 28, 2024

It's interesting to see everyone's perspectives in this, I appreciate this discussion, thank you! I don't know enough of what TG3 did to comment on the use cases, but I would like to share my perspective when I was mapping the checks from OBIS with the tests here:

  • It was very useful for me when @Tasilee told us that the CORE tests should be the first sets of tests that we should look at. We felt reassured that tests labeled with CORE are well developed and should be mature enough for our work. To me (as a user) at least, this is more important:

Concept 2: CORE: The set of mature tests that TG2 is putting forward as part of the standard. This is the bit meant by "Darwin Core terms that are widely applicable, informative, and straight forward to implement"

  • At that time, there was no supplementary tests yet, so I can't comment on this.
  • Notes from DO NOT IMPLEMENT was extremely helpful - save us time so that we know that we do not need to consider those.
  • Adapting the parameters (source authority) of some of the CORE tests are sufficient for our marine biodiversity data use case (e.g. using WoRMS as taxonomic backbone for TG2-VALIDATION_SCIENTIFICNAME_FOUND #46) we did not need to look further.
  • All in all, I agreed with how @ArthurChapman look at the labels in this comment
  • I felt that having example use case could be helpful but eventually, which tests are useful for any use case will depend on what the use case is and what data (and in what form, verbatim or not) the user has in their database.

The use case that we did in the OBIS data quality project team:

  • We came with the question - Can we have a set of standardized tests that could help to evaluate the data quality for records with Darwin Core fields that are deemed important to marine biodiversity data (mostly the required terms and strongly recommended terms for OBIS)?
  • We treated the tests as a grab bag and mapped the checks of OBIS portal to the CORE tests last year: https://github.com/iobis/quality-taskteam/wiki/Mapping-of-checks-in-obis%E2%80%90qc-to-TDWG-BDQ-core-tests-and-assertions
  • Report of those checks: https://r.obis.org/quality/ although not all the tests have been updated with the parameters the team has decided. This is the work of OBIS Secretariat.
  • Data managers of various OBIS nodes are expected to update the data with corrections (amendments).

Another thing that I think MAY be helpful is to clarify what CORE is NOT. I used to think that CORE tests are the minimal set of tests that are needed to evaluate the fitness for use of a record regardless of the use case (basically minimal set of tests that overlap any biodiversity use case), but I believe that it is not the case? (please correct me if I am wrong) For example, the newly added tests for pathway #277, #278

I don't know if these are helpful, please ignore if it doesn't. Thank you all SO MUCH for your hard work, I know time does not come cheap - I am so thankful to have the opportunity to work with you all!

@ArthurChapman
Copy link
Collaborator

Thanks @ymgan - great points and very helpful. One thing that springs to mind from your comments is that we can't document all use cases - if we followed the suggestions of @chicoreus, we would be making a random selection of a use case that certainly would not cover all cases. We currently have Examples that imply a use case, we cite where the test originated (ALA, VertNet, etc.)

@chicoreus - as said before TG3 was never meant to be comprehensive, but an exemplar or proof of concept. I attended all the early meetings of TG3 - in setting it up, and most of the meetings and discussions. It was a proof-of-concept and looking at how Use Cases could be developed and from that came the use of User Stories. Part way through, it was decided to link to the Framework and several were tested in conjunction with @allankv. TG3 was not comprehensive and was never intended to be comprehensive, and the majority of the TG2 tests were never covered by TG3. TG2 from the start was looking at a good set of tests, based on DwC and that would be "Fundamental tests of biodiversity data represented in Darwin Core terms that are widely applicable, informative, and straight forward to implement." We looked t what had been done by ALA, GBIF, iDigBio, CRIA, BISON, VertNet and others. There was never an idea of linking it directly to the Use Cases that came out of TG3. We had most of the TG2 tests prepared long before TG3 started to get any results.

For now we should just accept CORE as : "The set of mature tests that TG2 is putting forward as part of the standard. This is the bit meant by "Darwin Core terms that are widely applicable, informative, and straight forward to implement"

Perhaps, in the Document - we can have a section on adding future tests, that includes a workflow that includes documenting a Use Case, determining if it was "widely applicable, informative, and straight forward to implement" then follow the existing template, develop tests for implementation, then test implementation, etc.

@Tasilee
Copy link
Collaborator Author

Tasilee commented Feb 29, 2024

Just back, briefly. I fully agree with @tucotuco and @ArthurChapman regards the circumscription of TG2 by TG3: Our tests are not bound by TG3 use cases. Our definition of core has been basically, as all have stated, "Tests that are widely applicable, informative, and straight forward to implement" with one exception: tests that we believe are 'aspirational' in encouraging a better best current practice (e.g., annotations).

I (strongly) believe that it is also informative that we define what is not CORE (out of scope of the standard) as it helps to clarify what is CORE, and document the environment to inform future uses. Thanks @ymgan for your comments. Our 'Supplementary', 'Immature/Incomplete' and 'Do not implement' are useful and are now adequately documented.

Like Arthur (as he well knows), I am also close to burnout on this work. We need to 'cut to the chase': Fill in gaps within the current CORE tests (e.g., test data - which I will do, and implementations) and get the standard document prepared.

@ArthurChapman
Copy link
Collaborator

Altered definitions of bdqtag: terms CORE, Supplementary, Immature/Incomplete, and DO NOT IMPLEMENT following recent discussions via email.

@ArthurChapman
Copy link
Collaborator

Added 5 new bdqffdq:UseCase terms for

  1. bdq:Spatial-Temporal Patterns
  2. bdq:Record-Management
  3. bdq:Taxon-Management
  4. bdq:Alien-Species
  5. bdq:Biotic-Relationships

@chicoreus
Copy link
Collaborator

Most of the bdq:Response contexts aren't correct, here are a set of corrections to be applied:

namespace:Term Context Context Should Be
bdq:ASSUMEDDEFAULT bdq:Response bdqTestField:Term-Actions
bdq:CONVERTED bdq:Response bdqTestField:Term-Actions
bdq:ExpectedResponse bdq:Response bdq:Specification
bdq:FOUND bdq:Response bdqTestField:Term-Actions
bdq:OUTOFRANGE bdq:Response bdqTestField:Term-Actions
bdq:PRECISIONINSECONDS bdq:Response bdqTestField:Term-Actions
bdq:PREREQUISITESNOTMET bdq:Response bdqTestField:Term-Actions
bdq:PROPOSED bdq:Response bdqTestField:Term-Actions
bdq:Response bdq:Response bdq:Response
bdq:Response.comment bdq:Response bdq:Response
bdq:Response.qualifier bdq:Response bdq:Response
bdq:Response.result bdq:Response bdq:Response
bdq:Response.status bdq:Response bdq:Response
bdq:STANDARD bdq:Response bdqTestField:Term-Actions
bdq:STANDARDIZED bdq:Response bdqTestField:Term-Actions
bdq:TERRESTRIALMARINE bdq:Response bdqTestField:Term-Actions
bdq:TRANSPOSED bdq:Response bdqTestField:Term-Actions
bdq:CONSISTENT bdq:Response.result bdqTestField:Term-Actions

chicoreus added a commit that referenced this issue Aug 17, 2024
…f 2024-08-17. Edits to python markdown generation scripts to load vocabularies and display description of use cases before use case indexes in markdown documents. Regenerating markdown documents.
@chicoreus
Copy link
Collaborator

I've made an export of the vocabulary markdown table into https://github.com/tdwg/bdq/blob/master/tg2/vocabularies/combined_vocabulary.csv to give us something more easily sorted to look for inconsistencies and problems, and to start setting up to add vocabulary terms into various markdown documents.

As a demonstration of linking in vocabulary terms, I've added the definitions of the use cases to the index by use case section of: https://github.com/tdwg/bdq/blob/master/tg2/core/generation/docs/core_tests.md

For the time being, the markdown table in this issue remains the authoritative copy for editing, and we expect to overwrite the csv export.

@ArthurChapman
Copy link
Collaborator

Following advice from @chicoreus, "Context" changed to bdqTestField:Term-Actions for the following terms

bdq:ASSUMEDDEFAULT
bdq:CONVERTED
bdq:FOUND
bdq:OUTOFRANGE
bdq:PRECISIONINSECONDS
bdq:PREREQUISITESNOTMET
bdq:PROPOSED
bdq:STANDARD
bdq:STANDARDIZED
bdq:TERRESTRIALMARINE
bdq:TRANSPOSED
bdq:CONSISTENT

and Context for
bdq:ExpectedResponse changed to bdq:Specification

@ArthurChapman
Copy link
Collaborator

ArthurChapman commented Aug 18, 2024

Added new term

| bdq:AllValidationTestsRunOnSingleRecord | AllValidationTestsRunOnSingleRecord | A list of Core Validation Tests that have been run on a Single Record. | bdqffdq:InformationElements | Used in Measure of Single Record Tests |

@ArthurChapman
Copy link
Collaborator

Added new term

| bdq:AllAmendmentTestsRunOnSingleRecord | AllAmendmentTestsRunOnSingleRecord | A list of Amendments that have been run on a Single Record. | bdqffdq:InformationElements | Used in Measure of Single Record Tests |

@ArthurChapman
Copy link
Collaborator

Added new term

| bdq:assumptionOnUnknownHabitat | assumptionOnUnknownHabitat | Used when a bdq:taxonomyIsMarine source authority is unable to assert the marine or non-marine status of a taxon, the habitat (Marine/NonMarine) to assume instead or NoAssumption. | bdq:Parameter | See VALIDATION_COORDINATES_TERRESTRIALMARINE (b9c184ce-a859-410c-9d12-71a338200380). |

@ArthurChapman
Copy link
Collaborator

Changed bdq:assumptionOnUnknownHabitat to bdq:assumptionOnUnknownBiome

chicoreus added a commit that referenced this issue Aug 22, 2024
…r markdown tables in issues not quite aligned with the standard exported from #152 as an informal glossary file, to go as tables in a supplemenal document on the rationale management process for test development.
@ymgan
Copy link
Collaborator

ymgan commented Aug 22, 2024

I saw

A MEASURE may return either a Response.result that is a numeric value, or the values COMPLETE or NOT_COMPLETE, or NOT_REPORTED (when the Response.status="INTERNAL_PREREQUISITES_NOTMET"). The principle Measure defined in the core tests is MEASURE_EVENTDATE_DURATIONINSECONDS, it returns a Response.result measuring the amount of time represented by the value in dwc:eventDate, and can be used in QualityAssurance under specific research data quality needs to identify Occurrences where the date observed or collected is known well enough for particular analytical needs (e.g. to at least one day for phenology studies, to at least one year for other purposes) that generally summarises the results of running the VALIDATIONs and AMENDMENTs and in one case provides an indication of the length of the period of the value of dwc:eventDate.

Is NOT_REPORTED being used by any MEASURE please? If so, it is not in the vocab

@ArthurChapman
Copy link
Collaborator

Yes @ymgan Currently only in one test #31. Also one other test that is DO NOT IMPLEMENT (#35)

@ymgan
Copy link
Collaborator

ymgan commented Aug 22, 2024

Thanks @ArthurChapman ! Then I guess we need a bdq:NOT_REPORTED, it is not in the table above

@ArthurChapman
Copy link
Collaborator

We are just taking that term out of the test (#31), because it does not make sense. So that can be deleted from the document.

@chicoreus
Copy link
Collaborator

chicoreus commented Aug 22, 2024 via email

@ArthurChapman
Copy link
Collaborator

The Vocabulary terms in this file have been split into other files - a bdqdim vocabulary, a bdqffdq vocabulary, a bdq:directory, and a glossary and these files are being generated as csv files and markdown for the final BDQ Core Standard. They are currently generated and are in the _review folder. As such this file is no longer maintained.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

9 participants