Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review and update vocabulary terms for "dataQualityAssuranceMethod" (DMM) #32

Open
pbrenton opened this issue Aug 8, 2020 · 0 comments
Labels
Dataset Extension Model The dataset metadata model (DMM), extension model help wanted Extra attention is needed Vocabularies Issues dealing with vocabularies
Projects

Comments

@pbrenton
Copy link
Collaborator

pbrenton commented Aug 8, 2020

Vocabularies need to be accurately described and conform to best practice standard criteria for vocabulary definitions. These include the following:

  • The categorical terms list should be as comprehensive as possible (ie. it should include terms for all possible cases for a particular property). This will ensure that extreme edge cases will be minimised and hence minimise the need to include an option for "other" as an escape for misinterpretation of options that could/should reasonably placed into a provided category;
  • Terms must be mutually exclusive (ie. a case cannot be interpreted to be able to apply to multiple categorical terms concurrently, it should only be able to be interpreted as applicable to a single term in a given property);
  • Terms should be explicit in interpreted meaning, but concise in categorical form;
  • Terms should be able to be expressable in multiple languages and still comply with the above rules;
  • Each term should be clearly defined, particularly in respect to the boundary conditions between it and it's neighbouring terms. This is to ensure that cases close to boundary conditions can be correctly interpreted and assigned the most appropriate category.
  • Each vocabulary list should be versioned in it's own right and appropriately referenced in usage.

The DMM vocabulary "dataQualityAssuranceMethod" requires the following actions to be undertaken:

  1. Review and updating of the terms list in accordance with above rules;
  2. Terms need to be defined;

The current terms list for dataQualityAssuranceMethod is:

  • Data owner curated
  • Subject matter expert record verification
  • Crowd-sourced record verification
  • Record annotation
  • System supported data attribute configuration
  • No DQ methods used
  • Not applicable
@pbrenton pbrenton added this to To do in PPSR-Core Aug 8, 2020
@pbrenton pbrenton added Dataset Extension Model The dataset metadata model (DMM), extension model help wanted Extra attention is needed Vocabularies Issues dealing with vocabularies labels Aug 8, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dataset Extension Model The dataset metadata model (DMM), extension model help wanted Extra attention is needed Vocabularies Issues dealing with vocabularies
Projects
PPSR-Core
  
To do
Development

No branches or pull requests

1 participant