Problem Statement
The VRS standard submitted a proposal to the GA4GH Steering Committee to approve the use of the ga4gh namespace for VRS computed identifier CURIEs. This was approved by the Steering Committee, with the understanding that a governance mechanism would be established for object identifiers under this namespace once an appropriate technical committee was established (TASC).
VRS is preparing to release new data type prefixes under the ga4gh namespace as part of its upcoming minor version releases. We intend to move forward with these as scheduled until a formal mechanism for registering / requesting identifier prefixes is created.
Impact of alignment between standards
Due to the organization-level namespacing, any CURIEs generated under this namespace should have a consistent meaning across products. If two standards (e.g. refget and VRS) develop identifiers with similar prefixes or inconsistent structures under the ga4gh namespace, interoperability between resources implementing multiple GA4GH products expecting ga4gh CURIEs will fail.
Background research and landscape analysis
We have worked to create an identifier scheme that has been reviewed and approved by unanimous vote of the GA4GH Steering Committee.
We have established several type prefixes already, with more that will be generated over the next year as we iteratively expand VRS.
This has direct relevance to the DRS, VRS, and refget standards, and likely others down the road.
Proposed solution
The VRS Project Maintainers are granted authority to build additional type prefixes under the ga4gh namespace without external review, so long as those identifiers begin with V, (e.g. VS for VariationSet, VA for Allele, VH for Haplotype). Data type prefixes outside of this scope must be reviewed and approved by TASC before implementation. Similar project-level subdomains could be approved and allocated by TASC for other standards as needed.
In addition, a TASC-maintained public registry of type prefixes (approved, reserved, and pending) should be provided for quick reference by product developers and implementers.
Problem Statement
The VRS standard submitted a proposal to the GA4GH Steering Committee to approve the use of the
ga4ghnamespace for VRS computed identifier CURIEs. This was approved by the Steering Committee, with the understanding that a governance mechanism would be established for object identifiers under this namespace once an appropriate technical committee was established (TASC).VRS is preparing to release new data type prefixes under the
ga4ghnamespace as part of its upcoming minor version releases. We intend to move forward with these as scheduled until a formal mechanism for registering / requesting identifier prefixes is created.Impact of alignment between standards
Due to the organization-level namespacing, any CURIEs generated under this namespace should have a consistent meaning across products. If two standards (e.g. refget and VRS) develop identifiers with similar prefixes or inconsistent structures under the
ga4ghnamespace, interoperability between resources implementing multiple GA4GH products expectingga4ghCURIEs will fail.Background research and landscape analysis
We have worked to create an identifier scheme that has been reviewed and approved by unanimous vote of the GA4GH Steering Committee.
We have established several type prefixes already, with more that will be generated over the next year as we iteratively expand VRS.
This has direct relevance to the DRS, VRS, and refget standards, and likely others down the road.
Proposed solution
The VRS Project Maintainers are granted authority to build additional type prefixes under the
ga4ghnamespace without external review, so long as those identifiers begin withV, (e.g.VSfor VariationSet,VAfor Allele,VHfor Haplotype). Data type prefixes outside of this scope must be reviewed and approved by TASC before implementation. Similar project-level subdomains could be approved and allocated by TASC for other standards as needed.In addition, a TASC-maintained public registry of type prefixes (approved, reserved, and pending) should be provided for quick reference by product developers and implementers.