New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset aspects [RDSAT] #60
Comments
I am wondering how 'general' these aspects are. Some of the ones mentioned in the issue seem to be only relevant to particular types of datasets. A lot of datasets are not the result of sensoring. As to dimensions etc., the European StatDCAT-AP (for description of statistical data), includes properties for dimensions, attributes and measures. |
I expect that QB DataStructureDefinition might provide some suitable predicates or patterns for this. Details of the internal structure and semantics are the essence of QB's DSD capability. |
Makx comment above that sensor data is not ubiquitous. The SSN Vocabulary anticipated that issue, and defines the term 'Observation' as
The definitions of Procedure and Sensor are similarly general:
This means that there would be no inconsistency in using some terms from the SSN/SOSA vocabulary to describe some dataset aspects even when classic physical sensors are not involved. In particular the following might be useful There are no general domain/range constraints associated with these properties, so no entailment risks AFAICT. These SSN/SOSA properties would likely match DCAT's goal of providing a 'basic framework for describing datasets' - useful for discovery, though well short of the details needed for actual use, which is the spot that QB aims for. |
I've begun an inventory of predicates from some well-known vocabularies that might be reusable, or at least might guide us here - see https://github.com/w3c/dxwg/wiki/Data-aspects---semantics |
Comment on behalf of Øystein Åsnes on "Fine-grained semantics for datasets and resources within a dataset [ID7]":
|
These are certainly useful, but they do not provide direct access to the dataset semantics. You would have to look inside another artefact (the schema), which you may or may not be able to process. My goal here is to enable some fine-grained semantics to be directly visible in the discovery metadata, particularly things like variables and target features. The predicates mentioned above might be seen as merely specializations of |
Just to note that https://www.w3.org/2007/05/powder-s#describedby This property has also been registered in the IANA Link Relations registry, together with its inverse |
As there has been no further discussion on this issue, I propose to close it. |
Noting no objections, I am closing this issue. |
Dataset aspects [RDSAT]
Provide recommendations and mechanisms for data providers to describe datasets with a formal description of aspects (e.g. instrument/sensor used, spatial feature, observable property, quantity kind).
Finer grained semantics will also allow dataset dimensions to be described, and distributions described using these semantics - for example how a dataset is composed of multiple subsets, such as a set of image bands or tiles, or parameterised filtering/subsetting services
This requirement applies to catalogues of DCAT records, and is thus related to the concept of profiles, which are expected to define classification dimensions (use of controlled vocabularies in mandatory properties)
Related requirements: Profiles listing [RPFL] Distribution schema [RDIS]
Related use cases: Support associating fine-grained semantics for datasets and resources within a dataset [ID7] Profile support for input functions [ID46] Europeana profile ecosystem: representing, publishing and consuming application profiles of the Europeana Data Model (EDM) [ID37] Summarization/Characterization of datasets [ID33]
The text was updated successfully, but these errors were encountered: