You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In many cases it might be useful if it was possible to state that a certain dataset is comprised a set of other datasets.
Consider DBpedia:
There are several files under http://downloads.dbpedia.org/2014/en, each of can be considered as a distribution of a dataset that corresponds to its own content.
Now we may want to state that the dataset that corresponds to the data in the official DBpedia endpoint is comprised of a set of datasets that correspond to the individual files that were loaded into the endpoint.
If I understood correctly, currently this would be accomplished by using void:subset.
However, this would not allow to state multiple enumerations of datasets that make up a specific larger dataset.
Instead, I propose to introduce a DatasetEnumeration:
Such an enumeration explicitly states that a certain targetDataset is exactly comprised of a set of given source datasets (i.e. the triples referred to by the merge of the source dataset must be equal to that of the target dataset).
This would also allow stating multiple enumerations for a single dataset, such as a mixture of official enumerations and custom ones (such as pre-merged files (e.g. types, geo, properties + labels) of DBpedia hosted on a third party server).
Also, metadata can then be attached to enumerations themselves, such as who created them when.
:foo
a dataid:DatasetEnumeration ;
dcterms:modified "someDate"^^xsd:dateTime ;
dcterms:contributer :user1235 ;
skos:theme :official ; // Tag the enumeration as an official one so we can filter by that tag
rdfs:comment "Datasets included in the official DBpedia"@en ;
dataid:targetDataset :official-dbpedia-sparql-endpoint ;
dataid:sourceDatasets (
:dbpedia-types-2014,
:dbpedia-mapped-properties-2014,
:dbpedia-labels-en-2014,
...
) # Don't actually use blank nodes - it is just written like this for convenience.
.
The text was updated successfully, but these errors were encountered:
In many cases it might be useful if it was possible to state that a certain dataset is comprised a set of other datasets.
Consider DBpedia:
There are several files under http://downloads.dbpedia.org/2014/en, each of can be considered as a distribution of a dataset that corresponds to its own content.
Now we may want to state that the dataset that corresponds to the data in the official DBpedia endpoint is comprised of a set of datasets that correspond to the individual files that were loaded into the endpoint.
If I understood correctly, currently this would be accomplished by using void:subset.
However, this would not allow to state multiple enumerations of datasets that make up a specific larger dataset.
Instead, I propose to introduce a DatasetEnumeration:
Such an enumeration explicitly states that a certain targetDataset is exactly comprised of a set of given source datasets (i.e. the triples referred to by the merge of the source dataset must be equal to that of the target dataset).
This would also allow stating multiple enumerations for a single dataset, such as a mixture of official enumerations and custom ones (such as pre-merged files (e.g. types, geo, properties + labels) of DBpedia hosted on a third party server).
Also, metadata can then be attached to enumerations themselves, such as who created them when.
The text was updated successfully, but these errors were encountered: