Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing controlled vocabularies for newly added properties #95

Closed
jakubklimek opened this issue Oct 15, 2019 · 7 comments
Closed

Missing controlled vocabularies for newly added properties #95

jakubklimek opened this issue Oct 15, 2019 · 7 comments

Comments

@jakubklimek
Copy link
Contributor

The newly added properties are missing specifications of controlled vocabularies in Section 5 Controlled Vocabularies in DCAT-AP 2.0.0.

  • dcat:compressFormat should use IANA types, the same as dcat:mediaType
  • dcat:packageFormat should use IANA type, the same as dcat:mediaType
@bertvannuffelen
Copy link
Contributor

proposed resolution: to be added to the specification

@init-dcat-ap-de
Copy link

dcatap:availability (thanks for adding) should have URIs too

image

@sandervd
Copy link

Would it make sense to define controlled vocabulary consisting of a subset of the mediaType vocabulary, with the appropriate concepts?

@jakubklimek
Copy link
Contributor Author

Would it make sense to define controlled vocabulary consisting of a subset of the mediaType vocabulary, with the appropriate concepts?

why a subset? If it was a subset of types that we deem appropriate as compression/package formats, then someone would have to manage the subset in case there are changes/additions. I think it is safer to go either with IANA MediaTypes (which are, however, missing e.g. tar - can this be added there?) or the existing file format controlled vocabulary.

@sandervd
Copy link

Hi @jakubklimek,

why a subset? If it was a subset of types that we deem appropriate as compression/package formats, then someone would have to manage the subset in case there are changes/additions.

Indeed, the list would need to be managed. My concern is the point you made in #100: 'Lots of freedom means low interoperability.' Specifically in this case, it would leave it up to each application to decide on what archive formats should be supported.

I think it is safer to go either with IANA MediaTypes (which are, however, missing e.g. tar - can this be added there?) or the existing file format controlled vocabulary.

I agree that it is less risk on the AP side, but it would be shifting the burden to the implementations?

@costezki
Copy link

Concerning mime types (file types)

A proposal would be to adopt a constraint validating whether the range value is a valid mime type or not. This constraint can be applied to the three properties concerned: dcat:mediaType, dcat:compressFormat, dcat:packageFormat. FileType controlled vocabulary/authority table published by the Publications Office (OP) on EU Vocabularies is a reasonable way to go about it (although the completeness of this list is questionable and I address it below).

In case such a constraint is considered too broad or permissive, then further partitions of the File Type controlled vocabulary may be envisaged (e.g. the group of compression formats, text formats, modelling formats, etc.) and applied accordingly. The currently adopted model for expressing OP controlled vocabularies supports provisions of (sub-)classifications and partitions.

The FileType published by the OP currently comprises of a reduced list if compared to that made available by IANA (although the most frequently used types are already covered). The current work on the DCAT-AP specification, nonetheless, is an excellent occasion for the OP to update the FileType table and extend its coverage. So, we would consider enriching this table anyway for the next publication in December.

@bertvannuffelen
Copy link
Contributor

resolution: since in 1.1 the PO filetype was replaced with the IANA codelist for dcat:mediaType it is logical to apply the same constraint for properties having the same range.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants