mixed node kind in cube data #1469

giacomociti · 2023-10-24T11:33:41Z

Describe the bug

Affected functionalities (all that apply)

CSV Mapping
Transformation
Publishing
Other

Relevant links

query returning an example of inconsistent data (a dimension with
constraint sh:nodeKind sh:IRI but having both IRI and literal values).

To Reproduce
Steps to reproduce the behavior:

Create a new cube from CSV
Apply transformation
Edit metadata linking to a shared dimension some values of a dimension (but not all of them)
Publish

Expected behavior

The constraint for the dimension should have sh:nodeKind sh:IRIOrLiteral instead of sh:IRI.

Screenshots

Desktop (please complete the following information):

OS: Windows 11
Browser: chrome

Additional context

There is a proposal of disallowing mixed node kinds. If applied, cube creator should prevent publishing invalid data.

The text was updated successfully, but these errors were encountered:

tpluscode · 2024-04-26T08:44:48Z

I the correct node kind set when you transform again after editing the metadata?

giacomociti · 2024-04-26T12:28:09Z

by "editing the metadata" you mean for example linking every value to some shared dimension term? Because the node kind is set by the pipeline (<#toCubeShape>) based on the actual observations (and does not cover sh:IRIOrLiteral unlike the new implementation in barnard59). So the only chance to avoid the issue is to ensure all the values for a dimension have the same node kind (either IRI or Literal)

Rdataflow · 2024-04-26T13:08:34Z

@giacomociti WRT the spec at https://cube.link/#null-empty-values
it would be the most obvious to reuse cube:Undefined in this case.

... so nodeKind would be consistently IRI 👍

giacomociti · 2024-04-26T13:30:29Z

agreed, we should be using cube:Undefined.

There is an external application which may be affected by the change, so we'll probably evolve both the cube and the app in the future.

Rdataflow · 2024-04-26T15:06:49Z

@giacomociti to increase the reusability of cube:Undefined it might be good to establish labels using schema:name and maybe some other useful properties.

first brainstorming:

cube:Undefined  schema:name  "Undefined" , "Undefined"@en , "Unbestimmt"@de , "Indéfini"@fr , "Indefinito"@it , "Indefini"@rm .

possibly other attributes i.e. WDYT?

cube:Undefined  schema:identifier  ""^^cube:Undefined ;
    schema:position  ""^^cube:Undefined .

giacomociti added the 🐛 bug Something isn't working label Oct 24, 2023

giacomociti assigned tpluscode Oct 24, 2023

giacomociti mentioned this issue Nov 29, 2023

Constraint builders zazuko/barnard59#226

Merged

tpluscode added this to the s5.9 milestone May 14, 2024

the-zazukoian bot linked a pull request May 23, 2024 that will close this issue

Merge to release #1517

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mixed node kind in cube data #1469

mixed node kind in cube data #1469

giacomociti commented Oct 24, 2023

tpluscode commented Apr 26, 2024

giacomociti commented Apr 26, 2024

Rdataflow commented Apr 26, 2024 •

edited

giacomociti commented Apr 26, 2024

Rdataflow commented Apr 26, 2024

mixed node kind in cube data #1469

mixed node kind in cube data #1469

Comments

giacomociti commented Oct 24, 2023

tpluscode commented Apr 26, 2024

giacomociti commented Apr 26, 2024

Rdataflow commented Apr 26, 2024 • edited

giacomociti commented Apr 26, 2024

Rdataflow commented Apr 26, 2024

Rdataflow commented Apr 26, 2024 •

edited