Add test for lexical space of xsd:decimal #157

ajnelson-nist · 2022-09-20T15:35:19Z

This PR is being filed to try to diagnose an oddity with SHACL validation of some literals typed as xsd:decimal but not being recognized as xsd:decimal. The behavior arose with the release of pySHACL 0.20.0. (A cross-reference will come momentarily, I needed something to track in the pySHACL repo.)

Before merging this PR:

Resolve runtime error from OWL-RL inferencing.
Discuss if another test should be added to handle JSON-LD default behaviors.

This patch adds a unit test for `xsd:decimal` values, both in PASS and XFAIL cases. There is one issue apparent, left as a TODO in the last test. Signed-off-by: Alex Nelson <alexander.nelson@nist.gov>

With the current import of OWL-RL, 6.0.2, this raises a runtime error. Signed-off-by: Alex Nelson <alexander.nelson@nist.gov>

pySHACL 0.20.0, recently released, includes support for incorporating ill-typedness of literals in review of SHACL Datatype Constraints. For unknown reasons, this is now causing some `xsd:decimal` literals to be flggged as non-conformant. This is being discussed further in pySHACL PR 157. References: * RDFLib/pySHACL#157 Signed-off-by: Alex Nelson <alexander.nelson@nist.gov>

ajnelson-nist · 2022-09-20T16:02:02Z

The instigator for this PR was seeing new ValidationResults arise for xsd:decimal values in this patch. From the SHACL validation output, I could not tell what was going on.

When I looked at the JSON-LD being validated, I realized that there is a possibly undefined behavior in the JSON-LD specification. This was getting interpreted by RDFLib as a xsd:double long enough to trigger a ValidationResult, but by the time the pySHACL report-graph was being generated, it was being interpreted as a xsd:decimal:

{
    "@type": "xsd:decimal",
    "@value": 48.860346
}

The patch preventing the default behavior (with spec. citation) is here, and the follow-on patch demonstrating the validation results are undone is the third in this PR.

@nicholascar - is this something that needs to be fixed or clarified in RDFLib's JSON-LD parsing code?

I'm hesitant to augment this xsd:decimal test with a JSON-LD "default behaviors" test until I understand whether this is truly an undefined-behaviors corner-case of the specification.

ashleysommer · 2022-09-27T00:28:34Z

Hi @ajnelson-nist
In JSON-LD, the lexical value for a Decimal literal must be enclosed in quotation marks.

This structure is ill-typed:

{
    "@type": "xsd:decimal",
    "@value": 48.860346
}

All non-integer numbers in JSON are interpreted by Python as a Float. So when it is read by RDFLib, the discrepency between the @type: xsd:decimal and the @value being a Float, causes the literal to be flagged as Ill-typed.

PySHACL v0.20.0 introduced the feature to check for ill-typed literals, so that is why these errors are now seen when upgrading to that version.

EDIT: I just noticed you have discovered that already, and documented it in this fix here.

To answer your question, the behaviour of RDFLib in this case is correct. RDFlib v6.2.0 has exactly the same behaviour of previous RDFLib versions, except now with the addition that it flags these discrepencies with the Ill-Typed flag, for greater visibility.

ajnelson-nist · 2022-09-27T13:51:56Z

@ashleysommer Thank you, especially for the part I'd missed, that Python's JSON parser was causing the initial conversion to Float. I'll add a JSON-LD snippet to the test to demonstrate this issue.

Referencing the confusing-looking SHACL validation results again---these were the SHACL validation results from the ill-typed JSON-LD---it looks like this may be a nefarious data issue to explain to users. The Turtle serializes a text snippet that looks, and is, properly typed. I'm guessing it's not possible (or at least not a good idea) to "carry forward" the original ill-typed data into Turtle. Is there anything RDFLib or pySHACL can do to flag this ill typing? I'm guessing flagging would have to happen in the JSON-LD parser.

ashleysommer · 2024-10-27T01:11:25Z

@ajnelson-nist This has been open a while, and I forget where we got up to. Can it be closed, or does this test need to be revisited?

ajnelson-nist added 2 commits September 20, 2022 11:31

Add test for lexical space of xsd:decimal

b762d5a

This patch adds a unit test for `xsd:decimal` values, both in PASS and XFAIL cases. There is one issue apparent, left as a TODO in the last test. Signed-off-by: Alex Nelson <alexander.nelson@nist.gov>

Enable OWL-RL inferencing for bad decimal lexical space values

0f7cb83

With the current import of OWL-RL, 6.0.2, this raises a runtime error. Signed-off-by: Alex Nelson <alexander.nelson@nist.gov>

ashleysommer force-pushed the master branch from 0227d83 to 6323931 Compare March 31, 2023 13:14

ashleysommer force-pushed the master branch from 98d4044 to 5e93bd4 Compare October 11, 2024 00:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for lexical space of xsd:decimal #157

Add test for lexical space of xsd:decimal #157

ajnelson-nist commented Sep 20, 2022 •

edited

Loading

ajnelson-nist commented Sep 20, 2022 •

edited

Loading

ashleysommer commented Sep 27, 2022

ajnelson-nist commented Sep 27, 2022

ashleysommer commented Oct 27, 2024

Add test for lexical space of xsd:decimal #157

Are you sure you want to change the base?

Add test for lexical space of xsd:decimal #157

Conversation

ajnelson-nist commented Sep 20, 2022 • edited Loading

ajnelson-nist commented Sep 20, 2022 • edited Loading

ashleysommer commented Sep 27, 2022

ajnelson-nist commented Sep 27, 2022

ashleysommer commented Oct 27, 2024

ajnelson-nist commented Sep 20, 2022 •

edited

Loading

ajnelson-nist commented Sep 20, 2022 •

edited

Loading