New json ld context generator #36

timothee-haudebourg · 2022-09-02T12:14:43Z

The current LD context generator only generates type-scoped contexts with the assumption that incoming LD documents will always advertise the type of every node using a @type property. For instance, take the following TreeLDR schema:

base <https://example.com>;

type Foo {
  bar: Bar
}

type Bar {
  foo: Foo
}

using the command tldrc -i example.tldr json-ld-context https://example.com/Foo https://example.com/Bar, the following LD context is generated:

{
  "Foo": {
    "@id": "https://example.com/Foo",
    "bar": "https://example.com/Bar"
  }
  "Bar": {
    "@id": "https://example.com/Bar",
    "foo": "https://example.com/Foo"
  }
}

This is correct for an input LD document such as this:

{
  "@type": "Foo",
  "bar": {
    "@type": "Bar",
    "foo": {}
  }
}

Note how each node contains a @type entry, which is not specified in the original TreeLDR schema. So we want to be able to handle documents where no type is specified, such as

{
  "bar": {
    "foo": {}
  }
}

However it is not possible to specify the type of all the nodes using the JSON-LD context (specifying a @type entry inside the context only apply for value objects).

The purpose of the PR is to create a new LD context generator algorithm without this limitation. Type scoped contexts are useful to avoid conflicts and ambiguities between term definitions, so they should be generated whenever possible. Otherwise, terms should be defined globally, or inside property scoped contexts to avoid clashes whenever possible (if there is no cycle).

For the above example, the following context should be generated:

{
  "bar": "https://example.com/Foo/bar",
  "foo": "https://example.com/Bar/foo"
}

Ambiguous terms

Without type scoped contexts, ambiguities can arise when two layouts define fields with the same name.

type Foo {
  bar: Bar,
  prop: A
}

type Bar {
  foo: Foo,
  prop: B
}

Here there is an ambiguity on the prop term definition. These ambiguities should be detected by the LD context generator.

To solve this ambiguity, we could define some "main" layout (the expected layout of the input documents) and some included secondary layouts. Then the prop term of the main layout is defined globally while the prop term of the secondary layout is defined in a property-scoped context.

Type scoped contexts

Type scoped contexts can still be generated at the condition that the TreeLDR layout explicitly contains a required field holding its type.

type Foo {
  required rdf:type as myType,
  bar: Bar
}

Then the following context can be generated:

{
  "myType": "@type",
  "Foo": {
    "bar": "https://example.com/Foo/bar"
  }
}

Note: we should allow @type to be a valid field name so we can write:

type Foo {
  required rdf:type as @type,
  bar: Bar
}

which would generate the following context without @type alias:

{
  "Foo": {
    "bar": "https://example.com/Foo/bar"
  }
}

External contexts

Sometimes contexts are loaded alongside other contexts. For now, the context generator assumes the generated context will be the only one loader and should include all the term definitions. Instead, it would be nice to specify a list of contexts that will be loaded before the generated one. The generator can then omit the definitions already present in the input contexts.

Implementation status

json-ld refactor
Simple implementation without caring for ambiguities
Ambiguities detection
Ambiguities resolution using primary/secondary layouts and property scoped contexts
Type scoped contexts
~~External contexts~~

timothee-haudebourg · 2022-09-05T11:22:59Z

I discuss here some current limitations for the generation of type scoped contexts. Consider the following TreeLDR document:

base <https://example.com>;
use <http://www.w3.org/1999/02/22-rdf-syntax-ns#> as rdf;
use <http://www.w3.org/2000/01/rdf-schema#> as rdfs;

type Foo {
	bar: Bar,
	rdf:type: required &rdfs:Class
}

type Bar {
	foo: Foo,
	rdf:type: required &rdfs:Class
}

We want to generate the following JSON-LD context, with two type scoped contexts:

{
  "type": "@type",
  "Foo": {
    "@id": "https://example.com/Foo",
    "bar": "https://example.com/Foo/bar"
  },
  "Bar": {
    "@id": "https://example.com/Bar",
    "foo": "https://example.com/Bar/foo"
  }
}

This should be doable with the command:

tldrc -i example.tldr json-ld-context https://example.com/Foo https://example.com/Bar

Anonymous layouts cause ambiguous term definitions

In this example, each type define a type field for the rdf:type property with the same layout required &rdfs:Class. However because this layout is defined inline, it is anonymous and is given a blank node identifier. One blank node identifier for each occurrence, which means that the two fields in fact refer to two different layouts with different blank node identifiers. This causes the LD context generator to detect an ambiguity for the type term.

This can be solved by generating a unique blank node identifier for structurally equivalent anonymous layouts. This is already the case for references.

Type scoped context term name

In this example, we expect the type scoped context to be defined with the term Foo and Bar, extracted from the layout names. However this is completely inconsistent with the semantics of TreeLDR. The layout of the type field is required &rdfs:Class. The current semantics of TreeLDR dictates that the expected value for this field is hence an IRI, not Foo nor Bar.

One solution to this problem would be to allow the definition of custom reference layouts. One could specify what values the reference can take, and how it maps to actual IRIs. For instance with an enumeration layout (not yet implemented):

layout MyReference [ "Foo" = Foo, "Bar" = Bar ];

This states that a value of the layout MyReference is either the string Foo referring to https://example.com/Foo or Bar referring to https://example.com/Bar. Custom references are a power tool that can be useful outside the scope of simply generating type scoped contexts.

timothee-haudebourg · 2022-09-05T12:05:20Z

Until custom references are implemented, the generation of type scoped contexts is enabled only with the --rdf-type-to-layout-name option that explicitly asks for the value of the rdf:type property to be interpreted as the layout name. Once they are implemented, the option will be deprecated in favor of custom layouts.

tldrc -i example.tldr json-ld-context --rdf-type-to-layout-name https://example.com/Foo https://example.com/Bar

timothee-haudebourg · 2022-09-05T12:21:56Z

External contexts implementation is deferred (I'll probably need to harmonize the way vocabulary is handled before I can do that).

Timothée Haudebourg added 4 commits September 2, 2022 13:40

Generate LD context without handling ambiguities.

ead8464

Fix missing ; in example.

01ef8e6

Detect LD context term ambiguities.

ca7ec48

Generate property-scoped contexts.

43276a7

timothee-haudebourg linked an issue Sep 5, 2022 that may be closed by this pull request

Rewrite the JSON-LD context generator with new json-ld library #33

Closed

LD type scoped context generation.

fe16159

timothee-haudebourg merged commit 7bca43e into main Sep 5, 2022

timothee-haudebourg deleted the new-json-ld-context-generator branch September 5, 2022 14:30

timothee-haudebourg mentioned this pull request Oct 24, 2022

Do not generate type scoped context when no rdf:type property is in the layout. #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New json ld context generator #36

New json ld context generator #36

timothee-haudebourg commented Sep 2, 2022 •

edited

timothee-haudebourg commented Sep 5, 2022

timothee-haudebourg commented Sep 5, 2022

timothee-haudebourg commented Sep 5, 2022

New json ld context generator #36

New json ld context generator #36

Conversation

timothee-haudebourg commented Sep 2, 2022 • edited

Ambiguous terms

Type scoped contexts

External contexts

Implementation status

timothee-haudebourg commented Sep 5, 2022

Anonymous layouts cause ambiguous term definitions

Type scoped context term name

timothee-haudebourg commented Sep 5, 2022

timothee-haudebourg commented Sep 5, 2022

timothee-haudebourg commented Sep 2, 2022 •

edited