Clarify confusion around type of context element in ConjunctiveGraphs and context aware stores #167

ghost · 2012-02-20T08:34:15Z

gromgull, 2011-08-20T07:58:07.000Z

In the ConjunctiveGraph, the context field is assumed to be a graph with the identifier set to the URI of the context, i.e. this is what happens if you create context like this:

g=ConjunctiveGraph()
uri1=URIRef("http://example.org/mygraph1")
uri2=URIRef("http://example.org/mygraph2")

bob = URIRef(u'urn:bob')
likes = URIRef(u'urn:likes')
pizza = URIRef(u'urn:pizza')

g.get_context(uri1).add((bob, likes, pizza))
g.get_context(uri2).add((bob, likes, pizza))

Now g.contexts() returns a generator over some graphs.

Now, for code working on the store level, i.e. serializers and parsers, there should perhaps not be any graph objects?

I came across this when looking at the nquad parser, here:

https://github.com/RDFLib/rdflib/blob/master/rdflib/plugins/parsers/nquads.py#L106

This adds context as simply an URI ref.

I added a "work-around" to make the conjunctivegraph.contexts generator work "correctly" here:

https://github.com/RDFLib/rdflib/blob/master/rdflib/graph.py#L1075

Is this ok?

gromgull · 2013-03-11T13:35:13Z

I've semi-fixed this in the last two commits.

If you work with the ConjunctiveGraph class directly:

g = ConjunctiveGraph()
g.get_context('urn:1').add( ( s, p, o ) )

You end up with a Graph object in the store, with identifier set to the URIRef "urn:1".
I dont know saving the whole graph+store in the store is the best way to do it, saving just the identifier seems much cleaner.
AND - I can now mix and match stores in one ConjunctiveGraph - I am not even sure what this could mean:

>>> g1=ConjunctiveGraph()
>>> g2=ConjunctiveGraph()
>>> g1.get_context('urn:a').add( (s, p, o) )
>>> g2.addN( (x,y,z, g1.get_context('urn:a') ) )
>>> g2.store != g2.get_context('urn:a').store 
True

Hmm ...

In [43]: list(g1.contexts())[0]
Out[43]: <Graph identifier=urn:a (<class 'rdflib.graph.Graph'>)>

In [44]: list(list(g1.contexts())[0])
Out[44]: 
[(rdflib.term.BNode('Nae2ff5ac0056427e852347e0a58ff925'),
  rdflib.term.BNode('N7c8cbd571a51409e8a92c2870a30eddd'),
  rdflib.term.BNode('Nbc486ecd9a5b46d5913dafdc4458fc3f'))]

In [45]: list(g1.get_context('urn:a'))
Out[45]: 
[(rdflib.term.URIRef(u'a'),
  rdflib.term.URIRef(u'b'),
  rdflib.term.URIRef(u'c'))]

This is no good :)

gromgull · 2013-03-11T13:40:24Z

Something else, whatever we decide - the store.add method should check the type of the context parameter to make sure it is either Graph/Identifier to make sure we avoid very subtle and odd bugs.

raise exception when trying to rebind a prefix to another ns. fix broken rebinding when generating prefixes This fixes #679 - but actually it's more like a work-around. The underlying problem is confusion about context and graph objects (#167)

kvjrhall · 2021-07-04T16:20:49Z

Note that this is also observed when using Dataset instances. In particular,

ds = Dataset()
quads = ds.quads((None, None, None, None))   # Fourth term is identifier

store = SPARQLUpdateStore()
store.addN(quads)  # Expects four term to be graph

This seems to have been intended with #409 with the goal of reducing confusion for end users. My personal experience was a large amount of confusion that required me to review the code for both modules in order to identifying what was going wrong.

As a semantic web developer I've been trained to always interpret triple or quad as a tuple of graph nodes. Ideally, I'd hope that the interface match the expectations of a RDF Dataset as closely as possible and behave as a "set of quads" much the way that a graph can be treated as simply a "set of triples".

The documentation for the referenced methods, their argument names, and their method names are all ambiguous and add to the confusion. If a developer has started working with one of the APIs first (i.e., they are exposed to Dataset) then the first API has "deceived" them by setting their expectations incorrectly.

I'd recommend that usage of Graph instances as contexts be phased out for API consistency. In the interim, I'd suggest that the arguments and documentation for those methods be updated to clarify how they differ from other APIs in the system. For example, if migrating away from using graphs as contexts, arguments named quad or quads that expect graph contexts could be renamed deprecated_quad / deprecated_quads / etc. This would serve as a slap-in-the-face notification to new developers that there's something up. I'm sure their are more elegant ways to document this than what I'm suggesting.

- Once again adjusting for RDFLib/rdflib#167 not being resolved

ghost assigned gromgull Feb 20, 2012

gromgull mentioned this issue Apr 16, 2012

ConjunctiveStore.add() accepts values that make serialize() fail badly #200

Closed

gromgull mentioned this issue Sep 7, 2012

Graphs don't serialize new content correctly #228

Closed

gromgull added a commit that referenced this issue Mar 11, 2013

make sure graphs have nodes as identifiers, related #167

8127854

gromgull added a commit that referenced this issue Mar 11, 2013

fix nquads parser graph/uriref as context confusion, related #167

ee9ecf2

gromgull mentioned this issue Jun 19, 2013

Trig serializer output no triples #299

Closed

gromgull mentioned this issue Jul 11, 2014

Move Store API to work with identifiers, not graphs #409

Closed

gromgull added this to the rdflib 5.0.0 milestone Jan 24, 2017

gromgull mentioned this issue Jan 24, 2017

Dataset.graph should not allow adding random graphs to the store #698

Open

gromgull mentioned this issue Oct 9, 2018

fix return of the context of a given identifier #845

Open

oohlaf mentioned this issue Feb 11, 2020

Move Store API to work with identifiers, not graphs #958

Closed

white-gecko modified the milestones: rdflib 5.0.0, rdflib 5.1.0 Apr 6, 2020

white-gecko modified the milestones: rdflib 5.1.0, rdflib 6.0.0 May 1, 2020

mwatts15 added a commit to openworm/owmeta-core that referenced this issue Oct 26, 2021

Fixing the ContextSubsetStore._determineContext

c420d90

- Once again adjusting for RDFLib/rdflib#167 not being resolved

ghost mentioned this issue Dec 14, 2021

Identifier as context #1505

Closed

ghost added the id-as-cntxt tracking related issues label Dec 24, 2021

ghost mentioned this issue Jan 6, 2022

Move Store API to work with identifiers, not graphs #1646

Closed

white-gecko modified the milestones: rdflib 6.x.x, 2022 June release Jun 20, 2022

aucampia added the concept: RDF dataset Relates to the RDF datasets concept. label May 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify confusion around type of context element in ConjunctiveGraphs and context aware stores #167

Clarify confusion around type of context element in ConjunctiveGraphs and context aware stores #167

ghost commented Feb 20, 2012

gromgull commented Mar 11, 2013

gromgull commented Mar 11, 2013

kvjrhall commented Jul 4, 2021 •

edited

Loading

Clarify confusion around type of context element in ConjunctiveGraphs and context aware stores #167

Clarify confusion around type of context element in ConjunctiveGraphs and context aware stores #167

Comments

ghost commented Feb 20, 2012

gromgull commented Mar 11, 2013

gromgull commented Mar 11, 2013

kvjrhall commented Jul 4, 2021 • edited Loading

kvjrhall commented Jul 4, 2021 •

edited

Loading