Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A note or specification for dataset output would be useful #20

Open
sesuncedu opened this issue Dec 14, 2014 · 2 comments
Open

A note or specification for dataset output would be useful #20

sesuncedu opened this issue Dec 14, 2014 · 2 comments

Comments

@sesuncedu
Copy link
Contributor

During the course of processing a document it would be helpful to be to generate a few separate graphs :

  1. a graph for the extracted data. This could be the default graph, but it might be better to have a different name, so that different versions of an html file retrieved at different times have a distinct graph name that can be used with
  2. a provenance graph about that graph (and other resources used in processing it)
  3. A graph for any derived properties and classes (owl ontology for derived sdo "/" subproperties and subclasses)

There could easily be conventions for generating and identifying these graphs. Some function of the document url (including all parameters affecting the content), last-modified, access, and processing start time might work (whichever date is available). Each graph could be a frag of this base.
The named graph semantics would be Carollinian, where the graph name names the graph

A suitable normative format would be Trig or n-quads.

@gkellogg
Copy link
Member

RDFa uses similar notions of graphs, but does not create a dataset. Presumably a Microdata processor could do something similar, and implementations are free to do so. However, requiring every implementation to do this is unrealistic, as the community for Microdata is typically looking for a simple use case.

A hypothetical future version of the spec could allow optional language for doing this, along the lines of the other optional RDFa output graphs. This would need some community support.

@iherman
Copy link
Member

iherman commented Dec 16, 2014

I agree that this is a useful feature for the future, so I marked this as 'postponed' and left it open.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants