Implement query operation #24

jamesaoverton · 2015-05-07T14:01:03Z

The query operation should allow arbitrary SPARQL select and update queries to be run against the ontology, saving the results to files. I'm most familiar with Apache Jena, so I plan to use that. Things will be simpler if we load the ontology into the default RDF graph, and don't use named graphs.

We may want to run multiple queries, but we only want to load the ontology into an RDF graph once. The current chaining implementation passes state as an OWLOntology, and I'd like to stick with that simple solution as long as possible. So I propose this command-line interface:

--select INPUT OUTPUT (-s) take an input SPARQL file, run the select query, and save to a file; the output format will be determined by the file extension
--update INPUT (-u) take an input SPARQL file, run the update query

You can specify these options multiple times. Apache CLI should keep them in the right order. When all queries have been run, we'll load the default RDF graph into an ontology for further processing. Suggestions for the best way to do this are appreciated!

Jena supports these output file formats, and we'll use these file extensions:

text .txt
CSV .csv
TSV .tsv
XML .xml
JSON .js or .json

Note that the text format is close to some of the table formats accepted by various Markdown parsers: http://pandoc.org/README.html#tables

Example:

robot query --input example.owl \
  --select query1.rq result1.csv \
  --select query2.rq result2.csv \
  --update update1.rq \
  --update update2.rq \
  --output updated.owl

The text was updated successfully, but these errors were encountered:

cmungall · 2015-05-07T15:09:18Z

Should this cover cases where a sparql query is used to enforce a build constraint, exiting with a non-zero code if some condition is not met? Or is that best folded into the rest of the ontology unit test framework?

jamesaoverton · 2015-05-07T15:23:05Z

Good idea. If it's a binary condition, we could use "ask" instead of "select", with an option like --ask. The disadvantage is that you probably want to report the results that caused the failure, not just the fact that it failed.

I suggest another option: --verify INPUT OUTPUT (or maybe --assert.) If the query returns no results, we continue without writing OUTPUT. If the query returns one or more results, then we write OUTPUT and exit ROBOT with non-zero status.

cmungall · 2016-06-03T01:09:27Z

For constraints we may prefer something like SHACL, see @balhoff's experiments: https://github.com/balhoff/shacl-tests

As for the main SELECT use case, the simplest way to do this would be to write the OWL to a ttl file, and then run the query via Jena. Kind of hacky... could also try the bridge layer in the OWLAPI?

balhoff · 2016-06-14T19:45:46Z

I would be happy to attempt an implementation of --verify using SHACL. For input the user would provide an RDF file containing SHACL shapes. One issue is that there haven't been official releases of the @TopQuadrant/shacl library, so it is a bit of a moving target. We could host a build in another maven repo I guess.

As @jamesaoverton suggests above an --ask option could be used for running a SPARQL ASK. It would also be nice to provide --construct.

cmungall · 2016-06-14T19:57:23Z

Hmm, I'd like to avoid the need for another repo

We use code.berkeleybop.org for OWLTools, but we really messed people up when that was down for a few days.

I'm torn between having ROBOT be a simple one-stop-shop for their release pipelines vs keeping things more modular. Maybe we should start with a standalone tool?

balhoff · 2016-06-14T20:32:00Z

That makes sense. I'll clean up the SHACL runner I have now so that we can get some experience with in release pipelines.

- load ontology into default graph of Jena Arq DatasetGraph - run select queries, write to CSV - add command, include it in the CLI - add test

cmungall · 2017-01-23T22:34:39Z

Meanwhile, for the basic query option, looks like robot has query but we're lacking something in examples/ for it

jamesaoverton · 2017-01-24T00:33:54Z

Yes, I haven't properly documented query. I've been using it for a while and I like it.

cmungall · 2017-03-07T19:09:46Z

Closing this as the feature has been implemented, some discussion continuing here: #150

jamesaoverton mentioned this issue May 7, 2015

Implement SPARQL server #25

Open

jamesaoverton self-assigned this May 11, 2015

jamesaoverton modified the milestone: ROBOT for OBI May 11, 2015

cmungall mentioned this issue Jun 4, 2016

Create subset exports AgriculturalSemantics/agro#10

Open

jamesaoverton added a commit that referenced this issue Sep 10, 2016

First pass at query command, see #24

a6ad8d3

- load ontology into default graph of Jena Arq DatasetGraph - run select queries, write to CSV - add command, include it in the CLI - add test

cmungall mentioned this issue Mar 7, 2017

Add ability to use a query or verification constraint from a central repository #150

Closed

cmungall closed this as completed Mar 7, 2017

cmungall mentioned this issue Apr 13, 2017

Add ability to do SPARQL construct #159

Closed

jamesaoverton mentioned this issue Aug 18, 2017

Implement consistent set of SPARQL commands #182

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement query operation #24

Implement query operation #24

jamesaoverton commented May 7, 2015

cmungall commented May 7, 2015

jamesaoverton commented May 7, 2015

cmungall commented Jun 3, 2016

balhoff commented Jun 14, 2016

cmungall commented Jun 14, 2016

balhoff commented Jun 14, 2016

cmungall commented Jan 23, 2017

jamesaoverton commented Jan 24, 2017

cmungall commented Mar 7, 2017

Implement query operation #24

Implement query operation #24

Comments

jamesaoverton commented May 7, 2015

cmungall commented May 7, 2015

jamesaoverton commented May 7, 2015

cmungall commented Jun 3, 2016

balhoff commented Jun 14, 2016

cmungall commented Jun 14, 2016

balhoff commented Jun 14, 2016

cmungall commented Jan 23, 2017

jamesaoverton commented Jan 24, 2017

cmungall commented Mar 7, 2017