Skip to content

Failed to build SQLite database from OWL file #59

@vdancik

Description

@vdancik

We are attempting to parse owl files from Panther (http://data.pantherdb.org/ftp/pathway/current_release/BioPAX.tar.gz) using a docker image docker:linkml/semantic-sql. When we tried to parse Thiamin_metabolism.owl, we get the following error:

> semsql make Thiamin_metabolism.db

cat /usr/local/lib/python3.8/dist-packages/semsql/builder//sql_schema/semsql.sql | sqlite3 .template.db.tmp && \
echo .exit | sqlite3 -echo .template.db.tmp -cmd ".mode csv" -cmd ".import /usr/local/lib/python3.8/dist-packages/semsql/builder//prefixes/prefixes.csv prefix" && \
mv .template.db.tmp .template.db
.exit
robot remove -i Thiamin_metabolism.owl --axioms "equivalent disjoint annotation" -o Thiamin_metabolism-min.owl
relation-graph --disable-owl-nothing true \
                       --ontology-file Thiamin_metabolism-min.owl\
                       --output-file Thiamin_metabolism-relation-graph.tsv.ttl.tmp \
                       --equivalence-as-subclass true \
               --output-subclasses true \
                       --reflexive-subclasses true && \
riot --out RDFXML Thiamin_metabolism-relation-graph.tsv.ttl.tmp > Thiamin_metabolism-relation-graph.tsv.owl.tmp && \
sqlite3 Thiamin_metabolism-relation-graph.tsv.db.tmp -cmd ".mode csv" ".import /usr/local/lib/python3.8/dist-packages/semsql/builder//prefixes/prefixes.csv prefix" && \
rdftab Thiamin_metabolism-relation-graph.tsv.db.tmp < Thiamin_metabolism-relation-graph.tsv.owl.tmp && \
sqlite3 Thiamin_metabolism-relation-graph.tsv.db.tmp -cmd '.separator "\t"' -cmd '.header on' "SELECT subject,predicate,object FROM statements " > Thiamin_metabolism-relation-graph.tsv.tmp && \
mv Thiamin_metabolism-relation-graph.tsv.tmp Thiamin_metabolism-relation-graph.tsv && \
rm Thiamin_metabolism-relation-graph.tsv.*.tmp
2022.10.18 20:50:32:527 zio-default-async-1 INFO org.renci.relationgraph.Main.program:57
    Running reasoner
2022.10.18 20:50:33:801 zio-default-async-1 INFO org.renci.relationgraph.Main.program:60
    Done running reasoner
2022.10.18 20:50:37:266 zio-default-async-1 INFO org.renci.relationgraph.Main.program:70
    Computed relations in 3.371s
cp .template.db Thiamin_metabolism.db.tmp && \
rdftab Thiamin_metabolism.db.tmp < Thiamin_metabolism.owl && \
sqlite3 Thiamin_metabolism.db.tmp -cmd '.separator "\t"' ".import Thiamin_metabolism-relation-graph.tsv entailed_edge" && \
gzip -f Thiamin_metabolism-relation-graph.tsv && \
cat /usr/local/lib/python3.8/dist-packages/semsql/builder//indexes/*.sql | sqlite3 Thiamin_metabolism.db.tmp && \
mv Thiamin_metabolism.db.tmp Thiamin_metabolism.db
thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: RdfXmlError { kind: Other("eco_ECO:0000501_12 is not a valid rdf:ID value") }', src/main.rs:57:5
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
make: *** [/usr/local/lib/python3.8/dist-packages/semsql/builder/build.Makefile:49: Thiamin_metabolism.db] Error 101
rm Thiamin_metabolism-min.owl

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions