Skip to content

Latest commit

 

History

History
67 lines (58 loc) · 3.64 KB

README.md

File metadata and controls

67 lines (58 loc) · 3.64 KB

KGRC-RDF-star

KGRC-RDF-star is an RDF-star dataset converted from KGRC-RDF, which is a Knowledge graph dataset of novel stories.

RDF-star (also known as RDF*) introduces quoted triple (QT), which is a triple used as the subject or object of another triple.

KGRC-RDF-star is a complex RDF-star graph dataset that contains nested structures of statements and scenes, e.g., "Person A said "Person B saw "Person C was in D" " ."

The same s, p, and o combinations may occur in different scenes when the KGRC-RDF is converted to the KGRC-RDF-star. It is necessary to distinguish these QTs and assign different metadata to them. Therefore, we solved this problem by assigning a unique ID to each QT and nested these triples as a QT as follows: << << s p o >> id val >> p' o'.

Statistics

Class 17
Instance 9,524
Property 640
Triple Standard triple 14,180
Single-nested QT 9,765
Double-nested QT 6,409
Triple-nested QT 695
Quadruple-nested QT 43
Total 31,092

Embeddings

If you want to use this dataset for graph embeddings, please see RDF-star2Vec. The gold standard datasets and evaluation framework are here.

Publication

Egami, S., Ugai, T., Oota, M., Matsushita, K., Kawamura, T., Kozaki, K., Fukuda, K.: RDF-star2Vec: RDF-star Graph Embeddings for Data Mining, IEEE Access, EarlyAccess

LICENSE

Creative Commons License
KGRC-RDF-star by Shusaku Egami is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at https://github.com/KnowledgeGraphJapan/KGRC-RDF.