KGRC-RDF-star is an RDF-star dataset converted from KGRC-RDF, which is a Knowledge graph dataset of novel stories.
RDF-star (also known as RDF*) introduces quoted triple (QT), which is a triple used as the subject or object of another triple.
KGRC-RDF-star is a complex RDF-star graph dataset that contains nested structures of statements and scenes, e.g., "Person A said "Person B saw "Person C was in D" " ."
The same s, p, and o combinations may occur in different scenes when the KGRC-RDF is converted to the KGRC-RDF-star. It is necessary to distinguish these QTs and assign different metadata to them. Therefore, we solved this problem by assigning a unique ID to each QT and nested these triples as a QT as follows: << << s p o >> id val >> p' o'.
- rdf-star_ext_ikgrc2023.nt
- All data included.
- rdf-star_ext_ikgrc2023_remove_Person_Place_Object.nt
- Removed the values of rdf:type (Person, Object, and Place).
- rdf-star_ext_ikgrc2023_remove_Situation_Statement_Thought.nt
- Removed the values of rdf:type (Situation, Statement, and Thought).
Class | 17 | |
Instance | 9,524 | |
Property | 640 | |
Triple | Standard triple | 14,180 |
Single-nested QT | 9,765 | |
Double-nested QT | 6,409 | |
Triple-nested QT | 695 | |
Quadruple-nested QT | 43 | |
Total | 31,092 |
If you want to use this dataset for graph embeddings, please see RDF-star2Vec. The gold standard datasets and evaluation framework are here.
Egami, S., Ugai, T., Oota, M., Matsushita, K., Kawamura, T., Kozaki, K., Fukuda, K.: RDF-star2Vec: RDF-star Graph Embeddings for Data Mining, IEEE Access, EarlyAccess
KGRC-RDF-star by Shusaku Egami is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at https://github.com/KnowledgeGraphJapan/KGRC-RDF.