Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SameAs retrieval causes problems in a JUnit test #433

Open
MichaelRoeder opened this issue Jun 5, 2023 · 0 comments
Open

SameAs retrieval causes problems in a JUnit test #433

MichaelRoeder opened this issue Jun 5, 2023 · 0 comments

Comments

@MichaelRoeder
Copy link
Member

Description

The D2KBTest fails because two entities of the OKE Task 1 Example Dataset seem to be handled in a faulty way.

2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.evaluate.impl.ConfidenceBasedFMeasureCalculator] - <${pom_version} 0|||||||||>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((49, 19, [http://dbpedia.org/resource/Columbia_University, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Columbia_University])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((49, 19, [http://dbpedia.org/resource/Columbia_University, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Columbia_University])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.evaluate.impl.ConfidenceBasedFMeasureCalculator] - <${pom_version} 1|||||||||>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((96, 13, [http://dbpedia.org/resource/Donna_Brazile, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Donna_Brazile])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((22, 14, [http://dbpedia.org/resource/James_Carville, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/James_Carville])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((115, 16, [http://dbpedia.org/resource/Campaign_manager, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Campaign_manager])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((184, 7, [http://dbpedia.org/resource/Al_Gore, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Al_Gore])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((57, 17, [http://dbpedia.org/resource/Political_consulting, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Political_adviser])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false negative ((78, 12, [http://dbpedia.org/resource/Bill_Clinton, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Bill_Clinton])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false positive ((78, 12, [http://nl.dbpedia.org/resource/Bill_Clinton, http://nl.wikipedia.org/wiki/Bill_Clinton])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((96, 13, [http://dbpedia.org/resource/Donna_Brazile, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Donna_Brazile])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((22, 14, [http://dbpedia.org/resource/James_Carville, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/James_Carville])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((115, 16, [http://dbpedia.org/resource/Campaign_manager, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Campaign_manager])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((184, 7, [http://dbpedia.org/resource/Al_Gore, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Al_Gore])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((57, 17, [http://dbpedia.org/resource/Political_consulting, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Political_adviser])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false negative ((78, 12, [http://dbpedia.org/resource/Bill_Clinton, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Bill_Clinton])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false positive ((78, 12, [http://nl.dbpedia.org/resource/Bill_Clinton, http://nl.wikipedia.org/wiki/Bill_Clinton])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.evaluate.impl.ConfidenceBasedFMeasureCalculator] - <${pom_version} 2|||||||||>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((0, 20, [http://dbpedia.org/resource/Florence_May_Harding, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Florence_May_Harding])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false negative ((44, 6, [http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Sydney, http://dbpedia.org/resource/Sydney])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false positive ((44, 6, [http://de.dbpedia.org/resource/Sydney, http://de.wikipedia.org/wiki/Sydney])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a true positive ((0, 20, [http://dbpedia.org/resource/Florence_May_Harding, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Florence_May_Harding])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false negative ((44, 6, [http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Sydney, http://dbpedia.org/resource/Sydney])).>
2023-06-05 11:57:52,610 [Thread-7] DEBUG [org.aksw.gerbil.matching.impl.MatchingsCounterImpl] - <Found a false positive ((44, 6, [http://de.dbpedia.org/resource/Sydney, http://de.wikipedia.org/wiki/Sydney])).>
2023-06-05 11:57:52,611 [Thread-7] INFO [org.aksw.gerbil.execute.ExperimentTask] - <Setting experiment results: ExperimentTaskResult(state=0,taskId=-1,EE Macro Precision=0.5555555555555555,EE Macro F1 score=0.6,GSInKB Micro Precision=0.7777777777777778,InKB Macro F1 score=0.8585858585858586,Error Count=0,EE Micro Recall=1.0,InKB Micro Recall=0.7777777777777778,InKB Macro Precision=1.0,Micro Recall=0.8333333333333334,Macro F1 score=0.8611111111111112,Micro Precision=0.8333333333333334,InKB Macro Recall=0.7777777777777778,InKB Micro F1 score=0.8750000000000001,GSInKB Micro F1 score=0.7777777777777778,GSInKB Macro F1 score=0.7777777777777778,Macro Recall=0.8611111111111112,GSInKB Macro Recall=0.7777777777777778,avg millis/doc=0.0,EE Macro Recall=0.6666666666666666,GSInKB Macro Precision=0.7777777777777778,EE Micro Precision=0.6,EE Micro F1 score=0.7499999999999999,GSInKB Micro Recall=0.7777777777777778,Macro Precision=0.8611111111111112,Micro F1 score=0.8333333333333334,confidence threshold=0.0,InKB Micro Precision=1.0)>

The evaluation brings up the following two mismatches in the second and third document, respectively:

  • ((78, 12, [http://dbpedia.org/resource/Bill_Clinton, http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Bill_Clinton])) != ((78, 12, [http://nl.dbpedia.org/resource/Bill_Clinton, http://nl.wikipedia.org/wiki/Bill_Clinton]))
  • ((44, 6, [http://www.ontologydesignpatterns.org/data/oke-challenge/task-1/Sydney, http://dbpedia.org/resource/Sydney])) != ((44, 6, [http://de.dbpedia.org/resource/Sydney, http://de.wikipedia.org/wiki/Sydney]))

This raises several questions:

  1. Why does the sameAs retrieval provide two different sets of IRIs although it gets the same input documents for the dataset and annotator?
  2. Why do we only have language-dependent DBpedia IRIs instead of a larger set of IRIs?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant