Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GtCSVReader problems with jgrapht ConnectivityInspector #44

Closed
florisheijmans opened this issue Jan 14, 2021 · 2 comments
Closed

GtCSVReader problems with jgrapht ConnectivityInspector #44

florisheijmans opened this issue Jan 14, 2021 · 2 comments

Comments

@florisheijmans
Copy link

florisheijmans commented Jan 14, 2021

This issue arose when I attempted to reproduce the workflow in: org.scify.jedai.demoworkflows.CsvDblpAcm.java.

During the reading process of the ground truths in DBLP-ACM_perfectMapping.csv (specifically the GtCSVReader.getDuplicatePairs method), the detection of connected components by the jgrapht package seems to not work.

For some reason I obtain a single cluster of size 2225 and then 5375 more clusters of size 1, which is obviously incorrect since the csv contains about 2225 unique pairs (which should in turn produce 2225 clusters of size 2).

Have you seen this problem before? Maybe the jgrapht package expects a different format than it did previously?

@mthanos
Copy link
Contributor

mthanos commented Jan 14, 2021 via email

@florisheijmans
Copy link
Author

Thank you! That fixes the problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants