-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why multiple duplicates hasn't been merged? #25
Comments
I am not sure exactly why this happens. Does this cause an issue for you? I guess no, since the program found all cases of duplication, right? |
Do you mean that the program says nothing about line 600 in the output? |
I mean duplicate lines maybe should be expressed as: “duplicated content": [all the occurences, such as "file1:1", "file2:2" , "file3:4" ... ] |
Well I'm currently (slow) working on a GUI frontend for Duplo: https://github.com/ArsMasiuk/duploq |
Good suggestion @faywong . I’ll think a bit about this, but can’t promise anything as of now. |
Cool, aggregate Duplo's output in a post-process step is also a good solution. Also i have opt. the algorithm to boost the performance. |
@faywong cool, is that something you can share? |
Currently is only capable of use in our case, hasn't reach the expected performance yet. Anyway, i will update here if we're ready to contribute a better version. |
In the spirit of open source and all future users of Duplo, please do. |
@faywong Is there something you can share with your implementation? |
for example:
file_a.c line 100 duplicates to file_a.c line 200 and file_a.c line 500 and file_a.c line 600, expecte the result to be
file_a.c line 100 file_a.c line 200 file_a.c 500 are duplicated other than:
line 100 duplicates to line 200, line 500 duplicates to line 600.
The text was updated successfully, but these errors were encountered: