Why multiple duplicates hasn't been merged? #25

faywong · 2020-07-01T09:34:22Z

for example:

file_a.c line 100 duplicates to file_a.c line 200 and file_a.c line 500 and file_a.c line 600, expecte the result to be

file_a.c line 100 file_a.c line 200 file_a.c 500 are duplicated other than:

line 100 duplicates to line 200, line 500 duplicates to line 600.

dlidstrom · 2020-07-01T13:08:55Z

I am not sure exactly why this happens. Does this cause an issue for you? I guess no, since the program found all cases of duplication, right?

ArsMasiuk · 2020-07-03T09:40:57Z

for example:

file_a.c line 100 duplicates to file_a.c line 200 and file_a.c line 500 and file_a.c line 600, expecte the result to be

file_a.c line 100 file_a.c line 200 file_a.c 500 are duplicated other than:

line 100 duplicates to line 200, line 500 duplicates to line 600.

Do you mean that the program says nothing about line 600 in the output?

faywong · 2020-07-24T09:14:03Z

I mean duplicate lines maybe should be expressed as:

“duplicated content": [all the occurences, such as "file1:1"， "file2:2" , "file3:4" ... ]

ArsMasiuk · 2020-08-11T22:19:39Z

I mean duplicate lines maybe should be expressed as:
“duplicated content": [all the occurences, such as "file1:1"， "file2:2" , "file3:4" ... ]

Well I'm currently (slow) working on a GUI frontend for Duplo: https://github.com/ArsMasiuk/duploq
The tool tries to aggregate Duplo's output as well, hope that's what you are talking about?

dlidstrom · 2020-08-12T07:37:57Z

Good suggestion @faywong . I’ll think a bit about this, but can’t promise anything as of now.

faywong · 2020-08-13T10:36:35Z

I mean duplicate lines maybe should be expressed as:
“duplicated content": [all the occurences, such as "file1:1"， "file2:2" , "file3:4" ... ]
Well I'm currently (slow) working on a GUI frontend for Duplo: https://github.com/ArsMasiuk/duploq
The tool tries to aggregate Duplo's output as well, hope that's what you are talking about?

Cool, aggregate Duplo's output in a post-process step is also a good solution. Also i have opt. the algorithm to boost the performance.

dlidstrom · 2020-08-13T11:52:24Z

@faywong cool, is that something you can share?

faywong · 2020-08-19T02:20:13Z

@faywong cool, is that something you can share?

Currently is only capable of use in our case, hasn't reach the expected performance yet. Anyway, i will update here if we're ready to contribute a better version.

dlidstrom · 2020-08-19T11:56:04Z

In the spirit of open source and all future users of Duplo, please do.

dlidstrom · 2021-08-12T11:56:41Z

@faywong Is there something you can share with your implementation?

faywong changed the title ~~Why no multiple duplicates hasn't been merged?~~ Why multiple duplicates hasn't been merged? Jul 1, 2020

dlidstrom closed this as completed Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why multiple duplicates hasn't been merged? #25

Why multiple duplicates hasn't been merged? #25

faywong commented Jul 1, 2020

dlidstrom commented Jul 1, 2020

ArsMasiuk commented Jul 3, 2020

faywong commented Jul 24, 2020

ArsMasiuk commented Aug 11, 2020

dlidstrom commented Aug 12, 2020

faywong commented Aug 13, 2020

dlidstrom commented Aug 13, 2020

faywong commented Aug 19, 2020

dlidstrom commented Aug 19, 2020

dlidstrom commented Aug 12, 2021

Why multiple duplicates hasn't been merged? #25

Why multiple duplicates hasn't been merged? #25

Comments

faywong commented Jul 1, 2020

dlidstrom commented Jul 1, 2020

ArsMasiuk commented Jul 3, 2020

faywong commented Jul 24, 2020

ArsMasiuk commented Aug 11, 2020

dlidstrom commented Aug 12, 2020

faywong commented Aug 13, 2020

dlidstrom commented Aug 13, 2020

faywong commented Aug 19, 2020

dlidstrom commented Aug 19, 2020

dlidstrom commented Aug 12, 2021