Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deduplicator: Fix overlapping results between checksum & perceptual hash matches #1006

Merged
merged 1 commit into from
Feb 8, 2024

Conversation

d4rken
Copy link
Member

@d4rken d4rken commented Feb 8, 2024

If there are two sets of images, consisting of two copies each, that are also similar to each other, then let the checksum match take precedence.

Before:

Cluster1:
Group1: (checksum)
Dupe1Copy1
Dupe1Copy2

Cluster2:
Group1: (checksum)
Dupe2Copy1
Dupe2Copy2
Group2: (perceptual)
Dupe1Copy1
Dupe1Copy2

Now:

Cluster1:
Group1: (checksum)
Dupe1Copy1
Dupe1Copy2

Cluster2:
Group1: (checksum)
Dupe2Copy1
Dupe2Copy2

…ash matches

If there are two sets of images, consisting of two copies each, that are also similar to each other,
then let the checksum match take precedence.

Before:

Cluster1:
Group1: (checksum)
Dupe1Copy1
Dupe1Copy2

Cluster2:
Group1: (checksum)
Dupe2Copy1
Dupe2Copy2
Group2: (perceptual)
Dupe1Copy1
Dupe1Copy2

Now:
Cluster1:
Group1: (checksum)
Dupe1Copy1
Dupe1Copy2

Cluster2:
Group1: (checksum)
Dupe2Copy1
Dupe2Copy2
@d4rken d4rken added bug Something isn't working as expected c: Deduplicator labels Feb 8, 2024
@d4rken d4rken merged commit 4a05bf8 into main Feb 8, 2024
16 checks passed
@d4rken d4rken deleted the duplicates_grouping branch February 8, 2024 12:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working as expected c: Deduplicator
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant