Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about deduplication cluster size #23

Closed
everks opened this issue Oct 7, 2022 · 2 comments
Closed

question about deduplication cluster size #23

everks opened this issue Oct 7, 2022 · 2 comments

Comments

@everks
Copy link

everks commented Oct 7, 2022

As shown in following picture, the cluster starting at 0x02954cb9 has the size of 3.
image
but when I count it using bytes.count(), it shows 2.
image

I tried different datasets and observed the same phenomenon.
Did I make a mistake about the size meaning?

@everks
Copy link
Author

everks commented Oct 9, 2022

I also check the overlapping substrings using a for loop, answer is still 2.

@everks
Copy link
Author

everks commented Oct 9, 2022

Little awkward. I mistake the size meaning. Finally i find it in main.rs (because I don't familiar with rust, so I didn't go to the file first time). Just paste a snippet here
image
I sincerely suggest you can add format explain in readme.

@everks everks closed this as completed Oct 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant