-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to benchmark dmdedup #30
Comments
Any info or update?? |
What specific info is needed? |
Hi Vasily, For instance, If I copy 100GB of data(includes several files like linux kernels) to dmdedup device, what is amount of meta data and data partition writes. In short, I would like to reproduce the numbers which was tabulated in the dmdedup paper. Could you please tell, How exactly you guys did benchmark it? Thanks in advance, |
Hi, as the paper describes on page 10: "Linux kernels (Figure 6). This dataset contains the source code of 40 We just used dd command to write corresponding data to dmdedup. A word of warning - we did not use two paritions of the same HDD. Instead, we used a separte SSD for metadata. The paper has details on it: https://www.fsl.cs.sunysb.edu/docs/ols-dmdedup/dmdedup-ols14.pdf HTH, |
Thanks alot, that really helped. |
Another question: How do you test the random write? For seq write, dd is enough, though you still need another device to store the data and read from that device into dmdedup device. But I don't know how do you test the random write. Do you write a program to do that? By the way, I'm still wandering how to create a tar ball with alignment as well. Would be helpful if you got any clue. Thanks. |
We used Filebench and modified it to generate data with required deduplication ratio. I'm attaching old FB patch to give you a sense. I'm attaching tar patch to this post as well. |
Hi Team,
I would like to benchmark dmdedup as described in documentation/paper published.
In that, somewhere it is stated that "test exercise is done with 40 linux kernels",to see the level of deduplication with dmdedup.
In the process of learning, i want to reproduce the claimed numbers.
Will share the tabulated values as soon as I accomplish it.
Could you please share me some info about it, and shed some light.
Thanks in advance,
Venkatesh.
The text was updated successfully, but these errors were encountered: