Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File deduplication mechanism #3030

Closed
rvrignaud opened this issue Apr 19, 2013 · 4 comments
Closed

File deduplication mechanism #3030

rvrignaud opened this issue Apr 19, 2013 · 4 comments

Comments

@rvrignaud
Copy link

Hello,

Does a deduplication files mechanism implementation has been considered ?
What I mean is if I have multiple users that upload the same file it would be great if I could store only once the file (even if it's asynchronous mechanism).
My personal use case is that I run an owncloud instance for a lot of friends and we upload many time same pictures / video. So I have a lot of duplicated files on my server.
It would be very usefull for me to get this feature on application level and not on FS level (as ZFS would be able to give me that ability).

Any thoughts ?

Regards,

@VicDeo
Copy link
Member

VicDeo commented Apr 19, 2013

related to owncloud-archive/apps#903

@davidak
Copy link
Member

davidak commented Apr 19, 2013

@rvrignaud why not using existing functionality of a FS?
like wikipedia says btrfs had this too, but don't know it is usable right now.

i think you get a better performance than coding such function in php.

@karlitschek
Copy link
Contributor

Yes. I suggest to use the functionality of the OS/Filesystem/Storagebackend. Nothing that should be done in ownCloud.

@rvrignaud
Copy link
Author

@davidak : As far as I know btrfs is not considered as production ready (https://btrfs.wiki.kernel.org/index.php/FAQ#Is_btrfs_stable.3F) and I'm not sure btrfs fully support deduplication (https://btrfs.wiki.kernel.org/index.php/Deduplication).
And AFAIK there is no easy way to use ZFS on linux box.
There are maybe other FS with deduplication feature that I don't know.

@karlitschek : backuppc (http://backuppc.sourceforge.net/) have a deduplication mechanism (http://stanlarson.com/wordpress/?p=118) and I don' t see why this shouldn't/couldn't be done on application layer.

@lock lock bot locked as resolved and limited conversation to collaborators Aug 23, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants