Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-use duplicate files #625

Open
edemaine opened this issue Sep 8, 2023 · 0 comments
Open

Re-use duplicate files #625

edemaine opened this issue Sep 8, 2023 · 0 comments

Comments

@edemaine
Copy link
Owner

edemaine commented Sep 8, 2023

If you upload a file with the same MD5 as an existing file, check for identical contents and (if so) delete newly uploaded file and link to old file ID instead (or change new file ID to "symlink" to old file ID? otherwise need a way to pass the new ID back to the client).

Probably want to add an MD5 index if there isn't one already.

For existing files, I wonder whether we should have a special "clean up" function (perhaps just a secret method, or a superuser button on root with the Recomputes) that checks for duplicate files, relinks to use the oldest one, and deletes later copies. Maybe could go with the file display of #279.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant