Skip to content
ianwal edited this page Mar 17, 2024 · 22 revisions

Usage:

  1. Backup your database and files unless you trust a stranger to not ruin your database.

  2. Enable the Hydrus Client API and create an access key with all permissions.

    • The program needs access to your whole video library. Do NOT use blacklist/whitelist filters for the API token.
    • Make sure you enable allow non-local connections in manage services->client api if you are using WSL.
    • HTTPS is default for the client API URL.
      • Use --api-url if you need HTTP e.g. python -m hydrusvideodeduplicator --api-url=http://localhost:45869
      • ⚠️ SSL cert is NOT verified by default unless you pass the cert's file path with `--verify-cert
  3. Run and enter your access key as a parameter

python3 -m hydrusvideodeduplicator --api-key="put your Hydrus api key in these quotes here"
2. Example
python3 -m hydrusvideodeduplicator --api-key="78d2fcc9fe1f43c5008959ed1abfe38ffedcfa127d4f051a1038e068d3e32656"

You can select certain files with queries just like Hydrus e.g. --query="character:batman"

To cancel processing, press CTRL+C.

See the FAQ for more information.

See full list of options with --help

How to Update:

pip3 install hydrusvideodeduplicator --upgrade

How Does This Program Work?:

  1. First, the program will perceptually hash all your video files and store them in a database.
  • Initial hashing takes longer than searching for duplicates. It will also probably get slower as it progresses because the API requests are sorted by file size.
  1. Then, it will search the database for potential duplicates.

Similar video files should now be marked as potential duplicates in Hydrus.

You can run the program again when you add more files to find more duplicates.

Note:

Hashes are stored in a database created in ~/.local/share/hydrusvideodeduplicator to speed up processing.