Enhance Duplicate Finder with File Type Filtering and Report Generation #327

Stonebanks-js · 2024-10-12T18:31:18Z

Description:

This pull request introduces two significant enhancements to the duplicate_finder.py script:

File Type Filtering:

Users can now specify a file type filter to limit duplicate detection to certain file types, improving efficiency and precision.
Report Generation:
The script now generates a comprehensive report of detected duplicates and saves it to a duplicates_report.txt file. The report includes details of all duplicate files found, making it easier to review and manage duplicates.

Changes:

Added the ability to filter files based on type.
Integrated functionality to save duplicate file information to a text file (duplicates_report.txt).
Updated find_duplicates() function to support file type filtering.
Improved user interaction with prompts for file type input and report saving.

Tests:

Tested on directories containing images, documents, and other files.

Verified that the filtering works correctly by scanning only for .jpg and .png files.

Confirmed that the report is generated with correct paths for all detected duplicates.

How to test:

Run the script and specify a directory to scan for duplicates.
Provide a file type extension when prompted to filter files (e.g., .jpg).
Select either "delete" or "move" as an action for managing duplicates.
Check the generated duplicates_report.txt for detailed information on found duplicates.

Additional Notes:

These changes improve both user experience and performance for large directories.
Future improvements could involve adding support for additional report formats (e.g., CSV or JSON).

…cate finder

Stonebanks-js · 2024-10-12T18:32:04Z

@DhanushNehru Take a look into it and assign hacktoberfest labels to it

feat: Add file type filtering and report generation features to dupli…

a11e91a

…cate finder

hasan-py approved these changes Oct 13, 2024

View reviewed changes

hasan-py added hacktoberfest hacktoberfest-accepted labels Oct 13, 2024

hasan-py merged commit 4d28b62 into wasmerio:master Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Enhance Duplicate Finder with File Type Filtering and Report Generation #327

Enhance Duplicate Finder with File Type Filtering and Report Generation #327

Stonebanks-js commented Oct 12, 2024

Uh oh!

Stonebanks-js commented Oct 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Enhance Duplicate Finder with File Type Filtering and Report Generation #327

Enhance Duplicate Finder with File Type Filtering and Report Generation #327

Conversation

Stonebanks-js commented Oct 12, 2024

Description:

Uh oh!

Stonebanks-js commented Oct 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants