You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Goal: duplicate files are excluded from the analysis.
To detect duplicates efficiently, the following logic can be used:
Build a file_size_to_paths_map where each file size maps to a list of source files having this size.
Build a set of paths that are duplicate by comparing all files for sizes with multiple paths in file_size_to_paths_map. To compare two files, use filecmp.cmp(..., shallow=False)
The text was updated successfully, but these errors were encountered:
Goal: duplicate files are excluded from the analysis.
To detect duplicates efficiently, the following logic can be used:
file_size_to_paths_map
where each file size maps to a list of source files having this size.file_size_to_paths_map
. To compare two files, usefilecmp.cmp(..., shallow=False)
The text was updated successfully, but these errors were encountered: