New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HADOOP-18056. DistCp: Filter duplicates in the source paths. #3825
Conversation
9766389
to
565b1f8
Compare
🎊 +1 overall
This message was automatically generated. |
565b1f8
to
a232e11
Compare
🎊 +1 overall
This message was automatically generated. |
Set<Path> uniquePaths = new LinkedHashSet<>(); | ||
for (Path path : srcPaths) { | ||
if (!uniquePaths.add(path)) { | ||
LOG.warn("Path: {} added multiple times, Ignoring the redundant entry.", path); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Do we need to change Ignoring
to ignoring
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanx @tomscut, I have changed it.
🎊 +1 overall
This message was automatically generated. |
LGTM. |
Set<Path> uniquePaths = new LinkedHashSet<>(); | ||
for (Path path : srcPaths) { | ||
if (!uniquePaths.add(path)) { | ||
LOG.warn("Path: {} added multiple times, ignoring the redundant entry.", path); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should it log at info? I'm just worried because WARN Messages are often interpreted as a sign of serious problems; if this is happening due to some workflow then it's not going to go away.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, makes sense.
Thanx @steveloughran for the review. I have changed it to INFO
🎊 +1 overall
This message was automatically generated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 from me
… Contributed by Ayush Saxena. Reviewed-by: tomscut <litao@bigo.sg> Reviewed-by: Steve Loughran <stevel@apache.org>
… Contributed by Ayush Saxena. Reviewed-by: tomscut <litao@bigo.sg> Reviewed-by: Steve Loughran <stevel@apache.org>
…3825). Contributed by Ayush Saxena. Reviewed-by: tomscut <litao@bigo.sg> Reviewed-by: Steve Loughran <stevel@apache.org>
Description of PR
Filter duplicates from source paths
How was this patch tested?
UT
For code changes: