Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDFS-16762. Make the default value of dfs.federation.router.client.allow-partial-listing as false. #4867

Open
wants to merge 1 commit into
base: trunk
Choose a base branch
from

Conversation

smarthanwang
Copy link
Contributor

Description of PR

https://issues.apache.org/jira/browse/HDFS-16762

AS the default value of dfs.federation.router.client.allow-partial-listing is true, the hdfs client will got partial result when one or more of the subclusters are unavailable for no permissions or other Exceptions, but user may not know. It will lead to some fault.

So I think it's better to make the default value as false.

Copy link
Member

@ayushtkn ayushtkn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changing defaults is an incompatible change and this behaviour is there since very starting. I don't think we need to change this

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Logfile Comment
+0 🆗 reexec 0m 45s Docker mode activated.
_ Prechecks _
+1 💚 dupname 0m 0s No case conflicting files found.
+0 🆗 codespell 0m 1s codespell was not available.
+0 🆗 detsecrets 0m 1s detect-secrets was not available.
+0 🆗 xmllint 0m 1s xmllint was not available.
+1 💚 @author 0m 0s The patch does not contain any @author tags.
-1 ❌ test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ trunk Compile Tests _
+1 💚 mvninstall 39m 15s trunk passed
+1 💚 compile 0m 58s trunk passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚 compile 0m 55s trunk passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚 checkstyle 0m 43s trunk passed
+1 💚 mvnsite 1m 2s trunk passed
+1 💚 javadoc 1m 7s trunk passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚 javadoc 1m 11s trunk passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚 spotbugs 1m 44s trunk passed
+1 💚 shadedclient 20m 54s branch has no errors when building and testing our client artifacts.
_ Patch Compile Tests _
+1 💚 mvninstall 0m 41s the patch passed
+1 💚 compile 0m 43s the patch passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚 javac 0m 43s the patch passed
+1 💚 compile 0m 39s the patch passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚 javac 0m 39s the patch passed
+1 💚 blanks 0m 0s The patch has no blanks issues.
+1 💚 checkstyle 0m 24s the patch passed
+1 💚 mvnsite 0m 44s the patch passed
+1 💚 javadoc 0m 37s the patch passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚 javadoc 0m 54s the patch passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚 spotbugs 1m 24s the patch passed
+1 💚 shadedclient 20m 29s patch has no errors when building and testing our client artifacts.
_ Other Tests _
-1 ❌ unit 22m 3s /patch-unit-hadoop-hdfs-project_hadoop-hdfs-rbf.txt hadoop-hdfs-rbf in the patch passed.
+1 💚 asflicense 0m 46s The patch does not generate ASF License warnings.
120m 10s
Reason Tests
Failed junit tests hadoop.hdfs.server.federation.router.TestRouterRpcMultiDestination
Subsystem Report/Notes
Docker ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4867/1/artifact/out/Dockerfile
GITHUB PR #4867
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets xmllint
uname Linux 073932bb7ce7 4.15.0-191-generic #202-Ubuntu SMP Thu Aug 4 01:49:29 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality dev-support/bin/hadoop.sh
git revision trunk / 24f2d79
Default Java Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
Multi-JDK versions /usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
Test Results https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4867/1/testReport/
Max. process+thread count 2100 (vs. ulimit of 5500)
modules C: hadoop-hdfs-project/hadoop-hdfs-rbf U: hadoop-hdfs-project/hadoop-hdfs-rbf
Console output https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4867/1/console
versions git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

@goiri
Copy link
Member

goiri commented Sep 7, 2022

@ayushtkn we had a discussion about this in a previous JIRA.
I thought this setting wasn't released yet but 3.3 already has it.

@smarthanwang unfortunately, we cannot do this change according to the defaults policies.

@smarthanwang
Copy link
Contributor Author

@goiri @ayushtkn thanks for comment.
Changing the default is really not a good solution by our previous disscussion. I think the key issue is whether we need and how to inform user the result is not complete to avoid some data faults.
In our production cluster, some proccesses got partital data for the no permission to access destination of one subcluster which lead to some data problem and we didn't know. It make users feel that there is a problem with the cluster.
Do you have any suggestions about how to slove this probelm better?

@ayushtkn
Copy link
Member

ayushtkn commented Sep 8, 2022

Do you have any suggestions about how to slove this probelm better?

the behaviour is configurable, we can't change the default, but if you want this behaviour you can explicitly change the value of this conf in your prod cluster and get things sorted.

@smarthanwang
Copy link
Contributor Author

@ayushtkn thanks for comment. Changing the value of this conf really can solve the problem, but it's not a good way. Because it need to review code and find out this conf, it would wastes some time, and It also means that some faults may have happpend.

I think making the default as true means that the routers prefer to availabilty rather than consistence. It seems to be the opposite of hdfs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants