-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6297] EventLog permissions are always set to 770 which causes problems #4989
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-6297] EventLog permissions are always set to 770 which causes problems #4989
Conversation
By default permissions are set to 770, which in some cases is more problematic than relying on umask. If we are using spark master and driver application in an containerized application, using different users which don't share user group it will cause process to fail.
|
Can one of the admins verify this patch? |
|
CC @andrewor14 and @vanzin . Although it's annoying to add another property (which I guess should be documented, I understand the question here. You'd have to make the files public in this situation; is that desirable? |
|
Yeah, the main thing here is that you do not want user B to be able to modify user A's files. I understand your patch doesn't change the default case, but I wonder what's your motivation for not configuring your daemons appropriately instead? With HDFS in mind, you'd have:
Or maybe you're not using HDFS to store the logs? With this configuration files / directories created under the event log dir will belong to "user:spark" and everything should work as planned. Note this is a little different than what a POSIX fs would do - a POSIX fs would require "5777" permissions for this but HDFS doesn't support that. |
|
Yes, we are not using HDFS, for file storage we use GlusterFS. One of the requirements of our client was to run every process type with it's own user. This isn't something we would desire as it makes everything more complicated. @adaszko: can you please comment on that? |
|
If GlusterFS supports POSIX semantics, you should be able to set things up as I suggested and use the |
|
I guess we will try with that and report back if it worked well. Many thanks! |
|
Just checking to see whether that worked and if so whether we should close this? |
|
@lustefaniak were you able to see whether that workaround was sufficient? Is this still an issue, or can we close this PR? |
|
Can one of the admins verify this patch? |
|
Do you mind closing this PR? |
|
Want to reopen this as in YARN mode, Spark app will run under user's account. |
By default permissions are set to 770, which in some cases is more problematic than relying on umask.
If we are using spark master and driver application in an containerized application, using different users which don't share user group it will cause process to fail.