Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement] Cannot specify Hadoop username for catalog operation #596

Closed
mchades opened this issue Oct 25, 2023 · 2 comments · Fixed by #1450
Closed

[Improvement] Cannot specify Hadoop username for catalog operation #596

mchades opened this issue Oct 25, 2023 · 2 comments · Fixed by #1450
Assignees
Labels
improvement Improvements on everything

Comments

@mchades
Copy link
Contributor

mchades commented Oct 25, 2023

What would you like to be improved?

Some catalog operations require direct access to HDFS, but there is no way to specify the Hadoop username for the user, which increases the chances of encountering permission issues.

How should we improve?

we should provide a way to specify the Hadoop username for the user during catalog operations that require direct access to HDFS.

@mchades mchades added the improvement Improvements on everything label Oct 25, 2023
@jerryshao
Copy link
Contributor

@mchades can you please create a workaround PR to add documentation to tell user how to workaround this?

@jerryshao
Copy link
Contributor

This work is related to authentication and user impersonation, should be related to #413

jerryshao pushed a commit that referenced this issue Oct 26, 2023
### What changes were proposed in this pull request?
 - add `How to access Hadoop`
 - add runtime environment set up doc
 - Revised some content

### Why are the changes needed?
The workaround of cannot specify the Hadoop username

### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
local test
jerryshao pushed a commit that referenced this issue Oct 26, 2023
### What changes were proposed in this pull request?
 - add `How to access Hadoop`
 - add runtime environment set up doc
 - Revised some content

### Why are the changes needed?
The workaround of cannot specify the Hadoop username

### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
local test
jerryshao pushed a commit that referenced this issue Oct 26, 2023
### What changes were proposed in this pull request?
 - add `How to access Hadoop`
 - add runtime environment set up doc
 - Revised some content

### Why are the changes needed?
The workaround of cannot specify the Hadoop username

### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
local test
@jerryshao jerryshao added this to the Gravitino 0.4.0 milestone Dec 14, 2023
qqqttt123 pushed a commit to qqqttt123/gravitino that referenced this issue Jan 11, 2024
jerryshao pushed a commit that referenced this issue Jan 24, 2024
…ute operations in simple mode. (#1450)

### What changes were proposed in this pull request?
Hive catalog supports to impersonate users to execute operations in
simple mode.
For Kerberos mode, I have created an new issue. I will finish it in the
later pull request.
We use a Hive client cache pool referring to the Iceberg cache pool. We
use user name as the key of cache pool.
### Why are the changes needed?

Fix: #596 

### Does this PR introduce _any_ user-facing change?
Yes, we will add a new document.

### How was this patch tested?
Add a new IT

---------

Co-authored-by: Heng Qin <qqtt@123.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvements on everything
Projects
None yet
3 participants