Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CSV upload for on-premise Hadoop cluster block through erroneous code #14392

Open
3 tasks done
0xBADBAC0N opened this issue Apr 28, 2021 · 7 comments
Open
3 tasks done
Labels
#bug Bug report data:connect:hive Related to Hive

Comments

@0xBADBAC0N
Copy link
Contributor

Hi guys,
I have the issue that I can not upload a CSV to Hive through Superset as our Hadoop is not running on S3.
I hunted it down to this line in the code: https://github.com/apache/superset/blob/master/superset/db_engine_specs/hive.py#L65
The following settings in the config are all related to the bucket: https://github.com/apache/superset/blob/master/superset/config.py#L781
The issue is my Hive is running on-premise therefore I have no bucket.

Expected results

Uploading a CSV through the frontend should end up putting the file to Hadoop and make it accessible through a table.

Actual results

Uploading the CSV will cause an error like
Unable to upload CSV file "kc_house_data.csv" to table "hendrik_house_test" in database "Hive Production". Error message: No upload bucket specified. You can specify one in the config file.

Screenshots

image

How to reproduce the bug

  1. Go to the main dashboard
  2. Click on Data
  3. Click on Upload a CSV
  4. Enter Table name and Choose file etc
  5. Save
  6. See error

Environment

  • superset version: 1
  • python version: Python 3.7.3

Checklist

Make sure to follow these steps before submitting your issue - thank you!

  • I have checked the superset logs for python stack traces and included it here as text if there are any.
  • I have reproduced the issue with at least the latest released version of superset.
  • I have checked the issue tracker for the same issue and I haven't found one similar.

Additional context

It seems like the current upload functionality does only work as long the related cluster is cloud based. It looks like this bug is only affecting on-premise clusters therefore we would highly appreciate to get this fixed :)

@0xBADBAC0N 0xBADBAC0N added the #bug Bug report label Apr 28, 2021
@srinify
Copy link
Contributor

srinify commented Apr 29, 2021

Any helpful tips here @etr2460 ? I know y'all use CSV upload to hive a bit

@0xBADBAC0N
Copy link
Contributor Author

bump :)

@0xBADBAC0N
Copy link
Contributor Author

@etr2460 Any idea for this one or even plans? : )
If you would give us some hands we also could start fixing this : )

@stale
Copy link

stale bot commented May 1, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. For admin, please label this issue .pinned to prevent stale bot from closing the issue.

@livwel-populus
Copy link

Any word on this feature or how to fix it?

@campi01
Copy link

campi01 commented Nov 9, 2023

Is this even on the radar for getting looked at, or is it not to be supported at all?

@rusackas
Copy link
Member

Is this even on the radar for getting looked at, or is it not to be supported at all?

It doesn't seem to be on anyone's radar. Any interest in diagnosing/contributing?

If nobody has interest in picking this up, we might close it as stale before long, as we're trying to steer toward a more actionable backlog of issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
#bug Bug report data:connect:hive Related to Hive
Projects
None yet
Development

No branches or pull requests

5 participants