Cannot create tables in Hue #131

hemajv · 2021-03-17T19:18:50Z

Describe the bug
I have data stored in the black-flake ceph bucket and I am trying to create a table for it in Hue so that I can visualize the data using Superset.

To Reproduce
Steps to reproduce the behavior:

Go to https://hue-opf-datacatalog.apps.zero.massopen.cloud/
Login generic_user:operatefirst
Try to create a table by executing the following query:

CREATE EXTERNAL TABLE IF NOT EXISTS ocp_ci_analysis.flakes(
timstamp TIMESTAMP,
tab STRING,
grid STRING,
test STRING,
flake BOOLEAN
)
STORED AS PARQUET
LOCATION
's3a://<access key>:<secret key>@black-flake/metrics/flake';

See error:

org.apache.spark.sql.AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:com.amazonaws.AmazonClientException: Unable to execute HTTP request: black-flake.s3.openshift-storage.svc: Name or service not known);

Expected behaviour
The table should be successfully created with the parquet file contents loaded into it.

Is Hue setup to connect to Ceph?

The text was updated successfully, but these errors were encountered:

hemajv · 2021-03-17T19:19:28Z

cc @MichaelClifford @4n4nd @Shreyanand

hemajv · 2021-03-17T20:38:09Z

(as per conversation in chat)
seems like (by default) the opf-datacatalog bucket has been configured to Hue, however I stored a CSV file into this bucket and tried to create a table for it in Hue and I still end up with the same error:

org.apache.spark.sql.AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:com.amazonaws.AmazonClientException: Unable to execute HTTP request: opf-datacatalog.s3.openshift-storage.svc: Name or service not known);

@tumido any idea what the issue might be?

also, since both the black-flake and opf-datacatalog buckets have different access/secret keys, is it possible to configure multiple buckets to be used in Hue?

tumido · 2021-03-18T10:25:13Z

Yes, this is duplicate of: #117

The problem is with s3.openshift-storage.svc not being recognised as a proper hostname by boto in Hue and Thriftserver not using 433 port. I'll experiment with the external route on this, but that one was problematic in Argo due to some SSL errors. I'll look into it.

tumido · 2021-03-18T10:28:02Z

seems like (by default) the opf-datacatalog bucket has been configured to Hue, however I stored a CSV file into this bucket and tried to create a table for it in Hue and I still end up with the same error:

default bucket doesn't change anything. This is a hostname/port problem. And OpenShift Container Storage is not helping us here.

also, since both the black-flake and opf-datacatalog buckets have different access/secret keys, is it possible to configure multiple buckets to be used in Hue?

Of course they can! They must be available on the same S3 cluster though - and the connection to S3 is the problem here. Maybe we can even make Hue/Hive connect to multiple Ceph endpoints? I don't know, we can also try that...

tumido · 2021-03-29T12:47:20Z

So.. after quite some time on this I've managed to fix a sibling issue #117 while this one is still persistent. I need to raise this one back to upstream. I can't make Thriftserver to connect to Openshift Container Storage properly.

hemajv · 2021-03-29T13:11:25Z

ack, @tumido would you happen to know if there is any workaround we could look into meanwhile such as manually attaching the table to the superset/hue pod somehow?

tumido · 2021-03-29T13:17:36Z

Nope I don't know about any workaround as of now. Maybe @rimolive would be able to help...

In general, if you want to work with Hue or Superset, all the tables have to be loaded and their metadata stored in Hive. And currently Thriftserver is the only interface for Hive we've got. 😞

sesheta · 2021-10-13T23:59:21Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

HumairAK · 2021-10-14T12:19:37Z

No one is using hue atm so I think this is no longer relevant.

/close

sesheta · 2021-10-14T12:19:39Z

@HumairAK: Closing this issue.

In response to this:

No one is using hue atm so I think this is no longer relevant.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tumido self-assigned this Mar 18, 2021

hemajv mentioned this issue Mar 18, 2021

Bucket accessible from Superset to host data. aicoe-aiops/ocp-ci-analysis#123

Open

2 tasks

This was referenced Mar 29, 2021

Hue storage not accessible #117

Closed

fix(hue): Set proper endpoint for S3 file explorer operate-first/apps#452

Merged

tumido mentioned this issue Apr 1, 2021

Setup credentials in Hue to access private bucket #183

Closed

sesheta added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 13, 2021

sesheta closed this as completed Oct 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot create tables in Hue #131

Cannot create tables in Hue #131

hemajv commented Mar 17, 2021 •

edited

Loading

hemajv commented Mar 17, 2021 •

edited

Loading

hemajv commented Mar 17, 2021

tumido commented Mar 18, 2021

tumido commented Mar 18, 2021

tumido commented Mar 29, 2021

hemajv commented Mar 29, 2021

tumido commented Mar 29, 2021

sesheta commented Oct 13, 2021

HumairAK commented Oct 14, 2021

sesheta commented Oct 14, 2021

Cannot create tables in Hue #131

Cannot create tables in Hue #131

Comments

hemajv commented Mar 17, 2021 • edited Loading

hemajv commented Mar 17, 2021 • edited Loading

hemajv commented Mar 17, 2021

tumido commented Mar 18, 2021

tumido commented Mar 18, 2021

tumido commented Mar 29, 2021

hemajv commented Mar 29, 2021

tumido commented Mar 29, 2021

sesheta commented Oct 13, 2021

HumairAK commented Oct 14, 2021

sesheta commented Oct 14, 2021

hemajv commented Mar 17, 2021 •

edited

Loading

hemajv commented Mar 17, 2021 •

edited

Loading