[SPARK-32807][SQL] ThriftServer open session use direct API to init current DB #29656

AngersZhuuuu · 2020-09-07T03:36:25Z

What changes were proposed in this pull request?

When init current database, we can use direct API, don't need to call SQL

Why are the changes needed?

No

Does this PR introduce any user-facing change?

No

How was this patch tested?

Not need

AngersZhuuuu · 2020-09-07T03:36:52Z

cc @wangyum @juliuszsompolski

...iftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLSessionManager.scala

SparkQA · 2020-09-07T04:10:16Z

Test build #128331 has finished for PR 29656 at commit a1cf29a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-09-07T04:33:48Z

Test build #128333 has finished for PR 29656 at commit 1f2fa6c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2020-09-07T06:16:21Z

...iftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLSessionManager.scala

@@ -69,7 +69,7 @@ private[hive] class SparkSQLSessionManager(hiveServer: HiveServer2, sqlContext:
    setConfMap(ctx, hiveSessionState.getOverriddenConfigurations)
    setConfMap(ctx, hiveSessionState.getHiveVariables)
    if (sessionConf != null && sessionConf.containsKey("use:database")) {
-      ctx.sql(s"use ${sessionConf.get("use:database")}")
+      ctx.sessionState.catalog.setCurrentDatabase(sessionConf.get("use:database"))


just to be clear, is this just a small refactoring?

just to be clear, is this just a small refactoring?

In this way we can avoid a lot of process part, but seems I have miss a important problem that user may write a db name wrong such as aa.cc

Can we add extra tests to verify the behaviour in case of failures? Is it the same error thrown? Are we skipping some checks along the way by calling the catalog functions?

juliuszsompolski · 2020-09-07T09:52:31Z

This may change how this skipped processing handles errors. I would add some tests that verify such behaviour.
cc @bogdanghit @CJStuart

AngersZhuuuu · 2020-09-07T10:32:32Z

This may change how this skipped processing handles errors. I would add some tests that verify such behaviour.
cc @bogdanghit @CJStuart

What I most want to do is make HiveClientImpl's client to be thread local var and remove lock to speed up query.

Update SparkSQLSessionManager.scala

a1cf29a

probot-autolabeler bot added the SQL label Sep 7, 2020

AngersZhuuuu changed the title ~~[SPARK-32807][SQL] ThriftServer open session slow when high concurrent when init current DB~~ [SPARK-32807][SQL] ThriftServer open session use direct API to init current DB Sep 7, 2020

wangyum reviewed Sep 7, 2020

View reviewed changes

...iftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkSQLSessionManager.scala Outdated Show resolved Hide resolved

Update SparkSQLSessionManager.scala

1f2fa6c

HyukjinKwon reviewed Sep 7, 2020

View reviewed changes

AngersZhuuuu closed this Sep 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-32807][SQL] ThriftServer open session use direct API to init current DB #29656

[SPARK-32807][SQL] ThriftServer open session use direct API to init current DB #29656

AngersZhuuuu commented Sep 7, 2020

AngersZhuuuu commented Sep 7, 2020

SparkQA commented Sep 7, 2020

SparkQA commented Sep 7, 2020

HyukjinKwon Sep 7, 2020

AngersZhuuuu Sep 7, 2020

bogdanghit Sep 8, 2020

juliuszsompolski commented Sep 7, 2020

AngersZhuuuu commented Sep 7, 2020

[SPARK-32807][SQL] ThriftServer open session use direct API to init current DB #29656

[SPARK-32807][SQL] ThriftServer open session use direct API to init current DB #29656

Conversation

AngersZhuuuu commented Sep 7, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

AngersZhuuuu commented Sep 7, 2020

SparkQA commented Sep 7, 2020

SparkQA commented Sep 7, 2020

HyukjinKwon Sep 7, 2020

Choose a reason for hiding this comment

AngersZhuuuu Sep 7, 2020

Choose a reason for hiding this comment

bogdanghit Sep 8, 2020

Choose a reason for hiding this comment

juliuszsompolski commented Sep 7, 2020

AngersZhuuuu commented Sep 7, 2020