[SPARK-19157][SQL] should be able to change spark.sql.runSQLOnFiles at runtime #16531

cloud-fan · 2017-01-10T16:55:21Z

What changes were proposed in this pull request?

The analyzer rule that supports to query files directly will be added to Analyzer.extendedResolutionRules when SparkSession is created, according to the spark.sql.runSQLOnFiles flag. If the flag is off when we create SparkSession, this rule is not added and we can not query files directly even we turn on the flag later.

This PR fixes this bug by always adding that rule to Analyzer.extendedResolutionRules.

How was this patch tested?

new regression test

cloud-fan · 2017-01-10T16:55:47Z

cc @gatorsmile

tejasapatil · 2017-01-10T17:05:01Z

sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala

@@ -45,7 +45,7 @@ import org.apache.spark.unsafe.types.UTF8String
 * Replaces generic operations with specific variants that are designed to work with Spark
 * SQL Data Sources.
 */
-case class DataSourceAnalysis(conf: CatalystConf) extends Rule[LogicalPlan] {


I looked at other places in the diff and not clear why you changed this to NOT be a case class.

The same question. What is the reason why we made this change?

SparkQA · 2017-01-10T19:21:50Z

Test build #71136 has finished for PR 16531 at commit 963b66f.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
class DataSourceAnalysis(conf: CatalystConf) extends Rule[LogicalPlan]

gatorsmile · 2017-01-10T21:08:42Z

sql/core/src/main/scala/org/apache/spark/sql/internal/SessionState.scala

@@ -116,8 +116,8 @@ private[sql] class SessionState(sparkSession: SparkSession) {
        AnalyzeCreateTable(sparkSession) ::
        PreprocessTableInsertion(conf) ::
        new FindDataSourceTable(sparkSession) ::
-        DataSourceAnalysis(conf) ::
-        (if (conf.runSQLonFile) new ResolveDataSource(sparkSession) :: Nil else Nil)


+1
This was the root cause why we were unable to change the conf at runtime.

gatorsmile · 2017-01-10T22:21:44Z

LGTM except the question

gatorsmile · 2017-01-11T04:17:24Z

LGTM pending test

SparkQA · 2017-01-11T04:35:41Z

Test build #71174 has finished for PR 16531 at commit 20b2d95.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-01-11T05:34:43Z

Thanks! Merging to master.

…t runtime ## What changes were proposed in this pull request? The analyzer rule that supports to query files directly will be added to `Analyzer.extendedResolutionRules` when SparkSession is created, according to the `spark.sql.runSQLOnFiles` flag. If the flag is off when we create `SparkSession`, this rule is not added and we can not query files directly even we turn on the flag later. This PR fixes this bug by always adding that rule to `Analyzer.extendedResolutionRules`. ## How was this patch tested? new regression test Author: Wenchen Fan <wenchen@databricks.com> Closes apache#16531 from cloud-fan/sql-on-files.

tejasapatil reviewed Jan 10, 2017

View reviewed changes

gatorsmile reviewed Jan 10, 2017

View reviewed changes

should be able to change spark.sql.runSQLOnFiles at runtime

20b2d95

cloud-fan force-pushed the sql-on-files branch from 963b66f to 20b2d95 Compare January 11, 2017 01:57

asfgit closed this in 3b19c74 Jan 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-19157][SQL] should be able to change spark.sql.runSQLOnFiles at runtime #16531

[SPARK-19157][SQL] should be able to change spark.sql.runSQLOnFiles at runtime #16531

cloud-fan commented Jan 10, 2017

cloud-fan commented Jan 10, 2017

tejasapatil Jan 10, 2017

gatorsmile Jan 10, 2017

SparkQA commented Jan 10, 2017

gatorsmile Jan 10, 2017

gatorsmile commented Jan 10, 2017

gatorsmile commented Jan 11, 2017

SparkQA commented Jan 11, 2017

gatorsmile commented Jan 11, 2017

[SPARK-19157][SQL] should be able to change spark.sql.runSQLOnFiles at runtime #16531

[SPARK-19157][SQL] should be able to change spark.sql.runSQLOnFiles at runtime #16531

Conversation

cloud-fan commented Jan 10, 2017

What changes were proposed in this pull request?

How was this patch tested?

cloud-fan commented Jan 10, 2017

tejasapatil Jan 10, 2017

Choose a reason for hiding this comment

gatorsmile Jan 10, 2017

Choose a reason for hiding this comment

SparkQA commented Jan 10, 2017

gatorsmile Jan 10, 2017

Choose a reason for hiding this comment

gatorsmile commented Jan 10, 2017

gatorsmile commented Jan 11, 2017

SparkQA commented Jan 11, 2017

gatorsmile commented Jan 11, 2017