[WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql` by xinrong-meng · Pull Request #37777 · apache/spark

xinrong-meng · 2022-09-02T21:23:16Z

What changes were proposed in this pull request?

Introduce sql_conf context manager for pyspark.sql.

Why are the changes needed?

That simplifies the control of Spark SQL configuration as below

from

original_value = spark.conf.get("key")
spark.conf.set("key", "value")
...
spark.conf.set("key", original_value)

to

with sql_conf({"key": "value"}):
    ...

Here is such a context manager is in Pandas API on Spark.

We should introduce one in pyspark.sql, and deduplicate code if possible.

Does this PR introduce any user-facing change?

Yes. Users may use the context manager to manage the Spark SQL configuration for a code block.

For example,

>>> from pyspark.sql.utils import sql_conf
>>> with sql_conf({"spark.sql.execution.arrow.pyspark.enabled": True}):
...    pdf = sdf.toPandas()

How was this patch tested?

Unit tests.

xinrong-meng · 2022-09-02T21:33:49Z

python/pyspark/sql/utils.py

+    """
+    from pyspark.sql.session import SparkSession
+
+    assert isinstance(pairs, dict), "pairs should be a dictionary."


Remain the assertion to be consistent with pyspark.pandas.utils.sql_conf.

AmplabJenkins · 2022-09-04T01:45:24Z

Can one of the admins verify this patch?

HyukjinKwon · 2022-09-05T02:29:46Z

python/pyspark/sql/utils.py

+
+
+@contextmanager
+def sql_conf(pairs: Dict[str, Any], *, spark: Optional["SparkSession"] = None) -> Iterator[None]:


Should probably name it as sqlConf since we follow camelcase in the API. In addition, I think this should be able to be imported via something like from pyspark.sql import sqlConf. Should also document it in the API reference at python/docs

xinrong-meng added 2 commits September 2, 2022 14:08

sql

6ebe9d4

pandas

f9e2bfe

github-actions bot added CORE PANDAS API ON SPARK PYTHON SQL labels Sep 2, 2022

xinrong-meng commented Sep 2, 2022

View reviewed changes

rmv print

cb0ba6c

xinrong-meng force-pushed the sql_conf branch from 6197090 to cb0ba6c Compare September 2, 2022 21:36

xinrong-meng added 2 commits September 2, 2022 14:37

rmv dup assertion

9d2ef71

more tests

d5cd752

xinrong-meng changed the title ~~[SPARK-40309][PYTHON] Introduce sql_conf context manager for pyspark.sql~~ [SPARK-40309][PYTHON][PS] Introduce sql_conf context manager for pyspark.sql Sep 2, 2022

xinrong-meng changed the title ~~[SPARK-40309][PYTHON][PS] Introduce sql_conf context manager for pyspark.sql~~ [WIP][SPARK-40309][PYTHON][PS] Introduce sql_conf context manager for pyspark.sql Sep 2, 2022

HyukjinKwon reviewed Sep 5, 2022

View reviewed changes

xinrong-meng closed this Sep 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql`#37777

[WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql`#37777
xinrong-meng wants to merge 5 commits intoapache:masterfrom
xinrong-meng:sql_conf

xinrong-meng commented Sep 2, 2022 •

edited

Loading

Uh oh!

xinrong-meng Sep 2, 2022

Uh oh!

AmplabJenkins commented Sep 4, 2022

Uh oh!

HyukjinKwon Sep 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		@contextmanager
		def sql_conf(pairs: Dict[str, Any], *, spark: Optional["SparkSession"] = None) -> Iterator[None]:

Comments

Conversation

xinrong-meng commented Sep 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

xinrong-meng Sep 2, 2022

Choose a reason for hiding this comment

Uh oh!

AmplabJenkins commented Sep 4, 2022

Uh oh!

HyukjinKwon Sep 5, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xinrong-meng commented Sep 2, 2022 •

edited

Loading