[SPARK-52570][PS] Enable divide-by-zero for numeric rmod with ANSI enabled #51275

xinrong-meng · 2025-06-24T21:22:31Z

What changes were proposed in this pull request?

Enable divide-by-zero for numeric rmod with ANSI enabled

Why are the changes needed?

Part of https://issues.apache.org/jira/browse/SPARK-52169.

Does this PR introduce any user-facing change?

Yes.

>>> ps.set_option("compute.fail_on_ansi_mode", False)
>>> ps.set_option("compute.ansi_mode_support", True)

>>> pdf = pd.DataFrame({"a": [0], "b": [False]})
>>> pdf.dtypes
a    int64
b     bool
dtype: object
>>> psdf = ps.from_pandas(pdf)
>>> 1 % psdf["a"]
0   NaN
Name: a, dtype: float64
>>> 1 % psdf["b"]
0   NaN
Name: b, dtype: float64

How was this patch tested?

Unit tests.

(dev3.11) spark (bool_mod_new) % SPARK_ANSI_SQL_MODE=true  ./python/run-tests --python-executables=python3.11 --testnames "pyspark.pandas.tests.data_type_ops.test_boolean_ops 
...
Tests passed in 4 seconds

Was this patch authored or co-authored using generative AI tooling?

No.

…with ANSI enabled" This reverts commit 83639f5.

allisonwang-db · 2025-06-27T18:33:17Z

python/pyspark/pandas/data_type_ops/boolean_ops.py

-        spark_session = left._internal.spark_frame.sparkSession
-
-        def safe_mod(left_col: PySparkColumn, right_val: Any) -> PySparkColumn:
-            if is_ansi_mode_enabled(spark_session):


Just curious why don't we need to check ansi_mode here anymore?

Bools are considered numeric and will take num_ops logic if that makes sense

Merged to unblock prs, please let me know if you have further concerns :)

xinrong-meng · 2025-06-27T19:00:34Z

Merged to master, thanks!

HyukjinKwon · 2025-07-01T05:50:54Z

python/pyspark/pandas/tests/computation/test_binary_ops.py

@@ -225,11 +225,12 @@ def test_binary_operator_floordiv(self):

    def test_binary_operator_mod(self):
        # Positive
-        pdf = pd.DataFrame({"a": [3], "b": [2]})
+        pdf = pd.DataFrame({"a": [3], "b": [2], "c": [0]})


Seems like this change broke the non ANSI mode:

====================================================================== ERROR [4.586s]: test_binary_operator_mod (pyspark.pandas.tests.computation.test_binary_ops.FrameBinaryOpsTests.test_binary_operator_mod) ---------------------------------------------------------------------- Traceback (most recent call last): File "/__w/spark/spark/python/pyspark/pandas/tests/computation/test_binary_ops.py", line 233, in test_binary_operator_mod self.assert_eq(1 % psdf["c"], 1 % pdf["c"]) ~~^~~~~~~~~~~ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 386, in __rmod__ return self._dtype_op.rmod(self, other) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/data_type_ops/num_ops.py", line 177, in rmod return column_op(safe_rmod)(left, right) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 222, in wrapper scol = f( ^^ File "/__w/spark/spark/python/pyspark/pandas/data_type_ops/num_ops.py", line 175, in safe_rmod return ((right % left) + left) % left ~~~~~~^~~~~~ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 386, in __rmod__ return self._dtype_op.rmod(self, other) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/data_type_ops/num_ops.py", line 177, in rmod return column_op(safe_rmod)(left, right) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 222, in wrapper scol = f( ^^ File "/__w/spark/spark/python/pyspark/pandas/data_type_ops/num_ops.py", line 175, in safe_rmod return ((right % left) + left) % left ~~~~~~^~~~~~ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 386, in __rmod__ return self._dtype_op.rmod(self, other) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/data_type_ops/num_ops.py", line 177, in rmod return column_op(safe_rmod)(left, right) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/__w/spark/spark/python/pyspark/pandas/base.py", line 222, in wrapper scol = f( ^^

https://github.com/apache/spark/actions/runs/15987607479/job/45094971366

Let me revert it for now.

xinrong-meng added 3 commits June 24, 2025 11:18

Revert "[SPARK-52356][PS] Enable divide-by-zero for boolean mod/rmod …

f9117a1

…with ANSI enabled" This reverts commit 83639f5.

rmod

9c2aaf5

tests

1406d70

github-actions bot added PYTHON PANDAS API ON SPARK labels Jun 24, 2025

ueshin approved these changes Jun 24, 2025

View reviewed changes

HyukjinKwon approved these changes Jun 25, 2025

View reviewed changes

xinrong-meng added 2 commits June 25, 2025 12:00

fix

b27b1d1

precision diff

b90355a

allisonwang-db reviewed Jun 27, 2025

View reviewed changes

xinrong-meng closed this in 7f22899 Jun 27, 2025

HyukjinKwon reviewed Jul 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-52570][PS] Enable divide-by-zero for numeric rmod with ANSI enabled #51275

[SPARK-52570][PS] Enable divide-by-zero for numeric rmod with ANSI enabled #51275

xinrong-meng commented Jun 24, 2025 •

edited

Loading

Uh oh!

allisonwang-db Jun 27, 2025

Uh oh!

xinrong-meng Jun 27, 2025

Uh oh!

xinrong-meng Jun 27, 2025

Uh oh!

xinrong-meng commented Jun 27, 2025

Uh oh!

HyukjinKwon Jul 1, 2025

Uh oh!

Uh oh!

[SPARK-52570][PS] Enable divide-by-zero for numeric rmod with ANSI enabled #51275

[SPARK-52570][PS] Enable divide-by-zero for numeric rmod with ANSI enabled #51275

Conversation

xinrong-meng commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

allisonwang-db Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

xinrong-meng Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

xinrong-meng Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

xinrong-meng commented Jun 27, 2025

Uh oh!

HyukjinKwon Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xinrong-meng commented Jun 24, 2025 •

edited

Loading