[SPARK-43302][SQL][FOLLOWUP] Code cleanup for PythonUDAF #41142

cloud-fan · 2023-05-11T15:34:41Z

What changes were proposed in this pull request?

This is a followup of #40739 to do some code cleanup

remove the pattern PYTHON_UDAF as it's not used by any rule.
add PythonFuncExpression.evalType for convenience: catalyst rules (including third-party extensions) may want to get the eval type of a python function, no matter it's UDF or UDAF.
update the python profile to use PythonUDAF.resultId instead of AggregateExpression.resultId, to be consistent with PythonUDF

Why are the changes needed?

code cleanup

Does this PR introduce any user-facing change?

no

How was this patch tested?

existing tests

cloud-fan · 2023-05-11T15:35:17Z

cc @HyukjinKwon

HyukjinKwon · 2023-05-14T07:03:27Z

python/pyspark/sql/udf.py

                sc.profiler_collector.add_profiler(id, memory_profiler)
        else:
            judf = self._judf
            jPythonUDF = judf.apply(_to_seq(sc, cols, _to_java_column))
        return Column(jPythonUDF)

+    def _get_UDF_id(self, jexpr: JavaObject) -> int:


I think should be lowercased

Suggested change

def _get_UDF_id(self, jexpr: JavaObject) -> int:

def _get_udf_id(self, jexpr: JavaObject) -> int:

python/pyspark/sql/udf.py

yaooqinn · 2023-05-17T05:39:06Z

thanks, merged to master

minor

cadc302

github-actions bot added CORE PYTHON SQL labels May 11, 2023

HyukjinKwon reviewed May 14, 2023

View reviewed changes

HyukjinKwon approved these changes May 14, 2023

View reviewed changes

cloud-fan added 2 commits May 15, 2023 17:30

Update udf.py

d2bb649

Update PythonUDF.scala

91ce8bd

cloud-fan commented May 15, 2023

View reviewed changes

python/pyspark/sql/udf.py Outdated Show resolved Hide resolved

cloud-fan and others added 4 commits May 15, 2023 21:11

Update python/pyspark/sql/udf.py

4104e04

fix style

a1ca9ee

fix

7c3fc10

fix

a90b8f0

cloud-fan force-pushed the follow branch from a92615a to a90b8f0 Compare May 16, 2023 13:04

Update udf.py

b0b3f11

yaooqinn approved these changes May 17, 2023

View reviewed changes

yaooqinn closed this in fddf25a May 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-43302][SQL][FOLLOWUP] Code cleanup for PythonUDAF #41142

[SPARK-43302][SQL][FOLLOWUP] Code cleanup for PythonUDAF #41142

cloud-fan commented May 11, 2023

cloud-fan commented May 11, 2023

HyukjinKwon May 14, 2023

yaooqinn commented May 17, 2023

	def _get_UDF_id(self, jexpr: JavaObject) -> int:
	def _get_udf_id(self, jexpr: JavaObject) -> int:

[SPARK-43302][SQL][FOLLOWUP] Code cleanup for PythonUDAF #41142

[SPARK-43302][SQL][FOLLOWUP] Code cleanup for PythonUDAF #41142

Conversation

cloud-fan commented May 11, 2023

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

cloud-fan commented May 11, 2023

HyukjinKwon May 14, 2023

Choose a reason for hiding this comment

yaooqinn commented May 17, 2023