[KYUUBI #7106] Make response.results.columns optional #7107

fbertsch · 2025-06-18T15:32:53Z

Why are the changes needed?

Bugfix. Spark 3.5 is returning None for response.results.columns, while Spark 3.3 returned actual values.

The response here: https://github.com/apache/kyuubi/blob/master/python/pyhive/hive.py#L507

For a query that does nothing (mine was an add jar s3://a/b/c.jar), here are the responses I received.

Spark 3.3:

TFetchResultsResp(status=TStatus(statusCode=0, infoMessages=None, sqlState=None, errorCode=None, errorMessage=None), hasMoreRows=False, results=TRowSet(startRowOffset=0, rows=[], columns=[TColumn(boolVal=None, byteVal=None, i16Val=None, i32Val=None, i64Val=None, doubleVal=None, stringVal=TStringColumn(values=[], nulls=b'\x00'), binaryVal=None)], binaryColumns=None, columnCount=None))

Spark 3.5:

TFetchResultsResp(status=TStatus(statusCode=0, infoMessages=None, sqlState=None, errorCode=None, errorMessage=None), hasMoreRows=False, results=TRowSet(startRowOffset=0, rows=[], columns=None, binaryColumns=None, columnCount=None))

How was this patch tested?

I tested by applying it locally and running my query against Spark 3.5. I was not able to get any unit tests running, sorry!

Was this patch authored or co-authored using generative AI tooling?

No.

fbertsch · 2025-06-18T15:35:52Z

Fixes #7106

codecov-commenter · 2025-06-18T18:51:48Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 0.00%. Comparing base (5237227) to head (13d1440).

Additional details and impacted files

@@          Coverage Diff           @@
##           master   #7107   +/-   ##
======================================
  Coverage    0.00%   0.00%           
======================================
  Files         697     697           
  Lines       43214   43214           
  Branches     5855    5855           
======================================
  Misses      43214   43214

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copilot

Pull Request Overview

A bugfix to handle Spark 3.5 behavior when response.results.columns is None by making columns optional in the fetch results operation.

Introduces a new boolean flag (has_new_data) to check for actual returned data.
Safely updates the request state to finished when no new data is available.

Copilot · 2025-06-18T20:22:39Z

python/pyhive/hive.py

+                       zip(response.results.columns, schema)]
+            new_data = list(zip(*columns))
+            self._data += new_data
+            has_new_data = (True if new_data else False)


[nitpick] Consider simplifying the assignment by using 'bool(new_data)' instead of the ternary operator to improve readability and clarity.

Suggested change

has_new_data = (True if new_data else False)

has_new_data = bool(new_data)

turboFei

LGTM, thanks

pan3793 · 2025-06-19T05:31:25Z

@fbertsch thank you for fixing this issue. Do you happen to know which Spark PR causes this behavior change?

fbertsch · 2025-06-20T15:01:16Z

@fbertsch thank you for fixing this issue. Do you happen to know which Spark PR causes this behavior change?

I haven't been able to confirm, but I believe it's this change: https://issues.apache.org/jira/browse/SPARK-39041

That redid all the HiveThriftServer responses, and probably also changed the column responses.

fbertsch · 2025-06-20T15:01:42Z

@turboFei are you able to release a new version of PyHive with this included?

turboFei · 2025-06-20T23:28:58Z

@turboFei are you able to release a new version of PyHive with this included?

cc @pan3793

pan3793 · 2025-06-23T15:34:15Z

Thanks, merged to master.

are you able to release a new version of PyHive with this included?

This is definitely on our TODO list, hopefully we can achieve the first release in July.

Make response.results.columns optional

13d1440

turboFei requested a review from Copilot June 18, 2025 20:21

Copilot AI reviewed Jun 18, 2025

View reviewed changes

turboFei approved these changes Jun 18, 2025

View reviewed changes

pan3793 closed this in b49ed02 Jun 23, 2025

pan3793 assigned fbertsch Jun 23, 2025

pan3793 added this to the v1.11.0 milestone Jun 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KYUUBI #7106] Make response.results.columns optional #7107

[KYUUBI #7106] Make response.results.columns optional #7107

fbertsch commented Jun 18, 2025

Uh oh!

fbertsch commented Jun 18, 2025

Uh oh!

codecov-commenter commented Jun 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 18, 2025

Uh oh!

turboFei left a comment

Uh oh!

pan3793 commented Jun 19, 2025

Uh oh!

fbertsch commented Jun 20, 2025

Uh oh!

fbertsch commented Jun 20, 2025

Uh oh!

turboFei commented Jun 20, 2025

Uh oh!

pan3793 commented Jun 23, 2025

Uh oh!

Uh oh!

	has_new_data = (True if new_data else False)
	has_new_data = bool(new_data)

[KYUUBI #7106] Make response.results.columns optional #7107

[KYUUBI #7106] Make response.results.columns optional #7107

Conversation

fbertsch commented Jun 18, 2025

Why are the changes needed?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

fbertsch commented Jun 18, 2025

Uh oh!

codecov-commenter commented Jun 18, 2025

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

turboFei left a comment

Choose a reason for hiding this comment

Uh oh!

pan3793 commented Jun 19, 2025

Uh oh!

fbertsch commented Jun 20, 2025

Uh oh!

fbertsch commented Jun 20, 2025

Uh oh!

turboFei commented Jun 20, 2025

Uh oh!

pan3793 commented Jun 23, 2025

Uh oh!

Uh oh!