[SPARK-54014][CONNECT] Support max rows for SparkConnectStatement #52742

pan3793 · 2025-10-27T10:07:56Z

What changes were proposed in this pull request?

This PR implements the two methods of the java.sql.Statement interface for SparkConnectStatement

    /**
     * Retrieves the maximum number of rows that a
     * {@code ResultSet} object produced by this
     * {@code Statement} object can contain.  If this limit is exceeded,
     * the excess rows are silently dropped.
     *
     * @return the current maximum number of rows for a {@code ResultSet}
     *         object produced by this {@code Statement} object;
     *         zero means there is no limit
     * @throws SQLException if a database access error occurs or
     * this method is called on a closed {@code Statement}
     * @see #setMaxRows
     */
    int getMaxRows() throws SQLException;

    /**
     * Sets the limit for the maximum number of rows that any
     * {@code ResultSet} object  generated by this {@code Statement}
     * object can contain to the given number.
     * If the limit is exceeded, the excess
     * rows are silently dropped.
     *
     * @param max the new max rows limit; zero means there is no limit
     * @throws SQLException if a database access error occurs,
     * this method is called on a closed {@code Statement}
     *            or the condition {@code max >= 0} is not satisfied
     * @see #getMaxRows
     */
    void setMaxRows(int max) throws SQLException;

Why are the changes needed?

Implement more JDBC APIs.

Does this PR introduce any user-facing change?

No, it's new feature.

How was this patch tested?

New UTs are added.

Was this patch authored or co-authored using generative AI tooling?

No.

pan3793 · 2025-10-27T14:36:26Z

cc @LuciferYang @dongjoon-hyun

...ent/jdbc/src/main/scala/org/apache/spark/sql/connect/client/jdbc/SparkConnectStatement.scala

pan3793 · 2025-11-05T09:39:33Z

rebase on master to resolve Python protobuf version mismatch issue (no code change)

### What changes were proposed in this pull request? This PR implements the two methods of the `java.sql.Statement` interface for `SparkConnectStatement` ``` /** * Retrieves the maximum number of rows that a * {code ResultSet} object produced by this * {code Statement} object can contain. If this limit is exceeded, * the excess rows are silently dropped. * * return the current maximum number of rows for a {code ResultSet} * object produced by this {code Statement} object; * zero means there is no limit * throws SQLException if a database access error occurs or * this method is called on a closed {code Statement} * see #setMaxRows */ int getMaxRows() throws SQLException; /** * Sets the limit for the maximum number of rows that any * {code ResultSet} object generated by this {code Statement} * object can contain to the given number. * If the limit is exceeded, the excess * rows are silently dropped. * * param max the new max rows limit; zero means there is no limit * throws SQLException if a database access error occurs, * this method is called on a closed {code Statement} * or the condition {code max >= 0} is not satisfied * see #getMaxRows */ void setMaxRows(int max) throws SQLException; ``` ### Why are the changes needed? Implement more JDBC APIs. ### Does this PR introduce _any_ user-facing change? No, it's new feature. ### How was this patch tested? New UTs are added. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #52742 from pan3793/SPARK-54014. Authored-by: Cheng Pan <chengpan@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit 07cab00) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

dongjoon-hyun · 2025-11-05T15:44:34Z

Merged to master. Thank you, @pan3793 and @LuciferYang .

LuciferYang · 2025-11-06T03:30:11Z

I'm planning to backport this pr to branch-4.1. Additionally, to make this feature basically available in Spark 4.1, there are another 9 pending tickets that I hope can be completed and merged into branch-4.1 by November 15th: https://issues.apache.org/jira/browse/SPARK-53484

If you have any other suggestions, please let me know. @dongjoon-hyun Thanks ~

dongjoon-hyun · 2025-11-06T05:32:51Z

Oh, I backported this already to branch-4.1 when I merged this. Sorry for making you confused by my previous wrong comment, @LuciferYang . Here is the commit.

7159903

And, yes, of course, you can backport all late arrival patches until November 15th. So, feel free to proceed as the member of Apache Spark PMC.

pan3793 · 2025-11-06T05:53:57Z

@dongjoon-hyun @LuciferYang, many thanks for your help in advancing this new feature.

LuciferYang · 2025-11-06T06:24:12Z

Thank you for your clarification and support. @dongjoon-hyun

### What changes were proposed in this pull request? This PR implements the two methods of the `java.sql.Statement` interface for `SparkConnectStatement` ``` /** * Retrieves the maximum number of rows that a * {code ResultSet} object produced by this * {code Statement} object can contain. If this limit is exceeded, * the excess rows are silently dropped. * * return the current maximum number of rows for a {code ResultSet} * object produced by this {code Statement} object; * zero means there is no limit * throws SQLException if a database access error occurs or * this method is called on a closed {code Statement} * see #setMaxRows */ int getMaxRows() throws SQLException; /** * Sets the limit for the maximum number of rows that any * {code ResultSet} object generated by this {code Statement} * object can contain to the given number. * If the limit is exceeded, the excess * rows are silently dropped. * * param max the new max rows limit; zero means there is no limit * throws SQLException if a database access error occurs, * this method is called on a closed {code Statement} * or the condition {code max >= 0} is not satisfied * see #getMaxRows */ void setMaxRows(int max) throws SQLException; ``` ### Why are the changes needed? Implement more JDBC APIs. ### Does this PR introduce _any_ user-facing change? No, it's new feature. ### How was this patch tested? New UTs are added. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#52742 from pan3793/SPARK-54014. Authored-by: Cheng Pan <chengpan@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

github-actions bot added SQL CONNECT labels Oct 27, 2025

pan3793 force-pushed the SPARK-54014 branch 2 times, most recently from 84093f3 to a669c75 Compare October 27, 2025 12:59

LuciferYang reviewed Oct 29, 2025

View reviewed changes

pan3793 force-pushed the SPARK-54014 branch 2 times, most recently from 1116ffc to 3540af6 Compare November 3, 2025 08:37

LuciferYang reviewed Nov 5, 2025

View reviewed changes

...ent/jdbc/src/main/scala/org/apache/spark/sql/connect/client/jdbc/SparkConnectStatement.scala Outdated Show resolved Hide resolved

LuciferYang reviewed Nov 5, 2025

View reviewed changes

...ent/jdbc/src/main/scala/org/apache/spark/sql/connect/client/jdbc/SparkConnectStatement.scala Outdated Show resolved Hide resolved

dongjoon-hyun reviewed Nov 5, 2025

View reviewed changes

...ent/jdbc/src/main/scala/org/apache/spark/sql/connect/client/jdbc/SparkConnectStatement.scala Show resolved Hide resolved

dongjoon-hyun approved these changes Nov 5, 2025

View reviewed changes

LuciferYang approved these changes Nov 5, 2025

View reviewed changes

pan3793 added 2 commits November 5, 2025 17:37

[SPARK-54014][CONNECT] Support setMaxRows for SparkConnectStatement

5bc3897

rename limitRow to maxRows

750e5a2

pan3793 force-pushed the SPARK-54014 branch from 9c51d3d to 750e5a2 Compare November 5, 2025 09:38

dongjoon-hyun closed this in 07cab00 Nov 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-54014][CONNECT] Support max rows for SparkConnectStatement #52742

[SPARK-54014][CONNECT] Support max rows for SparkConnectStatement #52742

Uh oh!

pan3793 commented Oct 27, 2025

Uh oh!

pan3793 commented Oct 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pan3793 commented Nov 5, 2025

Uh oh!

dongjoon-hyun commented Nov 5, 2025

Uh oh!

LuciferYang commented Nov 6, 2025 •

edited

Loading

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

pan3793 commented Nov 6, 2025

Uh oh!

LuciferYang commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-54014][CONNECT] Support max rows for SparkConnectStatement #52742

[SPARK-54014][CONNECT] Support max rows for SparkConnectStatement #52742

Uh oh!

Conversation

pan3793 commented Oct 27, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

pan3793 commented Oct 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pan3793 commented Nov 5, 2025

Uh oh!

dongjoon-hyun commented Nov 5, 2025

Uh oh!

LuciferYang commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dongjoon-hyun commented Nov 6, 2025

Uh oh!

pan3793 commented Nov 6, 2025

Uh oh!

LuciferYang commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LuciferYang commented Nov 6, 2025 •

edited

Loading