Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SNOW-1433018: Seeking Snowpark User Feedback: Experimental Performance Improvement #1612

Open
sfc-gh-jdu opened this issue May 16, 2024 · 0 comments
Assignees
Labels
enhancement The issue is a request for improvement or a new feature

Comments

@sfc-gh-jdu
Copy link
Collaborator

Hi all,

We have an exciting query performance optimization released with Snowpark Python v1.15.0, and we're looking for testing it with production workloads. This optimization will convert duplicate subqueries to CTEs automatically and reduce both query compilation and computation time. We would encourage you to try out this experimental feature if you

  1. use the same Snowpark DataFrame multiple times to build another DataFrame in your workloads, e.g.
df = ...
df1 = df.filter(col("a") == 1)
df2 = df.with_column("c", lit(1))
df3 = df1.join(df2)

OR

  1. previously have manually used df.cache_result() to improve the performance by saving the intermediate result.

Feel free to try this optimization by setting

session.cte_optimization_enabled = True 

at the beginning of code (then no cache_result() is needed), and watch the performance.

If you have any questions, feel free to reach out to me via jianzhun.du@snowflake.com or comment under this issue directly. Any input is appreciated!

@sfc-gh-jdu sfc-gh-jdu added the enhancement The issue is a request for improvement or a new feature label May 16, 2024
@sfc-gh-jdu sfc-gh-jdu self-assigned this May 16, 2024
@sfc-gh-jdu sfc-gh-jdu pinned this issue May 16, 2024
@github-actions github-actions bot changed the title Seeking Feedback: Experimental Performance Improvement SNOW-1433018: Seeking Feedback: Experimental Performance Improvement May 16, 2024
@sfc-gh-jdu sfc-gh-jdu changed the title SNOW-1433018: Seeking Feedback: Experimental Performance Improvement SNOW-1433018: Seeking User Feedback: Experimental Performance Improvement May 16, 2024
@sfc-gh-jdu sfc-gh-jdu changed the title SNOW-1433018: Seeking User Feedback: Experimental Performance Improvement SNOW-1433018: Seeking Snowpark User Feedback: Experimental Performance Improvement May 16, 2024
@sfc-gh-helmeleegy sfc-gh-helmeleegy unpinned this issue May 23, 2024
@sfc-gh-jdu sfc-gh-jdu pinned this issue Jun 3, 2024
@sfc-gh-helmeleegy sfc-gh-helmeleegy unpinned this issue Jun 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement The issue is a request for improvement or a new feature
Projects
None yet
Development

No branches or pull requests

1 participant