Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new function under Class InteractiveQuery in queryservice.py to enable data load using pyspark.sql data frame #10

Open
yoyo6022 opened this issue Jun 8, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@yoyo6022
Copy link

yoyo6022 commented Jun 8, 2023

image

@yoyo6022 yoyo6022 changed the title Add new function under Class InteractiveQuery to enable data load using pyspark.sql data frame Add new function under Class InteractiveQuery in queryservice.py to enable data load using pyspark.sql data frame Jun 8, 2023
@pitchmuc pitchmuc added the enhancement New feature or request label Sep 30, 2024
@pitchmuc
Copy link
Contributor

Hello @yoyo6022
Sorry for the late response and not sure how much you still want that feature.
However, I feel like having to return the pyspark would create a dependency to pyspark that is less commonly used than pandas.
This dependency would be for all user of interactive query service...
What I would recommend instead is to document the translation to pyspark dataframe in our documentation.

# Pandas to Spark
df_sp = spark_session.createDataFrame(df_pd)

I think it should be clear enough.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants