-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-32183][DOCS][PYTHON] User Guide - PySpark Usage Guide for Pandas with Apache Arrow #29548
Conversation
cc @rohitmishr1484 and @fhoering FYI |
This comment has been minimized.
This comment has been minimized.
Test build #127925 has finished for PR 29548 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Few typos are found. Otherwise looks good to me. Thanks.
Thank you @viirya for reviewing this! |
GitHub Actions builds passed. Merged to master. |
Test build #127979 has finished for PR 29548 at commit
|
Sorry for the late review.. Looks great though, thanks! |
What changes were proposed in this pull request?
This PR proposes to move Arrow usage guide from Spark documentation site to PySpark documentation site (at "User Guide").
Here is the demo for reviewing quicker: https://hyukjin-spark.readthedocs.io/en/stable/user_guide/arrow_pandas.html
Why are the changes needed?
To have a single place for PySpark users, and better documentation.
Does this PR introduce any user-facing change?
Yes, it will move https://spark.apache.org/docs/latest/sql-pyspark-pandas-with-arrow.html to our PySpark documentation.
How was this patch tested?
cd docs SKIP_SCALADOC=1 SKIP_RDOC=1 SKIP_SQLDOC=1 jekyll serve --watch
and
cd python/docs make clean html