New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide #5001
Conversation
Test build #28519 has started for PR 5001 at commit
|
Test build #28519 has finished for PR 5001 at commit
|
Test PASSed. |
the path of each partition directory. The Parquet data source is now able to discover and infer | ||
partitioning information automatically. For exmaple, we can store all our previously used | ||
population data into a partitioned table using the following directory structure, with two extra | ||
columns, `sex` and `country` as partitioning columns: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we use "gender"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, sure.
Test build #28540 has started for PR 5001 at commit
|
Test build #28540 has finished for PR 5001 at commit
|
Test PASSed. |
lgtm |
Also fixed a bunch of minor styling issues. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/5001) <!-- Reviewable:end --> Author: Cheng Lian <lian@databricks.com> Closes #5001 from liancheng/parquet-doc and squashes the following commits: 89ad3db [Cheng Lian] Addresses @rxin's comments 7eb6955 [Cheng Lian] Docs for the new Parquet data source 415eefb [Cheng Lian] Some minor formatting improvements (cherry picked from commit 69ff8e8) Signed-off-by: Cheng Lian <lian@databricks.com>
Also fixed a bunch of minor styling issues.