-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MINOR][DOC] Update Partition Discovery
section to enumerate all available file sources
#19139
Conversation
Are the partition discovery applicable to all the built-in data sources? |
Test build #81421 has finished for PR 19139 at commit
|
As you pointed out, I checked it again. Right, |
|
@gatorsmile . |
Test build #81423 has finished for PR 19139 at commit
|
docs/sql-programming-guide.md
Outdated
@@ -733,7 +733,7 @@ SELECT * FROM parquetTable | |||
|
|||
Table partitioning is a common optimization approach used in systems like Hive. In a partitioned | |||
table, data are usually stored in different directories, with partitioning column values encoded in | |||
the path of each partition directory. The Parquet data source is now able to discover and infer | |||
the path of each partition directory. All built-in data sources are able to discover and infer |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-> All built-in file sources (including ...)
You know, it is not applicable to JDBC
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oops. Yep. I'll enumerate it.
BTW, please update the PR title. |
Partition Discovery
section.Partition Discovery
section to enumerate all available data sources
I added |
Test build #81428 has finished for PR 19139 at commit
|
docs/sql-programming-guide.md
Outdated
@@ -733,8 +733,9 @@ SELECT * FROM parquetTable | |||
|
|||
Table partitioning is a common optimization approach used in systems like Hive. In a partitioned | |||
table, data are usually stored in different directories, with partitioning column values encoded in | |||
the path of each partition directory. The Parquet data source is now able to discover and infer | |||
partitioning information automatically. For example, we can store all our previously used | |||
the path of each partition directory. All built-in data sources (including TEXT/CSV/JSON/ORC/Parquet) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
data sources
-> file sources
TEXT
-> Text
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks. It's updated.
LGTM except a minor comment |
Test build #81429 has finished for PR 19139 at commit
|
Partition Discovery
section to enumerate all available data sourcesPartition Discovery
section to enumerate all available file sources
Thanks! Merged to master/2.2 |
…ailable file sources ## What changes were proposed in this pull request? All built-in data sources support `Partition Discovery`. We had better update the document to give the users more benefit clearly. **AFTER** <img width="906" alt="1" src="https://user-images.githubusercontent.com/9700541/30083628-14278908-9244-11e7-98dc-9ad45fe233a9.png"> ## How was this patch tested? ``` SKIP_API=1 jekyll serve --watch ``` Author: Dongjoon Hyun <dongjoon@apache.org> Closes #19139 from dongjoon-hyun/partitiondiscovery. (cherry picked from commit 9e451bc) Signed-off-by: gatorsmile <gatorsmile@gmail.com>
Thank you, @gatorsmile ! |
…ailable file sources ## What changes were proposed in this pull request? All built-in data sources support `Partition Discovery`. We had better update the document to give the users more benefit clearly. **AFTER** <img width="906" alt="1" src="https://user-images.githubusercontent.com/9700541/30083628-14278908-9244-11e7-98dc-9ad45fe233a9.png"> ## How was this patch tested? ``` SKIP_API=1 jekyll serve --watch ``` Author: Dongjoon Hyun <dongjoon@apache.org> Closes apache#19139 from dongjoon-hyun/partitiondiscovery. (cherry picked from commit 9e451bc) Signed-off-by: gatorsmile <gatorsmile@gmail.com>
What changes were proposed in this pull request?
All built-in data sources support
Partition Discovery
. We had better update the document to give the users more benefit clearly.AFTER
How was this patch tested?