Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ARROW-8058: [Dataset] Relax DatasetFactory discovery validation
This PR aims to improve the latency of the discovery process. Notably, it selects "fast" defaults over "safe" defaults. - Add `InspectOptions` which limits the number of fragments inspected to infer the schema, it defaults to one fragment. - Add `FinishOptions` which toggles if validation of the optional schema and also controls the number of fragments it validates with. It defaults to disabling validation. This gives a noticeable speedup when the fragments have a uniform schema. Closes #6687 from fsaintjacques/ARROW-8058-optional-discovery-validation Authored-by: François Saint-Jacques <fsaintjacques@gmail.com> Signed-off-by: Benjamin Kietzman <bengilgit@gmail.com>
- Loading branch information
1 parent
815531c
commit 17b9980
Showing
8 changed files
with
200 additions
and
106 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.