-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ARROW-8318: [C++][Dataset] Construct FileSystemDataset from fragments
* Simplified FileSystemDataset to hold a FragmentVector. Each Fragment must be a FileFragment and is checked at `FileSystemDataset::Make`. Fragments are not required to use the same backing filesystem nor the same format. * Removed `FileSystemDataset::format` and `FileSystemDataset::partitions`. * Since FileInfo is not required by neither FileSystemDataset and FileSystemDatasetFactory, it is no possible to create a dataset without any IO involved. * Re-introduced the natural behavior of creating FileFragment with their full partition expressions instead of removing the ancestors common partitions. * Added `Expression::IsSatisfiableWith` method. * Added missing compression cmake options to archery. * Ensure FileSource holds a shared_ptr<FileSystem> pointer. This is required to refactor FileSystemDataset to support Buffer FileSource and heterogeneous FileSystems. * Rename `type` to `id`, following other classes.
- Loading branch information
1 parent
17404e1
commit d2263be
Showing
16 changed files
with
285 additions
and
365 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.