Skip to content

Commit

Permalink
Source S3: Doc fix grammar & typo describing parquet file source (#16264
Browse files Browse the repository at this point in the history
)
  • Loading branch information
Bhupesh-V committed Sep 5, 2022
1 parent e1596bc commit 498d700
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/integrations/sources/s3.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,7 +185,7 @@ Since CSV files are effectively plain text, providing specific reader options is

### Parquet

Apache Parquet file is a column-oriented data storage format of the Apache Hadoop ecosystem. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk. For now this solutiion are iterating through individual files at the abstract-level thus partitioned parquet datasets are unsupported. The following settings are available:
Apache Parquet file is a column-oriented data storage format of the Apache Hadoop ecosystem. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk. For now, the solution involves iterating through individual files at the abstract level thus partitioned parquet datasets are unsupported. The following settings are available:

* `buffer_size` : If positive, perform read buffering when deserializing individual column chunks. Otherwise IO calls are unbuffered.
* `columns` : If not None, only these columns will be read from the file.
Expand Down

0 comments on commit 498d700

Please sign in to comment.