-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Update documentation for COPY command #9931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| OPTIONS( | ||
| NULL_VALUE 'NAN' | ||
| ); | ||
| my_table(a bigint, b bigint) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I found the existing formatting hard to read, so I added some whitespace
|
|
||
| ### COPY Specific Options | ||
|
|
||
| The following special options are specific to the `COPY` command. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These options are now specified directly in the DML syntax itself, so I removed them from here
| [ OPTIONS( <i><b>option</i></b> [, ... ] ) ] | ||
| </pre> | ||
|
|
||
| `STORED AS` specifies the file format the `COPY` command will write. If this |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ported / reworded this content from write options page
| format parquet, | ||
| compression snappy, | ||
| 'compression::col1' 'zstd(5)', | ||
| partition_by 'column3, column4' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this the correct format? Based on #9927 the partition_by has moved to the DML and it should be something like: COPY t1 TO '/tmp/hive_output/' PARTITIONED BY (col1) OPTIONS (format parquet);
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that is an excellent point -- I fixed it in af55db8 (I also tested that it works locally):
❯ create table source_table as values ('1','2','3','4');
0 row(s) fetched.
Elapsed 0.021 seconds.
❯ COPY source_table
TO 'test/table_with_options'
PARTITIONED BY (column3, column4)
OPTIONS (
format parquet,
compression snappy,
'compression::column1' 'zstd(5)',
)
;
+-------+
| count |
+-------+
| 1 |
+-------+|
Thank you for the review @hveiga 🙏 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm! Thank you for these changes!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Which issue does this PR close?
closes #9927
Rationale for this change
Looks like I missed a spot while updating the docs in #9754
What changes are included in this PR?
Are these changes tested?
CI doc checks
Are there any user-facing changes?
Docs