Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

importccl: replace the undocumented data format MYSQLOUTFILE with DELIMITED #40482

Merged
merged 1 commit into from Sep 4, 2019

Conversation

spaskob
Copy link
Contributor

@spaskob spaskob commented Sep 4, 2019

MYSQLOUTFILE was originally used to help a client import data that was
not in csv proper format. This turns out to be useful for other users so
we call the new format DELIMITED. It's very fast and simple format for importing
delimited data disregarding issues with quoting. For example the csv format forbids
field1,fieldsth2 as such fields that contain quotes have to be enclosed in
quotes themselves. The whole format is fully described here
https://dev.mysql.com/doc/refman/8.0/en/load-data.html.

Fixes #39820.

Release note (cli change): add a new IMPORT DATA format DELIMITED.

@cockroach-teamcity
Copy link
Member

This change is Reviewable

MYSQLOUTFILE was originally used to help a client import data that was
not in csv proper format. This turns out to be useful for other users so
we call the new format DELIMITED. It's very fast and simple format for importing
delimited data disregarding issues with quoting. For example the csv format forbids
`field1,fieldsth2` as such fields that contain  quotes have to be enclosed in
quotes themselves. The whole format is fully described here
https://dev.mysql.com/doc/refman/8.0/en/load-data.html.

Fixes cockroachdb#39820.

Release note (cli change): add a new IMPORT DATA format DELIMITED.
Copy link
Member

@dt dt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we'll want to work with docs to expand a bit more on this in the docs but change itself LGTM

@spaskob
Copy link
Contributor Author

spaskob commented Sep 4, 2019

bors r+

craig bot pushed a commit that referenced this pull request Sep 4, 2019
40469: opt: fix panic when building indirection exprs r=justinj a=justinj

Previously, we would ask a built array datum to tell us its type when
building an IndirectionExpr. This was problematic when the datum was
NULL, since while the optimizer-level NULLs track their inferred type,
built DNulls do not, so we ended up not knowing the element type.

This is fixed by grabbing the type from the opt expression, rather than
the built datum.

Fixes #40404.
Fixes #37794.

Release note (bug fix): fixed an optimizer panic when building array
access expressions.

40471: exec: add projections of AndExpr r=jordanlewis a=jordanlewis

This adds support for TPCH q19, which sees a 3x speedup with vectorized.

Release note: None

40482: importccl: replace the undocumented data format MYSQLOUTFILE with DELIMITED r=spaskob a=spaskob

MYSQLOUTFILE was originally used to help a client import data that was
not in csv proper format. This turns out to be useful for other users so
we call the new format DELIMITED. It's very fast and simple format for importing
delimited data disregarding issues with quoting. For example the csv format forbids
`field1,fieldsth2` as such fields that contain  quotes have to be enclosed in
quotes themselves. The whole format is fully described here
https://dev.mysql.com/doc/refman/8.0/en/load-data.html.

Fixes #39820.

Release note (cli change): add a new IMPORT DATA format DELIMITED.

Co-authored-by: Justin Jaffray <justin@cockroachlabs.com>
Co-authored-by: Jordan Lewis <jordanthelewis@gmail.com>
Co-authored-by: Spas Bojanov <spas@cockroachlabs.com>
@craig
Copy link
Contributor

craig bot commented Sep 4, 2019

Build succeeded

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

importccl: Inconsistent IMPORT CSV behavior related to quotation marks
3 participants