Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for LARGE_BINARY/LARGE_STRING with Arrow #40293

Merged

Conversation

joshuataylor
Copy link
Contributor

@joshuataylor joshuataylor commented Aug 17, 2022

This adds support for LARGE_UTF8 with Arrow. See the issue here #32401 for a full rundown of how to replicate this issue is there.

Unsure of how to add a test for this, I think we might need to change test.arrow to add a LARGE_UTF8 column there, and also for Parquet. Thoughts?

Changelog category (leave one):

  • Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Add support for LARGE_BINARY/LARGE_STRING with Arrow (Closes #32401)

@CLAassistant
Copy link

CLAassistant commented Aug 17, 2022

CLA assistant check
All committers have signed the CLA.

@robot-clickhouse robot-clickhouse added the pr-improvement Pull request with some product improvements label Aug 17, 2022
@yakov-olkhovskiy yakov-olkhovskiy self-assigned this Aug 17, 2022
@yakov-olkhovskiy yakov-olkhovskiy added the can be tested Allows running workflows for external contributors label Aug 17, 2022
@Avogar Avogar merged commit 09a2ff8 into ClickHouse:master Aug 18, 2022
@joshuataylor
Copy link
Contributor Author

Thanks so much for cleaning this up!

@joshuataylor joshuataylor deleted the feature/arrow-large-binary-string branch August 19, 2022 01:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
can be tested Allows running workflows for external contributors pr-improvement Pull request with some product improvements
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support large_utf8 column format on parquet
5 participants