Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Input ORC format #6454

Merged
merged 5 commits into from
Aug 22, 2019
Merged

Input ORC format #6454

merged 5 commits into from
Aug 22, 2019

Conversation

akonyaev90
Copy link
Contributor

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

Category (leave one):

  • New Feature

Short description (up to few sentences):
Add input ORC format
...

For read data in ORC format.
Made similar to parquet.
The file we are reading should fit in memory.
...
#4903

include_directories(SYSTEM ${ORC_INCLUDE_DIR})
include_directories(SYSTEM ${ORC_SOURCE_SRC_DIR})
include_directories(SYSTEM ${ORC_SOURCE_WRAP_DIR})
include_directories(SYSTEM ${GOOGLE_PROTIBUF_DIR})
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PROTIBUF ?

if (!in.eof())
{
if (row_group_current < row_group_total)
throw Exception{"Got new data, but data from previous chunks not readed " + std::to_string(row_group_current) + "/" + std::to_string(row_group_total), ErrorCodes::CANNOT_READ_ALL_DATA};
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not readed -> was not read

@alexey-milovidov
Copy link
Member

Is it possible to generate protobuf on the fly? (auto-generated file does not look good).

@@ -10,6 +10,13 @@ endif ()

set_property(DIRECTORY PROPERTY EXCLUDE_FROM_ALL 1)

#===== add orc subdir
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if (USE_INTERNAL_ORC_LIBRARY) ...

@akonyaev90
Copy link
Contributor Author

Is it possible to generate protobuf on the fly? - Alexey, what file are you talking about?

@akonyaev90
Copy link
Contributor Author

orc_proto.pb.cc ?

@akonyaev90
Copy link
Contributor Author

now orc_proto building with proto-command (I added command in arrow-cmake cmakeFile)
typos corrected

@akonyaev90
Copy link
Contributor Author

common part for parquetBlockInput and orcBlockInput was migrated to ArrowColumnToCHColumn class

@alexey-milovidov alexey-milovidov merged commit 51d4ab5 into ClickHouse:master Aug 22, 2019
@@ -0,0 +1,8 @@
##TODO replace hardcode to find procedure

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@akonyaev90 when are you going to implement the proper build logic for ORC contrib?

@KochetovNicolai KochetovNicolai added the pr-feature Pull request with new product feature label Sep 19, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr-feature Pull request with new product feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants