Skip to content

[Pipeline](exec) support async writer in pipelien query engine#22901

Merged
Gabriel39 merged 1 commit intoapache:masterfrom
HappenLee:lambda
Aug 15, 2023
Merged

[Pipeline](exec) support async writer in pipelien query engine#22901
Gabriel39 merged 1 commit intoapache:masterfrom
HappenLee:lambda

Conversation

@HappenLee
Copy link
Contributor

Proposed changes

Issue Number: close #xxx

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

}

void AsyncResultWriter::start_writer() {
ExecEnv::GetInstance()->fragment_mgr()->get_thread_pool()->submit_func(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe this need a lot of threads

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need another thread pool with a configurable size

@HappenLee HappenLee changed the title save code [Pipeline](exec) support async writer in pipelien query engine Aug 14, 2023
@HappenLee HappenLee marked this pull request as ready for review August 14, 2023 16:03
@HappenLee
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@hello-stephen
Copy link
Contributor

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 45.19 seconds
stream load tsv: 514 seconds loaded 74807831229 Bytes, about 138 MB/s
stream load json: 21 seconds loaded 2358488459 Bytes, about 107 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 31 seconds loaded 861443392 Bytes, about 26 MB/s
insert into select: 98.6 seconds inserted 10000000 Rows, about 101K ops/s
storage size: 17162432315 Bytes

@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added approved Indicates a PR has been approved by one committer. reviewed labels Aug 15, 2023
@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

}

void AsyncResultWriter::start_writer() {
ExecEnv::GetInstance()->fragment_mgr()->get_thread_pool()->submit_func(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need another thread pool with a configurable size


void AsyncResultWriter::process_block() {
if (!_is_open) {
_writer_status = open();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

process_block will not be executed twice so we do not need to do this check

Copy link
Contributor

@Gabriel39 Gabriel39 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Gabriel39 Gabriel39 merged commit 9b2323b into apache:master Aug 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants