[Bug] Enable AO/AOCO insert to multiple files even enable_parallel is off #39

avamingli · 2023-07-24T07:41:37Z

Cloudberry Database version

No response

What happened

When enable_parallel is off, we will insert into only one AO segfile even gp_appendonly_insert_files is > 1.

Think about the case: user set enable_parallel to on, have some data inserted, query and reset it to false.

That will make data skew after user set enable_parallel to off, and there are a lot of data inserted later or an online-steaming ETL(all data would be inserted into only one segfile).

And that make our parallel plan has a bottleneck.

We should take it back, insert into multiple files according to gp_appendonly_insert_files whatever enable_parallel is.

In general, we should try to make AO segfiles as much as gp_appendonly_insert_files and avoid data skew for users, no matter users use parallel or not.

And only keep gp_appendonly_insert_files default value to 4 is enough.

What you think should happen instead

No response

How to reproduce

Need to create cases.

Operating System

Ubuntu

Anything else

By fixing this, to make regression pass , we need to set GUC gp_appendonly_insert_files = 0 when deploying CBDB at CI pipeline. Need help from @sandiandian .

Are you willing to submit PR?

Yes, I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct.

The text was updated successfully, but these errors were encountered:

avamingli added type: Bug Something isn't working help wanted Extra attention is needed labels Jul 24, 2023

This was referenced Jul 24, 2023

Add icw parallel test in cicd pipeline #72

Merged

Enable AO/AOCO insert to multiple files even enable_parallel is off #83

Merged

avamingli self-assigned this Jul 24, 2023

avamingli closed this as completed in #83 Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Enable AO/AOCO insert to multiple files even enable_parallel is off #39

[Bug] Enable AO/AOCO insert to multiple files even enable_parallel is off #39

avamingli commented Jul 24, 2023

[Bug] Enable AO/AOCO insert to multiple files even enable_parallel is off #39

[Bug] Enable AO/AOCO insert to multiple files even enable_parallel is off #39

Comments

avamingli commented Jul 24, 2023

Cloudberry Database version

What happened

What you think should happen instead

How to reproduce

Operating System

Anything else

Are you willing to submit PR?

Code of Conduct