[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events #398

zuston · 2022-12-12T08:18:33Z

Code of Conduct

I agree to follow this project's Code of Conduct

Search before asking

I have searched in the issues and found no similar issues.

What would you like to be improved?

[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events.

As the problem mentioned by #378 (comment), the small event will benefit more after the PR of #396 is applied.

How should we improve?

No response

Are you willing to submit PR?

Yes I am willing to submit a PR!

zuston · 2022-12-12T08:18:42Z

cc @jerqi

jerqi · 2022-12-12T08:56:31Z

Could we avoid huge event by using this pr #176 ?

zuston · 2022-12-12T08:58:18Z

Could we avoid huge event by using this pr #176 ?

Yes. But I don't hope so, detailed reason could be found in #378 (comment) of second case.

jerqi · 2022-12-12T08:59:59Z

Could we avoid huge event by using this pr #176 ?

Yes. But I don't hope so, detailed reason could be found in #378 (comment) of second case.

I can't get your point.

zuston · 2022-12-12T09:09:19Z

If the single buffer flush mechanism is enabled, it will be applied all partition data. But when the memory is free and the concurrent apps number is small, there is unnecessary to flush data to localfile or disk. In my view, it will make the local disk free if the single buffer flush size > cold storage threshold size. And it will make memory free if the single buffer flush size < cold storage threshold size.

Overall, this mechanism will not make full use of memory and then leads to performance regression.

jerqi · 2022-12-13T12:50:43Z

We find that it won't matter. Because if we write data to storage, and then we read the data quickly, the data may be in the page cache. It has similar performance with the data in memory. Single buffer mechnism can make full use of network card.

zuston · 2022-12-14T02:35:57Z

Because if we write data to storage, and then we read the data quickly, the data may be in the page cache.

But in the most cases, the read wont happen quickly after writing, especially for many tasks.

jerqi · 2022-12-14T03:13:07Z

Because if we write data to storage, and then we read the data quickly, the data may be in the page cache.

But in the most cases, the read wont happen quickly after writing, especially for many tasks.

Maybe we should some tests prove the effect.

zuston · 2022-12-14T03:51:02Z

I have tested in our online cluster #378 (comment).

jerqi · 2022-12-14T06:21:18Z

I have tested in our online cluster #378 (comment).

The test don't have the pr d3aa5dc

zuston · 2022-12-14T06:29:05Z

It's still useless after #396. Single huge event only can be handled by single thread to flush to HDFS, which don't benefit from concurrent written mechanism.

jerqi · 2022-12-14T06:35:42Z

It's still useless after #396. Single huge event only can be handled by single thread to flush to HDFS, which don't benefit from concurrent written mechanism.

If you use single buffer limit at the same time, you will benefit the feature.

zuston · 2022-12-15T08:22:06Z

If you use single buffer limit at the same time, you will benefit the feature.

I know this, but I don't hope all large buffer flush to persist storage, especially for having enough free memory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events #398

[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events #398

zuston commented Dec 12, 2022

zuston commented Dec 12, 2022

jerqi commented Dec 12, 2022

zuston commented Dec 12, 2022

jerqi commented Dec 12, 2022

zuston commented Dec 12, 2022

jerqi commented Dec 13, 2022 •

edited

Loading

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 15, 2022

[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events #398

[Improvement] Allow split huge data ShuffleDataFlushEvent to multiple small events #398

Comments

zuston commented Dec 12, 2022

Code of Conduct

Search before asking

What would you like to be improved?

How should we improve?

Are you willing to submit PR?

zuston commented Dec 12, 2022

jerqi commented Dec 12, 2022

zuston commented Dec 12, 2022

jerqi commented Dec 12, 2022

zuston commented Dec 12, 2022

jerqi commented Dec 13, 2022 • edited Loading

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 14, 2022

jerqi commented Dec 14, 2022

zuston commented Dec 15, 2022

jerqi commented Dec 13, 2022 •

edited

Loading