Skip to content

HIVE-25373 : Modify buildColumnStatsDesc to send configured number of stats for updation#2520

Merged
maheshk114 merged 2 commits intoapache:masterfrom
maheshk114:HIVE-25373
Aug 3, 2021
Merged

HIVE-25373 : Modify buildColumnStatsDesc to send configured number of stats for updation#2520
maheshk114 merged 2 commits intoapache:masterfrom
maheshk114:HIVE-25373

Conversation

@maheshk114
Copy link
Contributor

What changes were proposed in this pull request?

Group the stats objects into different batches to avoid large message size to HMS.

Why are the changes needed?

If the number of stats exceeds limit, the operation may fail.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Unit test

request.setValidWriteIdList(validWriteIdList.toString());
}
}
db.setPartitionColumnStatistics(request);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Info level logging would be good to have here. Since it will be printed for every 10K updates, it should be fine and will be helpful for debugging as well. And if possible provide time taken for overall buildColumnStatsDesc.

Copy link
Contributor

@rbalamohan rbalamohan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

minor comment. Otherwise LGTM.

@rbalamohan
Copy link
Contributor

LGTM. +1

@maheshk114 maheshk114 merged commit 75dea70 into apache:master Aug 3, 2021
@maheshk114 maheshk114 deleted the HIVE-25373 branch August 3, 2021 03:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants