Considers all tablet metadata columns in split code #4323

keith-turner · 2024-02-29T16:52:31Z

Made the following changes to the split code that adds new tablets and updates the existing tablet.

fixed potential NPE w/ tablet operation id check by reversing order of equals check
Throws IllegalStateException when attempting to split tablet with merged or cloned markers
Removed adding wals when creating new tablets in split, its not expected that the parent tablet would have wals and this is checked earlier
Deleted any user compaction requested, hosting requested, suspended, or last columns in the parent tablet

Added a unit test that attempts to exercise the split code with all tablet columns. The unit test also has a set of tablet columns that were verified to work with split and it is checked against the set of columns in the code. The purpose of this test is to fail when a new column is added to ensure that split is considered.

Was a bit uncertain about deleting the last location and suspend. Those columns either need to be deleted from the parent tablet or added to the new tablets being created. The current code was doing neither. Decided to delete them as the new tablets have a different range and are conceptually different tablets than the parent.

Made the following changes to the split code that adds new tablets and updates the existing tablet. * fixed potential NPE w/ tablet operation id check by reversing order of equals check * Throws IllegalStateException when attempting to split tablet with merged or cloned markers * Removed adding wals when creating new tablets in split, its not expected that the parent tablet would have wals and this is checked earlier * Deleted any user compaction requested, hosting requested, suspended, or last columns in the parent tablet Added a unit test that attempts to exercise the split code with all tablet columns. The unit test also has a set of tablet columns that were verified to work with split and it is checked against the set of columns in the code. The purpose of this test is to fail when a new column is added to ensure that split is considered. Was a bit uncertain about deleting the last location and suspend. Those columns either need to be deleted from the parent tablet or added to the new tablets being created. The current code was doing neither. Decided to delete them as the new tablets have a different range and are conceptually different tablets than the parent.

cshannon

I am still reviewing this but one thought I had would be with all the state checks that this PR introduces (such as throwing an ISE if there's an unexpected merged marker on split) we should probably add some tests to see what happens if we encounter that situation. I'm wondering if we could get into a weird state where we just start rapidly throwing exceptions in a loop as it tries to split over and over and keeps failing so it would be good to test what the outcome is if there's unexpected columns that show up and that it's acceptable behavior for the system. The other alternative would be to clean up the unexpected markers instead of the state check but if we end up with columns that should never exist then forcing manual intervention to investigate probably is better so I think the the state checks probably make sense.

cshannon · 2024-03-01T17:04:37Z

This is probably something to do as a follow on when the merge test is looked at, but you could try to move common code related to mocking TabletMetadata to a central place like a utility class to be reused. My assumption is a lot of setup for all of the mocks could be re-used for testing that merge handles all the columns. It should make it easier to update the tests when adding new columns that need to be considered for both split and merge.

cshannon · 2024-03-01T17:14:48Z

server/manager/src/test/java/org/apache/accumulo/manager/tableOps/split/UpdateTabletsTest.java

+  @Test
+  public void checkColumns() {
+    for (ColumnType columnType : ColumnType.values()) {
+      assertTrue(COLUMNS_HANDLED_BY_SPLIT.contains(columnType),


I don't know if this is easily possible with EasyMock but I was wondering if the mock API supports a way to check that all methods have been mocked with expectations? I suppose we could just use reflection to iterate over the methods and check each one, but I'm curious if there's a simpler way to check the mocked object has methods that haven't been mocked so we could throw an error if new methods showed up but were not mocked with a return value, which would of course imply it was not considered.

I don't know if this is easily possible with EasyMock but I was wondering if the mock API supports a way to check that all methods have been mocked with expectations?

For this case I was kinda covering it with that set of columns reviewed by a developer in the test. Not sure how to do that in EasyMock.

keith-turner · 2024-03-01T18:25:09Z

I'm wondering if we could get into a weird state where we just start rapidly throwing exceptions in a loop as it tries to split over and over and keeps failing so it would be good to test what the outcome is if there's unexpected columns that show up and that it's acceptable behavior for the system.

It would be good to add an IT for this. Could manually add an unexpected column to a tablet in an IT and then attempt to split it. I'll add that in this PR.

keith-turner linked an issue Feb 29, 2024 that may be closed by this pull request

Ensure split and merge handle bulk import and compaction metadata #4111

Closed

This was referenced Feb 29, 2024

Ensure split and merge handle bulk import and compaction metadata #4111

Closed

Periodically clean up compacted and loaded markers in tablets #4324

Closed

cshannon reviewed Mar 1, 2024

View reviewed changes

code review update

439b3bc

EdColeman approved these changes Mar 4, 2024

View reviewed changes

keith-turner merged commit 1d7f693 into apache:elasticity Mar 4, 2024

cshannon mentioned this pull request Mar 8, 2024

Move the tracking of unsplittable tablets to metadata table #4317

Merged

keith-turner mentioned this pull request Mar 15, 2024

Need a way to test UNKNOWN result for conditional mutations #4376

Closed

keith-turner added this to the 4.0.0 milestone Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Considers all tablet metadata columns in split code #4323

Considers all tablet metadata columns in split code #4323

Uh oh!

keith-turner commented Feb 29, 2024

Uh oh!

cshannon left a comment

Uh oh!

cshannon commented Mar 1, 2024 •

edited

Loading

Uh oh!

cshannon Mar 1, 2024 •

edited

Loading

Uh oh!

keith-turner Mar 2, 2024

Uh oh!

keith-turner commented Mar 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Considers all tablet metadata columns in split code #4323

Considers all tablet metadata columns in split code #4323

Uh oh!

Conversation

keith-turner commented Feb 29, 2024

Uh oh!

cshannon left a comment

Choose a reason for hiding this comment

Uh oh!

cshannon commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cshannon Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

keith-turner Mar 2, 2024

Choose a reason for hiding this comment

Uh oh!

keith-turner commented Mar 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cshannon commented Mar 1, 2024 •

edited

Loading

cshannon Mar 1, 2024 •

edited

Loading