Skip to content

Conversation

@keith-turner
Copy link
Contributor

This change lowers the time it takes to host ondemand tablets by moving this functionality into TabletGroupWatcher.

The client RPC thread processing the hosting request can now directly call a function in TGW that will immediately start on the work of hosting the tablets.

Updated SplitMillionIT to request hosting of 200 tablets all at once instead of one by one. This was done by using a BatchScanner instead of lots of scanners.

This change lowers the time it takes to host ondemand tablets
by moving this functionality into TabletGroupWatcher.

The client RPC thread processing the hosting request can now
directly call a function in TGW that will immediately start on
the work of hosting the tablets.

Updated SplitMillionIT to request hosting of 200 tablets all
at once instead of one by one.  This was done by using a
BatchScanner instead of lots of scanners.
@keith-turner keith-turner linked an issue May 20, 2024 that may be closed by this pull request
@keith-turner
Copy link
Contributor Author

In SplitMillionIT this changes lowered the time to read 100 tablets after a clone operation from 20 -30 secs to 1-2 secs. This is because the hosting operation is no longer impeded by the hosting operation. There may still be further room for improvement, need to dig in and understand where the code is spending its time after making this improvement.

Copy link
Contributor

@dlmarion dlmarion left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good. I also noticed that manageTablets could be called from different threads. I tried to parallelize the calls to it from the EventCoordinator. But, it didn't provide any real speed increase.

@keith-turner
Copy link
Contributor Author

This looks good. I also noticed that manageTablets could be called from different threads. I tried to parallelize the calls to it from the EventCoordinator. But, it didn't provide any real speed increase.

Working through this made me wonder what will happen when like 1000 clients processes all try to get some tablets hosted at around the same time. Will it work through it quickly, slowly, or get jammed up. Need to test this eventually.

@keith-turner keith-turner merged commit 5aca710 into apache:elasticity May 21, 2024
@keith-turner keith-turner deleted the accumulo-4571 branch May 22, 2024 15:02
@keith-turner keith-turner added this to the 4.0.0 milestone Jul 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Lower time to host ondemand tablets

2 participants