External compaction design by dlmarion · Pull Request #266 · apache/accumulo-website

dlmarion · 2021-03-02T14:38:24Z

No description provided.

milleruntime

Looks good overall. I provided some thoughts for consideration.

milleruntime · 2021-03-03T14:07:50Z

design/external-compaction.md

Suggested change

* `tserver.compaction.major.service.<service>.planner.opts.executors`: a json array where each object in the array has the fields name, maxSize, and numThreads. For example:

* `tserver.compaction.major.service.default.planner.opts.executors`: a json array where each object in the array has the fields name, maxSize, and numThreads. For example:

The default name should be "default" right?

If you can modify the behavior of the default compaction planner by using these properties, then yes.

milleruntime · 2021-03-03T14:08:16Z

design/external-compaction.md

Suggested change

* `tserver.compaction.major.service.<service>.planner.opts.maxOpen`: number of files that will be included in a single compaction

* `tserver.compaction.major.service.default.planner.opts.maxOpen`: number of files that will be included in a single compaction

The default name should be "default" right?

If you can modify the behavior of the default compaction planner by using these properties, then yes.

milleruntime · 2021-03-03T14:21:58Z

design/external-compaction.md

I feel like at least one of these methods should include a timeout, I am not sure where that would be determined though.

milleruntime · 2021-03-03T14:28:13Z

design/external-compaction.md

Would it be better to have the tservers push the summary information when it changes? This way we don't want to burden tservers by polling while it is busy.

Would it be better to have the tservers push the summary information when it changes?

I am uncertain which is better, but I am leaning a bit towards pushing. One advantage to tservers pushing is that tservers w/ nothing in their external queues would not push anything. With polling, it would keep polling tservers that have no tablets in their external queues.

Was thinking polling is better in the case when the coordinator starts up. It can do something like the following.

poll all tservers

start accepting request for work from compactors

With push, I suppose it could do something like the following

wait X minutes for tservers to report external compaction summary info

start accepting request for work from compactors

So need to figure out the coordinator startup behavior w/ push.

What about only polling at startup so compactors can get started right away? I suppose a hybrid approach would then require 2 different RPC endpoints though. Maybe that's not such a big deal?

The hybrid approach may be best, I was hoping to think of something simpler.

If the tserver push frequency is known, like tservers push at least every 30 secs. Then the coordinator could wait 2x that I think and it would be close to optimal. If the coordinator misses a few updates from tservers before handing out work, then it may start some lower priority compactions running before higher prio ones. That is probably ok.

milleruntime · 2021-03-03T14:39:07Z

design/external-compaction.md

Would the priority be system generated (like age + size)?

The default planner currenty uses the compaction type and #of files to construct a priority. The following are links to the code that currently creates the priority.

https://github.com/apache/accumulo/blob/b12d1103d788473168616f28ae87a65bb76aa880/core/src/main/java/org/apache/accumulo/core/spi/compaction/DefaultCompactionPlanner.java#L236

https://github.com/apache/accumulo/blob/b12d1103d788473168616f28ae87a65bb76aa880/core/src/main/java/org/apache/accumulo/core/util/compaction/CompactionJobPrioritizer.java#L33

@dlmarion and I were talking and concluded that long is probably too high cardinality for priority. Maybe a 16 bit or 8 bit integer would be better. Then maybe the prio could be type:log2(numFiles)

milleruntime · 2021-03-03T14:47:27Z

design/external-compaction.md

I think it would be better to call cancel on any compactions if a table is deleted.

milleruntime · 2021-03-03T14:54:09Z

design/external-compaction.md

Maybe it would be good to have different configurable options for handling this scenario. Option 1 would be to cancel any compactions on the arrival of new files. Option 2 would be to allow the compaction to finish (not cancel on arrival). This would allow users to tune their cluster for faster scans or cleaning old data. This might also be complicated by whether the compaction is a USER or system type. For example, we may want to allow user compactions to finish vs cancelling a system one.

Also, I am not sure if we want to cancel compactions with the arrival of new data. I don't think we currently do that at all.

The wording needs to be updated. The compaction is not yet running and cancel just means its canceled in the queue. The current code never cancels a running compaction when the plan changes, only queued compactions.

Ah OK. So something more like "EE2 is removed from the queue"

"Removed from the queue" would be better wording. In the code it actually just cancels it, possibly leaving it in the queue for efficiency. Something canceled in the queue would never run. However those are just implementation details, its doing lazy removal.

Below is where only queued compactions are canceled

https://github.com/apache/accumulo/blob/b12d1103d788473168616f28ae87a65bb76aa880/server/tserver/src/main/java/org/apache/accumulo/tserver/compactions/CompactionService.java#L185

Below is where a task is atomically transitioned to cancel, but only if its currently in the queued state.

https://github.com/apache/accumulo/blob/b12d1103d788473168616f28ae87a65bb76aa880/server/tserver/src/main/java/org/apache/accumulo/tserver/compactions/CompactionExecutor.java#L112

Below is where a task atomically transitions from queued state to running. The task in the canceled state, then the transition will fail.

https://github.com/apache/accumulo/blob/b12d1103d788473168616f28ae87a65bb76aa880/server/tserver/src/main/java/org/apache/accumulo/tserver/compactions/CompactionExecutor.java#L88

milleruntime · 2021-03-03T14:56:56Z

design/external-compaction.md

I know this design is strictly for online compactions but we could make offline compactions configurable. Maybe even have a separate queue for handling offline compactions.

dlmarion added 4 commits March 2, 2021 12:54

Added images for external compaction design doc

0b9bf09

External compaction design doc

5d2ff17

Switched images

072b92e

minor updates to formatting and wording

de71ceb

dlmarion mentioned this pull request Mar 2, 2021

Support external compactions in containers apache/accumulo#1451

Closed

milleruntime reviewed Mar 3, 2021

View reviewed changes

Merge branch 'next-release' into external-compaction-design

96617f5

dlmarion changed the base branch from main to next-release April 20, 2021 17:09

dlmarion added 2 commits April 20, 2021 20:57

Updating compaction doc with external compaction information

20b7501

Updated components to include Coordinator and Compactor

017f602

dlmarion mentioned this pull request May 10, 2021

Cleanup documentation PR apache/accumulo#2089

Closed

Removed OBE external compaction design docs

4732635

dlmarion closed this May 11, 2021

dlmarion deleted the external-compaction-design branch May 11, 2021 13:41

	* `tserver.compaction.major.service.<service>.planner.opts.executors`: a json array where each object in the array has the fields name, maxSize, and numThreads. For example:
	* `tserver.compaction.major.service.default.planner.opts.executors`: a json array where each object in the array has the fields name, maxSize, and numThreads. For example:

Conversation

dlmarion commented Mar 2, 2021

Uh oh!

milleruntime left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

keith-turner Mar 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

keith-turner Mar 3, 2021 •

edited

Loading