fixes #473 avoids scanning entire table metadata for bulk import #3336

keith-turner · 2023-04-24T21:46:37Z

No description provided.

keith-turner · 2023-04-24T21:52:58Z

Noticed this issue again while working on #3337 and created a fix for it. Thinking this fix may be good for 2.1, it could offer nice performance benefits for bulk imports into really large tables.

ctubbsii · 2023-04-24T23:04:06Z

Noticed this issue again while working on #3337 and created a fix for it. Thinking this fix may be good for 2.1, it could offer nice performance benefits for bulk imports into really large tables.

If you decide to do this against 2.1 instead, and update the PR, then please also update the target version project to 2.1.1 from 3.0.0.

keith-turner · 2023-04-25T15:57:59Z

I asked for thoughts about merging this to 2.1 in slack. Ed brought up a good point there. I need to investigate what if any impact the changes have on the bulkv1 code.

keith-turner · 2023-05-01T14:10:18Z

Looked at bulk import V1 in 2.1 branch and I don't think this optimization could apply to bulk V1 because there is no good way to know what files are imported to what tablets. In bulk V1 intermediate tservers do all of the work of figuring out what files go where and these tservers know something about the metadata. However the manager does not know enough to limit the clean up scans.

We could make this change in 2.1, but it would only benefit bulk import v2. Bulk import V1 would continue to scan the entire table for each bulk import when doing cleanup.

keith-turner · 2023-05-01T14:38:10Z

I rebased this onto 2.1

ctubbsii · 2023-05-01T21:42:03Z

GitHub Actions QA checks didn't run on this PR not sure why. Going to try to add an empty commit to the PR branch to force them to run.
EDIT: Nevermind. Closing and reopening triggered the QA checks.

core/src/main/java/org/apache/accumulo/core/util/Nulls.java

keith-turner added 6 commits May 1, 2023 10:13

fixes apache#473 avoids scanning entire table metadata for bulk import

6095f70

updates javadoc

41d50a4

fixes build issue

f8b020e

format code

18a3b20

improves logging

2c2237b

get bulk v1 working

ea966fa

keith-turner force-pushed the accumulo-473 branch from b577984 to ea966fa Compare May 1, 2023 14:35

keith-turner changed the base branch from main to 2.1 May 1, 2023 14:36

ctubbsii closed this May 1, 2023

ctubbsii reopened this May 1, 2023

ctubbsii reviewed May 1, 2023

View reviewed changes

core/src/main/java/org/apache/accumulo/core/util/Nulls.java Outdated Show resolved Hide resolved

remove Nulls class

3edd638

dlmarion approved these changes May 9, 2023

View reviewed changes

dlmarion mentioned this pull request May 9, 2023

modifies bulk import to use conditional mutations #3350

Merged

keith-turner merged commit e776715 into apache:2.1 May 9, 2023

ctubbsii linked an issue May 10, 2023 that may be closed by this pull request

Bulk import scans all table metadata when removing load flags. #473

Closed

ctubbsii added this to the 2.1.1 milestone Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixes #473 avoids scanning entire table metadata for bulk import #3336

fixes #473 avoids scanning entire table metadata for bulk import #3336

Uh oh!

keith-turner commented Apr 24, 2023

Uh oh!

keith-turner commented Apr 24, 2023

Uh oh!

ctubbsii commented Apr 24, 2023

Uh oh!

keith-turner commented Apr 25, 2023

Uh oh!

keith-turner commented May 1, 2023

Uh oh!

keith-turner commented May 1, 2023 •

edited

Loading

Uh oh!

ctubbsii commented May 1, 2023 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fixes #473 avoids scanning entire table metadata for bulk import #3336

fixes #473 avoids scanning entire table metadata for bulk import #3336

Uh oh!

Conversation

keith-turner commented Apr 24, 2023

Uh oh!

keith-turner commented Apr 24, 2023

Uh oh!

ctubbsii commented Apr 24, 2023

Uh oh!

keith-turner commented Apr 25, 2023

Uh oh!

keith-turner commented May 1, 2023

Uh oh!

keith-turner commented May 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ctubbsii commented May 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keith-turner commented May 1, 2023 •

edited

Loading

ctubbsii commented May 1, 2023 •

edited

Loading