Skip to content

suggest index parallel for native batch reindexing > 1GB#10788

Merged
jihoonson merged 1 commit intoapache:masterfrom
techdocsmith:remove_outdated_rec
Jan 23, 2021
Merged

suggest index parallel for native batch reindexing > 1GB#10788
jihoonson merged 1 commit intoapache:masterfrom
techdocsmith:remove_outdated_rec

Conversation

@techdocsmith
Copy link
Contributor

Removes outdated recommendation to use Hadoop for production.


This PR has:

  • been self-reviewed.

cc: @petermarshallio @druid-matt

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

index_parallel behaves almost the same as index when maxNumConcurrentSubTasks is 1. So, I think we can suggest to always use index_parallel, but change maxNumConcurrentSubTasks depending on data size.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @jihoonson . I changed it PTAL

Copy link
Contributor

@jihoonson jihoonson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @techdocsmith!

@jihoonson
Copy link
Contributor

The integration test failures should be irrelevant to the doc change.

@jihoonson jihoonson merged commit 99494e3 into apache:master Jan 23, 2021
@clintropolis clintropolis added this to the 0.22.0 milestone Aug 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants