-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Separate hadoop and native batch docs more #6120
Conversation
76edf45
to
790a8e3
Compare
👍 |
If you are having dependency conflicts between Druid and your version of Hadoop, you can try | ||
searching for a solution in the [Druid user groups](https://groups.google.com/forum/#!forum/druid- | ||
user), or reading the Druid [Different Hadoop Versions](../operations/other-hadoop.html) documentation. | ||
Hadoop can be used for batch ingestion. The Hadoop-based batch ingestion will be faster and more scalable than the native batch ingestion. See [here](../ingestion/hadoop.html) for more details. | ||
|
||
## Command Line Hadoop Indexer | ||
|
||
If you don't want to use a full indexing service to use Hadoop to get data into Druid, you can also use the standalone command line Hadoop indexer. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel like this 'option' for the command line Hadoop indexer makes more sense as a note at the beginning of the Hadoop indexing page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved this to the beginning of the hadoop page
docs/content/toc.md
Outdated
@@ -17,6 +17,8 @@ layout: toc | |||
* [Schema Design](/docs/VERSION/ingestion/schema-design.html) | |||
* [Schema Changes](/docs/VERSION/ingestion/schema-changes.html) | |||
* [Batch File Ingestion](/docs/VERSION/ingestion/batch-ingestion.html) | |||
* [Native Batch Ingestion](docs/VERSION/native-batch.html) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/ingestion/native-batch.html?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks, fixed
docs/content/toc.md
Outdated
@@ -17,6 +17,8 @@ layout: toc | |||
* [Schema Design](/docs/VERSION/ingestion/schema-design.html) | |||
* [Schema Changes](/docs/VERSION/ingestion/schema-changes.html) | |||
* [Batch File Ingestion](/docs/VERSION/ingestion/batch-ingestion.html) | |||
* [Native Batch Ingestion](docs/VERSION/native-batch.html) | |||
* [Hadoop Batch Ingestion](docs/VERSION/hadoop.html) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/ingestion/hadoop.html?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
|
||
## Segment publishing modes | ||
|
||
While ingesting data using the Index task, it creates segments from the input data and publishes them. For segment publishing, the Index task supports two segment publishing modes, i.e., _bulk publishing mode_ and _incremental publishing mode_ for [perfect rollup and best-effort rollup](./design/index.html), respectively. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should be ../design
instead of ./design
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i updated this in ingestion/native_tasks.md
(current page for native batch indexing with the parallel index task)
Slight adjustment to batch ingestion docs, creating a separate page for hadoop and restructuring the existing batch ingestion page to be a directory page pointing to specific task type docs