Push parallelism isn't implemented in Hadoop batch ingestion

In `HadoopSegmentTarPushJobRunner.run()`, this is the code at the very end of the method:

``` java
    int pushParallelism = _spec.getPushJobSpec().getPushParallelism();
    if (pushParallelism < 1) {
      pushParallelism = segmentsToPush.size();
    }
    // Push from driver
    try {
      SegmentPushUtils.pushSegments(_spec, outputDirFS, segmentsToPush);
    } catch (RetriableOperationException | AttemptsExceededException e) {
      throw new RuntimeException(e);
    }
```

So it doesn't actually use `pushParallelism`, and `SegmentPushUtils.pushSegments()` does a single-threaded (sequential) push.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Push parallelism isn't implemented in Hadoop batch ingestion #6505

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Push parallelism isn't implemented in Hadoop batch ingestion #6505

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions