Minion Batch ingestion scheduling bottleneck #11282

t0mpere · 2023-08-07T14:32:38Z

Hello, I've tried to debug why scheduling SegmentGenerationAndPushTask Minion jobs take so long to schedule and I've narrowed it down the problem to this part of the code.

pinot/pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/minion/PinotHelixTaskResourceManager.java

Lines 297 to 309 in 78308da

    
           JobConfig.Builder jobBuilder = 
        
               new JobConfig.Builder().addTaskConfigs(helixTaskConfigs).setInstanceGroupTag(minionInstanceTag) 
        
                   .setTimeoutPerTask(taskTimeoutMs).setNumConcurrentTasksPerInstance(numConcurrentTasksPerInstance) 
        
                   .setIgnoreDependentJobFailure(true).setMaxAttemptsPerTask(1).setFailureThreshold(Integer.MAX_VALUE) 
        
                   .setExpiry(_taskExpireTimeMs); 
        
           _taskDriver.enqueueJob(getHelixJobQueueName(taskType), parentTaskName, jobBuilder); 
        
           // Wait until task state is available 
        
           while (getTaskState(parentTaskName) == null) { 
        
             Uninterruptibles.sleepUninterruptibly(100, TimeUnit.MILLISECONDS); 
        
           } 
        
           return parentTaskName;

I'm currently use POST /tasks/execute API to schedule the job.
The culprit seems to be the while loop waiting for the task to get a state. I'm not familiar on how helix handles this in the background. Do you think it would be possible to avoid looping on synchronized getTaskState() and maybe implement a callback to get the result of a job scheduling.
This is a big deal for us since scheduling takes more than ingestion and doesn't allow to keep up with new data and scale.
It might also be a misconfiguration problem but in this case I will need your help to find it.

Current configuration:
GKE
version 0.12.1
GCS for deep storage
3 ZK - 8 CPU and 18GB ram
6 Servers - 16CPU and 32 64GB ram 1.45TB SSD
2 Controllers - 16 CPU and 32GB ram
2 Brokers - 5 CPU 16.25GB ram
32 Minions - 2 CPU and 2GB of ram

1M Segments 4TB of data

The text was updated successfully, but these errors were encountered:

Jackie-Jiang · 2023-08-08T22:56:58Z

cc @snleee

Do you see the log of Submitting parent task... before the scheduling returns? Usually the task state should be available very soon, so we need to figure out whether creating the tasks (in SegmentGenerationAndPushTaskGenerator) takes the time or scheduling.

t0mpere · 2023-08-08T23:10:25Z

Ok so these are the logs from a job scheduling. As you can see the task generation is very quick and the scheduling seems to be the bottleneck. 19 seconds passed between generation and the response. Let me know if you need any more info.

2023-08-04 16:50:00.118 BST Trying to create tasks of type: SegmentGenerationAndPushTask, table: TABLE
2023-08-04 16:50:00.434 BST Submitting ad-hoc task for task type: SegmentGenerationAndPushTask with task configs: [...]
2023-08-04 16:50:00.452 BST Submitting parent task: Task_SegmentGenerationAndPushTask_TABLE_c826881f-a0c5-48d0-bb58-8a43b3037b60 of type: SegmentGenerationAndPushTask with 1 child task configs
2023-08-04 16:50:00.456 BST Add job configuration TaskQueue_SegmentGenerationAndPushTask_Task_SegmentGenerationAndPushTask_TABLE_c826881f-a0c5-48d0-bb58-8a43b3037b60
2023-08-04 16:50:19.144 BST Handled request from 10.00.00.00 POST http://prod.host:80/tasks/execute, content-type application/json status code 200 OK
2023-08-04 16:50:19.376 BST Trying to create tasks of type: SegmentGenerationAndPushTask, table: TABLE_OTHER

Jackie-Jiang · 2023-08-10T06:59:34Z

19 seconds is normal for Helix to create the task state after a task is submitted because several steps are ZK watcher callback based.
We shouldn't need to wait for the task state to show up though. Seems it is a workaround for a Helix bug introduced in #1894. Since we already upgraded to higher Helix version, let me see if we can remove the workaround.

t0mpere · 2023-08-10T09:16:36Z

Thanks this will help a lot 🚀

Jackie-Jiang added the performance label Aug 10, 2023

Jackie-Jiang mentioned this issue Aug 10, 2023

Enhance Minion task management #11315

Merged

Jackie-Jiang closed this as completed in #11315 Aug 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minion Batch ingestion scheduling bottleneck #11282

Minion Batch ingestion scheduling bottleneck #11282

t0mpere commented Aug 7, 2023 •

edited

Loading

Jackie-Jiang commented Aug 8, 2023

t0mpere commented Aug 8, 2023 •

edited

Loading

Jackie-Jiang commented Aug 10, 2023

t0mpere commented Aug 10, 2023

Minion Batch ingestion scheduling bottleneck #11282

Minion Batch ingestion scheduling bottleneck #11282

Comments

t0mpere commented Aug 7, 2023 • edited Loading

Jackie-Jiang commented Aug 8, 2023

t0mpere commented Aug 8, 2023 • edited Loading

Jackie-Jiang commented Aug 10, 2023

t0mpere commented Aug 10, 2023

t0mpere commented Aug 7, 2023 •

edited

Loading

t0mpere commented Aug 8, 2023 •

edited

Loading