[Feature]: Overlap batch scheduling, model running, and node communications.

### Is your feature request related to a problem?

Performance is not optimal.

### Describe the Solution you'd like

Though we implemented `micro_batch` in the batch scheduling part, we haven't refactored `run_loop` to overlap `process_batch` with `send_multipart`. Also, similar to zero-overhead scheduling, we should hide the CPU latency introduced by the scheduling itself as well.

### Alternatives Considered (Optional)

_No response_

### Additional Context (Optional)

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: Overlap batch scheduling, model running, and node communications. #123

Is your feature request related to a problem?

Describe the Solution you'd like

Alternatives Considered (Optional)

Additional Context (Optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: Overlap batch scheduling, model running, and node communications. #123

Description

Is your feature request related to a problem?

Describe the Solution you'd like

Alternatives Considered (Optional)

Additional Context (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions