Skip to content

Conversation

@guojinrong-nn
Copy link
Collaborator

@guojinrong-nn guojinrong-nn commented Sep 8, 2025

Notice: this is the part-1/4 of multi_stream_parallel feature.

  1. New added
  • gflag : enable_multi_stream_parallel, default as false.
  • add member variables all_seqs_in_prefill_ in class Batch, to ensure the whole batch, all sequences in prefill or chunked prefill stage, to use this feature. If there is decode sequence, will not use this feature.
  • add concat method in class SamplingParameters, we split two micro batched to execute model execution individually, after this, will execute the sample operation together, to avoid extra small kernel launch.
  1. Besides, refactor two places:
  • delete useless function definition split() in xllm/core/framework/batch/batch.h.
  • delete useless set_dp_rank() logic in xllm/core/framework/batch/batch_factory.cpp, dp_rank is already set in block manager.

@guojinrong-nn guojinrong-nn changed the title feat: support multi_stream_parallel part-1 add flag and modify related batch sampler modules. feat: support multi_stream_parallel and modify related batch sampler modules. [1/4] Sep 9, 2025
@liutongxuan liutongxuan changed the title feat: support multi_stream_parallel and modify related batch sampler modules. [1/4] feat: support multi_stream_parallel and modify batch sampler. [1/4] Sep 9, 2025
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Submodule xllm_ops now uses a8e8b1, no need to fallback to 98b515

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated.

@guojinrong-nn guojinrong-nn force-pushed the feat_prefill_multi_streams_parallel branch 2 times, most recently from 7dd5e35 to c6e0d53 Compare September 9, 2025 08:32
@guojinrong-nn guojinrong-nn force-pushed the feat_prefill_multi_streams_parallel branch from 9b82d3f to f883018 Compare September 10, 2025 10:58
@liutongxuan liutongxuan merged commit 06addc9 into jd-opensource:main Sep 10, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants