[Feature]refactor ucconnector #167

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

ygwpz merged 2 commits into ModelEngine-Group:develop from qyh111:dev_qyh_0908

Sep 12, 2025

Contributor

qyh111 commented Sep 8, 2025 •

edited

Loading

Purpose

What this PR does / why we need it?

Fix hole match problem when lookup and create
Remove some unnecessary parameters when build metadata

Modifications

Does this PR introduce any user-facing change?

No just metadata change

Test

Tested by offline inferrence

also use benchmark to test online service

How was this patch tested?

No new patch added

qyh111 changed the title ~~[WIP]Dev qyh 0908~~ [WIP]refactor ucconnector

Contributor

flesher0813 commented Sep 9, 2025

Find no location where _load_req_to_blocks is assigned, so the blocks that failed to load cannot be properly returned.

ygwpz reviewed

View reviewed changes

ucm/integration/vllm/uc_connector.py Outdated Show resolved Hide resolved

ygwpz reviewed

View reviewed changes

ucm/integration/vllm/uc_connector.py Outdated Show resolved Hide resolved


          refactor ucconnector

50ab55d

qyh111 force-pushed the dev_qyh_0908 branch from 50f88e0 to 50ab55d Compare

September 10, 2025 06:57

qyh111 changed the title ~~[WIP]refactor ucconnector~~ refactor ucconnector

Contributor

ygwpz commented Sep 10, 2025

pr title should include [feature] / [xx]

qyh111 changed the title ~~refactor ucconnector~~ [Feature]refactor ucconnector

flesher0813 reviewed

View reviewed changes

ucm/integration/vllm/uc_connector.py

    
                          assert len(fetch_block_ids) == len(fetch_block_hashes)

                          blocks_len = len(fetch_block_ids)

                          storage_block_ids = [block[0] for block in request.load_blocks]

Contributor

flesher0813 Sep 11, 2025

It would be nice to add some comments to help understand this code, block_hash of ReqMeta.load_blocks is storage_block_ids could be confusing

Contributor Author

qyh111 Sep 11, 2025 •

edited

Loading

already add comments

ucm/integration/vllm/uc_connector.py

    
                              )

                          if request.load_async:

                          if request.load_async and request.request_id in self.layerwise_load_tasks:

Contributor

flesher0813 Sep 11, 2025

Is the judgment of request.request_id in self.layerwise_load_tasks necessary? So as the judgment below

Contributor Author

qyh111 Sep 11, 2025 •

edited

Loading

just in case

ucm/integration/vllm/uc_connector.py

    
                              + save_param.num_blocks_to_save

                          ]

                          blocks_len = len(vllm_block_ids)

                          storage_block_ids = [block[0] for block in request.dump_blocks]

Contributor

flesher0813 Sep 11, 2025

It would be nice to add some comments to help understand this code too

Contributor Author

qyh111 Sep 11, 2025

already add comments

ucm/integration/vllm/uc_connector.py Outdated

    
                      need_load_tokens = max(num_external_computed_tokens - num_computed_tokens, 0)

                      # Load async when Decode instance need to load.

                      # Load async when Decode instance need to load.kv_consumer"

Contributor

flesher0813 Sep 11, 2025

Remove kv_consumer

Contributor Author

qyh111 Sep 11, 2025

fixed

qyh111 force-pushed the dev_qyh_0908 branch from e475a01 to 628f6fe Compare

September 11, 2025 11:20


          fic comment

ac7b6a3

qyh111 force-pushed the dev_qyh_0908 branch from 628f6fe to ac7b6a3 Compare

September 11, 2025 11:51

Contributor

ygwpz commented Sep 11, 2025

log format should be unified

ygwpz merged commit 62ed31f into ModelEngine-Group:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet