-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
[V0 Deprecation] Remove vllm.worker
and update according imports
#25901
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[V0 Deprecation] Remove vllm.worker
and update according imports
#25901
Conversation
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request effectively removes the deprecated vllm.worker
directory and migrates its functionality to vllm.v1
, successfully deprecating the V0 worker implementation. The changes are clean and well-executed. Key improvements include:
- Consolidating worker base classes into
vllm/v1/worker/worker_base.py
. - Removing the insecure and deprecated usage of
cloudpickle
for dynamic worker class loading, replacing it with a clear error for unsupported configurations. - Simplifying the worker selection logic in platform-specific files (
cuda.py
,rocm.py
) to always use the V1 worker. - Updating all relevant imports across the codebase to point to the new
vllm.v1.worker
path.
The refactoring makes the codebase cleaner, more secure, and easier to maintain by removing the old V0 logic. I have reviewed the changes and found no issues. The implementation is solid.
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
Will wait to trigger CI to make sure there aren't any failures |
def execute_model( | ||
self, | ||
execute_model_req: Optional[ExecuteModelRequest] = None | ||
) -> Optional[list[SamplerOutput]]: | ||
raise NotImplementedError |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI, this method is also outdated. V1 doesn't use ExecuteModelRequest
at all.
…25901) Signed-off-by: yewentao256 <zhyanwentao@126.com>
This PR removes the
vllm/worker
directory and migrate relevant class tovllm/v1
Some change behaviours include:
init_worker
(cc @russellb)I haven't touched the ray executor part yet.
Signed-off-by: Aaron Pham contact@aarnphm.xyz