Motivation
Support profiling of workload for different frameworks.
References
vllm
sglang
Proposed Solution
Goal is to enable profiling for a phase by a boolean flag, and have the client send a start/end profile request around the requests.
Alternatives Considered
No response
Additional Context
No response
Motivation
Support profiling of workload for different frameworks.
References
vllm
sglang
Proposed Solution
Goal is to enable profiling for a phase by a boolean flag, and have the client send a start/end profile request around the requests.
Alternatives Considered
No response
Additional Context
No response