Ensure metrics are logged regardless of requests #2347

ichernev · 2024-01-04T17:39:29Z

Metrics are currently logged at the end of each step, but if there are no requests there are no new logs/metrics, so the last values are reported to prometheus indefinitely.

Also, for some reason, it always reports one running request.

simon-mo · 2024-01-04T18:16:35Z

nice catch! although a better UX would be to reset it just once so vLLM won't be spamming the same log message telling idle status every 10s. i'm having trouble coming up with a clean way to do it though.

ichernev · 2024-01-04T19:42:40Z

If you have the instance running in kubernetes, every 15s it queries the /metrics and prints in the logs. If you really want to avoid the vllm log part (where it logs zeros over and over), I can add some ifs. I think it's not needed, let me know.

ichernev · 2024-01-04T19:59:03Z

@simon-mo I'm a bit confused about the ray/remote logic. Should I call the do_log_stats via self.engine.do_log_stats.remote(), or there is something I'm missing.

Metrics are currently logged at the end of each step, but if there are no requests there are no new logs/metrics, so the last values are reported to prometheus indefinitely. Also, for some reason, it always reports one running request.

simon-mo · 2024-01-05T13:21:47Z

vllm/engine/async_llm_engine.py

+        if self.engine_use_ray:
+            await self.engine.do_log_stats.remote()


Yeah this is right.

simon-mo · 2024-01-05T13:25:25Z

@ichernev thanks again. I think the current state is fine, therefore, merged. We can revisit this if people find the logs too spammy.

simon-mo self-assigned this Jan 4, 2024

ichernev force-pushed the fix-metrics branch from 96cd969 to f775cf1 Compare January 4, 2024 18:43

ichernev marked this pull request as draft January 4, 2024 19:42

Ensure metrics are logged regardless of requests

3d39d1f

Metrics are currently logged at the end of each step, but if there are no requests there are no new logs/metrics, so the last values are reported to prometheus indefinitely. Also, for some reason, it always reports one running request.

ichernev force-pushed the fix-metrics branch from 7f374af to 3d39d1f Compare January 4, 2024 21:09

ichernev marked this pull request as ready for review January 4, 2024 21:11

simon-mo approved these changes Jan 5, 2024

View reviewed changes

simon-mo merged commit d0215a5 into vllm-project:main Jan 5, 2024
2 checks passed

jedibrillo pushed a commit to jedibrillo/vllm that referenced this pull request Jan 5, 2024

Ensure metrics are logged regardless of requests (vllm-project#2347)

94433e1

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Jan 18, 2024

Ensure metrics are logged regardless of requests (vllm-project#2347)

edebc90

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Ensure metrics are logged regardless of requests (vllm-project#2347)

209c767

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure metrics are logged regardless of requests #2347

Ensure metrics are logged regardless of requests #2347

ichernev commented Jan 4, 2024

simon-mo commented Jan 4, 2024 •

edited

ichernev commented Jan 4, 2024

ichernev commented Jan 4, 2024

simon-mo Jan 5, 2024

simon-mo commented Jan 5, 2024

		if self.engine_use_ray:
		await self.engine.do_log_stats.remote()

Ensure metrics are logged regardless of requests #2347

Ensure metrics are logged regardless of requests #2347

Conversation

ichernev commented Jan 4, 2024

simon-mo commented Jan 4, 2024 • edited

ichernev commented Jan 4, 2024

ichernev commented Jan 4, 2024

simon-mo Jan 5, 2024

Choose a reason for hiding this comment

simon-mo commented Jan 5, 2024

simon-mo commented Jan 4, 2024 •

edited