feat(logging): Distinguish logs from different models #1302

vtaskow · 2023-07-17T15:12:20Z

Why

Inspecting logs from different ML models and knowing which log comes from which model is difficult. Currently, all such logs are grouped under the same logger - mlserver.

How

Generally, there are several ways to accomplish this behaviour(by using built-in adapter, formatter, filter). Using a custom logger formatter with context variables(used in metrics package with contextmanager) is a simple solution to this problem. Furthermore, external libraries or custom config files are not needed.

Outcome

In context scope, where the model name and version are set by the context vars, all logs will contain this format: [mlserver][model_name:model_version] LOGLEVEL MSG. E.g.
2023-07-17 14:23:10,102 [mlserver][mushroom-xgboost:v0.1.0] INFO - Loaded model 'mushroom-xgboost' succesfully.
Or if the model_version is not present:
2023-07-17 14:23:10,102 [mlserver][mushroom-xgboost] INFO - Loaded model 'mushroom-xgboost' succesfully.
Furthermore, enabling the model context for un/re/loading models and predicting

Resolves #602

…ormatter

mlserver/metrics/__init__.py

adriangonz

Great work @vtaskow !! I've added a small comment, but this should be ready to go. 🚀

mlserver/logging.py

agrski

Reviewing after the fact, but leaving a few suggestions that I think are worth discussing.

These are nice, simple changes - nice work @vtaskow 😄

I'm personally a big fan of structured, contextual logging, although I normally prefer it to be explicit--passing a configured logger into the appropriate logic. That's purely personal preference and a thought for offline discussion, rather than a comment on the changes in this PR.

agrski · 2023-07-19T11:37:57Z

mlserver/metrics/context.py

-
-model_name_var: ContextVar[str] = ContextVar("model_name")
-model_version_var: ContextVar[str] = ContextVar("model_version")
+from ..context import model_name_var, model_version_var

 SELDON_MODEL_NAME_LABEL = "model_name"


💭 Given that the name and version are used in both logging and metrics, it'd be nice to have constants (as far as Python allows) to ensure the same labels are available in both contexts. If the context vars are the source of truth, that be var.name and var.get(), which I think is pretty straightforward.

agrski · 2023-07-19T11:58:57Z

mlserver/logging.py

+    A logging formatter that uses context variables to inject
+    the model name and version in the log message.


I thought the general pattern for Python docstrings is that the summary should be a single line, even for multi-line docstrings. E.g.

"""Log formatter incorporating model details, e.g. name and version."""

I wasn't aware of this PEP rule. Good spot, I will refactor 👍

agrski · 2023-07-19T12:11:14Z

mlserver/logging.py

+        if not name:
+            return ""
+        model_fmt = f"{name}:{version}" if version else name


🙃 Personally I find this slightly hard to read, even if it's concise. Having 3 explicit cases would be much more immediately obvious to me. Assuming Python 3.10, this could use match-case:

def fmt(name, version): match name, version: case n, _ if not n: return "" case n, v if not v: return f"[{n}]" case n, v: return f"[{n}:{v}]"

This also only needs 1 format string per case--minor performance improvement and slightly easier to understand what's output.

For older Python versions, this could be:

if not name: return "" elif not version: return f"[{name}]" else: return f"[{n}:{v}]"

This is arguably the most legible and broad definition due to the concise use of Boolean contexts.

Oh, the pattern matching looks very elegant. Thanks, Alex! I will refactor it

agrski · 2023-07-19T17:37:23Z

tests/test_logging.py

+
+
+@pytest.mark.parametrize(
+    "name, version, expected_fmt",


🔧 The implementation of the model logger allows the model name to be empty. This case should either be captured by a test case or it shouldn't be permitted because it indicates something has gone wrong (or hasn't been wired up correctly).

Good point! I've missed that. Will fix in the incoming PR.

dtpryce · 2023-12-14T10:08:44Z

I was just testing using master branch (1.4.0.dev3 or similar) and found that the model name does work in logs for loading but not on inference. This is because the REST and gRPC servers inherit their own loggers but that is not configured using the mlserver logger so you need to update the mlserver.rest and mlserver.grpc loggers to use the same configuration - if you want that model and version in the log line and it makes sense.

vtaskow added 4 commits July 17, 2023 15:45

602 Distinguish logs from different models by introducing a new log f…

76f8651

…ormatter

Merge branch 'master' into 602-distinguish-logs-from-different-models

b809448

602 Move logging test to package

05d3228

602 Black

3f64211

vtaskow marked this pull request as ready for review July 17, 2023 15:20

vtaskow commented Jul 17, 2023

View reviewed changes

mlserver/metrics/__init__.py Outdated Show resolved Hide resolved

adriangonz self-requested a review July 17, 2023 15:24

vtaskow added 2 commits July 17, 2023 16:26

602 Fixed lint problem

8902b03

602 Fix import

61f3e93

adriangonz requested a review from agrski July 18, 2023 08:21

602 Flatten test logging folder and remove redundant import

4f27dca

adriangonz approved these changes Jul 18, 2023

View reviewed changes

mlserver/logging.py Outdated Show resolved Hide resolved

602 Access method with self

c37065c

adriangonz merged commit 0258a64 into SeldonIO:master Jul 19, 2023
27 checks passed

agrski reviewed Jul 19, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(logging): Distinguish logs from different models #1302

feat(logging): Distinguish logs from different models #1302

vtaskow commented Jul 17, 2023 •

edited

adriangonz left a comment

agrski left a comment

agrski Jul 19, 2023

agrski Jul 19, 2023

vtaskow Jul 20, 2023

agrski Jul 19, 2023

vtaskow Jul 20, 2023

agrski Jul 19, 2023

vtaskow Jul 20, 2023

dtpryce commented Dec 14, 2023

		A logging formatter that uses context variables to inject
		the model name and version in the log message.

feat(logging): Distinguish logs from different models #1302

feat(logging): Distinguish logs from different models #1302

Conversation

vtaskow commented Jul 17, 2023 • edited

Why

How

Outcome

adriangonz left a comment

Choose a reason for hiding this comment

agrski left a comment

Choose a reason for hiding this comment

agrski Jul 19, 2023

Choose a reason for hiding this comment

agrski Jul 19, 2023

Choose a reason for hiding this comment

vtaskow Jul 20, 2023

Choose a reason for hiding this comment

agrski Jul 19, 2023

Choose a reason for hiding this comment

vtaskow Jul 20, 2023

Choose a reason for hiding this comment

agrski Jul 19, 2023

Choose a reason for hiding this comment

vtaskow Jul 20, 2023

Choose a reason for hiding this comment

dtpryce commented Dec 14, 2023

vtaskow commented Jul 17, 2023 •

edited