Support custom mergeable metrics in whylogs #241

andyndang · 2021-06-24T00:33:48Z

There are three kinds of metrics that whylogs users track:

Tracking derived metrics from customers. Typically this is numerical data. You can use the approach above to track these metrics because they will show up as a “whylogs” column
Custom metrics that are mergeable: basically if you have metrics that can be “summed” or “aggregated” across different profiles, this is a feature request that we are tracking from other customers as well.
One-off metrics: sometimes users have one-off metrics that they want to piggy back on top of whylogs. These metrics are not aggregatable, but they want to use whylogs object to store these metrics.

andyndang · 2021-06-24T00:39:54Z

For 1, we're already doing this.

For 2, we'll need to support:

Storing metrics (probably in binary form)
A class that can handle the metric in Python
A class that can handle the metric in Java
These classes will implements method for serder and merging metrics. So we get something like obj.to_bytes(), obj.parse_bytes(), obj.merge(another). I believe these should be sufficient for most of the use cases

For 3, we can start with supporting numbers. But what's the behavior when merging two objects with "unmergeable" metrics? Should we throw exception? Warnings? Drop the fields?

lalmei · 2021-06-24T01:01:37Z

I would focus on a simple class custom metric api, like

class CustomMetric

   def track(inputs):
   def merge(self, right_metric):

if there is no merge then we cant merge, unless it inherents from something like numberTracker .

andyndang · 2021-06-24T01:03:51Z

That might work. We still need to:

Decide to throw error/drop the metrics or not
How to store this (so you'll need to convert it to bytes/from bytes)
Store information about the class and the metrics for later parsing

ramannanda9 · 2021-06-29T17:59:59Z

A base class should be able to handle serialization pretty easily.

@dataclass
class CustomMetric(abc.ABC):
    @abc.abstractmethod
    def track():
        pass
     @abc.abstracmethod
     def merge(self, right_metric: 'CustomMetric'):
          pass
     @abc.abstractmethod
     def name():
          pass
     def deserialize(name:str) -> 'CustomMetric'
           //implementation here by traversing subclasses of 'CustomMetric' and calling its constructor
     def serialize():
         return {"name": self.name, 'params': dataclasses.asdict(self)  }

lalmei · 2021-07-08T01:10:47Z

except we need at least the protobuf de/serialization

jamie256 · 2021-09-02T15:37:29Z

The protobuf message packing and then serialization is how our datasetprofile serializes all the associated metrics for a dataset when the logger writes, but I took your suggestion using a dataclass and doing most of that work in a base class. Here is a draft PR if you have comments: #300

jamie256 · 2022-05-31T21:45:06Z

custom metrics participate in merging in v1:
https://github.com/whylabs/whylogs/blob/mainline/python/examples/basic/Merging_Profiles.ipynb
https://github.com/whylabs/whylogs/blob/dev/richard/custom-example/python/examples/extensions/Custom_Metrics.ipynb

andyndang added the feature label Jun 24, 2021

jamie256 self-assigned this May 3, 2022

jamie256 added the v1 label May 3, 2022

jamie256 closed this as completed May 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support custom mergeable metrics in whylogs #241

Support custom mergeable metrics in whylogs #241

andyndang commented Jun 24, 2021 •

edited

andyndang commented Jun 24, 2021

lalmei commented Jun 24, 2021

andyndang commented Jun 24, 2021

ramannanda9 commented Jun 29, 2021

lalmei commented Jul 8, 2021

jamie256 commented Sep 2, 2021

jamie256 commented May 31, 2022

Support custom mergeable metrics in whylogs #241

Support custom mergeable metrics in whylogs #241

Comments

andyndang commented Jun 24, 2021 • edited

andyndang commented Jun 24, 2021

lalmei commented Jun 24, 2021

andyndang commented Jun 24, 2021

ramannanda9 commented Jun 29, 2021

lalmei commented Jul 8, 2021

jamie256 commented Sep 2, 2021

jamie256 commented May 31, 2022

andyndang commented Jun 24, 2021 •

edited