ml-cube · alelavml3 · Nov 4, 2024 · Oct 14, 2024 · Oct 14, 2024 · Oct 16, 2024
diff --git a/md-docs/imgs/monitoring/drift-explainability/concept-fi.svg b/md-docs/imgs/monitoring/drift-explainability/concept-fi.svg
diff --git a/md-docs/imgs/monitoring/drift-explainability/fi.svg b/md-docs/imgs/monitoring/drift-explainability/fi.svg
diff --git a/md-docs/imgs/monitoring/drift-explainability/score.svg b/md-docs/imgs/monitoring/drift-explainability/score.svg
diff --git a/md-docs/imgs/monitoring/overview.svg b/md-docs/imgs/monitoring/overview.svg
diff --git a/md-docs/imgs/monitoring/states.svg b/md-docs/imgs/monitoring/states.svg
diff --git a/md-docs/stylesheets/extra.css b/md-docs/stylesheets/extra.css
@@ -30,4 +30,12 @@
   background-color: rgb(43, 155, 70);
   -webkit-mask-image: var(--md-admonition-icon--code-block);
           mask-image: var(--md-admonition-icon--code-block);
-}
+}
+
+.nice-list ul{
+    list-style-type: circle;
+}
+
+.mermaid {
+    text-align: center;
+ }
diff --git a/md-docs/user_guide/data.md b/md-docs/user_guide/data.md
@@ -19,7 +19,7 @@ Available categories are:
 The [Data Schema] created for the [Task] contains a list of Column objects, each of which has a _Role_.
 Naturally, there is a relationship between the Column's Role and the Data Category.
 In fact, each Data Category comprises a set of Column objects with certain Roles.
-So that, when you upload samples belonging to a Data Category, they must contains all the Columns objects declared on the Data Schema to be considered valid.
+When you upload samples belonging to a Data Category, they must contain all the Columns objects declared on the Data Schema to be considered valid.
 
 The following table shows these relationships:
 
@@ -130,7 +130,7 @@ For RAG Tasks, reference data can be used to indicate the type of data expected
     You can set reference data as follow:
 
     ``` py
-    job_id = job_id = client.set_model_reference(
+    job_id = client.set_model_reference(
         model_id=model_id,
         from_timestamp=from_timestamp,
         to_timestamp=to_timestamp,

diff --git a/md-docs/user_guide/detection_event_rules.md b/md-docs/user_guide/detection_event_rules.md
diff --git a/md-docs/user_guide/index.md b/md-docs/user_guide/index.md
@@ -53,7 +53,7 @@ A **Task** is specified by several attributes, the most important are:
 
 - `type`: regression, classification, object detection ...
 - `data structure`: tabular data, image data, ...
-- `optional target`: if the target is not always available. This happen when input samples are labeled and the most part of production data do not have a label
+- `optional target`: if the target is not always available. This happens when input samples are labeled and the most part of production data do not have a label
 - `data schema`: specifies the inputs and the target of the task, see [Data Schema](data_schema.md) section for more details
 - `cost info`: information about the economic costs of the error on the target
 
@@ -110,7 +110,7 @@ Now that you have clear the basic concepts we invite you to explore the other ML
 
     Discover how to setup automation rules to increase your reactivity.
 
-    [:octicons-arrow-right-24: More info](detection_event_rules.md)
+    [:octicons-arrow-right-24: More info](monitoring/detection_event_rules.md)
 
 -   :material-lock:{ .lg .middle } **Roles and access**
 

diff --git a/md-docs/user_guide/model.md b/md-docs/user_guide/model.md
@@ -1 +1,20 @@
-# Model
+# Model
+
+
+
+
+[//]: # ()
+[//]: # ()
+[//]: # (What is additional probabilistic output?)
+
+[//]: # ()
+[//]: # (What is metric?)
+
+[//]: # ()
+[//]: # (What is suggestion type?)
+
+[//]: # ()
+[//]: # (What is retraining cost?)
+
+[//]: # ()
+[//]: # (What is retraining trigger?)
diff --git a/md-docs/user_guide/modules/index.md b/md-docs/user_guide/modules/index.md
@@ -13,15 +13,15 @@ Modules can be always active or on-demand: Monitoring module and Drift Explainab
 
     Data drift detection over data.
 
-    [:octicons-arrow-right-24: More info](user_guide/company.md)
+    [:octicons-arrow-right-24: More info](../monitoring/index.md)
 
 -   :material-compare:{ .lg .middle } **Drift Explainability**
 
     ---
 
     Understand the nature of detected drift.
 
-    [:octicons-arrow-right-24: More info](user_guide/modules/index.md)
+    [:octicons-arrow-right-24: More info](../monitoring/drift_explainability.md)
 
 -   :material-speedometer:{ .lg .middle } **Retraining**
 

diff --git a/md-docs/user_guide/monitoring/detection_event.md b/md-docs/user_guide/monitoring/detection_event.md
@@ -0,0 +1,49 @@
+# Detection Event
+
+A Detection Event is raised by the ML cube Platform when a significant change is detected in one of the entities being monitored.
+
+An event is characterized by the following attributes:
+
+- `Event Type`: the type of the event. It's possible values are:
+        <div class="nice-list">
+            <ul>
+                <li> `Warning On`: the monitoring entity is experiencing slight changes that might lead to a drift.</li>
+                <li> `Warning Off`: the monitoring entity has returned to the reference distribution. </li>
+                <li> `Drift On`: the monitoring entity has drifted from the reference distribution.</li>
+                <li> `Drift Off`: the monitoring entity has returned to the reference distribution.</li>
+            </ul>
+        </div>
+- `Severity`: the severity of the event. It's provided only for drift events and it can be `Low`, `Medium`, or `High`.
+- `Monitoring Target`: the [Monitoring Target](index.md#monitoring-metrics) being monitored.
+- `Monitoring Metric`: the [Monitoring Metric](index.md#monitoring-metrics) being monitored.
+- `Model Name`: the name of the model that raised the event. It's present only if the event is related to a model.
+- `Model Version`: the version of the model that raised the event. It's present only if the event is related to a model.
+- `Insert datetime`: the time when the event was raised.
+- `Sample timestamp`: the timestamp of the sample that triggered the event.
+- `Sample customer ID`: the id of the customer that triggered the event.
+- `User feedback`: the feedback provided by the user on whether the event was expected or not.
+
+## Retrieve Detection Events
+
+You can access the detection events generated by the Platform in two ways:
+
+- **SDK**: it can be used to retrieve all detection events for a specific task programmatically.
+- **WebApp**: navigate to the **`Detection `** section located in the task page's sidebar. Here, all detection events are displayed in a table, 
+   with multiple filtering options available for useful event management. Additionally, the latest detection events identified are shown in the Task homepage,
+   in the section named "Latest Detection Events".
+
+## User Feedback
+
+When a detection event is raised, you can provide feedback on whether the event was expected or not. This feedback is then used 
+to tune the monitoring algorithms and improve their performance. The feedback can be provided through the WebApp, in the
+**`Detection `** section of the task page, or through the SDK.
+
+
+## Detection Event Rules
+
+To automate actions upon the reception of a detection event, you can set up detection event rules. 
+You can learn more about how to configure them in the [Detection Event Rules] section.
+
+
+[Monitoring]: index.md
+[Detection Event Rules]: detection_event_rules.md
diff --git a/md-docs/user_guide/monitoring/detection_event_rules.md b/md-docs/user_guide/monitoring/detection_event_rules.md
@@ -0,0 +1,66 @@
+# Detection Event Rules
+
+This section outlines how to configure automation to receive notifications or start retraining after a [Detection Event] occurs.
+
+When a detection event is produced, the ML cube Platform reviews all the detection event rules you have set 
+and triggers those matching the event.
+
+Rules are specific to a task and are characterized by the following attributes:
+
+- `Name`: a descriptive label of the rule.
+- `Detection Event Type`: the type of event that triggers the rule.
+- `Severity`: the severity of the event that triggers the rule. It is only applicable to drift events. If not specified, the rule will be triggered by drift events of any severity.
+- `Monitoring Target`: the [Monitoring Target](index.md#monitoring-targets) whose event should trigger the rule. 
+- `Monitoring Metric`: the [Monitoring Metric](index.md#monitoring-metrics) whose event should trigger the rule.
+- `Model name`: the name of the model to which the rule applies. This is only required when the monitoring target is related to a model
+  (such as `ERROR` or `PREDICTION`).
+- `Actions`: A list of actions to be executed sequentially when the rule is triggered.
+
+## Detection Event Actions
+Three types of actions are currently supported: notification, plot configuration and retrain.
+
+### Notifications
+
+These actions send notifications to external services when a detection event is triggered. The following notification actions are available:
+
+- `Slack Notification`: sends a notification to a Slack channel via webhook.
+- `Discord Notification`: sends a notification to a Discord channel via webhook.
+- `Email Notification`: sends an email to the provided email address.
+- `Teams Notification`: sends a notification to Microsoft Teams via webhook.
+- `Mqtt Notification`: sends a notification to an MQTT broker.
+
+### Plot Configuration
+
+This action consists in creating two plot configurations when a detection event is triggered: the first one includes
+data preceding the event, while the second one includes data following the event.
+
+### Retrain
+
+Retrain Action enables the automatic retraining of your model. Therefore, it is only available when the target of the rule is related to a model.
+The retrain action does not need any parameter because it is automatically inferred from the `Model Name` attribute of the rule.
+Of course, the model must already have a retrain trigger associated before setting up this action.
+
+!!! example
+    The following code snippet demonstrates how to create a rule that matches high severity drift events on the error of a model. 
+    When triggered, it first sends a notification to the `ml3-platform-notifications` channel on your Slack workspace, using the 
+    provided webhook URL, and then starts the retraining of the model.
+
+    ```py
+    rule_id = client.create_detection_event_rule(
+        name='Retrain model with notification',
+        task_id='my-task-id',
+        model_name='my-model',
+        severity=DetectionEventSeverity.HIGH,
+        detection_event_type=DetectionEventType.DRIFT_ON,
+        monitoring_target=MonitoringTarget.ERROR,
+        actions=[
+            SlackNotificationAction(
+                webhook='https://hooks.slack.com/services/...',
+                channel='ml3-platform-notifications'
+            ),
+            RetrainAction()
+        ],
+    )
+    ```
+
+[Detection Event]: detection_event.md
diff --git a/md-docs/user_guide/monitoring/drift_explainability.md b/md-docs/user_guide/monitoring/drift_explainability.md
@@ -0,0 +1,64 @@
+# Drift Explainability
+
+[Monitoring]  is a crucial aspect of the machine learning lifecycle, as it enables tracking the model's performance and its data over time,
+ensuring the model continues to function as expected. However, monitoring only is not enough when it comes to the adaptation phase.
+
+In order to make the right decisions, you need to understand what were the main factors that led to the drift in the first place, so that
+the correct actions can be taken to mitigate it.
+
+The ML cube Platform supports this process by offering what we refer to as **Drift Explainability Reports**, 
+automatically generated upon the detection of a drift and containing several elements that should help you diagnose the root causes 
+of the change occurred.
+
+You can access the reports in the WebApp, by navigating to the `Drift Explainability` tab in the sidebar of the Task page.
+
+## Structure
+
+A Drift Explainability Report consists in comparing the reference data and the portion of production data where the drift was identified, hence 
+those belonging to the new data distribution. Notice that these reports are generated after a sufficient amount of samples has been collected 
+after the drift, in order to ensure statistical reliability of the results.
+If the data distribution moves back to the reference before enough samples are collected, the report might not be generated.
+
+Each report is composed of several entities, each providing a different perspective on the data and the drift occurred. 
+Most of them are specific to a certain `Data Structure`, so they might not be available for all tasks.
+
+These entities can take the form of tables, plots, or textual explanations. 
+Observed and analyzed together, they should provide a comprehensive understanding of the drift and its underlying causes.
+These are the entities currently available:
+
+- `Feature Importance`: it's a barplot that illustrates how the significance of each feature differs between the reference 
+ and the production datasets. Variations in a feature's values might suggest that its contribution to the model's predictions 
+ has changed over time. This entity is available only for tasks with tabular data.
+
+<figure markdown>
+  ![Feature Importance](../../imgs/monitoring/drift-explainability/fi.svg)
+  <figcaption>Example of a feature importance plot.</figcaption>
+</figure>
+
+- `Variable discriminative power`: it's also a bar plot displays the influence of each feature, as well as the target, 
+ in differentiating between the reference and the production datasets. 
+ The values represent how strongly a given feature helps to distinguish the datasets, with higher values representing stronger 
+ separating power. This entity is available only for tasks with tabular data.
+
+<figure markdown>
+  ![Variable discriminative power](../../imgs/monitoring/drift-explainability/concept-fi.svg)
+  <figcaption>Example of a variable discriminative power plot.</figcaption>
+</figure>
+
+- `Drift Score`: it's a line plot that shows the evolution of the drift score over time. The drift score is a 
+  measure of the statistical distance between a sliding window of the production data and the reference data. It also shows the threshold,
+  which is the value that the drift score must exceed to raise a drift alarm, and all the [Detection Events] that were triggered in
+  the time frame of the report. This plot helps in understanding how the drift evolved over time and the moments in which the difference
+  between the two datasets was higher. Notice that some postprocessing is applied on the events to account for the functioning of the drift detection algorithms. 
+  Specifically,
+  we shift back the drift on events by a certain offset, aiming to point at the precise time when the drift actually started. As a result,
+  drift on events might be shown before the threshold is exceeded. This explainability entity is available for all tasks.
+
+
+<figure markdown style="width: 100%">
+  ![Drift score](../../imgs/monitoring/drift-explainability/score.svg)
+  <figcaption style="width: 100%; text-align: center;">Example of a drift score plot with detection events of increasing severity displayed.</figcaption>
+</figure>
+
+[Monitoring]: index.md
+[Detection Events]: detection_event.md