Skip to content

Commit f43b459

Browse files
committed
Updated troubleshooting panel docs for COO 1.3 release.
1 parent c0a73b7 commit f43b459

7 files changed

+235
-107
lines changed
137 KB
Loading

images/coo-troubleshooting-panel-graph.svg

Lines changed: 207 additions & 0 deletions
Loading
85.4 KB
Loading
364 KB
Loading

modules/coo-troubleshooting-ui-plugin-creating-alert.adoc

Lines changed: 0 additions & 46 deletions
This file was deleted.

modules/coo-troubleshooting-ui-plugin-install.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,4 +24,4 @@ metadata:
2424
name: troubleshooting-panel
2525
spec:
2626
type: TroubleshootingPanel
27-
----
27+
----

modules/coo-troubleshooting-ui-plugin-using.adoc

Lines changed: 27 additions & 60 deletions
Original file line numberDiff line numberDiff line change
@@ -5,81 +5,48 @@
55
:_mod-docs-content-type: PROCEDURE
66
[id="coo-troubleshooting-ui-plugin-using_{context}"]
77
= Using the {coo-full} troubleshooting UI plugin
8-
9-
include::snippets/unified-perspective-web-console.adoc[]
8+
include::_attributes/common-attributes.adoc[]
9+
:icons: font
1010

1111
.Prerequisites
1212
* You have access to the {ocp-product-title} cluster as a user with the `cluster-admin` cluster role. If your cluster version is 4.17+, you can access the troubleshooting UI panel from the Application Launcher {launch}.
1313
* You have logged in to the {ocp-product-title} web console.
14-
* You have installed {ocp-product-title} Logging, if you want to visualize correlated logs.
15-
* You have installed {ocp-product-title} Network Observability, if you want to visualize correlated netflows.
1614
* You have installed the {coo-full}.
1715
* You have installed the {coo-full} troubleshooting UI plugin.
18-
+
16+
1917
[NOTE]
2018
====
21-
The troubleshooting panel relies on the observability signal stores installed in your cluster.
22-
Kuberenetes resources, alerts and metrics are always available by default in an {ocp-product-title} cluster.
23-
Other signal types require optional components to be installed:
19+
The troubleshooting panel can display additional signals when the following Red Hat provided components are be installed in the installer-recommended namespace.
2420
25-
* **Logs:** Red Hat Openshift Logging (collection) and Loki Operator provided by Red Hat (store)
26-
* **Network events:** Network observability provided by Red Hat (collection) and Loki Operator provided by Red Hat (store)
21+
* **Logs:** `{ocp-product-title} Logging` (collection) and `{ocp-product-title} Loki Operator` (store)
22+
* **Network events:**` {ocp-product-title} Network observability` (collection) and `{ocp-product-title} Loki Operator` (store)
23+
* **Traces:** `{ocp-product-title} Distributed Tracing` (collection and store)
2724
====
28-
.Procedure
2925

30-
. In the web console, go to *Observe* -> *Alerting* and then select an alert. If the alert has correlated items, a **Troubleshooting Panel** link will appear above the chart on the alert detail page.
31-
+
32-
image::coo-troubleshooting-panel-link.png[Troubleshooting Panel link]
33-
+
34-
Click on the **Troubleshooting Panel** link to display the panel.
35-
. The panel consists of query details and a topology graph of the query results. The selected alert is converted into a Korrel8r query string and sent to the `korrel8r` service.
36-
The results are displayed as a graph network connecting the returned signals and resources. This is a _neighbourhood_ graph, starting at the current resource and including related objects up to 3 steps away from the starting point.
37-
Clicking on nodes in the graph takes you to the corresponding web console pages for those resouces.
38-
. You can use the troubleshooting panel to find resources relating to the chosen alert.
39-
+
4026
[NOTE]
4127
====
42-
Clicking on a node may sometimes show fewer results than indicated on the graph. This is a known issue that will be addressed in a future release.
28+
Users with `ClusterRole/view` (for example `kube:admin`) will get the most complete results.
29+
Other users will see more restricted graphs, showing only data that they have permission to see.
4330
====
44-
+
45-
image::coo-troubleshooting-panel-graph.png[Troubleshooting panel]
46-
[arabic]
47-
.. **Alert (1):** This node is the starting point in the graph and represents the `KubeContainerWaiting` alert displayed in the web console.
48-
49-
.. **Pod (1):** This node indicates that there is a single `Pod` resource associated with this alert. Clicking on this node will open a console search showing the related pod directly.
50-
51-
.. **Event (2):** There are two Kuberenetes events associated with the pod. Click this node to see the events.
52-
53-
.. **Logs (74):** This pod has 74 lines of logs, which you can access by clicking on this node.
54-
55-
.. **Metrics (105):** There are many metrics associated with the pod.
56-
57-
.. **Network (6):** There are network events, meaning the pod has communicated over the network. The remaining nodes in the graph represent the `Service`, `Deployment` and `DaemonSet` resources that the pod has communicated with.
5831

59-
.. **Focus:** Clicking this button updates the graph. By default, the graph itself does not change when you click on nodes in the graph. Instead, the main web console page changes, and you can then navigate to other resources using links on the page, while the troubleshooting panel itself stays open and unchanged. To force an update to the graph in the troubleshooting panel, click **Focus**. This draws a new graph, using the current resource in the web console as the starting point.
32+
.Procedure
33+
The **Troubleshooting Panel** is a side-panel that shows a graph of resources and signals related to the information displayed in the main console.
34+
You can use the troubleshooting panel on most console pages - resources, alerts, metrics, logs, traces, network observability and more.
6035

61-
.. **Show Query:** Clicking this button enables some experimental features:
36+
. Open the Application Launcher {launch}, click the **Signal Correlation** menu item.
6237
+
63-
image::coo-troubleshooting-experimental.png[Experimental features]
64-
[arabic]
65-
... **Hide Query** hides the experimental features.
66-
67-
... The query that identifies the starting point for the graph.
68-
The query language, part of the link:https://korrel8r.github.io/korrel8r[Korrel8r] correlation engine used to create the graphs, is experimental and may change in future.
69-
The query is updated by the **Focus** button to correspond to the resources in the main web console window.
70-
71-
... **Neighbourhood depth** is used to display a smaller or larger neighbourhood.
38+
image::coo-troubleshooting-panel-link.png[Open panel]
7239
+
73-
[NOTE]
74-
====
75-
Setting a large value in a large cluster might cause the query to fail, if the number of results is too big.
76-
====
77-
... **Goal class** results in a goal directed search instead of a neighbourhood search. A goal directed search shows all paths from the starting point to the goal class, which indicates a type of resource or signal. The format of the goal class is experimental and may change. Currently, the following goals are valid:
78-
**** `k8s:__RESOURCE[VERSION.[GROUP]]__` identifying a kind of kuberenetes resource. For example `k8s:Pod` or `k8s:Deployment.apps.v1`.
79-
**** `alert:alert` representing any alert.
80-
81-
**** `metric:metric` representing any metric.
82-
83-
**** `netflow:network` representing any network observability network event.
84-
85-
**** `log:__LOG_TYPE__` representing stored logs, where `__LOG_TYPE__` must be one of `application`, `infrastructure` or `audit`.
40+
. Click **Focus** to show a correlation graph starting from the resources or signals in the main console display.
41+
+
42+
image::coo-troubleshooting-panel-graph.png[Troubleshooting panel]
43+
+
44+
<1> **Focus** will re-calculate the graph from the main console at any time.
45+
<2> **Sync** refreshes the graph that is currently displayed.
46+
<3> The root node represents the resource in the main console window.
47+
<4> "First degree" neighbors - directly related to the initial resource.
48+
<5> "Second degree" neighbors - indirectly related via first-degree neighbors.
49+
+
50+
. Clicking on a node opens the data in the main console.
51+
+
52+
image::coo-troubleshooting-panel-log.png[Troubleshooting panel]

0 commit comments

Comments
 (0)