Skip to content
This repository has been archived by the owner on Jun 28, 2023. It is now read-only.

Latest commit

 

History

History
36 lines (31 loc) · 9.2 KB

Container-Insights-metrics-EKS.md

File metadata and controls

36 lines (31 loc) · 9.2 KB

Amazon EKS and Kubernetes Container Insights metrics

The following table lists the metrics and dimensions that Container Insights collects for Amazon EKS and Kubernetes. These metrics are in the ContainerInsights namespace. For more information, see Metrics.

If you do not see any Container Insights metrics in your console, be sure that you have completed the setup of Container Insights. Metrics do not appear before Container Insights has been set up completely. For more information, see Setting up Container Insights.

When you use Container Insights to collect the following metrics, the metrics are charged as custom metrics. For more information about CloudWatch pricing, see Amazon CloudWatch Pricing.

Metric name Dimensions Description
cluster_failed_node_count ClusterName The number of failed worker nodes in the cluster. A node is considered failed if it is suffering from any node conditions. For more information, see Conditions in the Kubernetes documentation.
cluster_node_count ClusterName The total number of worker nodes in the cluster.
namespace_number_of_running_pods Namespace ClusterName ClusterName The number of pods running per namespace in the resource that is specified by the dimensions that you're using.
node_cpu_limit ClusterName The maximum number of CPU units that can be assigned to a single node in this cluster.
node_cpu_reserved_capacity NodeName, ClusterName, InstanceId ClusterName The percentage of CPU units that are reserved for node components, such as kubelet, kube-proxy, and Docker. Formula: node_cpu_request / node_cpu_limit node_cpu_request is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
node_cpu_usage_total ClusterName The number of CPU units being used on the nodes in the cluster.
node_cpu_utilization NodeName, ClusterName, InstanceId ClusterName The total percentage of CPU units being used on the nodes in the cluster. Formula: node_cpu_usage_total / node_cpu_limit
node_filesystem_utilization NodeName, ClusterName, InstanceId ClusterName The total percentage of file system capacity being used on nodes in the cluster. Formula: node_filesystem_usage / node_filesystem_capacity node_filesystem_usage and node_filesystem_capacity are not reported directly as metrics, but are fields in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
node_memory_limit ClusterName The maximum amount of memory, in bytes, that can be assigned to a single node in this cluster.
node_memory_reserved_capacity NodeName, ClusterName, InstanceId ClusterName The percentage of memory currently being used on the nodes in the cluster. Formula: node_memory_request / node_memory_limit node_memory_request is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
node_memory_utilization NodeName, ClusterName, InstanceId ClusterName The percentage of memory currently being used by the node or nodes. It is the percentage of node memory usage divided by the node memory limitation. Formula: node_memory_working_set / node_memory_limit.
node_memory_working_set ClusterName The amount of memory, in bytes, being used in the working set of the nodes in the cluster.
node_network_total_bytes NodeName, ClusterName, InstanceId ClusterName The total number of bytes per second transmitted and received over the network per node in a cluster. Formula: node_network_rx_bytes + node_network_tx_bytes node_network_rx_bytes and node_network_tx_bytes are not reported directly as metrics, but are fields in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
node_number_of_running_containers NodeName, ClusterName, InstanceId ClusterName The number of running containers per node in a cluster.
node_number_of_running_pods NodeName, ClusterName, InstanceId ClusterName The number of running pods per node in a cluster.
pod_cpu_reserved_capacity PodName, Namespace, ClusterName ClusterName The CPU capacity that is reserved per pod in a cluster. Formula: pod_cpu_request / node_cpu_limit pod_cpu_request is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_cpu_utilization PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The percentage of CPU units being used by pods. Formula: pod_cpu_usage_total / node_cpu_limit pod_cpu_usage_total is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_cpu_utilization_over_pod_limit PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The percentage of CPU units being used by pods relative to the pod limit. Formula: pod_cpu_usage_total / pod_cpu_limit pod_cpu_usage_total and pod_cpu_limit are not reported directly as metrics, but are fields in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_memory_reserved_capacity PodName, Namespace, ClusterName ClusterName The percentage of memory that is reserved for pods. Formula: pod_memory_request / node_memory_limit pod_memory_request is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_memory_utilization PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The percentage of memory currently being used by the pod or pods. Formula: pod_memory_working_set / node_memory_limit pod_memory_working_set is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_memory_utilization_over_pod_limit PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The percentage of memory that is being used by pods relative to the pod limit. If any containers in the pod don't have a memory limit defined, this metric doesn't appear. Formula: pod_memory_working_set / pod_memory_limit pod_memory_working_set is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_network_rx_bytes PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The number of bytes per second being received over the network by the pod. Formula: sum(pod_interface_network_rx_bytes) pod_interface_network_rx_bytes is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_network_tx_bytes PodName, Namespace, ClusterName Namespace, ClusterName Service, Namespace, ClusterName ClusterName The number of bytes per second being transmitted over the network by the pod. Formula: sum(pod_interface_network_tx_bytes) pod_interface_network_tx_bytes is not reported directly as a metric, but is a field in performance log events. For more information, see Relevant fields in performance log events for Amazon EKS and Kubernetes.
pod_number_of_container_restarts PodName, Namespace, ClusterName The total number of container restarts in a pod.
service_number_of_running_pods Service, Namespace, ClusterName ClusterName The number of pods running the service or services in the cluster.