Skip to content

Latest commit

 

History

History
257 lines (192 loc) · 15.2 KB

File metadata and controls

257 lines (192 loc) · 15.2 KB
title tags metaDescription redirects freshnessValidatedDate
Infrastructure hosts UI
Infrastructure
Infrastructure monitoring UI
Infrastructure UI
Hosts page
For New Relic infrastructure monitoring, use the Hosts UI page to view the most important metrics from across your server/host environment.
/docs/infrastructure/infrastructure-ui-pages/hosts-new-view
/docs/infrastructure/infrastructure-ui-pages/hosts-new-view/infrastructure-ui-entities
/docs/infrastructure/new-relic-infrastructure/filter-group/group-infrastructure-results-specific-attributes
/docs/infrastructure-group-results-specific-attributes
/docs/infrastructure/new-relic-infrastructure/scope-filter/group-infrastructure-results-specific-attributes
/docs/infrastructure/infrastructure-ui-pages/classic-infra-ui
/docs/infrastructure/new-relic-infrastructure/filter-group/filter-sets-organize-your-infrastructure-hosts
/docs/scopes-organize-your-infrastructure
/docs/infrastructure-organize-your-resources-scopes
/docs/infrastructure-overview-page
/docs/infrastructure/new-relic-infrastructure/infrastructure-ui-pages/infrastructure-overview-page-check-health-all-your-systems
/docs/infrastructure/new-relic-infrastructure/scope-filter/organize-infrastructure-hosts-scopes
/docs/infrastructure/new-relic-infrastructure/filter-group/organize-infrastructure-hosts-filter-sets
/docs/infrastructure/infrastructure-ui-pages/infrastructure-ui-entities
/docs/infrastructure/infrastructure-ui-pages/infrastructure-hosts-page
never

import infrastructureHostsUiDiagram from 'images/infrastructure_screenshot-crop_infrastructure-hosts-ui-diagram.webp'

import infrastructureHostsNewSummaryAnimated from 'images/infrastructure_screenshot-crop_host-page-tiles.gif'

import infrastructureHostsUiOptions from 'images/infrastructure_screenshot-crop_hosts-ui-options.webp'

import infrastructureMainHostsUi from 'images/infrastructure_screenshot-crop_infrastructure-main-hosts-ui.webp'

import infrastructureHostNewSystem from 'images/infrastructure_screenshot-full_host-new-system.webp'

import infrastructureHostNewNetwork from 'images/infrastructure_screenshot-full_host-new-network.webp'

import infrastructureHostNewProcesses from 'images/infrastructure_screenshot-full_host-new-processes.webp'

import infrastructureHostNewStorage from 'images/infrastructure_screenshot-full_host-new-storage.webp'

import infrastructureHostNewContainers from 'images/infrastructure_screenshot-full_host-new-containers.webp'

In our infrastructure monitoring UI, the Hosts UI page is the main place for you to understand and troubleshoot issues with your infrastructure.

These docs describe the new **Hosts** UI added in April of 2023. You can switch between the new view and the **Classic** view.

Infrastructure UI - Main hosts UI

**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure**

An example use case [#use-case]

The Hosts page is the place to go to investigate issues in your infrastructure. Here's one example of a common workflow our customers use to diagnose a problem:

  1. On the APM summary page, you notice an issue with your Kafka service. Throughput has dropped and many errors are present.
  2. Based on the infrastructure data you see in APM, you think it's a host issue so you click through to the infrastructure Hosts page.
  3. The Hosts page automatically filters to your Kafka service. You see that half of the hosts running that service have maxed out their CPU %.
  4. If you suspect a host configuration change could be the source of your issue, go to the Inventory tab and check your system's configurations per host.

UI features for exploring and analyzing your hosts [#ui-features]

There are some important UI features available across the Hosts UI. Understanding how to use these will help you get the most out of this UI.

Infrastructure UI - Hosts UI options

  • A. The filter bar: Filter for the hosts you want to see. For tips, see Filter bar.
  • B. Saved views: When you create a view you like (which can include filters, and custom chart and table selections), click Save.
  • C. Chart options: You can switch to different metrics for each chart, and you can click ... to see more chart options, including getting a shareable link and creating an alert for that metric.
  • D. Group by: Use this to group chart results by specific attributes. For example, you might display the AWS regions with the highest CPU usage grouped by awsRegion.
  • E. Attribute filter: This lets you filter your hosts to those with a specific attribute value. For example,
  • F. Host table: By default, 25 hosts are selected for display. Use the table columns and the attribute filter to explore the hosts, and select different hosts for display. You can export the table as a CSV.
  • G. Host details: For a specific host in the host table, click the icon to see more options to learn about it, including options for viewing logs, seeing an entity map, and more.

Tabs on the hosts UI page [#host-views]

Here are more details about the tabs available on the Hosts UI page:

On the Hosts page, the Summary tab gives you an overview of your hosts' performance.

Hosts Summary page

**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure**

The tiles at the top of this page give you an overview of your hosts, applications, events, and alerts. Use these tiles to see more detail about a host or filter the entire Summary page to a specific host.

The time picker on the Summary page only adjusts the metrics tiles and charts but does not adjust the host table. The other tabs in the Hosts UI have tables that reflect the time range set by the time picker.

<Collapser
id="system"
title="System UI"
>

The System tab gives you an overview of your system. It's somewhat similar to the Summary tab but allows you to search for more in-depth details about some aspects of your system.

Hosts System page

**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > System**

Here are the default charts shown on the System page:

  • CPU %: This is a metric derived from several other metrics on the SystemSample event. Specifically, the cpuPercent attribute is an aggregation of cpuUserPercent, cpuSystemPercent, cpuIoWaitPercent and cpuStealPercent.
  • Load average five minute: represents an average of the number of tasks waiting to do work on a system over the last 5 minutes.
  • Memory free %: compares the amount of free memory bytes to the amount of used memory bytes.

For details about the data used to generate these charts, see SystemSample.

</Collapser>

<Collapser
id="network"
title="Network UI"
>

The Network page provides real-time visibility into the health and performance of individual hosts, web servers, or other groups of resources across your network. Default charts show bandwidth metrics by packet, bandwidth by bytes, and errors per second.

<img
    title="Hosts Network page"
    alt="Hosts Network page"
    src={infrastructureHostNewNetwork}
/>

<figcaption>
    <DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Network**</DoNotTranslate>
</figcaption>

Details about individual interfaces can help you:

  • Examine saturation levels across your network or for selected hosts.
  • Compare load balances between specific resources.
  • Identify unexpected differences in capacity patterns between similar hosts.
  • Evaluate the top five network errors detected.

This real-time network data can then help you determine whether to resolve errors by reconfiguring your network, rebalancing the loads, or taking other preventative maintenance actions before needing to make a purchase decision.

For details about the data used to generate these charts, see NetworkSample.

</Collapser>

<Collapser
id="processes"
title="Processes UI"
>
By default, the infrastructure agent doesn't send [data about the operating system's processes](/attribute-dictionary/?event=ProcessSample) unless you use guided install. To enable the sending of process data set [`enable_process_metrics`](/docs/infrastructure/install-infrastructure-agent/configuration/infrastructure-agent-configuration-settings/) to `true`. To fine-tune which processes you want to monitor, configure [`include_matching_metrics`](/docs/infrastructure/install-configure-manage-infrastructure/configuration/infrastructure-configuration-settings#include-matching-metrics).

Use the Processes tab to get information about processes running on your hosts, and to set alerts on process metrics. The Processes tab shows data such as CPU percentage, I/O bytes, and memory usage for individual processes or groupings of processes.

<img
    title="Hosts Processes page"
    alt="Hosts Processes page"
    src={infrastructureHostNewProcesses}
/>

<figcaption>
    <DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Processes**</DoNotTranslate>
</figcaption>

On the Processes tab, CPU percent is scoped to individual processes, rather than hosts. Because of this, the CPU percent metric does not take into account the resources of the entire system. Instead, it shows how much of a single CPU core each process is taking.

For details about the data used to generate these charts, see ProcessSample.

Here's an example of how to pinpoint an issue and set an alert:

  1. You notice on the Hosts page that a cluster has a CPU percentage much higher than normal.
  2. You go to the Processes page and filter down to that cluster's processes.
  3. You notice that several instances of the same process have excessive CPU percentage.
  4. After fixing the issue, you decide to create an alert condition that triggers when the CPU percentage for that process type exceeds a certain threshold.
You cannot start, stop, or restart processes from the UI. Instead, follow standard procedures to [start, stop, check, or restart](/docs/infrastructure/new-relic-infrastructure/configuration/start-stop-restart-check-infrastructure-agent-status) the infrastructure agent manually.
</Collapser>

<Collapser
id="storage"
title="Storage UI"
>
<img
    title="Hosts Storage page"
    alt="Hosts Storage page"
    src={infrastructureHostNewStorage}
/>

<figcaption>
    <DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Storage**</DoNotTranslate>
</figcaption>

The Storage tab allows you to monitor your resources' capacity and efficiency, including your devices' overall utilization, disk usage, or I/O operations. This can help you to:

  • Examine unexpected patterns; for example, a cluster of mirrored machines that do not process I/O tasks uniformly.
  • Monitor usage levels before disk space completely runs out.
  • Set alert conditions to notify you about problems with one or more hosts; for example, long processing times for read or write operations, disk availability or utilization based on percentage or bytes, etc.
  • Make informed decisions about where to redistribute hosts with higher or lower than normal processing requests.
  • Use data to help plan for additions or changes to your infrastructure budget before an emergency arises.

For details about the data used to generate these charts, see StorageSample.

</Collapser>

<Collapser
id="containers"
title="Containers"
>

Use the Containers tab to get information about the containers running on your hosts, and to set alerts on container metrics. This tab shows data such as CPU percentage, I/O bytes, and memory usage for individual containers or groupings of containers. Use this information to:

  • Identify containers that are experiencing performance issues.
  • Create for conditions such as CPU percentage and memory usage.

Hosts Containers page

**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts (New) > Containers**
</Collapser>

Infrastructure data [#data]

To learn more about infrastructure data, see Infrastructure data.