title | tags | metaDescription | redirects | freshnessValidatedDate | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Infrastructure hosts UI |
|
For New Relic infrastructure monitoring, use the Hosts UI page to view the most important metrics from across your server/host environment. |
|
never |
import infrastructureHostsUiDiagram from 'images/infrastructure_screenshot-crop_infrastructure-hosts-ui-diagram.webp'
import infrastructureHostsNewSummaryAnimated from 'images/infrastructure_screenshot-crop_host-page-tiles.gif'
import infrastructureHostsUiOptions from 'images/infrastructure_screenshot-crop_hosts-ui-options.webp'
import infrastructureMainHostsUi from 'images/infrastructure_screenshot-crop_infrastructure-main-hosts-ui.webp'
import infrastructureHostNewSystem from 'images/infrastructure_screenshot-full_host-new-system.webp'
import infrastructureHostNewNetwork from 'images/infrastructure_screenshot-full_host-new-network.webp'
import infrastructureHostNewProcesses from 'images/infrastructure_screenshot-full_host-new-processes.webp'
import infrastructureHostNewStorage from 'images/infrastructure_screenshot-full_host-new-storage.webp'
import infrastructureHostNewContainers from 'images/infrastructure_screenshot-full_host-new-containers.webp'
In our infrastructure monitoring UI, the Hosts UI page is the main place for you to understand and troubleshoot issues with your infrastructure.
These docs describe the new **Hosts** UI added in April of 2023. You can switch between the new view and the **Classic** view. **[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure**The Hosts page is the place to go to investigate issues in your infrastructure. Here's one example of a common workflow our customers use to diagnose a problem:
- On the APM summary page, you notice an issue with your Kafka service. Throughput has dropped and many errors are present.
- Based on the infrastructure data you see in APM, you think it's a host issue so you click through to the infrastructure Hosts page.
- The Hosts page automatically filters to your Kafka service. You see that half of the hosts running that service have maxed out their CPU %.
- If you suspect a host configuration change could be the source of your issue, go to the Inventory tab and check your system's configurations per host.
There are some important UI features available across the Hosts UI. Understanding how to use these will help you get the most out of this UI.
- A. The filter bar: Filter for the hosts you want to see. For tips, see Filter bar.
- B. Saved views: When you create a view you like (which can include filters, and custom chart and table selections), click Save.
- C. Chart options: You can switch to different metrics for each chart, and you can click
...
to see more chart options, including getting a shareable link and creating an alert for that metric. - D. Group by: Use this to group chart results by specific attributes. For example, you might display the AWS regions with the highest CPU usage grouped by
awsRegion
. - E. Attribute filter: This lets you filter your hosts to those with a specific attribute value. For example,
- F. Host table: By default, 25 hosts are selected for display. Use the table columns and the attribute filter to explore the hosts, and select different hosts for display. You can export the table as a CSV.
- G. Host details: For a specific host in the host table, click the icon to see more options to learn about it, including options for viewing logs, seeing an entity map, and more.
Here are more details about the tabs available on the Hosts UI page:
On the Hosts page, the Summary tab gives you an overview of your hosts' performance.
**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure**The tiles at the top of this page give you an overview of your hosts, applications, events, and alerts. Use these tiles to see more detail about a host or filter the entire Summary page to a specific host.
The time picker on the Summary page only adjusts the metrics tiles and charts but does not adjust the host table. The other tabs in the Hosts UI have tables that reflect the time range set by the time picker.
<Collapser
id="system"
title="System UI"
>
The System tab gives you an overview of your system. It's somewhat similar to the Summary tab but allows you to search for more in-depth details about some aspects of your system.
**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > System**Here are the default charts shown on the System page:
- CPU %: This is a metric derived from several other metrics on the
SystemSample
event. Specifically, thecpuPercent
attribute is an aggregation ofcpuUserPercent
,cpuSystemPercent
,cpuIoWaitPercent
andcpuStealPercent
. - Load average five minute: represents an average of the number of tasks waiting to do work on a system over the last 5 minutes.
- Memory free %: compares the amount of free memory bytes to the amount of used memory bytes.
For details about the data used to generate these charts, see SystemSample.
</Collapser>
<Collapser
id="network"
title="Network UI"
>
The Network page provides real-time visibility into the health and performance of individual hosts, web servers, or other groups of resources across your network. Default charts show bandwidth metrics by packet, bandwidth by bytes, and errors per second.
<img
title="Hosts Network page"
alt="Hosts Network page"
src={infrastructureHostNewNetwork}
/>
<figcaption>
<DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Network**</DoNotTranslate>
</figcaption>
Details about individual interfaces can help you:
- Examine saturation levels across your network or for selected hosts.
- Compare load balances between specific resources.
- Identify unexpected differences in capacity patterns between similar hosts.
- Evaluate the top five network errors detected.
This real-time network data can then help you determine whether to resolve errors by reconfiguring your network, rebalancing the loads, or taking other preventative maintenance actions before needing to make a purchase decision.
For details about the data used to generate these charts, see NetworkSample.
</Collapser>
<Collapser
id="processes"
title="Processes UI"
>
Use the Processes tab to get information about processes running on your hosts, and to set alerts on process metrics. The Processes tab shows data such as CPU percentage, I/O bytes, and memory usage for individual processes or groupings of processes.
<img
title="Hosts Processes page"
alt="Hosts Processes page"
src={infrastructureHostNewProcesses}
/>
<figcaption>
<DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Processes**</DoNotTranslate>
</figcaption>
On the Processes tab, CPU percent is scoped to individual processes, rather than hosts. Because of this, the CPU percent metric does not take into account the resources of the entire system. Instead, it shows how much of a single CPU core each process is taking.
For details about the data used to generate these charts, see ProcessSample.
Here's an example of how to pinpoint an issue and set an alert:
- You notice on the Hosts page that a cluster has a CPU percentage much higher than normal.
- You go to the Processes page and filter down to that cluster's processes.
- You notice that several instances of the same process have excessive CPU percentage.
- After fixing the issue, you decide to create an alert condition that triggers when the CPU percentage for that process type exceeds a certain threshold.
</Collapser>
<Collapser
id="storage"
title="Storage UI"
>
<img
title="Hosts Storage page"
alt="Hosts Storage page"
src={infrastructureHostNewStorage}
/>
<figcaption>
<DoNotTranslate>**[one.newrelic.com > All capabilities](https://one.newrelic.com/all-capabilities) > Infrastructure > Hosts > Storage**</DoNotTranslate>
</figcaption>
The Storage tab allows you to monitor your resources' capacity and efficiency, including your devices' overall utilization, disk usage, or I/O operations. This can help you to:
- Examine unexpected patterns; for example, a cluster of mirrored machines that do not process I/O tasks uniformly.
- Monitor usage levels before disk space completely runs out.
- Set alert conditions to notify you about problems with one or more hosts; for example, long processing times for read or write operations, disk availability or utilization based on percentage or bytes, etc.
- Make informed decisions about where to redistribute hosts with higher or lower than normal processing requests.
- Use data to help plan for additions or changes to your infrastructure budget before an emergency arises.
For details about the data used to generate these charts, see StorageSample.
</Collapser>
<Collapser
id="containers"
title="Containers"
>
Use the Containers tab to get information about the containers running on your hosts, and to set alerts on container metrics. This tab shows data such as CPU percentage, I/O bytes, and memory usage for individual containers or groupings of containers. Use this information to:
- Identify containers that are experiencing performance issues.
- Create for conditions such as CPU percentage and memory usage.
</Collapser>
To learn more about infrastructure data, see Infrastructure data.