Alerts on restored agent's health #1826

tnesztler · 2018-10-22T14:09:01Z

Description of Issue

The next section assumes you're subscribed to the alerts topics (see here).
For an agent using the health subsystem (such as all historian agents based on the BaseHistorian class), if the health goes from GOOD to BAD (for example due to this case), a message is published as expected.
However, whenever the agent starts publishing again, the health is restored to GOOD but no message is sent stating so.

One use case is an agent publishing the status of selected agents to a Slack channel. It is impossible to know if the agent's status is still BAD or was restored.

Affected Version

5.1 and up

Screenshots

In this screenshot, data is published every 5 minutes.

Expected

Message being published to the alerts base topic or "subtopics" at each change of the health of an agent using the health subsystem.

Actual

Only degraded health is reported to the alerts topic.

Steps to Reproduce

Use an agent that is based on the BaseHistorian agent and subscribe to alerts.
Prevent the historian to publish data. Wait for its health to go from GOOD to BAD. A new message is published.
Let the historian publish again. The health should go from BAD to GOOD again after the data backlog have been processed. No message is being published once the health is restored.

The text was updated successfully, but these errors were encountered:

schandrika · 2018-12-26T23:19:34Z

Fixed as part of #1846. If status becomes bad after the initial setup phase, say a connection failure when trying to write device data to database, then once the connection is back up, database write will happen and status of agent will change to GOOD. If connection failure happens at the time of startup (agent init, startup) then alert is sent to user and user need to fix the issue and restart the agent.

schandrika self-assigned this Nov 21, 2018

schandrika closed this as completed Dec 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alerts on restored agent's health #1826

Alerts on restored agent's health #1826

tnesztler commented Oct 22, 2018 •

edited

schandrika commented Dec 26, 2018

Alerts on restored agent's health #1826

Alerts on restored agent's health #1826

Comments

tnesztler commented Oct 22, 2018 • edited

Description of Issue

Affected Version

Screenshots

Expected

Actual

Steps to Reproduce

schandrika commented Dec 26, 2018

tnesztler commented Oct 22, 2018 •

edited