Not all agents get disconnected in `test_shutdown_message` #5199

juliamagan · 2024-04-10T11:46:47Z

Description

During the system tests launched for 4.8.0 Beta 5 at wazuh/wazuh#22824, it has been found that not all agents go offline:

E       AssertionError: assert 33 == 40
E        +  where 33 = len(['Disconnected', 'Disconnected', 'Disconnected', 'Disconnected', 'Disconnected', 'Disconnected', ...])

This test has been modified recently, so we should check if the waiting time for the check is as expected, because if it is, even if no errors appear in the managers, it could indicate some kind of performance error. After all, after several executions, the error seems consistent.

The text was updated successfully, but these errors were encountered:

juliamagan · 2024-04-18T16:43:30Z

After talking to @TomasTurina, it was found that when the agent stops it sends HC_SHUTDOWN to the manager, which immediately shows the agent as Disconnected. However, reviewing the logs, it has been seen that the manager receives 50~52 shutdown messages when there are only 40 agents. We need to check if there are old messages or if some messages are being duplicated. Also, with thread.join() it waits for all the agents to be stopped, so all the agents should appear as Disconnected.

juliamagan · 2024-04-19T15:21:03Z

By monitoring the logs and the agent statuses, we have been able to see that the test started when there were agents that were not yet Active, which could affect the results. The necessary logic has been added to avoid this, but it is being tested to see how much time is needed for all the agents to be active.

juliamagan · 2024-04-22T07:41:41Z

On hold due to Beta 6 testing

juliamagan · 2024-04-29T10:32:44Z

With the proposed solution, the test passes without problem when launched individually, but when all tests in the environment are launched it fails. We are checking if the environment is dirty from the previous tests, but these tests take 1:40h, which makes it very slow to debug.

juliamagan · 2024-04-30T16:39:51Z

Finally, it was found that the environment was dirty and was not registered in the expected manager. It remains to upload the results of the complete test set to ensure that it does not fail.

juliamagan added level/task Task issue type/bug labels Apr 10, 2024

This was referenced Apr 10, 2024

Release 4.8.0 - Beta 5 - System tests wazuh/wazuh#22824

Closed

Release 4.8.0 - Beta 5 wazuh/wazuh#22777

Closed

juliamagan self-assigned this Apr 17, 2024

This was referenced Apr 23, 2024

Release 4.8.0 - Beta 6 - System tests wazuh/wazuh#23056

Closed

Fix shutdown messages system test #5298

Merged

juliamagan linked a pull request Apr 26, 2024 that will close this issue

Fix shutdown messages system test #5298

Merged

This was referenced May 2, 2024

An agent starting automatically for no apparent reason wazuh/wazuh#23221

Closed

Release 4.8.0 - RC 1 - System tests wazuh/wazuh#23297

Closed

juliamagan closed this as completed May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not all agents get disconnected in `test_shutdown_message` #5199

Not all agents get disconnected in `test_shutdown_message` #5199

juliamagan commented Apr 10, 2024

juliamagan commented Apr 18, 2024

juliamagan commented Apr 19, 2024

juliamagan commented Apr 22, 2024

juliamagan commented Apr 29, 2024

juliamagan commented Apr 30, 2024

Not all agents get disconnected in test_shutdown_message #5199

Not all agents get disconnected in test_shutdown_message #5199

Comments

juliamagan commented Apr 10, 2024

Description

juliamagan commented Apr 18, 2024

juliamagan commented Apr 19, 2024

juliamagan commented Apr 22, 2024

juliamagan commented Apr 29, 2024

juliamagan commented Apr 30, 2024

Not all agents get disconnected in `test_shutdown_message` #5199

Not all agents get disconnected in `test_shutdown_message` #5199