The entire Elastic Fleet went Unhealthy after changing Fleet settings. Any ideas how I can make it work again? #15100
-
|
Version Installation Method Description Installation Type Location Hardware Specs CPU RAM Storage for / Storage for /nsm Network Traffic Collection Network Traffic Speeds Status Salt Status Logs I made a terrible mistake changing Fleet server hosts -> grid-default and Outputs -> grid-logstash & so-manager_elasticsearch hosts. And now my whole Elastic fleet: FleetServer-sec-on, sec-on (so-grid-nodes_general) + 18 elastic agents (endpoints initial) in Unhealthy state. Any ideas, how can I fix this? I brought all setting back, but it didn't help. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
Went ahead and created an issue #15101 To fix this you should be able to take the cert from your standalone and paste it back into the fleet output policy. From your manager run Go back into the logstash fleet output policy and paste that into the 'Client SSL certificate key' section. Save the policy and check your agents are coming back into healthy state. If this happens again you can repeat the cert steps and disable updates to the fleet output policy from happening automatically within SOC -> Administration -> config (hit options and toggle on advanced settings) set that to false and save Where did the need to modify the policy hosts directly come from? On a standalone you only have 1 host to receive the logs (the standalone itself) if you wanted to add an fqdn for the standalone the proper way would be https://docs.securityonion.net/en/2.4/elastic-fleet.html#custom-fqdn-url because the fleet certs need to be updated (they automatically update in the background for you & make sure the endpoints DNS can resolve the custom fqdn) |
Beta Was this translation helpful? Give feedback.
Went ahead and created an issue #15101
To fix this you should be able to take the cert from your standalone and paste it back into the fleet output policy. From your manager run
sudo cat /etc/pki/elasticfleet-logstash.keythen copy that entire output should look something likeGo back into the logstash fleet output policy and paste that into the 'Client SSL certificate key' section. Save the policy and check your agents are coming back into healthy state.
If this happens again you can repeat the cert steps and disable updates to the fleet output policy from happening automatically
within SOC -> Administration -> config (hit …