-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[alerting] add user facing doc on event log ILM policy #92736
Conversation
resolves elastic#82435 I added this to the alert/actions setting doc, as that seemed like the best place for now. Just provided a brief description, name of the policy, mentioned we create it but never modify it, provided the default values, and mentioned it could be updated by customers for their environment. Not sure we want to provide more info than that.
2a662bd
to
a19c0d9
Compare
Pinging @elastic/kibana-alerting-services (Team:Alerting Services) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The new text doesn't seem to fit in the settings doc. Is there another place it might go?
[[event-log-ilm]] | ||
==== Event log index lifecycle managment | ||
|
||
Alerts and actions log their activity in a set of "event log" indices. These indices are configured with an index lifecycle managment (ILM) policy which can be customized. The default policy will perform a rollover of the index when it reaches 50GB in size, or after 30 days. Indices over 90 days old will be deleted. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Alerts and actions log their activity in a set of "event log" indices. These indices are configured with an index lifecycle managment (ILM) policy which can be customized. The default policy will perform a rollover of the index when it reaches 50GB in size, or after 30 days. Indices over 90 days old will be deleted. | |
Alerts and actions log activity in a set of "event log" indices. These indices are configured with an index lifecycle management (ILM) policy, which you can customize. The default policy rolls over the index when it reaches 50GB, or after 30 days. Indices over 90 days old are deleted. |
|
||
Alerts and actions log their activity in a set of "event log" indices. These indices are configured with an index lifecycle managment (ILM) policy which can be customized. The default policy will perform a rollover of the index when it reaches 50GB in size, or after 30 days. Indices over 90 days old will be deleted. | ||
|
||
The name of the index policy is `kibana-event-log-policy`. It will be created when Kibana starts, if it doesn't exist, otherwise it will never be modified by Kibana. It can be customized for your environment. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The name of the index policy is `kibana-event-log-policy`. It will be created when Kibana starts, if it doesn't exist, otherwise it will never be modified by Kibana. It can be customized for your environment. | |
The name of the index policy is `kibana-event-log-policy`. {kib} creates the index policy on startup, if it doesn't already exist. The index policy can be customized for your environment, but {kib} never modifies the index policy after creating it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't understand the part about "otherwise it will never be modified by Kibana" so my last sentence is a guess.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The idea is that we create the ILM policy if it doesn't already exist, with some default values. But we never modify it after creating it - there is no UI in Kibana to explicitly modify it (outside of the general ui for all ILM policies that Kibana has). We also never programmatically modify it.
So a customer can make modifications to the policy, to keep data for a shorter or longer period of time, etc. And not worry that alerting will somehow "overwrite" their settings while it's running, or when upgrading versions, etc. We just create it when needed, and then never directly access it (even for reading!) after that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So the wording "You can customize the index policy prior to startup, but not after." is exactly wrong! hehe
Customers should NOT "pre-create" the index policy, because then we won't create one, but we actually DO need to create it to set all the index patterns, aliases, etc. And customers may modify the ILM policy at any time after it's created. Note again, the policy is created at Kibana startup time; this was added in ~7.9 time frame, so anyone upgrading from 7.9 or higher will already have one. Brand new deployments will have one created when Kibana starts up.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
new suggested wording from @gchaps:
Kibana creates the index policy on startup, if it doesn't already exist. The index policy can be customized for your environment, but Kibana never modifies the index policy after creating it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I updated the suggestion.
|
||
The name of the index policy is `kibana-event-log-policy`. It will be created when Kibana starts, if it doesn't exist, otherwise it will never be modified by Kibana. It can be customized for your environment. | ||
|
||
The event log is current used to display historic data on alert and action activity, and for diagnostic purposes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The event log is current used to display historic data on alert and action activity, and for diagnostic purposes. | |
The event log displays historic data on alert and action activity, for diagnostic purposes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The wording on this is not quite right, as the event log doesn't "display" anything itself. It just contains the data that Kibana uses to display historic information in the UI.
I think we can also drop the "diagnostic purposes" part of this - my bad. I have some separate "how to use the event log for diagnostic purposes" info I'm going to be putting in the event log README.md for now (not quite ready for "production"). I was hoping to drop a hint that "there's some interesting stuff in this index!", but that's just teasing without more info.
Here's a new take:
Because the documents in the event log are used to display historic alert and action activity in {kib}, care should be taken when using the ILM delete phase. For example, if the rolled over indices are deleted after 2 weeks, no historic data will be shown in {kib} that is more than 2 weeks old.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice! Suggest changing "care should be taken when" to "take care when".
Yeah, it's certainly not a I considered putting it in "Production considerations" as well, and that would be fine. https://www.elastic.co/guide/en/kibana/7.12/alerting-production-considerations.html Open for any other thoughts though. |
I like the idea of moving the content to production considerations. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Just a question about header size
|
||
[float] | ||
[[event-log-ilm]] | ||
==== Event log index lifecycle managment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be a sibling to Deployment considerations
instead of a sub-heading?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I think you're right. Will change that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes LGTM. The location of these docs will probably have to change when other plugins start using the event log?
|
||
The name of the index policy is `kibana-event-log-policy`. {kib} creates the index policy on startup, if it doesn't already exist. The index policy can be customized for your environment, but {kib} never modifies the index policy after creating it. | ||
|
||
Because the documents in the event log are used to display historic alert and action activity in {kib}, take care when using the ILM delete phase. For example, if the rolled over indices are deleted after 2 weeks, no historic data will be shown in {kib} that is more than 2 weeks old. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure what it means take care when using the ILM delete phase
. Did you mean be careful when modifying the ILM delete phase
? or cautious
?
cc @gchaps
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could be phrased better, for sure. Maybe something like:
Because Kibana uses the documents to display historic data, you should set the delete phase longer than you would like the historic data to be shown. For example, if you would like to see one month's worth of historic data, you should set the delete phase to at least one month.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That suggestion looks good to me 👍
Yes, I think it would go somewhere under the top-level Config section at the top, but not sure where. I originally put it in the alert settings page under Config, which wasn't great since it's not strictly "config" (and was still under alerting anyway). Still open to move it somewhere else. This seems like the best place for now, otherwise it's hard to imagine anyone finding it - though if we do move it, we could add a link to the new location in the alerting page I have it in now. |
Yeah, I'm good waiting until the event log isn't explicit for alerting before moving the docs 👍 |
This PR is so stale, the single file that had content added to it moved! from:
to
So the backport to 7.11 is going to require a tiny bit of extra work, that hardly seems worth it considering it's the 7.11 release, heh. Going to skip that for now; LMK if there's concern about backports - technically this goes all the way back to 7.8. |
resolves elastic#82435 Just provided a brief description, name of the policy, mentioned we create it but never modify it, provided the default values, and mentioned it could be updated by customers for their environment. Not sure we want to provide more info than that.
Even the backport to 7.12 had conflicts, as we've made alerts => rules type changes. Rather than screw that up in the backport, I'll just not backport there either. |
resolves #82435 Just provided a brief description, name of the policy, mentioned we create it but never modify it, provided the default values, and mentioned it could be updated by customers for their environment. Not sure we want to provide more info than that.
resolves #82435
doc preview: https://kibana_92736.docs-preview.app.elstc.co/guide/en/kibana/master/alerting-production-considerations.html
Summary
I added this to the alert/actions setting doc, as that seemed like the best
place for now. Just provided a brief description, name of the policy, mentioned
we create it but never modify it, provided the default values, and mentioned
it could be updated by customers for their environment. Not sure we want to
provide more info than that.
Checklist
Delete any items that are not applicable to this PR.
Unit or functional tests were updated or added to match the most common scenariosAny UI touched in this PR is usable by keyboard only (learn more about keyboard accessibility)Any UI touched in this PR does not create any new axe failures (run axe in browser: FF, Chrome)If a plugin configuration key changed, check if it needs to be allowlisted in the cloud and added to the docker listThis renders correctly on smaller devices using a responsive layout. (You can test this in your browser)This was checked for cross-browser compatibilityFor maintainers
This was checked for breaking API changes and was labeled appropriately