(Doc+) Flush out Data Tiers #107981

stefnestor · 2024-04-27T19:54:13Z

👋🏽 howdy, team!

I highly value the content on this Data Tiers page. Thanks for writing it! In my experience, some users may become slightly confused by its golden nuggets due to its brevity. This PR attempts to flush out common questions while remaining concise.

The main changes are in the first and second-to-last sections; however, I do attempt some heading restructuring to make the TOC idea-groupings more clear for easier scan-throughs.

The specific clarifications I'd like to push in order of appearance:

There's content tier (for "data category" > "content" as we've dubbed it on the higher page) and the data temperature tiers (for time series). That the temperature tiers group together is technically not stated so users end up asking about when they'd go hot>warm vs content>warm, etc. I suspect this confusion is only because users come straight to this page instead of starting at the hierarchy-parent page so have linked up.
Frozen being accessed/searched "rarely" should imply, well rarely. I wrote 1% in the PR [TIP] guideline section as a discussion starting point. Frequently we see users not understanding either that they actually have been or that they shouldn't have ≥25% of all searches hitting frozen tier. This comes up because of architecture bugs (e.g. frozen indices with future timestamps) but also just happenstance (e.g. 01605242 where of searches they hit majority hot, ~5% cold, but then again hit 75% frozen).
There's a slew of "how do I check that?", "how do I change that (at creation/later)?", "what if I set it null?" questions we get about _tier_preference so just extended the existing section already about it.

TIA! 🙏 cc: @dakrone @bytebilly

👋🏽 howdy, team! I highly value the content on this [Data Tiers](https://www.elastic.co/guide/en/elasticsearch/reference/current/data-tiers.html) page. Thanks for writing it! In my experience, some users may become slightly confused by its golden nuggets due to its brevity. This PR attempts to flush out common questions while remaining concise. The main changes are in the first and second-to-last sections; however, I do attempt some heading restructuring to make the TOC idea-groupings more clear for easier scan-throughs. The specific clarifications I'd like to push in order of appearance: - There's content tier (for "data category" > "content" as we've dubbed it on the higher page) and the data temperature tiers (for time series). That the temperature tiers group together is technically not stated so users end up asking about when they'd go hot>warm vs content>warm, etc. I suspect this confusion is only because users come straight to this page instead of starting at the hierarchy-parent page so have linked up. - (Main) Frozen being accessed/searched "rarely" should imply, well rarely. I wrote 1% in the PR `[TIP]` guideline section as a discussion starting point. Frequently we see users not understanding either that they actually have been or that they shouldn't have ≥25% of all searches hitting frozen tier. This comes up because of architecture bugs (e.g. frozen indices with future timestamps) but also just happenstance (e.g. 01605242 where of searches they hit majority hot, ~5% cold, but then again hit 75% frozen). - There's a slew of "how do I check that?", "how do I change that (at creation/later)?", "what if I set it null?" questions we get about `_tier_preference` so just extended the existing section already about it. TIA! 🙏

github-actions · 2024-04-27T19:54:26Z

Documentation preview:

✨ Changed pages

elasticsearchmachine · 2024-04-27T19:54:38Z

Pinging @elastic/es-docs (Team:Docs)

shainaraskas

🔥 you added so many great details in this PR!

I've reviewed and provided some feedback/edits from an organization and clarity POV. There are some nuances around tier hardware profiles that I didn't completely understand, so I apologize for any inaccuracies I injected with my edits and for any feedback that doesn't exactly align with your goals.

docs/reference/datatiers.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

docs/reference/datatiers.asciidoc

stefnestor · 2024-05-02T17:24:04Z

👋🏽 @shainaraskas , thanks for hanging out! Apologies for the delay, I work weekends so today's my Monday.

Your edits are also 🔥 , cheers! I accepted all grammar and most rewordings; I've left comments on what remains because I agree it matters to get these parts right to avoid confusion.

shainaraskas

just working through your comments on the index allocation section but thought I'd throw these comments your way :)

docs/reference/datatiers.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

shainaraskas

looking so good! left a couple of comments that are up to your preference.

I think we're basically ready to go, but I'm not sure why the tests are failing. looking into it now. 👍

edit: this looks like it's maybe the same error as your other PR, so I'm going to rebase this one too.

edit 2: after it's green and you check out my comments, feel free to merge (unless you're waiting on an engineering review).

docs/reference/datatiers.asciidoc

shainaraskas · 2024-05-03T16:14:06Z

we can also probably target 8.14.0, 8.13.3, and 8.13.4 with this so the docs are available asap.

docs/reference/datatiers.asciidoc

dakrone

I left some comments for this change.

I also have concerns that we give a false sense of specificity with giving hard recommendations for percentages in these docs. My preference would be to teach the reader to weigh the values of cost, performance, and configuration complexity rather than giving hard numbers that are likely to mislead a user. I'm curious what your thoughts about this are.

docs/reference/datatiers.asciidoc

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

stefnestor · 2024-07-11T16:19:05Z

I also have concerns that we give a false sense of specificity with giving hard recommendations for percentages in these docs.

From sub-thread, we're agreed to leave this out for now & consider in future doc/blog. Ready again for your review, @dakrone 🙏 & sorry for the delay.

dakrone

LGTM, thanks for iterating on this Stef!

I highly value the content on this [Data Tiers](https://www.elastic.co/guide/en/elasticsearch/reference/current/data-tiers.html) page. Thanks for writing it! In my experience, some users may become slightly confused by its golden nuggets due to its brevity. This PR attempts to flush out common questions while remaining concise. The main changes are in the first and second-to-last sections; however, I do attempt some heading restructuring to make the TOC idea-groupings more clear for easier scan-throughs. The specific clarifications I'd like to push in order of appearance: - There's content tier (for "data category" > "content" as we've dubbed it on the higher page) and the data temperature tiers (for time series). That the temperature tiers group together is technically not stated so users end up asking about when they'd go hot>warm vs content>warm, etc. I suspect this confusion is only because users come straight to this page instead of starting at the hierarchy-parent page so have linked up. - (Main) Frozen being accessed/searched "rarely" should imply, well rarely. I wrote 1% in the PR `[TIP]` guideline section as a discussion starting point. Frequently we see users not understanding either that they actually have been or that they shouldn't have ≥25% of all searches hitting frozen tier. This comes up because of architecture bugs (e.g. frozen indices with future timestamps) but also just happenstance (e.g. 01605242 where of searches they hit majority hot, ~5% cold, but then again hit 75% frozen). - There's a slew of "how do I check that?", "how do I change that (at creation/later)?", "what if I set it null?" questions we get about `_tier_preference` so just extended the existing section already about it. --------- Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

elasticsearchmachine · 2024-07-18T20:37:48Z

💚 Backport successful

Status	Branch	Result
✅	8.14
✅	8.13

I highly value the content on this [Data Tiers](https://www.elastic.co/guide/en/elasticsearch/reference/current/data-tiers.html) page. Thanks for writing it! In my experience, some users may become slightly confused by its golden nuggets due to its brevity. This PR attempts to flush out common questions while remaining concise. The main changes are in the first and second-to-last sections; however, I do attempt some heading restructuring to make the TOC idea-groupings more clear for easier scan-throughs. The specific clarifications I'd like to push in order of appearance: - There's content tier (for "data category" > "content" as we've dubbed it on the higher page) and the data temperature tiers (for time series). That the temperature tiers group together is technically not stated so users end up asking about when they'd go hot>warm vs content>warm, etc. I suspect this confusion is only because users come straight to this page instead of starting at the hierarchy-parent page so have linked up. - (Main) Frozen being accessed/searched "rarely" should imply, well rarely. I wrote 1% in the PR `[TIP]` guideline section as a discussion starting point. Frequently we see users not understanding either that they actually have been or that they shouldn't have ≥25% of all searches hitting frozen tier. This comes up because of architecture bugs (e.g. frozen indices with future timestamps) but also just happenstance (e.g. 01605242 where of searches they hit majority hot, ~5% cold, but then again hit 75% frozen). - There's a slew of "how do I check that?", "how do I change that (at creation/later)?", "what if I set it null?" questions we get about `_tier_preference` so just extended the existing section already about it. --------- Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

stefnestor added >enhancement >docs General docs changes Team:Data Management Meta label for data/management team Team:Docs Meta label for docs team Supportability Improve our (devs, SREs, support eng, users) ability to troubleshoot/self-service product better. labels Apr 27, 2024

This comment was marked as resolved.

Sign in to view

elasticsearchmachine added v8.15.0 external-contributor Pull request authored by a developer outside the Elasticsearch team labels Apr 27, 2024

elasticsearchmachine removed the Team:Data Management Meta label for data/management team label Apr 27, 2024

stefnestor added the Team:Data Management Meta label for data/management team label Apr 27, 2024

elasticsearchmachine removed the Team:Data Management Meta label for data/management team label Apr 27, 2024

shainaraskas self-requested a review April 29, 2024 15:33

shainaraskas reviewed Apr 29, 2024

View reviewed changes

stefnestor and others added 2 commits May 2, 2024 11:05

Grammar feedback

362201b

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

_tier_preference section feedback

9eed70d

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

stefnestor commented May 2, 2024

View reviewed changes

docs/reference/datatiers.asciidoc Outdated Show resolved Hide resolved

stefnestor commented May 2, 2024

View reviewed changes

docs/reference/datatiers.asciidoc Show resolved Hide resolved

shainaraskas reviewed May 2, 2024

View reviewed changes

Apply suggestions from code review

24035b3

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

shainaraskas approved these changes May 3, 2024

View reviewed changes

docs/reference/datatiers.asciidoc Outdated Show resolved Hide resolved

Merge branch 'main' into stefnestor-patch-7

955650a

shainaraskas reviewed May 3, 2024

View reviewed changes

docs/reference/datatiers.asciidoc Show resolved Hide resolved

shainaraskas added 2 commits May 3, 2024 13:50

Update docs/reference/datatiers.asciidoc

e6388c2

Update docs/reference/datatiers.asciidoc

b71b016

shainaraskas added the v8.14.0 label May 3, 2024

elasticsearchmachine added v8.13.5 and removed v8.13.4 labels May 7, 2024

dakrone requested changes May 9, 2024

View reviewed changes

Grammar feedback

c72d632

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

elasticsearchmachine added v8.14.2 and removed v8.14.1 labels Jun 12, 2024

feedback

807d5c4

shainaraskas added the auto-backport-and-merge label Jun 21, 2024

elasticsearchmachine added v8.14.3 v8.16.0 and removed v8.14.2 v8.15.0 labels Jun 27, 2024

elasticsearchmachine added v8.14.4 and removed v8.14.3 labels Jul 9, 2024

stefnestor added 2 commits July 11, 2024 10:11

feedback

656a7a6

Merge branch 'main' into stefnestor-patch-7

21528bd

dakrone approved these changes Jul 18, 2024

View reviewed changes

stefnestor merged commit 67a8e89 into main Jul 18, 2024
6 checks passed

stefnestor deleted the stefnestor-patch-7 branch July 18, 2024 20:35

stefnestor mentioned this pull request Jul 18, 2024

[8.14] (Doc+) Flush out Data Tiers (#107981) #111073

Merged

stefnestor mentioned this pull request Jul 18, 2024

[8.13] (Doc+) Flush out Data Tiers (#107981) #111074

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Doc+) Flush out Data Tiers #107981

(Doc+) Flush out Data Tiers #107981

stefnestor commented Apr 27, 2024

github-actions bot commented Apr 27, 2024

This comment was marked as resolved.

elasticsearchmachine commented Apr 27, 2024

shainaraskas left a comment

stefnestor commented May 2, 2024

shainaraskas left a comment

shainaraskas left a comment •

edited

Loading

shainaraskas commented May 3, 2024

dakrone left a comment

stefnestor commented Jul 11, 2024

dakrone left a comment

elasticsearchmachine commented Jul 18, 2024

(Doc+) Flush out Data Tiers #107981

(Doc+) Flush out Data Tiers #107981

Conversation

stefnestor commented Apr 27, 2024

github-actions bot commented Apr 27, 2024

This comment was marked as resolved.

elasticsearchmachine commented Apr 27, 2024

shainaraskas left a comment

Choose a reason for hiding this comment

stefnestor commented May 2, 2024

shainaraskas left a comment

Choose a reason for hiding this comment

shainaraskas left a comment • edited Loading

Choose a reason for hiding this comment

shainaraskas commented May 3, 2024

dakrone left a comment

Choose a reason for hiding this comment

stefnestor commented Jul 11, 2024

dakrone left a comment

Choose a reason for hiding this comment

elasticsearchmachine commented Jul 18, 2024

💚 Backport successful

shainaraskas left a comment •

edited

Loading