[Rhythm] Full RF1 read path by mapno · Pull Request #4857 · grafana/tempo

mapno · 2025-03-14T15:20:45Z

What this PR does:

Updates #4478 to actual state of main. Original text:

These are the changes needed to migrate the backend read path to RF1. New config options rf1_after which is a timestamp, after that only blocks with rf==1 are included in searches and trace by ID lookups. And an option to discontinue flushes to object storage from the ingesters.

How each api is handled:

Search: the frontend determines the blocks:

query_frontend:
  search:
    rf1_after: "2024-12-18T00:00:00Z"

Trace lookup: the querier determines the blocks:

querier:
  trace_by_id:
    rf1_after: "2024-12-18T00:00:00Z"

Tags: this is actually not filtering on replication factor, so no changes needed
Metrics: already limited to rf1 blocks

The migration path:

Enable kafka ingest, block-builder, etc. It is now flushing RF1 blocks.
Grace period for monitoring and validation
Set rf1_after to move the read path over to the new RF1 blocks (recent searches/tags/tracelookups are still directed at ingesters)
Grace period for monitoring and validation
Disable flushes on ingesters

Note: Global config is non-trivial due to Tempo's modular setup, so this adds redundant config in multiple places. Not ideal.

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

…d stop all flushing from ingesters

Introduced a new `RF1After` field in the search request to allow filtering blocks by replication factor and time. Updated backend logic and tests to handle this functionality. Adjusted max search duration to enhance usability and clarity in search tests. Signed-off-by: Mario <mariorvinas@gmail.com>

Signed-off-by: Mario <mariorvinas@gmail.com>

mdisibio · 2025-03-19T12:28:22Z

modules/querier/config.go


 type TraceByIDConfig struct {
 	QueryTimeout time.Duration `yaml:"query_timeout"`
+	RF1After     time.Time     `yaml:"rf1_after"`


Rethinking the separate settings for search and trace by id rf1. I don't think there is a case to enable them separately - but I'm remembering now this was done because we don't have a great way to share a config option between frontend and querier modules (frontend has knowledge of blocks during search, and querier has knowledge of blocks for trace by id). Can you think of a better approach?

Now rf1After is passed via query params to the querier.

Added RF1After field to trace query validation logic and related structures. Updated request parsing, test cases, and query configuration to support RF1After. Removed deprecated RF1After config from querier module, ensuring runtime determination. Signed-off-by: Mario <mariorvinas@gmail.com>

javiermolinar · 2025-03-20T10:20:39Z

modules/frontend/tag_sharder.go

 	endT := time.Unix(int64(end), 0)
-	blocks := blockMetasForSearch(s.reader.BlockMetas(tenantID), startT, endT, backend.DefaultReplicationFactor)
+	blocks := blockMetasForSearch(s.reader.BlockMetas(tenantID), startT, endT, func(m *backend.BlockMeta) bool {
+		return m.ReplicationFactor == backend.DefaultReplicationFactor


Can we not use the same logic here as in the search sharder?

Actually, we need to. Otherwise these requests will fail when we stop flushing ingester blocks. Great catch!

Signed-off-by: Mario <mariorvinas@gmail.com>

mdisibio · 2025-03-20T18:58:00Z

modules/frontend/traceid_handlers.go


 		// validate start and end parameter
-		_, _, _, _, _, reqErr := api.ValidateAndSanitizeRequest(req)
+		_, _, _, _, _, _, reqErr := api.ValidateAndSanitizeRequest(req)


This function signature is getting lengthy. Do we want to introduce a TraceByIDRequest object and use it as the output from api.ParseTraceID(req) ?

Yes, I thought of that as well. What didn't convince me was that it also contains extra params start and end that are not part of TraceByIDRequest and it requires extra changes.

I don't have a strong opinion, but initially favored introducing only the necessary changes.

mdisibio and others added 4 commits March 14, 2025 16:20

New config and logic to only search RF1 blocks after a given date, an…

9383b87

…d stop all flushing from ingesters

Fix filtering function

2542d69

Fix lint issue

8131973

Re-generate manifest

2bbd803

mapno marked this pull request as ready for review March 17, 2025 14:00

mapno requested review from carles-grafana, electron0zero, ie-pham, javiermolinar, joe-elliott, knylander-grafana, mdisibio, stoewer, yvrhdn and zalegrala as code owners March 17, 2025 14:01

mapno changed the title ~~WIP: [Rhythm] Full RF1 read path~~ [Rhythm] Full RF1 read path Mar 17, 2025

mapno added 2 commits March 17, 2025 18:01

Update Docker images for protobuf gen

ba1a18e

Signed-off-by: Mario <mariorvinas@gmail.com>

mapno mentioned this pull request Mar 17, 2025

[rhythm] Query param for rf1_after config #4866

Closed

3 tasks

mdisibio reviewed Mar 19, 2025

View reviewed changes

mapno added 3 commits March 20, 2025 11:00

Undo dumb logic

d955a0a

Fix config options

c8181da

mapno mentioned this pull request Mar 20, 2025

[rhythm] Add support for passing rf1_after query param in Vulture #4871

Closed

You again proto?

cdce53a

javiermolinar reviewed Mar 20, 2025

View reviewed changes

mapno added 2 commits March 20, 2025 11:49

Add RF1After filtering logic to tag searches

177c7a0

Signed-off-by: Mario <mariorvinas@gmail.com>

WTF proto

81962b4

mdisibio reviewed Mar 20, 2025

View reviewed changes

Fix comment

745c9cb

javiermolinar approved these changes Mar 24, 2025

View reviewed changes

mapno merged commit 4e29421 into grafana:main Mar 24, 2025
15 checks passed

mapno mentioned this pull request Mar 24, 2025

[rhythm] Add support for rf1After query param in vulture. #4895

Merged

3 tasks

mdisibio mentioned this pull request Apr 15, 2025

[Rhythm] Fix ingester to correctly delete old blocks even if not flushing to object storage #5005

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Rhythm] Full RF1 read path#4857

[Rhythm] Full RF1 read path#4857
mapno merged 13 commits intografana:mainfrom
mapno:rhythm-rf1-read-path

mapno commented Mar 14, 2025 •

edited

Loading

Uh oh!

mdisibio Mar 19, 2025

Uh oh!

mapno Mar 20, 2025

Uh oh!

javiermolinar Mar 20, 2025

Uh oh!

mapno Mar 20, 2025

Uh oh!

mdisibio Mar 20, 2025

Uh oh!

mapno Mar 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

mapno commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdisibio Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

mapno Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

javiermolinar Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

mapno Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

mdisibio Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

mapno Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

mapno commented Mar 14, 2025 •

edited

Loading