Right now we send the same tags on replay segments over and over again. we store and index tags in clickhouse, which means we're indexing duplicates of the same data.
We should either use some kind of deduplication logic to not send the same tag twice on replay segments, or alternatively, only send tags on the first segment. I don't think the dedupe logic would be too difficult, could do in memory and that should remove a large % of the duplicated tags.
Right now we send the same tags on replay segments over and over again. we store and index tags in clickhouse, which means we're indexing duplicates of the same data.
We should either use some kind of deduplication logic to not send the same tag twice on replay segments, or alternatively, only send tags on the first segment. I don't think the dedupe logic would be too difficult, could do in memory and that should remove a large % of the duplicated tags.