Use the new symDB package #770

cyriltovena · 2023-06-15T13:35:36Z

Todos

Create a v2 block
Flush the new file
Read from the file.

* Introduce stacktrace partition This determines the partition of a particular profile, by looking first at its metadata: * If there is a `Filename` on the main mapping use its filepath.Base(Filename) * Failing that take the externally supplied `service_name` * Fallback to `unknown` Take the underlying string value and hash. * After a chat with cyril we decided to not longer mod and use the hash straight away. We don't wanted to risk the collisions of two very big stacktrace applications.

* Increase parquet writer PageBufferSize * reduce by 2 page buffer size * Introduce symdb * Add chunk format description * Add chunk format description * Improve naming * Implement stack trace appender * Limit chunk by number of nodes * Stacktrace ID is uint32 * Add in-memory stacktrace resolver * Add writer * Add writer * Fix stacktrace resolver * Single pass write * Index file refactoring * Fixes, improvements, notes * Ignore empty stacktraces * Fix chunk boundary check * Fix tests * Store chunk headers sorted * Make chunk index explicit * Add file reader * Use group varint encoding * Refine stacktrace tree * Stacktrace tree race condition elimination * Remove unused stacktracesResolve.do * Better nil coalescence in stack trace appender * Format imports * Use the new symDB package (#770) * Ingest stacktraces in the new symdb * Setup read in memory read path * Fix up a comment placement * Start setting up the read path * Update to uint32 * Introduce stacktrace partition (#775) * Introduce stacktrace partition This determines the partition of a particular profile, by looking first at its metadata: * If there is a `Filename` on the main mapping use its filepath.Base(Filename) * Failing that take the externally supplied `service_name` * Fallback to `unknown` Take the underlying string value and hash. * After a chat with cyril we decided to not longer mod and use the hash straight away. We don't wanted to risk the collisions of two very big stacktrace applications. * Remove reconstructMeta from singleBlockQuerier * support multiple versions of stacktraces resolver * Integrate v2 reader for stacktraces in block reader * Fixes tests * Rewrite locations Ids * Rewrite test for counting uniq stacktraces * lint and fmt * Fixes more tests * Fixes leftover from todo --------- Co-authored-by: Christian Simon <simon@swine.de> * Use prefixed bucket for symbols * Initialize locationsIdsByStacktraceID * Initialize locationsIdsByStacktraceID for pprof as well * Fix chunk headers sort * Inline node alloc * Mapping filename extraction * Tidy go.mod * Fix TestHeadIngestStacktraces * Use symdb.DefaultDirName * Sort mappings on write * Make column iterator to respect the context * Fix unexpected EOF on stacktrace chunk unmarshal * Fix symbols upload * Fix symbols upload * Release fetched data * 3MB Page Buffer Size * Sort stacktraces IDs as expected by the resolver --------- Co-authored-by: Cyril Tovena <cyril.tovena@gmail.com> Co-authored-by: Christian Simon <simon@swine.de>

* Increase parquet writer PageBufferSize * reduce by 2 page buffer size * Introduce symdb * Add chunk format description * Add chunk format description * Improve naming * Implement stack trace appender * Limit chunk by number of nodes * Stacktrace ID is uint32 * Add in-memory stacktrace resolver * Add writer * Add writer * Fix stacktrace resolver * Single pass write * Index file refactoring * Fixes, improvements, notes * Ignore empty stacktraces * Fix chunk boundary check * Fix tests * Store chunk headers sorted * Make chunk index explicit * Add file reader * Use group varint encoding * Refine stacktrace tree * Stacktrace tree race condition elimination * Remove unused stacktracesResolve.do * Better nil coalescence in stack trace appender * Format imports * Use the new symDB package (grafana/phlare#770) * Ingest stacktraces in the new symdb * Setup read in memory read path * Fix up a comment placement * Start setting up the read path * Update to uint32 * Introduce stacktrace partition (grafana/phlare#775) * Introduce stacktrace partition This determines the partition of a particular profile, by looking first at its metadata: * If there is a `Filename` on the main mapping use its filepath.Base(Filename) * Failing that take the externally supplied `service_name` * Fallback to `unknown` Take the underlying string value and hash. * After a chat with cyril we decided to not longer mod and use the hash straight away. We don't wanted to risk the collisions of two very big stacktrace applications. * Remove reconstructMeta from singleBlockQuerier * support multiple versions of stacktraces resolver * Integrate v2 reader for stacktraces in block reader * Fixes tests * Rewrite locations Ids * Rewrite test for counting uniq stacktraces * lint and fmt * Fixes more tests * Fixes leftover from todo --------- Co-authored-by: Christian Simon <simon@swine.de> * Use prefixed bucket for symbols * Initialize locationsIdsByStacktraceID * Initialize locationsIdsByStacktraceID for pprof as well * Fix chunk headers sort * Inline node alloc * Mapping filename extraction * Tidy go.mod * Fix TestHeadIngestStacktraces * Use symdb.DefaultDirName * Sort mappings on write * Make column iterator to respect the context * Fix unexpected EOF on stacktrace chunk unmarshal * Fix symbols upload * Fix symbols upload * Release fetched data * 3MB Page Buffer Size * Sort stacktraces IDs as expected by the resolver --------- Co-authored-by: Cyril Tovena <cyril.tovena@gmail.com> Co-authored-by: Christian Simon <simon@swine.de>

cyriltovena and others added 21 commits June 15, 2023 09:52

Ingest stacktraces in the new symdb

5396ab6

Setup read in memory read path

0eb7228

Fix up a comment placement

3bc2814

Start setting up the read path

e8c6a51

Merge branch 'feat/symdb' into feat/symdb-write-path

b20af73

Update to uint32

1071c92

Merge branch 'feat/symdb' into feat/symdb-write-path

a040c36

Merge branch 'feat/symdb' into feat/symdb-write-path

7ee05ee

Remove reconstructMeta from singleBlockQuerier

664e6b5

support multiple versions of stacktraces resolver

8688fe7

Merge branch 'feat/symdb' into feat/symdb-write-path

40442cf

Merge branch 'feat/symdb' into feat/symdb-write-path

7ce4c38

Integrate v2 reader for stacktraces in block reader

39cdfbc

Fixes tests

98a5847

Rewrite locations Ids

5e2b85f

Rewrite test for counting uniq stacktraces

181f32e

lint and fmt

6d5e7c0

Merge branch 'feat/symdb' into feat/symdb-write-path

c9ffd76

Fixes more tests

d94f0b1

Fixes leftover from todo

84e2d60

cyriltovena merged commit e783311 into feat/symdb Jun 20, 2023
16 of 17 checks passed

cyriltovena deleted the feat/symdb-write-path branch June 20, 2023 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the new symDB package #770

Use the new symDB package #770

cyriltovena commented Jun 15, 2023 •

edited

Use the new symDB package #770

Use the new symDB package #770

Conversation

cyriltovena commented Jun 15, 2023 • edited

cyriltovena commented Jun 15, 2023 •

edited