Emit total number of Rows scanned while addressing query by ravisharm · Pull Request #17931 · apache/druid

ravisharm · 2025-04-17T10:47:20Z

Description

Emit number of rows scanned while addressing the query as a response header. This value can be used to quantify the cost of a Druid query and can greatly help in determining costly queries.

This PR modifies the query engine class of the various query types to compute the number of scanned rows which is then passed to the higher layers via response context. Eventually, number of rows scanned is emitted as header X-Num-Scanned-Rows.

Release note

While responding to any Druid query, Druid response now contains a new header X-Num-Scanned-Rows that gives number of rows scanned after applying all the filters in the query. The value of this header can be used as one of the signals to quantify the cost of a Druid query.

This PR has:

[x ] been self-reviewed.
added documentation for new or modified features or behaviors.
[ x] a release note entry in the PR description.
[x ] been tested in a test Druid cluster.

* Fix web-console snapshots * Revert changes to package and package-lock.json

* Add sketch fetching framework * Refactor code to support sequential merge * Update worker sketch fetcher * Refactor sketch fetcher * Refactor sketch fetcher * Add context parameter and threshold to trigger sequential merge * Fix test * Add integration test for non sequential merge * Address review comments * Address review comments * Address review comments * Resolve maxRetainedBytes * Add new classes * Renamed key statistics information class * Rename fetchStatisticsSnapshotForTimeChunk function * Address review comments * Address review comments * Update documentation and add comments * Resolve build issues * Resolve build issues * Change worker APIs to async * Address review comments * Resolve build issues * Add null time check * Update integration tests * Address review comments * Add log messages and comments * Resolve build issues * Add unit tests * Add unit tests * Fix timing issue in tests

* Backport firehose PR 12981 * Update migrate-from-firehose-ingestion.md

* Suppress jackson-databind CVE-2022-42003 and CVE-2022-42004 (cherry picked from commit 1f4d892) * Suppress CVEs (cherry picked from commit ed55baa) * Suppress vulnerabilities from druid-website package (cherry picked from commit c0fb364) * Add more suppressions for website package (cherry picked from commit 9bba569) Co-authored-by: Rohan Garg <7731512+rohangarg@users.noreply.github.com>

…e#13438) * fixes BlockLayoutColumnarLongs close method to nullify internal buffer. * fixes other BlockLayoutColumnar supplier close methods to nullify internal buffers. * fix spotbugs (cherry picked from commit b091b32)

apache#13421) * we can read where we want to we can leave your bounds behind 'cause if the memory is not there we really don't care and we'll crash this process of mine

…che#13422)

apache#13442) (apache#13444)

* Update and document experimental features (cherry picked from commit ccbf3ab) * Updated (cherry picked from commit d7b8fae) * Update experimental-features.md * Updated after review (cherry picked from commit 975ae24) * Updated (cherry picked from commit eb8268e) * Update materialized-view.md (cherry picked from commit 53c3bde) * Update experimental-features.md (cherry picked from commit 77148f7)

* Update nested columns docs * Update nested-columns.md

…inputRow map instead of eagerly copying (apache#13406) (apache#13447)

…13445) Detects self-redirects, redirect loops, long redirect chains, and redirects to unknown servers. Treat all of these cases as an unavailable service, retrying if the retry policy allows it. Previously, some of these cases would lead to a prompt, unretryable error. This caused clients contacting an Overlord during a leader change to fail with error messages like: org.apache.druid.rpc.RpcException: Service [overlord] redirected too many times Additionally, a slight refactor of callbacks in ServiceClientImpl improves readability of the flow through onSuccess. Co-authored-by: Gian Merlino <gianmerlino@gmail.com>

…s to parse exception in MSQ (apache#13366) (apache#13454) * initial commit * fix test * push the json changes * reduce the area of the try..catch * Trigger Build * review

…ache#13459) (apache#13464) * Fix an issue with WorkerSketchFetcher not terminating on shutdown * Change threadpool name

* add ability to make inputFormat part of the example datasets (apache#13402) * Web console: Index spec dialog (apache#13425) * add index spec dialog * add sanpshot * Web console: be more robust to aux queries failing and improve kill tasks (apache#13431) * be more robust to aux queries failing * feedback fixes * remove empty block * fix spelling * remove killAllDataSources from the console * don't render duration if aggregated (apache#13455)

(cherry picked from commit 994d7c2)

* Update LDAP configuration docs (cherry picked from commit e74bd89) * Updated after review (cherry picked from commit 882e0b2) * Update auth-ldap.md Updated. (cherry picked from commit d4f0797) * Update auth-ldap.md (cherry picked from commit fbec7b2) * Updated spelling file (cherry picked from commit ef5316b) * Update docs/operations/auth-ldap.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> (cherry picked from commit 1a9b42a) * Update docs/operations/auth-ldap.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> (cherry picked from commit 1018d9a) * Update docs/operations/auth-ldap.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> (cherry picked from commit dd81b3f) * Update auth-ldap.md (cherry picked from commit f0655cf)

) (apache#13493) In a cluster with a large number of streaming tasks (~1000), SegmentAllocateActions on the overlord can often take very long intervals of time to finish thus causing spikes in the `task/action/run/time`. This may result in lag building up while a task waits for a segment to get allocated. The root causes are: - large number of metadata calls made to the segments and pending segments tables - `giant` lock held in `TaskLockbox.tryLock()` to acquire task locks and allocate segments Since the contention typically arises when several tasks of the same datasource try to allocate segments for the same interval/granularity, the allocation run times can be improved by batching the requests together. Changes - Add flags - `druid.indexer.tasklock.batchSegmentAllocation` (default `false`) - `druid.indexer.tasklock.batchAllocationMaxWaitTime` (in millis) (default `1000`) - Add methods `canPerformAsync` and `performAsync` to `TaskAction` - Submit each allocate action to a `SegmentAllocationQueue`, and add to correct batch - Process batch after `batchAllocationMaxWaitTime` - Acquire `giant` lock just once per batch in `TaskLockbox` - Reduce metadata calls by batching statements together and updating query filters - Except for batching, retain the whole behaviour (order of steps, retries, etc.) - Respond to leadership changes and fail items in queue when not leader - Emit batch and request level metrics

…he#13495) * Update docs for useBatchedSegmentSampler * Update docs for round robin assigment

* Update to native ingestion doc (cherry picked from commit aba83f2) * Update native-batch.md * Update native-batch.md

…verview.type=http (apache#13499) (apache#13515) * fix issue with http server inventory view blocking data node http server shutdown with long polling * adjust * fix test inspections

…pache#13517) Changes: - Limit max batch size in `SegmentAllocationQueue` to 500 - Rename `batchAllocationMaxWaitTime` to `batchAllocationWaitTime` since the actual wait time may exceed this configured value. - Replace usage of `SegmentInsertAction` in `TaskToolbox` with `SegmentTransactionalInsertAction`

… (apache#13529) * Remove stray reference to fix OOM while merging sketches * Update future to add result from executor service * Update tests and address review comments * Address review comments * Moved mock * Close threadpool on teardown * Remove worker task cancel

…ache#13537) (apache#13542) The planner sets sqlInsertSegmentGranularity in its context when using PARTITIONED BY, which sets it on every native query in the stack (as all native queries for a SQL query typically have the same context). QueryKit would interpret that as a request to configure bucketing for all native queries. This isn't useful, as bucketing is only used for the penultimate stage in INSERT / REPLACE. So, this patch modifies QueryKit to only look at sqlInsertSegmentGranularity on the outermost query. As an additional change, this patch switches the static ObjectMapper to use the processwide ObjectMapper for deserializing Granularities. Saves an ObjectMapper instance, and ensures that if there are any special serdes registered for Granularity, we'll pick them up. (cherry picked from commit 5581488) Co-authored-by: Gian Merlino <gianmerlino@gmail.com>

* Web console: add arrayOfDoublesSketch and other small fixes (apache#13486) * add padding and keywords * add arrayOfDoubles * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * partiton int * fix docs Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Web console: improve compaction status display (apache#13523) * improve compaction status display * even more accurate * fix snapshot * MSQ: Improve TooManyBuckets error message, improve error docs. (apache#13525) 1) Edited the TooManyBuckets error message to mention PARTITIONED BY instead of segmentGranularity. 2) Added error-code-specific anchors in the docs. 3) Add information to various error codes in the docs about common causes and solutions. * update error anchors (apache#13527) * update snapshot Co-authored-by: Charles Smith <techdocsmith@gmail.com> Co-authored-by: Gian Merlino <gianmerlino@gmail.com>

…attening (apache#13519) (apache#13546) * add protobuf flattener, direct to plain java conversion for faster flattening, nested column tests

Cherry-picked from commit 4ebdfe2

…pache#17185) (#233) * Improve logging to include taskId in segment handoff notifier thread (apache#17185) * Fix dependencies effecting cherry-pick

…sumed by all components include Brokers/Indexers/Historicals (#235) * OBSDATA-5211 At Brokers emit a single metric that tells total CPU consumed by all components include Brokers/Indexers/Historicals (#74) * Emit Cpu consumed as header * Add comment * Address PR comments * Log message from exception * OBSDATA-5211 Fix style check error (#75) * Emit Cpu consumed as header * Add comment * Address PR comments * Log message from exception * fix style check error * Empty commit (#76) * Emit Cpu consumed as header * Add comment * Address PR comments * Log message from exception * fix style check error

* removing generated internal project.yml * removing generated public project.yml --------- Co-authored-by: ConfluentSemaphore <40306929+ConfluentSemaphore@users.noreply.github.com>

…ent (#237)

…nfig from 1M to 10S (#84) (#241)

…ifierConfig from 1M to 10S (#86) (#242)

…203)

…sks (#255) * Make maxRowsInMemory and maxBytesInMemory configurable for indexer tasks * Fix tests

…#256) * Emit rows scanned metric * fix test * address PR comments

kfaraz and others added 30 commits November 21, 2022 20:39

Update versions for 25.0 release

0c66815

Fix web-console snapshots (apache#13408)

35580fe

* Fix web-console snapshots * Revert changes to package and package-lock.json

Backport firehose doc changes (apache#13419)

1f107cb

* Backport firehose PR 12981 * Update migrate-from-firehose-ingestion.md

Add mechanism for 'safe' memory reads for complex types (apache#13361) (

7916770

apache#13421) * we can read where we want to we can leave your bounds behind 'cause if the memory is not there we really don't care and we'll crash this process of mine

fix off by one error in nested column range index (apache#13405) (apa…

d9a79f0

…che#13422)

Add MetricsVerifier to simplify verification of metric values in tests (

8e7a32a

apache#13442) (apache#13444)

Update nested columns docs (apache#13424)

4125701

* Update nested columns docs * Update nested-columns.md

fix issues with nested data conversion (apache#13407) (apache#13448)

753d770

fix KafkaInputFormat with nested columns by delegating to underlying …

23500a4

…inputRow map instead of eagerly copying (apache#13406) (apache#13447)

Convert errors based on implicit type conversion in multi value array…

8bf4b68

…s to parse exception in MSQ (apache#13366) (apache#13454) * initial commit * fix test * push the json changes * reduce the area of the try..catch * Trigger Build * review

Fix an issue with WorkerSketchFetcher not terminating on shutdown (ap…

ff3c83f

…ache#13459) (apache#13464) * Fix an issue with WorkerSketchFetcher not terminating on shutdown * Change threadpool name

Update experimental features doc (apache#13462)

054e4e9

(cherry picked from commit 994d7c2)

Docs: Update docs for coordinator dynamic config (apache#13494) (apac…

7d106e4

…he#13495) * Update docs for useBatchedSegmentSampler * Update docs for round robin assigment

Update to native ingestion doc - backport (apache#13483)

789922a

* Update to native ingestion doc (cherry picked from commit aba83f2) * Update native-batch.md * Update native-batch.md

Use version 25.0.0 in docker-compose.yml

888311c

fix issue with jetty graceful shutdown of data servers when druid.ser…

63780ed

…verview.type=http (apache#13499) (apache#13515) * fix issue with http server inventory view blocking data node http server shutdown with long polling * adjust * fix test inspections

add protobuf flattener, direct to plain java conversion for faster fl…

93e2a7f

…attening (apache#13519) (apache#13546) * add protobuf flattener, direct to plain java conversion for faster flattening, nested column tests

[Backport] Druid automated quickstart (apache#13365) (apache#13552)

5383dc5

Cherry-picked from commit 4ebdfe2

hardikbajaj and others added 17 commits October 1, 2024 16:19

Improve logging to include taskId in segment handoff notifier thread (a…

c11b751

…pache#17185) (#233) * Improve logging to include taskId in segment handoff notifier thread (apache#17185) * Fix dependencies effecting cherry-pick

chore: update repo by service bot (#236)

ee56c64

* removing generated internal project.yml * removing generated public project.yml --------- Co-authored-by: ConfluentSemaphore <40306929+ConfluentSemaphore@users.noreply.github.com>

[OBSDATA-1786]: Adding Channel acquire time metric for netty HTTP cli…

f3f1092

…ent (#237)

Reduce the poll duration for CoordinatorBasedSegmentHandoffNotifierCo…

17f4598

…nfig from 1M to 10S (#84) (#241)

Revert Reduce the poll duration for CoordinatorBasedSegmentHandoffNot…

b8a711c

…ifierConfig from 1M to 10S (#86) (#242)

Add unit test (#87) (#243)

8a7c1b9

Fix npm file-loader vulnerability

9a63cb0

OBSDATA-4171 Fix npm postcss-preset-env vulnerability (#246)

f0e71b5

Remove codecov dependency (#247)

7327d16

Fix core-js vulnerability (#248)

2cd3d76

Migrate jaxb bind dependency to jakarta (#251)

7ccf12f

[OBSDATA-5586] Add 2, 3, 4 minute granularity support (#253)

cc24dc5

Add logging to reveal reason to persist the hydrants (apache#16409) (#…

e910617

…203)

Make maxRowsInMemory and maxBytesInMemory configurable for indexer ta…

74e1aaf

…sks (#255) * Make maxRowsInMemory and maxBytesInMemory configurable for indexer tasks * Fix tests

use responsecontext.cpu when emitting metric (#254)

517d9f4

OBSDATA-5408 Emit total number of Rows scanned while addressing query (…

d7db8ac

…#256) * Emit rows scanned metric * fix test * address PR comments

ravisharm closed this Apr 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emit total number of Rows scanned while addressing query#17931

Emit total number of Rows scanned while addressing query#17931
ravisharm wants to merge 199 commits intoapache:masterfrom
confluentinc:numrowsscanned

ravisharm commented Apr 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

ravisharm commented Apr 17, 2025

Description

Release note

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants