[WIP] prototype for caching geometries #54476

otan · 2020-09-16T21:12:43Z

No description provided.

cockroach-teamcity · 2020-09-16T21:12:49Z

This change is

57817: colexec: add the disk-spilling to the hash aggregator r=yuzefovich a=yuzefovich **colexec: mechanical changes for the external hash aggregator** This commit performs several mechanical changes prompted by the follow up work on the external hash aggregator: - extract the arguments to `newSpillingQueue` into a struct - add `context.Context` as the first argument to `ExportBuffered` method - extract aggregator tests into global variables. It also fixes a couple of cosmetic issues with the memory account names for the external operators. Release note: None **colexec: add the disk-spilling to the hash aggregator** This commit introduces the external hash aggregator that uses the hash-based partitioner with the in-memory hash aggregator as the "main" strategy and the external sort + the ordered aggregator as the "fallback". This approach was benchmarked against simply using the external sort + the ordered aggregator, and on larger datasets the chosen approach is noticably faster. In order for the in-memory hash aggregator to be able to actually fallback we need to keep track of all of the input tuples since it is very hard to spill the intermediate results of computation. This required the usage of a spilling queue and enqueuing the copies of all input batches into it. The benchmarks show that the performance overhead of this is relatively small while the spilling queue doesn't have to spill to disk (on the order of 15-20% hit in micro-benchmarks), however, when the spilling queue needs to use the disk, the hit can be 2-3x in case when the hash aggregator itself doesn't have spill. One notable change is that because the ordered aggregator doesn't support filtering aggregation, we cannot support it in the external hash aggregator, and as a result the hash aggregation is currently not planned if filtering aggregation is requested. Another notable change is the addition of unwrapping datum when converting it to JSON using `AsJSON` - for some reason, the row engine was panicking on `TestAggregatorAgainstProcessor` test with `json_agg` function, yet I couldn't reproduce it outside of the unit test, still I believe this addition doesn't make things worse. Fixes: #42485. Release note (sql change): Hash aggregation can now spill to disk when it exhausts its memory limit when executed via the vectorized engine. 58288: builtins: Implement ST_GeneratePoints function r=otan a=mknycha This function generates pseudo-random points until the requested number are found within the input area. Release note (sql change): Implement geo builtin ST_GeneratePoints Resolves: #48942 Dependent on: #54476 Co-authored-by: Yahor Yuzefovich <yahor@cockroachlabs.com> Co-authored-by: Marcin Knychała <knychala.marcin@gmail.com>

prepared geometry

34430ce

otan changed the title ~~prototype for caching geometries~~ WIP: prototype for caching geometries Sep 16, 2020

otan mentioned this pull request Nov 10, 2020

spatial: cache binary predicate operations #56495

Open

otan mentioned this pull request Dec 10, 2020

geo/geomfn: implement ST_GeneratePoints({geometry,int4}) #48942

Closed

mknycha mentioned this pull request Dec 27, 2020

builtins: Implement ST_GeneratePoints function #58288

Merged

otan changed the title ~~WIP: prototype for caching geometries~~ [WIP] prototype for caching geometries Feb 4, 2021

andyyang890 mentioned this pull request Apr 9, 2021

geosprepared: add global spatial cache and use it for st_intersects #63364

Closed

tbg added the X-noremind Bots won't notify about PRs with X-noremind label May 6, 2021

otan closed this May 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] prototype for caching geometries #54476

[WIP] prototype for caching geometries #54476

otan commented Sep 16, 2020

cockroach-teamcity commented Sep 16, 2020

[WIP] prototype for caching geometries #54476

[WIP] prototype for caching geometries #54476

Conversation

otan commented Sep 16, 2020

cockroach-teamcity commented Sep 16, 2020