Harden search cache and add pg_stat_statements/pg_cron support by bitner · Pull Request #440 · stac-utils/pgstac

bitner · 2026-05-12T19:13:06Z

Summary

This branch tightens PgSTAC search cache behavior, improves concurrency safety around search stats, and adds first-class support for pg_stat_statements and pg_cron in the pgstac Docker image and test flow.

What changed

Reworked search cache handling to use a canonical where-clause hash and better concurrency controls when creating and touching cached searches.
Added search stats refresh support so updatestats flows through cached and uncached search paths and keeps numberMatched / context counts current.
Add columns and tooling to be able to save named queries, to pin queries so they won't be cleaned by TTL values, and to have garbage collection that can clean up not-recently used, named, or pinned queries.
Add garbage collection function for searches (this still must be called by something such as pg_cron).
Added pg_stat_statements and pg_cron to the pgstac image, enabled them through shared_preload_libraries, and initialized them during container bootstrap.
Added smoke tests for both extensions in scripts/container-scripts/test.
Updated pgtap/basic SQL coverage around search, token handling, readonly behavior, and related cache behavior.

Why

The goal of the search-cache work is to reduce collision risk, avoid stale or inconsistent stats under concurrent requests, and make the search implementation easier to maintain by keeping hashing and cache logic in the same SQL module. We also want to be able to clear out the cache for any searches that we don't need to actively be able to look up by name or hash (for example for titler-pgstac integration).

The extension work makes the standard test image closer to the production runtime and gives us direct verification that both extensions are loaded and usable, rather than assuming the container configuration is correct.

…ations

Co-authored-by: Pete Gadomski <pete.gadomski@gmail.com>

…ions - Update pgstac-migrate pyproject.toml to require pgpkg>=0.1.1 (includes routine body-change detection) - Regenerate migrations with pgpkg 0.1.1 which correctly includes search/search_query replacements - Suppress unsafe DROP FUNCTION statements for routines that exist in target schema - Fix PGTap test 116 to check column names in alphabetical order (migration adds columns at end) - Update test plan count from 229 to 248 (tests added for GC, context_count, statslastupdated) - Validate migration chain end-to-end with all tests passing - All precommit hooks passing (migrations, pgtap, pypgstac)

- expand pgstac-migrate README with full CLI/API/env var docs and troubleshooting - make psycopg[binary] mandatory in pgstac-migrate and pypgstac - make psycopg-pool mandatory in pypgstac - remove redundant psycopg optional/group wiring and update test script flags - remove pgstac-migrate upper bound in pypgstac dependency - update release workflow paths and uv setup/build step - refresh docs/changelog references for pgpkg>=0.1.1 - regenerate uv lockfiles

…ash-and-dead-code-rerun # Conflicts: # src/pgstac-migrate/pyproject.toml # src/pgstac-migrate/uv.lock

…d-code-rerun # Conflicts: # .github/instructions/scripts.instructions.md # .gitignore # AGENTS.md # CLAUDE.md # src/pgstac/migrations/pgstac--0.9.11--unreleased.sql

gadomski

Do we need to update the best-practices docs to tell folks to set up a cronjob to clean up searches?

bitner · 2026-05-13T14:04:37Z

Do we need to update the best-practices docs to tell folks to set up a cronjob to clean up searches?

At the end of this series of PRs, there's going to need to be a big clean up of docs including several "cron ready" functions/procedures that I'd rather document all together.

bitner and others added 18 commits May 5, 2026 17:00

feat: add pgstac-migrate compatibility layer

6677497

chore: switch pgpkg workflows to published packages

a5a2b5c

chore: clean up test warnings

3e853c6

update changelog

392c04c

Merge branch 'main' into pgpkgmigrations

2dc9775

add more tests

f31bcd2

pr1: switch search_wheres hashing to sha256 and stage unreleased migr…

88c039d

…ations

Update scripts/makemigration

5982900

Co-authored-by: Pete Gadomski <pete.gadomski@gmail.com>

Update .github/workflows/release.yml

85a299f

Co-authored-by: Pete Gadomski <pete.gadomski@gmail.com>

Merge remote-tracking branch 'origin/pgpkgmigrations' into v010-pr1-h…

adfdd4c

…ash-and-dead-code-rerun # Conflicts: # src/pgstac-migrate/pyproject.toml # src/pgstac-migrate/uv.lock

Merge remote-tracking branch 'origin/main' into v010-pr1-hash-and-dea…

7561525

…d-code-rerun # Conflicts: # .github/instructions/scripts.instructions.md # .gitignore # AGENTS.md # CLAUDE.md # src/pgstac/migrations/pgstac--0.9.11--unreleased.sql

Move pgstac_hash into search SQL

50d73bc

Refine unreleased changelog for search cache hardening

5932cd8

Enable pg_stat_statements and pg_cron in test image

d03f9be

Wire search_query updatestats into where_stats

2dff33e

Update unreleased changelog for search stats refresh

bba2f27

bitner marked this pull request as ready for review May 12, 2026 19:20

bitner requested review from gadomski and hrodmn May 12, 2026 19:20

bitner added 8 commits May 12, 2026 14:31

Move Rust crate under src

197587a

don't save _PLAN.md docs

999a74b

Merge branch 'main' into v010-pr1-hash-and-dead-code-rerun

dd5e7de

Merge branch 'rustac_cleanup' into v010-pr1-hash-and-dead-code-rerun

cd9b9e1

Document Rust crate move

dd4a621

Fix server extension smoke test db selection

a0d5c3c

Harden CI extension smoke tests and tighten changelog

9a253e3

Remove content_slim and regenerate SQL artifacts

584861c

gadomski approved these changes May 13, 2026

View reviewed changes

bitner merged commit 1b810ed into main May 13, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden search cache and add pg_stat_statements/pg_cron support#440

Harden search cache and add pg_stat_statements/pg_cron support#440
bitner merged 26 commits into
mainfrom
v010-pr1-hash-and-dead-code-rerun

bitner commented May 12, 2026 •

edited

Loading

Uh oh!

gadomski left a comment

Uh oh!

bitner commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bitner commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Why

Uh oh!

gadomski left a comment

Choose a reason for hiding this comment

Uh oh!

bitner commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bitner commented May 12, 2026 •

edited

Loading