Skip to content

Region aggregation for custom histogram.#121

Merged
norlandrhagen merged 2 commits intomainfrom
hist_update
Jul 28, 2025
Merged

Region aggregation for custom histogram.#121
norlandrhagen merged 2 commits intomainfrom
hist_update

Conversation

@norlandrhagen
Copy link
Collaborator

Updates the region aggregation pipeline step to allow for custom histogram summary stats.

The previous approach used duckdb's histogram func to create equally spaced bins for each risk time horizon score.

This new approach bins risk scores that are exactly 0 into a separate bin, then concatenates them with a histogram of scores greater then 0 to create a schema that looks like:

0
>0-10
10-20
20-30
30-40
40-50
50-60
60-70
70-80
80-90
90-100

@norlandrhagen norlandrhagen marked this pull request as ready for review July 28, 2025 16:43
@norlandrhagen norlandrhagen merged commit bb92344 into main Jul 28, 2025
2 checks passed
@norlandrhagen norlandrhagen deleted the hist_update branch July 28, 2025 17:05
andersy005 added a commit that referenced this pull request Aug 5, 2025
* origin/main: (51 commits)
  fix: update integration tests to include env-file and ensure pull request triggers (#130)
  feat: add OCRConfig for improved configuration management (#122)
  Bump prefix-dev/setup-pixi from 0.8.10 to 0.8.14 in the actions group (#125)
  refactor: revamp the CLI and add `LocalBatchManager` for local execution (#113)
  Region aggregation for custom histogram. (#121)
  Tract level aggregations (#119)
  bump icechunk min version (#118)
  add new script for subsetting and rechunking CONUS-404 data (#110)
  Valid rid check (#112)
  Work towards seperating deployment from core library (#111)
  Data flow diagram update (#109)
  County region aggregation (#104)
  Interp less zero (#106)
  revert to intel based vm types (#103)
  standardize tag casing from 'project=OCR' to 'Project=OCR' across multiple scripts (#102)
  Bump VM type from m5 to m8g (#101)
  alt duckdb install (#100)
  adds duckdb to spatial dockerfile (#99)
  merge pixi.toml with pyproject.toml (#97)
  remove empty building risk scores to filter out CA and MX buildings from CONUS (#95)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant