Skip to content

Add cached valid_region_ids.json and use it in ChunkingConfig#280

Merged
andersy005 merged 1 commit intomainfrom
cache-valid-region-ids
Oct 21, 2025
Merged

Add cached valid_region_ids.json and use it in ChunkingConfig#280
andersy005 merged 1 commit intomainfrom
cache-valid-region-ids

Conversation

@andersy005
Copy link
Member

  • ChunkingConfig.valid_region_ids now tries to load a cached JSON (ocr/data/valid_region_ids.json),
    logs progress in debug mode, and writes the computed list back to the cache if needed.
  • Add ocr/data/.gitkeep and the precomputed valid_region_ids.json to the package.
  • Update pyproject.toml to include data/*.json as package data.

- ChunkingConfig.valid_region_ids now tries to load a cached JSON (ocr/data/valid_region_ids.json),
  logs progress in debug mode, and writes the computed list back to the cache if needed.
- Add ocr/data/.gitkeep and the precomputed valid_region_ids.json to the package.
- Update pyproject.toml to include data/*.json as package data.
@andersy005 andersy005 added the enhancement New feature or request label Oct 21, 2025
@andersy005 andersy005 merged commit 52cd9a2 into main Oct 21, 2025
8 checks passed
@andersy005 andersy005 deleted the cache-valid-region-ids branch October 21, 2025 19:13
andersy005 added a commit that referenced this pull request Nov 4, 2025
* main: (46 commits)
  Chage summary stats geoparquet filepaths from `output` to `intermediate` (#299)
  Update data downloads page (#300)
  Bump prefix-dev/setup-pixi from 0.9.1 to 0.9.2 in the actions group (#298)
  Update data download documentation (#293)
  migrate vector input datasets to unified ingestion and remove unused datasets (#297)
  Fix duplicate `avg_name` (#296)
  fix California and Tennessee region IDs in staging automatic deploy (#294)
  Add additional region IDs to QA PR automatic deploy (#292)
  create a unified infrastructure for ingesting and processing input datasets (#289)
  Combine county, tract and block PMTiles layers into a single regions.pmtiles layer (#291)
  Pyramid (#284)
  Use buffered slices to remove edge effects from neighborhood operations (#288)
  Bumps up RAM for `write-aggregated-region-analysis-files` job (#290)
  fix block dataset path construction in wind risk regional aggregation (#282)
  Adds a bbox struct for region pmtiles (#281)
  compute Dask-backed data before assert_equal/assert_all_close (#283)
  pipeline and configuration improvements (#279)
  Add cached valid_region_ids.json and use it in ChunkingConfig (#280)
  Combining wind-smeared data and Riley BP + smoothing (#278)
  update-docs: add first draft of all docs pages (#275)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant