Skip to content

Conversation

@emmanuelmathot
Copy link
Contributor

@emmanuelmathot emmanuelmathot commented Dec 2, 2025

Summary

Updated documentation to clearly explain the structural differences between the deprecated V0 converter (create_geozarr_dataset) and the current V1 converter (convert_s2_optimized) for Sentinel-2 data.

Key Changes

  • docs/converter.md: Added "V0 vs V1 Converter: Key Differences" section with ASCII diagrams showing:

    • V0: Creates pyramids within each resolution group (/r10m/0, /r10m/1, etc.)
    • V1: Consolidates pyramid by reusing native resolutions and adding coarser levels (/r10m, /r20m, /r60m, /r120m, /r360m, /r720m)
  • docs/architecture.md: Updated to reflect functional programming approach and xarray downsampling

  • docs/geozarr-minispec.md: Documented dual convention support (Zarr Multiscales + GeoZarr TMS)

  • docs/api-reference.md: Added S2 optimization functions documentation

Why This Matters

Addresses confusion about why V1 doesn't produce nested pyramid levels (r10m/0, r10m/1, etc.) like V0. V1's approach:

  • Eliminates data duplication (e.g., r10m/1 ≈ r20m/0)
  • More efficient storage
  • Simpler hierarchy
  • Better fit for Sentinel-2's multi-resolution nature

Migration Note

V0 is deprecated. Data-pipeline v1.0.0 will use V1 converter exclusively.

d-v-b and others added 30 commits November 20, 2025 21:05
- Added `initialize_crs_from_dataset` function to extract CRS from dataset metadata.
- Updated S2 optimization commands to include new CRS handling.
- Removed unused arguments related to geometry and meteorology groups.
- Added comprehensive tests for CRS initialization from various sources.
@emmanuelmathot emmanuelmathot changed the title update documentation for Sentinel-2 optimized conversion, detailing V1 approach and differences from V0 Update documentation for Sentinel-2 optimized conversion Dec 2, 2025
@d-v-b
Copy link
Contributor

d-v-b commented Dec 2, 2025

lgtm! maybe we should get #75 in first?

@lhoupert
Copy link
Contributor

lhoupert commented Dec 2, 2025

I think https://github.com/EOPF-Explorer/data-model/blob/main/docs/examples.md needs to be updated too

@emmanuelmathot emmanuelmathot marked this pull request as ready for review December 2, 2025 17:47
@emmanuelmathot emmanuelmathot merged commit 06df224 into main Dec 4, 2025
5 checks passed
@emmanuelmathot emmanuelmathot deleted the doc-pr-75 branch December 7, 2025 21:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants