Add resolution-based chunking to ABI L1b reader #2621

djhoese · 2023-10-30T17:54:32Z

Add resolution-based chunking to the ABI L1b reader and switch default data type to 32-bit floats. This is similar to #2052 and #2584, but for the ABI L1b reader. This PR was a little more difficult than those as we are using xr.open_dataset here so we can't tell xarray what chunk size we want and have it be on-disk-chunk aligned. I cheat by having the on-disk chunk size hardcoded. Without this we'd end up with misaligned resolution chunks which defeats the purpose of this PR. I noticed that full disk files have a chunk size of 226 (consistently) and CONUS and M1/M2 are 250.

Edit: CSPP Geo GRB is configured to produce on-disk chunks of 226 for all sectors.

Additionally, I changed the default processing to produce 32-bit floats instead of 64-bit floats since the extra precision for non-x/y variables should be unnecessary.

Lastly, I rewrote the L1b tests. They didn't make it possible to test what I needed to test and there is a lot sense duplicate code now.

TODO:

Actually add tests for the dtype and chunk size changes

Closes #xxxx
Tests added
Fully documented
Add your name to AUTHORS.md if not there already

Caused failure in GLM L2 DQF processing

codecov · 2023-10-30T19:51:27Z

Codecov Report

Merging #2621 (006136e) into main (86c075a) will decrease coverage by 0.01%.
Report is 7 commits behind head on main.
The diff coverage is 97.48%.

@@            Coverage Diff             @@
##             main    #2621      +/-   ##
==========================================
- Coverage   95.18%   95.18%   -0.01%     
==========================================
  Files         354      354              
  Lines       51270    51316      +46     
==========================================
+ Hits        48803    48846      +43     
- Misses       2467     2470       +3

Flag	Coverage Δ
behaviourtests	`4.24% <1.00%> (+<0.01%)`	⬆️
unittests	`95.81% <98.96%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
satpy/readers/abi_base.py	`95.36% <100.00%> (+0.80%)`	⬆️
satpy/readers/abi_l1b.py	`98.87% <100.00%> (-0.03%)`	⬇️
satpy/tests/reader_tests/test_abi_l1b.py	`98.95% <98.85%> (-0.43%)`	⬇️
satpy/utils.py	`27.05% <40.00%> (+0.17%)`	⬆️

... and 2 files with indirect coverage changes

coveralls · 2023-10-30T20:25:56Z

Pull Request Test Coverage Report for Build 6712031249

201 of 203 (99.01%) changed or added relevant lines in 4 files are covered.
1 unchanged line in 1 file lost coverage.
Overall coverage increased (+0.003%) to 95.762%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
satpy/tests/reader_tests/test_abi_l1b.py	177	179	98.88%

Files with Coverage Reduction	New Missed Lines	%
satpy/readers/abi_base.py	1	95.36%

Totals
Change from base Build 6663415040:	0.003%
Covered Lines:	48966
Relevant Lines:	51133

💛 - Coveralls

djhoese · 2023-11-01T02:29:29Z

So the tests are passing here and the individual ABI L1b tests pass locally, but if I run all the Satpy tests, some of these new tests fail at the dask array chunking checks with unexpected chunk sizes. I'd still like to investigate but that shouldn't hold up a review on this. My guess is some poorly configured test in something else and/or one of my other edits/debugs in my other packages screwing things up.

djhoese · 2023-11-01T02:50:58Z

Scratch that. Pycharm was using an old Python 3.10 environment that I haven't used for a long time. No idea what was in there, but my guess is an old version of xarray not using previous_chunks was giving different final chunk results.

mraspaud

Looks good overall, and great job on the test refactoring/rewitting.

I have a couple of comments in-line, but one general thing I want to bring up is the use of the scene object in the tests. Is this really necessary? these tests are for the file handlers iiuc, so I suggest we stick to that api. Using the scene here makes these test fragile and likely to break if the scene object changes or is deprecated in the future, while the abi reader will still be valid.

satpy/readers/abi_l1b.py

mraspaud · 2023-11-01T19:42:30Z

satpy/tests/reader_tests/test_abi_l1b.py

+# RAD_SHAPE = {
+#     500: (21696, 21696),  # fldk - 500m
+# }


Do we need this dead code?

satpy/tests/reader_tests/test_abi_l1b.py

djhoese · 2023-11-01T20:33:38Z

one general thing I want to bring up is the use of the scene object in the tests. Is this really necessary? these tests are for the file handlers iiuc, so I suggest we stick to that api. Using the scene here makes these test fragile and likely to break if the scene object changes or is deprecated in the future, while the abi reader will still be valid.

Agreed. I brought this up on slack and Panu was the only one that commented. I don't like using the Scene in the tests either, but also I hate the number of functions/methods I have to call just to get a reader instance. I can change it, I'm just not sure where to use utility functions in readers/__init__.py versus calling the class and class methods directly. I'd like it to load the YAML and it seems dumb to do that manually, so maybe I should use the helper functions in readers/__init__.py.

mraspaud

LGTM, thanks for the heavy work on the tests and not using the scene interface anymore

djhoese added 11 commits October 27, 2023 12:36

Add initial hacky chunking and float32 handling to ABI L1b reader

f182d74

Use filetype info for ABI resolution-based chunking

878e5c6

Start refactoring ABI L1b tests

8536097

Remove unnecessary duplication in ABI L1b tests

55e5248

Use dask arrays in abi l1b tests

deac453

Switch some tests to on-disk files

e248324

Move more ABI L1b tests to on-disk files

07e841c

Switch all ABI L1b tests to on-disk files

b6411c7

Use more realistic sizes in ABI tests

f9efd96

Revert AreaDefinition import for easier test mocking

1e0e21f

More abi l1b test refactoring

514f5e1

djhoese added enhancement code enhancements, features, improvements component:readers cleanup Code cleanup but otherwise no change in functionality labels Oct 30, 2023

djhoese self-assigned this Oct 30, 2023

djhoese added 2 commits October 30, 2023 13:11

Undo forcing GRB fill to floating point

83609cc

Caused failure in GLM L2 DQF processing

Fix various inconsistencies in ABI L1b DataArrays

8b5c450

Add dask chunk size checks to ABI l1b tests

14f59c4

djhoese marked this pull request as ready for review October 31, 2023 20:22

djhoese requested a review from mraspaud as a code owner October 31, 2023 20:22

mraspaud reviewed Nov 1, 2023

View reviewed changes

Remove unnecessary float cast in satpy/readers/abi_l1b.py

edd0632

Switch abi l1b reader tests to use reader-level interfaces

006136e

mraspaud approved these changes Nov 2, 2023

View reviewed changes

mraspaud merged commit ee63577 into pytroll:main Nov 2, 2023
16 of 19 checks passed

djhoese deleted the feature-abi-res-chunking branch November 2, 2023 20:41

djhoese mentioned this pull request Nov 6, 2023

Fix ABI readers using wrong dtype for resolution-based chunks #2627

Merged

4 tasks

yukaribbba mentioned this pull request Jan 11, 2024

Resolution-based chunking for AMI L1b? #2716

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add resolution-based chunking to ABI L1b reader #2621

Add resolution-based chunking to ABI L1b reader #2621

djhoese commented Oct 30, 2023 •

edited

Loading

codecov bot commented Oct 30, 2023 •

edited

Loading

coveralls commented Oct 30, 2023 •

edited

Loading

djhoese commented Nov 1, 2023

djhoese commented Nov 1, 2023

mraspaud left a comment

mraspaud Nov 1, 2023

djhoese commented Nov 1, 2023

mraspaud left a comment

Add resolution-based chunking to ABI L1b reader #2621

Add resolution-based chunking to ABI L1b reader #2621

Conversation

djhoese commented Oct 30, 2023 • edited Loading

codecov bot commented Oct 30, 2023 • edited Loading

Codecov Report

coveralls commented Oct 30, 2023 • edited Loading

Pull Request Test Coverage Report for Build 6712031249

💛 - Coveralls

djhoese commented Nov 1, 2023

djhoese commented Nov 1, 2023

mraspaud left a comment

Choose a reason for hiding this comment

mraspaud Nov 1, 2023

Choose a reason for hiding this comment

djhoese commented Nov 1, 2023

mraspaud left a comment

Choose a reason for hiding this comment

djhoese commented Oct 30, 2023 •

edited

Loading

codecov bot commented Oct 30, 2023 •

edited

Loading

coveralls commented Oct 30, 2023 •

edited

Loading