Update SEVIRI native reader with 'time_parameters' metadata #1877

simonrp84 · 2021-11-09T12:55:36Z

Edit: The original purpose of this PR has changed. Originally it switched the native reader to use the observation time for the generic start and end time, but since its original creation a new "standard" has been described to use nominal time for the start/end time and put all time metadata in a special time_parameters subdictionary of .attrs.

Original Description:

The SEVIRI HRIT and NAT readers are currently inconsistent with their start/end time values:

from satpy import Scene
scn = Scene([my_nat_file], reader='seviri_l1b_native')
scn.load(['IR_108'])
print("NATIVE")
print(scn['IR_108'].attrs['start_time'])
print(scn['IR_108'].attrs['end_time'])

scn = Scene(my_hrit_files, reader='seviri_l1b_hrit')
scn.load(['IR_108'])
print("HRIT")
print(scn['IR_108'].attrs['start_time'])
print(scn['IR_108'].attrs['end_time'])

Gives:

NATIVE
2021-11-09 12:00:10.388905
2021-11-09 12:15:10.076995

HRIT 
2021-11-09 12:00:10.388000
2021-11-09 12:12:43.089000

This is because the HRIT reader selects:

['ImageProductionStats']['ActualScanningSummary']['ForwardScanStart'] and
['ImageProductionStats']['ActualScanningSummary']['ForwardScanEnd'] from the trailer / epilogue.

The NAT reader, though, selects:

['ImageAcquisition']['PlannedAcquisitionTime']['TrueRepeatCycleStart'] and
['ImageAcquisition']['PlannedAcquisitionTime']['PlannedRepeatCycleEnd'] from the header.

In this PR I update the NAT reader to select the same start/end time values as the HRIT reader uses. With the PR, the output of the above code snippet is:

NATIVE
2021-11-09 12:00:10.388000
2021-11-09 12:12:43.089000

HRIT 
2021-11-09 12:00:10.388000
2021-11-09 12:12:43.089000

mraspaud · 2021-11-09T12:58:09Z

Thanks for the quick PR! waiting for the expert opinions here, @sfinkens @ameraner ?

sjoro · 2021-11-09T13:21:22Z

a philosophical question: are the Satpy start_time and end_time meant to represent the repeat cycle times or the actual scanning times? as you can see the end times are then fundamentally different. i might be the one who selected the repeat cycle times for the native-reader back in the days...
in the latter case the trailer has, as indicated, the actual start and end times, though the planned times should be accurate to a fractions of a second (says the documentation) and could be used in case there is any inconvenience of reading these data from the trailer.

sfinkens · 2021-11-09T14:52:48Z

Looks good to me, thanks for fixing this! They should indeed be consistent.

Regarding philosophy: I'd vote for the actual scanning times as provided by the HRIT reader and also ahi_hsd or abi_l1b for example. That is more consistent with swath data which don't have the concept of repeat cycles. But of course it would be nice to have the nominal timestamps in the attributes as well. In ahi_hsd they're called scheduled time

Also pinging @ninahakansson : I think this doesn't break our workflow, because the end time is computed by level1c4pps here

mraspaud · 2021-11-10T07:39:11Z

Not in the scope of this PR, but to react to @sjoro 's question, I would say that different users might have different needs. I suppose a solution would be to have multiple attributes in this case, eg start_scan_time, start_nominal_time, etc
But if we want to be consistent between polar and geo satellites, I would say that the default start_time and end_time should indeed refer to scanning times.

mraspaud · 2021-11-10T07:41:19Z

@simonrp84 looks like you broke the seviri_native tests :)

pnuu · 2021-11-10T07:50:11Z

But if we want to be consistent between polar and geo satellites, I would say that the default start_time and end_time should indeed refer to scanning times.

The GEO users (the people looking at the images) expect the timestamps to match the nominal times, which means start_time should stay as it is and the (filename) timestamps should align with the nominal repeat cycle.

mraspaud · 2021-11-10T08:37:47Z

The GEO users (the people looking at the images) expect the timestamps to match the nominal times, which means start_time should stay as it is and the (filename) timestamps should align with the nominal repeat cycle.

So we are lucky, because the scanning time in HRIT that we have used all the time as start_time is pretty much in sync with the nominal time :) Maybe a more sustainable solution would be to add new attributes with nominal times and use that in our filenamepatterns (in trollflow2) ?

sjoro · 2021-11-10T11:48:23Z

i'm ok with start_time and end_time being the actual scanning times, feels more consistent with LEO data/instruments. but as @mraspaud suggested, i feel like we'd need to have additional attributes like start_nominal_time, end_nominal_time. would be useful for plotting purposes.

most likely even needed in SIFT... currently we just take the start_time and as you can guess, the dataset marker on the GUI timeline is not placed exactly at the nominal repeat cycle start time, but a bit forward, e.g. 10 seconds. we've had a few users asking about this.

another issue could be (i'm inventing this use case) that a user clicks at 12:14 on the SIFT time line, the end time for SEVIRI data was 12:12.48. at 12:14 there's no valid data to be displayed. here i', imagining that some start and end times could be used from the Satpy readers to give timeframe when that dataset is "valid" and should be displayed. apologies for mixing SIFT in the mixture here, hehe. :)

simonrp84 · 2021-11-10T13:30:24Z

The GEO users (the people looking at the images) expect the timestamps to match the nominal times, which means start_time should stay as it is and the (filename) timestamps should align with the nominal repeat cycle.

Presumably most of the GEO users are using HRIT data, and they seem OK with the actual rather than nominal times in that reader.

I do agree that it'd be useful to provide also the nominal times, so like @sjoro suggested I will add start_nominal_time and end_nominal_time to this PR.

satpy/tests/reader_tests/test_seviri_l1b_native.py

ameraner · 2021-11-10T14:24:41Z

I also think that having the scan end time as end_time is the most technically correct and consistent option (it would also match the end time contained in the native filenames), but I share the concerns of having time gaps between the validity periods of repeat cycles... could be problematic for monitoring/LEO-GEO matching/plotting tools etc.

So I agree that adding extra attrs with the nominal repeat cycle time is useful, and would suggest to make this difference clear in the reader documentation.

PS: we would need to do the same on the NetCDF reader (these lines compute the time) and use the forward_scan_start_day, forward_scan_start_msec, forward_scan_end_day, forward_scan_end_msec global attributes values.

codecov · 2021-11-10T14:34:36Z

Codecov Report

Merging #1877 (e5daa33) into main (92130df) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1877   +/-   ##
=======================================
  Coverage   93.95%   93.96%           
=======================================
  Files         285      285           
  Lines       43754    43834   +80     
=======================================
+ Hits        41108    41187   +79     
- Misses       2646     2647    +1

Flag	Coverage Δ
behaviourtests	`4.73% <0.00%> (+<0.01%)`	⬆️
unittests	`94.63% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...y/tests/reader_tests/test_seviri_l1b_hrit_setup.py	`100.00% <ø> (ø)`
satpy/readers/seviri_l1b_hrit.py	`90.42% <100.00%> (+0.27%)`	⬆️
satpy/readers/seviri_l1b_native.py	`86.25% <100.00%> (+0.86%)`	⬆️
satpy/tests/reader_tests/test_seviri_l1b_native.py	`100.00% <100.00%> (ø)`
satpy/composites/__init__.py	`90.26% <0.00%> (-0.01%)`	⬇️
satpy/tests/test_composites.py	`100.00% <0.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

sjoro

LGTM

coveralls · 2021-11-10T15:03:16Z

Coverage increased (+0.008%) to 94.578% when pulling e5daa33 on simonrp84:sev_time_fix into 92130df on pytroll:main.

simonrp84 · 2021-11-10T21:07:52Z

Ok, I have now added nominal_start_time and nominal_end_time attributes to both the HRIT and NAT readers plus (hopefully!) updated the tests correctly with these.

you can access the attributes like so:

files = glob(f'{idir}/*.nat')
scn_nat = Scene(files, reader='seviri_l1b_native')
scn_nat.load(['IR_108], upper_right_corner='NE')
print(scn_nat['IR_108'].attrs['start_time'], scn_nat['IR_108'].attrs['end_time'])
print(scn_nat['IR_108'].attrs['nominal_start_time'], scn_nat['IR_108'].attrs['nominal_end_time'])

These will display minute values of 00, 15, 30 and 45 for nominal_end_time, while end_time will now display 12, 27, 42, 57.

I haven't updated the netcdf reader (as @ameraner suggested) simply as I'm not familiar with that reader and don't have much time to make changes in unfamiliar code. Hopefully someone else can do that!

sfinkens

Nice work, thanks a lot! I have a couple of questions/comments:

Is there a reason why the HRIT reader doesn't have properties for nominal times?
Regarding naming consistency: Should scheduled_time in the AHI HSD reader be deprecated and renamed to nominal_start_time?
Please shortly explain the nominal time attributes in the metadata chapter in the docs

satpy/readers/seviri_l1b_native.py

satpy/tests/reader_tests/test_seviri_l1b_native.py

simonrp84 · 2021-11-11T16:34:46Z

Is there a reason why the HRIT reader doesn't have properties for nominal times?

Because I couldn't find a way to access them as a user! But you raise a good point. Have updated for this now.

Regarding naming consistency: Should scheduled_time in the AHI HSD reader be deprecated and renamed to nominal_start_time?

Probably! Should we have that as a separate PR?

Please shortly explain the nominal time attributes in the metadata chapter in the docs

Will do, may take some time as I'll need to figure out how to edit the docs.

sfinkens · 2021-11-11T17:22:14Z

👍 Separate PR for AHI sounds good!

simonrp84 · 2021-11-12T15:38:21Z

Ok, I've now updated the documentation and have also added a couple of small extra tests test that exceptions are raised if a bad grid origin or earth model is found in the data.

sfinkens

Thanks! Looks good to me now! Also thanks for adding those extra tests. Just one final nitpick, then good to merge from my point of view!

sfinkens · 2021-11-16T21:42:31Z

satpy/tests/reader_tests/test_seviri_l1b_native.py

@@ -584,7 +601,7 @@ def create_test_header(earth_model, dataset_id, is_full_disk, is_rapid_scan):
                    reference_grid: {
                        'ColumnDirGridStep': column_dir_grid_step,
                        'LineDirGridStep': line_dir_grid_step,
-                        'GridOrigin': 2,  # south-east corner
+                        'GridOrigin': gridorigin,  # south-east corner


This comment is now outdated. Could you please explain in the docstring that the default value of 2 represents the south-east corner and then remove this comment?

sfinkens · 2021-11-16T21:46:29Z

Can't tell why coveralls thinks the coverage dropped...

…d NAT readers, swap the `start_time` and `end_time` in the NAT reader to match those used in the HRIT reader.

djhoese · 2022-08-03T13:45:39Z

In the semi-recent AHI work I did we defined a new time_parameters dictionary in the DataArray.attrs. I wonder if we should update this PR (how much do you want to hate me?) to use that dictionary instead of the new attributes in the root level of the .attrs dictionary.

https://satpy.readthedocs.io/en/latest/readers.html#time-metadata

sfinkens · 2022-08-03T14:29:02Z

https://satpy.readthedocs.io/en/latest/readers.html#time-metadata

Oh, this looks very nice! But maybe this can be done in a separate PR?

djhoese · 2022-08-03T14:32:48Z

I'd be OK with that. I'm a little worried people are going to get used to the new attributes here, but I'm not a user of the SEVIRI native reader so maybe that's OK.

djhoese · 2022-08-03T14:34:46Z

@sfinkens @simonrp84 What if I did it in this PR?

Edit: I noticed that CodeScene doesn't like the quality of this PR so I thought I'd clean that up and add the time_parameter changes.

sfinkens · 2022-08-03T14:35:10Z

Go ahead :)

simonrp84 · 2022-08-03T17:20:09Z

Feel free, I won't have time until tomorrow evening but can do it then if you're busy today.

djhoese · 2022-08-03T18:43:47Z

Step 1 was to fix the code quality issues. That is done now. Now I'll try to rework the time_parameters.

djhoese · 2022-08-03T18:47:44Z

@sfinkens @simonrp84 I have an...issue. The decision in this PR was that start_time/end_time should be the observation time, not the nominal time. It was suggested in the related AHI PRs that start_time/end_time should typically be the nominal time. In this way times are typically consistent between bands/files which improves caching for things like sensor angles calculated during rayleigh correction. I suppose if I changed this in the Native reader I'd also have to do this for the HRIT reader.... 😢

djhoese · 2022-08-03T19:06:56Z

At this time I have not changed what start_time/end_time are, but would like to change them to nominal times. As-is, I'd be OK merging this PR, making a satpy release, and having this discussion another time. But, if we can have the discussion now then it may make things easier in the future and not have to worry about users using different versions of satpy.

simonrp84 · 2022-08-03T20:00:19Z

This seems reasonable to me, but if we also change the HRIT file to use the planned start and end times (rather than the current use of observation start and end times) then the reader is no longer tolerant to the situation where the prologue file is missing for a timeslot: As the plannet start/end times are in the prologue whilst the observation start/end times are in the epilogue.

Would like the opinion of the met center people on this - is losing the tolerance to a missing prologue acceptable?

If we keep this PR for the native reader and deal with HRIT separately then this is not a problem, however.

…observation times.

djhoese · 2022-08-03T20:47:34Z

That is an interesting problem...however is that how the reader is actually configured to work? I mean, does the satpy reader allow for no prologue file. On mobile but I didn't think the code had a try/except around getting the nominal times so it would have failed with no prologue anyway.

pnuu · 2022-08-04T05:20:43Z

As the plannet start/end times are in the prologue whilst the observation start/end times are in the epilogue.

I thought we just use the datetime from the filenames for HRIT 🤔

Also, I at least have the segment gatherer configured so that the PRO file is always required, whether we need it or not.

mraspaud · 2022-08-04T06:26:30Z

I think atm the hrit reader won't work if any of épi or pro files are missing. We probably could change that, at the cost of having degraded metadata when these files are missing.

Regarding start and end time. We have seen that using real acquisition times can slow down the reading significantly. So I would suggest to use metadata/nominal values for these as default for all readers, and further make the acquisition time per line available as a coordinate for users that require more precise values.
The idea behind this is that the nominal times are largely sufficient for quick sorting or browsing, and that's what they are needed for in 90% of the cases. For other cases, looking at the time coordinate is OK i think.

sfinkens · 2022-08-04T07:28:11Z

@mraspaud Good point with the performance. I'm a bit torn regarding the other time attributes. On the one hand I like your idea of the scanline acquisition being the only reference. Especially if a user selects a subset of scanlines, there's no mechanism to update the attributes accordingly. On the other hand I'm not sure whether all file formats provide acquisition timestamps. So I feel like it would make sense to keep the time_parameters as described in the docs, as a compromise...

djhoese · 2022-08-04T13:32:40Z

So once I fix the test that @simonrp84 didn't update (test driven development!), it sounds like we're all OK with this being merged even if it undermines the original idea/purpose of this PR?

djhoese · 2022-08-04T13:38:19Z

Oh no. The satpos property in the native reader uses self.start_time to get the satellite position. With this change, the satellite_actual_altitude changes by ~1.5 meters which makes the tests fail. Should this use the observation time or the nominal time? If it uses the observation time then every file will have a slightly different position and may not cache well. If it uses the nominal then it is technically less accurate but more cache-able. Actually, I think I round the satellite altitude in the angle generation anyway so the caching might not be a problem. I'll change it to use observation time.

…servation time

simonrp84 · 2022-08-04T13:53:14Z

Good spot, I missed that - using the nominal times will be slightly less accurate, but no less so than the existing assumption that using the nominal times instead of actual times is OK for angle generation.

djhoese · 2022-08-04T16:47:50Z

I've modified the title and description to describe what this PR actually (currently) does. If there are no other comments today I will probably merge this as-is and we can plan on bringing the HRIT reader up to date in a future PR.

sfinkens · 2022-08-09T14:24:52Z

Thanks @djhoese !

simonrp84 requested review from mraspaud and sfinkens as code owners November 9, 2021 12:55

mraspaud requested review from ameraner and sjoro November 9, 2021 12:57

simonrp84 requested a review from djhoese as a code owner November 10, 2021 14:20

stickler-ci reviewed Nov 10, 2021

View reviewed changes

satpy/tests/reader_tests/test_seviri_l1b_native.py Outdated Show resolved Hide resolved

sjoro approved these changes Nov 10, 2021

View reviewed changes

sfinkens reviewed Nov 11, 2021

View reviewed changes

satpy/readers/seviri_l1b_native.py Outdated Show resolved Hide resolved

satpy/readers/seviri_l1b_native.py Outdated Show resolved Hide resolved

satpy/tests/reader_tests/test_seviri_l1b_native.py Outdated Show resolved Hide resolved

sfinkens reviewed Nov 16, 2021

View reviewed changes

simonrp84 closed this Aug 2, 2022

simonrp84 force-pushed the sev_time_fix branch from f24888b to 92130df Compare August 2, 2022 20:29

Add nominal_start_time and nominal_end_time to the SEVIRI HRIT an…

34836cc

…d NAT readers, swap the `start_time` and `end_time` in the NAT reader to match those used in the HRIT reader.

simonrp84 reopened this Aug 2, 2022

djhoese added bug component:readers labels Aug 3, 2022

Refactor SEVIRI L1b Native tests

7271d74

Add 'time_parameters' to SEVIRI native reader

9f3fed9

Re-update the SEV native reader to use the nominal times rather than …

a064b15

…observation times.

Fix seviri native satellite position using nominal time instead of ob…

e5daa33

…servation time

djhoese changed the title ~~Update SEVIRI native reader to select the correct scene start/end times.~~ Update SEVIRI native reader with 'time_parameters' metadata Aug 4, 2022

djhoese merged commit 700ad69 into pytroll:main Aug 5, 2022

strandgren mentioned this pull request Mar 13, 2023

Inconsistent behavior of time attributes in EUM L1 GEO readers #2409

Closed

simonrp84 deleted the sev_time_fix branch August 22, 2023 09:54

Update SEVIRI native reader with 'time_parameters' metadata #1877

Update SEVIRI native reader with 'time_parameters' metadata #1877

Conversation

simonrp84 commented Nov 9, 2021 • edited by djhoese

Original Description:

mraspaud commented Nov 9, 2021

sjoro commented Nov 9, 2021 • edited

sfinkens commented Nov 9, 2021 • edited

mraspaud commented Nov 10, 2021

mraspaud commented Nov 10, 2021

pnuu commented Nov 10, 2021

mraspaud commented Nov 10, 2021

sjoro commented Nov 10, 2021 • edited

simonrp84 commented Nov 10, 2021

ameraner commented Nov 10, 2021

codecov bot commented Nov 10, 2021 • edited

Codecov Report

sjoro left a comment

Choose a reason for hiding this comment

coveralls commented Nov 10, 2021 • edited

simonrp84 commented Nov 10, 2021

sfinkens left a comment

Choose a reason for hiding this comment

simonrp84 commented Nov 11, 2021 • edited

sfinkens commented Nov 11, 2021

simonrp84 commented Nov 12, 2021

sfinkens left a comment

Choose a reason for hiding this comment

sfinkens Nov 16, 2021 • edited

Choose a reason for hiding this comment

sfinkens commented Nov 16, 2021

djhoese commented Aug 3, 2022

sfinkens commented Aug 3, 2022

djhoese commented Aug 3, 2022

djhoese commented Aug 3, 2022 • edited

sfinkens commented Aug 3, 2022

simonrp84 commented Aug 3, 2022

djhoese commented Aug 3, 2022

djhoese commented Aug 3, 2022

djhoese commented Aug 3, 2022

simonrp84 commented Aug 3, 2022

djhoese commented Aug 3, 2022

pnuu commented Aug 4, 2022

mraspaud commented Aug 4, 2022 • edited

sfinkens commented Aug 4, 2022

djhoese commented Aug 4, 2022

djhoese commented Aug 4, 2022

simonrp84 commented Aug 4, 2022

djhoese commented Aug 4, 2022

sfinkens commented Aug 9, 2022

simonrp84 commented Nov 9, 2021 •

edited by djhoese

sjoro commented Nov 9, 2021 •

edited

sfinkens commented Nov 9, 2021 •

edited

sjoro commented Nov 10, 2021 •

edited

codecov bot commented Nov 10, 2021 •

edited

coveralls commented Nov 10, 2021 •

edited

simonrp84 commented Nov 11, 2021 •

edited

sfinkens Nov 16, 2021 •

edited

djhoese commented Aug 3, 2022 •

edited

mraspaud commented Aug 4, 2022 •

edited