Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GML 100-m ozonesondes #159

Merged
merged 51 commits into from
Apr 12, 2024
Merged

GML 100-m ozonesondes #159

merged 51 commits into from
Apr 12, 2024

Conversation

zmoon
Copy link
Member

@zmoon zmoon commented Feb 6, 2024

Initial reader for GML ozonesondes, currently only supporting the 100-m files.

Thanks to Owen Cooper (CSL), Bryan Johnson (GML), and Irena for helping with interpretation.

xref: NOAA-CSL/MELODIES-MONET#199

@zmoon
Copy link
Member Author

zmoon commented Feb 6, 2024

@bbakernoaa @rschwant I do have some questions about the variables (in the col_info defn near the bottom of the file), as I couldn't find a data readme or such.

@rschwant
Copy link
Contributor

rschwant commented Feb 7, 2024

Let me ask others at NOAA CSL who have used these files a lot whether they have a readme for this dataset. I'll copy you on an email. They are not on GitHub.

@bbakernoaa
Copy link
Member

bbakernoaa commented Feb 7, 2024 via email

more strict, but should be faster, and luckily the two cases
seem to cover all
besides this one and the bad data line one, all others load now
(tested on Hopper)
@zmoon
Copy link
Member Author

zmoon commented Feb 8, 2024

Now all of the 11k or so files load except two.

Some questions:

  • for selection by site in add_data() and discover_files(), do we like place or station or site or something else?
  • do we like the data variable names (mostly given in COL_INFO_L100 definition) and long names?

@bbakernoaa @rschwant @iri01 would you be able to take a look?

found a few cases of 99.9
based on email from Bryan Johnson (NOAA GML)
for when Dask checks for consistency
@zmoon zmoon marked this pull request as ready for review February 14, 2024 18:47
@zmoon
Copy link
Member Author

zmoon commented Feb 14, 2024

The altitude diffs are mostly 0.1 (100 m; 99.67%), but not all. For combined files, e.g. for MM, may want to stick to data on those cleaner altitudes (3.5, 3.6, 3.7, ...), maybe storing the surface level data in separate variables. Although, the less-clean altitudes are mostly in the lower troposphere.

From Bryan:

The first line is the actual elevation then the 100 meter intervals begin for the rest of the flight. So the spacing will only be different from surface to first interval.

Also, for combined files, note that there are some cases of same launch time/place (e.g. one station launching two at the same time). The flight number can be used to differentiate them.

otherwise can't differentiate launches at same time and place
@zmoon
Copy link
Member Author

zmoon commented Feb 22, 2024

"University of Rhode Island" should maybe be normalized to "Narragansett, Rhode Island".

And perhaps "Mauna Loa Observatory, Hawaii" to "Hilo, Hawaii"? Seems like it only has 2 soundings, in 2002, and the launch loc isn't that far away.

@iri01
Copy link

iri01 commented Feb 27, 2024

This looks great!
Would it be possible to include humidity in the variables, provided it is a reliable measurement?

On your earlier questions:

  • for selection by site in add_data() and discover_files(), do we like place or station or site or something else?

I like station or, if used in other search criteria, either place or location

  • do we like the data variable names (mostly given in COL_INFO_L100 definition) and long names?

I like:
"o3_mpa" for ozone partial pressure

ColInfo("o3_press", "ozone partial pressure", "mPa", "99.90"),

"o3_ppm" for ozone mixing ratio in ppmv

ColInfo("o3", "ozone mixing ratio", "ppmv", "99.999"),

"o3_col" for ozone partial column

ColInfo("o3_int", "integrated ozone below", "atm-cm", "99.9990")

"o3_du" for ozone total column in DU

fancier sounding

not using 'siteid' since one folder's data can have multiple
unique 'station' values (akin to 'siteid')
@zmoon
Copy link
Member Author

zmoon commented Apr 11, 2024

@bbakernoaa I changed place arg to location (not siteid since a location folder's data may have multiple unique values of 'station'). But then I changed 'station' to 'siteid' in the add_data() output for MONETIO-ness.

@bbakernoaa
Copy link
Member

bbakernoaa commented Apr 11, 2024 via email

@zmoon zmoon merged commit 9f246d9 into noaa-oar-arl:develop Apr 12, 2024
7 checks passed
@zmoon zmoon deleted the gml-ozonesonde branch April 12, 2024 00:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants