Dataset Submission: HakaiPruthMooringProvisional #17

raytula · 2021-03-02T20:40:55Z

raytula · 2021-03-02T20:45:36Z

The real-time data from this node is available through a database view.

Here you can view the daily data.

                          View "sn.PruthMooring:5minuteSamples"
             Column              |           Type           | Collation | Nullable | Default 
---------------------------------+--------------------------+-----------+----------+---------
 measurementTime                 | timestamp with time zone |           |          | 
 PruthMooring:WaterTemp_0m_QL    | smallint                 |           |          | 
 PruthMooring:WaterTemp_0m_QC    | character varying        |           |          | 
 PruthMooring:WaterTemp_0m_UQL   | integer                  |           |          | 
 PruthMooring:WaterTemp_0m_Med   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Avg   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Min   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Max   | double precision         |           |          | 
 PruthMooring:WaterTemp_0m_Std   | double precision         |           |          | 
...

raytula · 2021-03-02T20:51:25Z

@JessyBarrette now that the automated QC configuration has been improved a bit, I'm looking at how to create an ERDDAP dataset for this.

I'm currently thinking that we should share the five minute data, as we did for many of the other datasets mapped from the sensor network database.

However, I not sure how to handle the measurements at different depths properly. That is, can/should we continue to define a set of dataset fields for each measurements (ie. five minute average or median value, UQL level, secondary QC flag) and assign a depth value to each measurement ..... or will we need to handle the measurement at each depth as a separate ERDDAP record? Not sure if I'm explaining this right or not. Maybe we should discuss.

JessyBarrette · 2021-03-02T21:39:44Z

dataset or datasetS
That is a good question, my first intuition would be to follow what we've done with other datasets and create one unique ERDDAP dataset, perhaps even on a single dataset for both sites (but I guess that may be still a debate with the CIOOS Metadata group), and it may not be simplest if we're interfacing ERDDAP with a view of the database.

Sample Rate
Yes, I think it would be best to just provide the finest sampling rate available. However, I think the tidbits themself are sampled every 10 minutes, while there's some interpolation (maybe something else) to a higher rate of 5min. Likely those stats aren't really meaningful in this specific case since they are related to just a single value every 10 minutes.

Sort variables
I think it would be best to stack each temperature and associated stats and flags to single variables and create a depth variable that describes each record's specific depth. It will definitely make the data easier to handle through erddap like any other datasets. Ideally, I think it would also be good to capture the serial number of the tidbits, I'm not sure if that is captured in the database. All this may be a big issue to deal with by using the database data. It would easy to handle by generating netcdfs, but it may be just because I'm more familiar with the NetCDF erddap datasets.

raytula · 2021-03-03T16:51:57Z

Thanks @JessyBarrette I was thinking of one ERDDAP dataset.

Re: I think it would be best to stack each temperature and associated stats and flags to single variables and create a depth variable that describes each record's specific depth. It will definitely make the data easier to handle through erddap like any other datasets....

That is the key thing I was wondering about. @n-a-t-e do you know if/how we can transform the results returned from the database to do this? Specifically, the database will return separate columns with measurements at different depths, and we would like the ERDDAP dataset to break it out to provide (time, depth, measurement QC) columns instead -- assuming I understand @JessyBarrette correctly. A representative database view is PruthMooring:5minuteSamples.

n-a-t-e · 2021-03-03T17:00:31Z

We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do

raytula · 2021-03-03T17:09:27Z

We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do

Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP.

JessyBarrette · 2021-03-03T17:26:36Z

Ideally, you would also need to generate a site, lat, long and source file name variable. QU5 has been at the same site the whole time while Pruth moved to another location after one year. I can get the official locations for you.

…

On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting ***@***.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | jessy.barrette@hakai.org | (C) (250) 208-7806

raytula · 2021-03-03T18:12:41Z

Ideally, you would also need to generate a site, lat, long and source file name variable. QU5 has been at the same site the whole time while Pruth moved to another location after one year. I can get the official locations for you.
…
On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting @.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ .
-- Jessy Barrette M.Sc. Marine Instrumentation Specialist Hakai Institute https://www.hakai.org/ | jessy.barrette@hakai.org | (C) (250) 208-7806

Having the database view populate and return the latitude and longitude may be a good approach, assuming it could populate those fields with different values based on date/time. In a few cases (e.g. KC Buoy) the lat/long are stored in the SN database and can be passed through to ERDDAP. In many other cases, the lat/long are fixed and could be added into the view as such (ie. most terrestrial sensor nodes that don't move).

JessyBarrette · 2021-03-03T18:20:22Z

ERDDAP recommends to associated latitude and longitude variables to fix values that relate to the site location. For cases where a precise time variable location exist, ERDDAP suggest using the field preciseLat and preciseLon https://coastwatch.pfeg.noaa.gov/erddap/download/setupDatasetsXml.html#cdmTimeSeries This doesn't apply to the tidbit moorings but certainly to the KC Buoy. This is what I'm hoping to do with the CTD profile datasets too and should ideally be a standard within Hakai/CIOOS. This can also be very useful for running QARTOD tests on the data in the future. On Wed, Mar 3, 2021 at 10:12 AM Ray Brunsting <notifications@github.com> wrote:

…

Ideally, you would also need to generate a site, lat, long and source file name variable. QU5 has been at the same site the whole time while Pruth moved to another location after one year. I can get the official locations for you. … <#m_7596592719505883172_> On Wed, Mar 3, 2021 at 9:09 AM Ray Brunsting *@*.***> wrote: We would have to sort this out at the database level, creating views with one depth column instead of a column for each depth, and then connect that to ERDDAP. Should be easy enough to do Cool. I'm a database view novice, and have not done that type of thing before. Can you take a crack at defining such a database view? I'm thinking it would be helpful to have a workflow available to define custom views that are automatically (re)created when the underlying tables are (re)created and are used by ERDDAP. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment) <#17 (comment)>>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHICYONCV2JDUHTZKBP53CLTBZUNPANCNFSM4YPWPDHQ . -- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute https://www.hakai.org/ | ***@***.*** | (C) (250) 208-7806 Having the database view populate and return the latitude and longitude may be a good approach, assuming it could populate those fields with different values based on date/time. In a few cases (e.g. KC Buoy) the lat/long are stored in the SN database and can be passed through to ERDDAP. In many other cases, the lat/long are fixed and could be added into the view as such (ie. most terrestrial sensor nodes that don't move). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOOHA5YWSNP65KTTGMDTBZ32TANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | jessy.barrette@hakai.org | (C) (250) 208-7806

n-a-t-e · 2021-03-03T18:28:44Z

As @JessyBarrette is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be

raytula · 2021-03-03T18:33:53Z

As @JessyBarrette is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be

Can that be done based on time? (ie. like with the Pruth Mooring where the mooring was relocated after the first year). If so, cool.

JessyBarrette · 2021-03-03T18:35:27Z

Only if the lat long are the same for all the data of that dataset though I would think. It wouldn't work if we combine multiple sites.

…

On Wed, Mar 3, 2021 at 10:29 AM Nate ***@***.***> wrote: As @JessyBarrette <https://github.com/JessyBarrette> is saying, we can set the lat/long to fixed values right in datasets.xml when appropriate. So it could be in the view but wouldn't have to be — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOOBUVJ3KUQ2BHQQ3YLTBZ5WXANCNFSM4YPWPDHQ> .

-- *Jessy Barrette M.Sc.* *Marine Instrumentation Specialist* Hakai Institute <https://www.hakai.org/> | jessy.barrette@hakai.org | (C) (250) 208-7806

n-a-t-e · 2021-03-03T18:40:05Z

@JessyBarrette oh right, forgot that you are combining sites.

I created a view sn_unpivot."PruthMooring:5minuteSamples" with columns: measurementTime, depth, watertemp_ql, watertemp_qc, watertemp_uql, watertemp_med, watertemp_avg, watertemp_min, watertemp_max, watertemp_std. We can create a test ERDDAP dataset based on this and the set it up to auto-recreate as needed

raytula · 2021-03-03T20:28:49Z

Nice. I tried our the following queries and it worked well.

hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01';
hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01' and depth=10;
hakai=> select * from sn_unpivot."PruthMooring:5minuteSamples" where "measurementTime">'2020-06-01' and depth=10;

As a general approach, perhaps we should maybe put the database view definitions in this repo and automatically run them when the sensor network database changes....like you are doing now. I can imagine a bunch of situations where explicitly defined database views will work better than auto-generated. I'm not sure about performance/turning, but I imagine you can do things to address any performance issues.

JessyBarrette · 2021-03-03T21:31:46Z

Here's the moorings location over time

Site	Start time	End Time	Latitude	Longitude	Latitude degrees north	Longitude degrees east
Pruth	2017-12-03 00:00:00PST	2018-11-15 15:00:00 PST	51° 39.3810' N	128° 05.4890' W	51.65635	-128.0914833
Pruth	2018-11-15 15:00:00 PST	onGoing	51°39.126'N	128°05.122'W	51.6521	-128.0853667
QU5M	2018-10-05 00:00:00 PST	onGoing	50°07.20082'N	125°12.7317'W	50.12001367	-125.212195

JessyBarrette · 2021-03-08T22:14:43Z

@raytula Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate?

raytula · 2021-03-08T22:45:33Z

@raytula Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate?

Hmm. Not sure. I would be inclined to keep them separate, so let's start with that. Nate will be putting the related database views in this repo and working with you to define the datasets.xml fragments to get those views integrated into ERDDAP.

If we were to combine them, would it make sense to bundle it with the Provisional River's Inlet mooring data?

JessyBarrette · 2021-03-08T23:38:47Z

I would keep it separate from rivers inlet since the sensors havea different accuracy. Ok let's make two separate for now Le lun. 8 mars 2021 2:45 p.m., Ray Brunsting <notifications@github.com> a écrit :

…

@raytula <https://github.com/raytula> Just to make things clear regarding those two Tidbits Mooring datasets. Should regroup them together as one dataset, or keep the two sites separate? Hmm. Not sure. I would be inclined to keep them separate, so let's start with that. Nate will be putting the related database views in this repo and working with you to define the datasets.xml fragments to get those views integrated into ERDDAP. If we were to combine them, would it make sense to bundle it with the Provisional River's Inlet mooring data? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHICYOLZCATICIBQCO6O37LTCVHRXANCNFSM4YPWPDHQ> .

JessyBarrette · 2021-03-17T17:21:26Z

I created a metadata record for this dataset:
https://cioos-siooc.github.io/metadata-entry-form/#/en/pacific/tV5qE0aUgaOjSVmgPgiZ6MyHuSy1/-MW079KN2Z_OrMh7Zk6T

raytula · 2021-03-17T17:54:44Z

Looks good to me.

JessyBarrette · 2021-03-17T18:31:03Z

The metadata form was updated to reflect the provisional aspect

JessyBarrette · 2021-03-17T19:00:57Z

ERDDAP dataset is looking good:
https://goose.hakai.org/erddap/tabledap/HakaiPruthMooringProvisional.graph?time%2Cdepth%2Cwatertemp_avg&time%3E=2017-10-11T00%3A00%3A00Z&time%3C=2021-03-18T00%3A00%3A00Z&.draw=markers&.marker=2%7C5&.color=0x000000&.colorBar=Rainbow%7C%7C%7C6%7C14%7C&.yRange=-1%7C65%7Cfalse%7C&.bgColor=0xffccccff

Mostly need now to add more documentation to the dataset.xml

raytula · 2021-03-17T19:07:35Z

Yeah, nice to see that graph. I kind of feel like ERDDAP is partly a much more functional version of the Hakai sensor network tool --- with neither being super easy to use without spending some time to explore and perhaps have someone prepare some great examples.

JessyBarrette · 2021-04-14T20:05:38Z

This dataset is now available online. We can close that issue!

raytula added the Submission label Mar 2, 2021

raytula self-assigned this Mar 2, 2021

raytula closed this as completed Mar 2, 2021

JessyBarrette reopened this Mar 2, 2021

JessyBarrette mentioned this issue Mar 31, 2021

Update Hakai Production datasets with the reformated datasets plus new ones #28

Merged

JessyBarrette closed this as completed Apr 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset Submission: HakaiPruthMooringProvisional #17

Dataset Submission: HakaiPruthMooringProvisional #17

raytula commented Mar 2, 2021 •

edited by JessyBarrette

Loading

raytula commented Mar 2, 2021

raytula commented Mar 2, 2021 •

edited

Loading

JessyBarrette commented Mar 2, 2021

raytula commented Mar 3, 2021

n-a-t-e commented Mar 3, 2021 •

edited

Loading

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

n-a-t-e commented Mar 3, 2021

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

n-a-t-e commented Mar 3, 2021

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021

JessyBarrette commented Mar 8, 2021

raytula commented Mar 8, 2021

JessyBarrette commented Mar 8, 2021 via email

JessyBarrette commented Mar 17, 2021

raytula commented Mar 17, 2021

JessyBarrette commented Mar 17, 2021

JessyBarrette commented Mar 17, 2021

raytula commented Mar 17, 2021

JessyBarrette commented Apr 14, 2021

Dataset Submission: HakaiPruthMooringProvisional #17

Dataset Submission: HakaiPruthMooringProvisional #17

Comments

raytula commented Mar 2, 2021 • edited by JessyBarrette Loading

Hakai Dataset Submission

Submission steps

Initial Submission (Data Administrator)

ERDDAP Dataset Creation (Data Integrator)

Dataset Review (Data Administrator)

Dataset Completion (Data Integrator)

raytula commented Mar 2, 2021

raytula commented Mar 2, 2021 • edited Loading

JessyBarrette commented Mar 2, 2021

raytula commented Mar 3, 2021

n-a-t-e commented Mar 3, 2021 • edited Loading

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

n-a-t-e commented Mar 3, 2021

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021 via email

n-a-t-e commented Mar 3, 2021

raytula commented Mar 3, 2021

JessyBarrette commented Mar 3, 2021

JessyBarrette commented Mar 8, 2021

raytula commented Mar 8, 2021

JessyBarrette commented Mar 8, 2021 via email

JessyBarrette commented Mar 17, 2021

raytula commented Mar 17, 2021

JessyBarrette commented Mar 17, 2021

JessyBarrette commented Mar 17, 2021

raytula commented Mar 17, 2021

JessyBarrette commented Apr 14, 2021

raytula commented Mar 2, 2021 •

edited by JessyBarrette

Loading

raytula commented Mar 2, 2021 •

edited

Loading

n-a-t-e commented Mar 3, 2021 •

edited

Loading