-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset Submission: HakaiCTDResearch #8
Comments
Hi @finnshort |
Hi @raytula, the QC feature is still live/working- I just gave it a once over on the development server and I don't see anything that's changed in the CTD schema in the past couple years that has affected it. I do recommend checking the results to make sure they are coming through as expected (as with any data changes). I know @fostermh and @jodiew have been working on implementing changes to the CTD flags so maybe they can comment on whether they anticipate this affecting the QC flagging feature at all. The comments can still be updated through the CTD QC excel sheets. If Jen deletes an existing comment and adds her own then the old one will be removed (not recommended). It would be better if she adds her own comment after the existing one so that both are saved. Let me know if that answers your questions! |
I was under the impression the old QC workflow was scraped. level 1 and 2 QC flags are being added to support the workflow of Jessy populating them automatically from a script. We have not moved on to anything else as we are waiting for a clearly defined workflow. To date, this has been a problem and I'm hesitant to endorse any development without it. |
@fostermh @finnshort @jodiew @raytula Sorry for the late reply on this. I think we have now a clearly defined workflow. Just to make things clear for everyone: Changes will be done on the database side to reflect the following items: Database Changes
Workflow
Let me know if you have any questions! |
@jenjax2 @finnshort mentionned that we need to have your permission to get access to the CTD qc log. This log will be use to generate Research Ready NetCDF files which will then be uploaded to the Hakai CTD Research Dataset. |
@JessyBarrette @finnshort you have my permission to get access to the CTD qc log. When you say Research Ready NetCDF files, do you mean these NETCDF files will be created after I qc the data? |
That's right Jenn, once you have a set of data QCed, a simple script will read your QC log and generate static NetCDF files with the profiles and variables you QCed as AV and the QARTOD flags is GOOD ==1. Those NetCDF files once uploaded to our server would never be changed (unless we want to), other variables could potentially be added though. This is probably the only way I think we can make sure that the reviewed data remain constant overtime even though the whole database as an example gets rebuild for whatever reason. |
@raytula @jenjax2 @trollpete A first draft of the research dataset is now available on the development ERDDAP: For the moment, I kept all the variables that will be available either on the provisional or research dataset. We will likely remove the flag columns from the research dataset since exclusively data flagged as GOOD is kept here. Please, review data and metadata. Once we all agree on the variables and associated attributes, this will be carried over to the provisional dataset. The example, regroup all the dataset and associated variables which were flagged as AV by the reviewer and associated with QARTOD Flag =1 [GOOD]. I will make a little jupyter notebook to review the conflicting results between the reviewer and QARTOD flags in the next few days. |
I forgot to mention you can also review the generated NetCDF files which are used behind ERDDAP and accessible here: The objective is to have a long-term file format that regroups all the information for each profile within a single file. This may evolve a bit over the near future, but hopefully we'll get something pretty stable soonish. |
I switched this dataset from a view to a table that is recreated nightly. This is to avoid issues with rebuilds in the EIMS system, now it has no connection to EIMS tables (eg they could be dropped without affecting erddap). Since CTD data doesn't come in that quick anyways I figured only updating nightly won't bother people. See 5bcb6d9 |
The ERDDAP links @JessyBarrette provided are not currently working. Is that perhaps due to the delayed EIMS rebuild today? |
@raytula sorry that was my bad the dataset is back online. FYI Goose ERDDAP restart every 15mins and this dataset is among the last ones to appears when the servers refresh. |
goose ERDDAP should only restart if there are changes to |
@jenjax2 @raytula The research CTD profile dataset is now available on goose erddap: Since we're only presenting data flagged as 1 and reviewed. We may want to omit any of the flag columns. |
This dataset is using netcdf files generated for each profiles are available here: https://goose.hakai.org/erddap/files/HakaiWaterPropertiesInstrumentProfileResearch/ Grouped by work_area/station |
Looks good to me. Yes, no need for flag columns from the research datasets. |
FYI. The link was working yesterday, but not today (I think)
|
@raytula thanks! I made a mistake last night while trying to remove the flags from the dataset.xml. Should be back on in the next 15min or so |
Great. Thanks @JessyBarrette |
Thanks @JessyBarrette ! |
This dataset is now available online. We can close that issue! |
A DOI was added to this metadata record https://doi.org/10.21966/6cz5-6d70 This will also get added to the ERDDAP datasets |
Hakai Dataset Submission
Below are listed all the different steps related to the initial submission of a dataset.
A more detailed written and visual description of every step is available respectively
here and here.
Submission steps
Initial Submission (Data Administrator)
ERDDAP Dataset Creation (Data Integrator)
Dataset Review (Data Administrator)
Dataset Completion (Data Integrator)
The text was updated successfully, but these errors were encountered: