Recipe testing and comparison for release 2.7.0 #2881

valeriupredoi · 2022-10-25T13:48:19Z

Sister and logical evolution of #2852 - I am commencing testing and comparison of recipes and recipes results in order to release 2.7.0 at the end of this week (hopefully). System parameters below, work done on DKRZ/Levante: submit files in /home/b/b382109/submit, output in /scratch/b/b382109/esmvaltool_output

System and settings

`conda`/`mamba`

(base) mamba --version
mamba 0.27.0
conda 22.9.0

Git branch and state

Date: 25 October 2022 14:22 BST

(base) git status
On branch release_270stable
Your branch is up to date with 'origin/release_270stable'.

nothing to commit, working tree clean

Environment

On Levante:

mamba env create -n tool270Test -f environment.yml
conda activate tool270Test

Environment file

ToolEnv270Test.yml

Extraneous file movements

I moved the autoassess-specific files to /home/b/b382109/autoassess_files - run was succesful for AA recipes then 👍

Ad-hoc hacks (code changes)

/home/b/b382109/ESMValTool/esmvaltool/diag_scripts/land_carbon_cycle/diag_global_turnover.py l.278 change .outline_patch with .spines["geo"] as suggested by @zklaus in Recipe recipe_carvalhais14nat.yml fails at plotting in diagnostic #2886 (comment) (cheers, dude!) - this will have to be PR-ed

Mods to config user file

Added DKRZ downloaded data pool as:

  CMIP6:
    - /work/bd0854/DATA/ESMValTool2/CMIP6_DKRZ
    - /work/bd0854/DATA/ESMValTool2/download/CMIP6
  CMIP5:
    - /work/bd0854/DATA/ESMValTool2/CMIP5_DKRZ
    - /work/bd0854/b309141/additional_CMIP5
    - /work/bd0854/DATA/ESMValTool2/download/cmip5/output1
    - /work/bd0854/DATA/ESMValTool2/download/cmip5

as @schlunma and @remi-kazeroni have suggested 🍺

Recipe runs

Recipe runs results (as of final on 27 October 2022) are listed in #2881 (comment) (with very many thanks to @remi-kazeroni for running the impossible to run ones!) and are as follows:

122(121)*/127 successfully run recipes
0(1)*/127 failed with Diagnostic error, but fixed and rerun, but not yet PR-ed with the fix
2/127 that are missing data (for reals)
3/127 that have various issues (not missing data and not DiagnosticError)

(*) means not counting/counting the one that had a DiagnosticError but was fixed but not PR-ed

Running the comparison

Login and access to the DKRZ esmvaltool VM

Results from recipe runs are stored on the VM; login with:

ssh youraccount@esmvaltool.dkrz.de

Get and install miniconda on VM

E.g. scp Miniconda3-py39_4.12.0-Linux-x86_64.sh b382109@esmvaltool.dkrz.de:~ from a file already on Levante.

Setting up the input files

If you wrote recipe runs output to Levante /scratch partition be aware that
the data will be removed after two weeks, so you will have to move the output data
to the /work partition, via e.g. a nohup job:

nohup cp -r /scratch/b/b382109/esmvaltool_output/* /work/bd0854/b382109/v270

/work is visible by the VM so you can run the compare tool straight on the VM.

NOTE do not store final release results on the VM including /preproc/ dirs, the total
size for all the recipes output, including /preproc/ dirs is in the 4.5TB ballpark,
much too high for the VM storage capacity

Running compare tool at VM

run date: 28 October 2022 (1st run)
conda env: tool270Compare
ESMValTool branch: release270stable
prerquisite: pip install imagehash

Input/output/run

current: /work/bd0854/b382109/v270 (contains preproc/ dirs too, 122 recipes)
reference: /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4 (does not contain preproc/ dirs)
cmd: nohup python ESMValTool/esmvaltool/utils/testing/regression/compare.py /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4 /work/bd0854/b382109/v270 > compare270output.txt

Sanity check, as outputted by compare.py

Comparing recipe run(s) in:
/work/bd0854/b382109/v270
to reference in /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4

First pass result

Running the compare.py results in a few recipes not-OK (NOK) wrt plots differing from previous release v2.6.0, summary in #2881 (comment)

Detailed plots inspection

Plots that differ for the 34 recipes that have them different is happening in #2881 (comment)

The text was updated successfully, but these errors were encountered:

valeriupredoi · 2022-10-25T14:21:57Z

@sloosvel I am in dire pain after realizing blithering DKRZ's SLURM emails me for every recipe 😵‍💫

valeriupredoi · 2022-10-25T14:54:37Z

@sloosvel what's these jobs up to?

(tool270Test) squeue -u b382109
             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
           2378977   compute recipe_z  b382109 PD       0:00      1 (AssocMaxJobsLimit)
           2378976   compute recipe_w  b382109 PD       0:00      1 (AssocMaxJobsLimit)
           2378975   compute recipe_w  b382109 PD       0:00      1 (AssocMaxJobsLimit)
           2378974   compute recipe_w  b382109 PD       0:00      1 (AssocMaxJobsLimit)

sloosvel · 2022-10-25T14:57:37Z

@sloosvel I am in dire pain after realizing blithering DKRZ's SLURM emails me for every recipe face_with_spiral_eyes

You can comment that if it's not useful to you, to me it was!

@sloosvel what's these jobs up to?

I think there is a limit in number of jobs an account can run simultaneously in levante. They will be pending until other jobs finish I guess

remi-kazeroni · 2022-10-25T14:59:29Z

@sloosvel what's these jobs up to?

On Levante, a user can't have more than 20 Slurm jobs running at a time. As soon as a job is finished, the next one should start

valeriupredoi · 2022-10-25T14:59:50Z

They will be pending until other jobs finish I guess

Cheers! More emails then 🤦‍♂️ 🤣

valeriupredoi · 2022-10-26T11:18:45Z

OK guys - first (and only) sbatch session over on Levante (I have one stray recipe still running, it's a zombie though) and this is how it looks:

Recipe running session 2022-10-26 13:13:41.568698

Succesfully run recipes

122 out of 127 final

recipe_anav13jclim.yml by @remi-kazeroni
recipe_albedolandcover.yml
recipe_arctic_ocean.yml
recipe_autoassess_landsurface_permafrost.yml
recipe_autoassess_landsurface_soilmoisture.yml
recipe_autoassess_landsurface_surfrad.yml
recipe_autoassess_radiation_rms_Amon_all.yml
recipe_autoassess_radiation_rms_Amon_obs.yml
recipe_autoassess_stratosphere.yml
recipe_bock20jgr_fig_1-4.yml by @remi-kazeroni
recipe_bock20jgr_fig_6-7.yml
recipe_bock20jgr_fig_8-10.yml
recipe_capacity_factor.yml
recipe_carvalhais14nat.yml
recipe_climwip_brunner2019_med.yml by @remi-kazeroni
recipe_climwip_brunner20esd.yml
recipe_climwip_test_basic.yml
recipe_climwip_test_performance_sigma.yml
recipe_clouds_bias.yml
recipe_clouds_ipcc.yml
recipe_cmug_h2o.yml
recipe_collins13ipcc.yml by @remi-kazeroni
recipe_combined_indices.yml
recipe_concatenate_exps.yml
recipe_consecdrydays.yml
recipe_correlation.yml
recipe_cox18nature.yml
recipe_cvdp.yml
recipe_daily_era5.yml
recipe_deangelis15nat.yml
recipe_deangelis15nat_fig1_fast.yml
recipe_decadal.yml
recipe_diurnal_temperature_index.yml
recipe_eady_growth_rate.yml
recipe_ecs.yml
recipe_ecs_constraints.yml
recipe_ecs_scatter.yml
recipe_ensclus.yml
recipe_era5-land.yml
recipe_esacci_lst.yml
recipe_esacci_oc.yml
recipe_extract_shape.yml
recipe_extreme_events.yml
recipe_extreme_index.yml
recipe_eyring06jgr.yml
recipe_eyring13jgr_12.yml
recipe_gier2020bg.yml
recipe_globwat.yml
recipe_heatwaves_coldwaves.yml
recipe_hydro_forcing.yml
recipe_hyint.yml
recipe_hyint_extreme_events.yml
recipe_hype.yml
recipe_impact.yml by @remi-kazeroni
recipe_ipccwg1ar6ch3_atmosphere.yml
recipe_julia.yml
recipe_kcs.yml
recipe_landcover.yml
recipe_lauer13jclim.yml
recipe_li17natcc.yml
recipe_lisflood.yml
recipe_marrmot.yml
recipe_martin18grl.yml
recipe_meehl20sciadv.yml
recipe_miles_block.yml
recipe_miles_eof.yml
recipe_miles_regimes.yml
recipe_modes_of_variability.yml
recipe_monitor.yml
recipe_monitor_with_refs.yml
recipe_mpqb_xch4.yml
recipe_multimodel_products.yml
recipe_my_personal_diagnostic.yml
recipe_ncl.yml
recipe_ocean_Landschuetzer2016.yml
recipe_ocean_amoc.yml
recipe_ocean_bgc.yml
recipe_ocean_example.yml
recipe_ocean_ice_extent.yml
recipe_ocean_multimap.yml
recipe_ocean_quadmap.yml
recipe_ocean_scalar_fields.yml
recipe_pcrglobwb.yml
recipe_preprocessor_derive_test.yml
recipe_preprocessor_test.yml
recipe_psyplot.yml
recipe_pv_capacity_factor.yml
recipe_python.yml
recipe_quantilebias.yml
recipe_perfmetrics_CMIP5.yml by @remi-kazeroni
recipe_perfmetrics_CMIP5_4cds.yml by @remi-kazeroni
recipe_r.yml
recipe_radiation_budget.yml
recipe_rainfarm.yml
recipe_runoff_et.yml
recipe_russell18jgr.yml
recipe_schlund20esd.yml
recipe_schlund20jgr_gpp_abs_rcp85.yml
recipe_schlund20jgr_gpp_change_1pct.yml
recipe_schlund20jgr_gpp_change_rcp85.yml
recipe_sea_surface_salinity.yml
recipe_seaice.yml by @remi-kazeroni
recipe_seaice_drift.yml
recipe_seaice_feedback.yml
recipe_shapeselect.yml
recipe_smpi.yml
recipe_smpi_4cds.yml
recipe_snowalbedo.yml
recipe_spei.yml
recipe_tcr.yml
recipe_tebaldi21esd.yml
recipe_thermodyn_diagtool.yml
recipe_toymodel.yml
recipe_validation.yml
recipe_validation_CMIP6.yml
recipe_variable_groups.yml
recipe_wenzel14jgr.yml
recipe_wenzel16jclim.yml
recipe_wenzel16nat.yml
recipe_wflow.yml
recipe_williams09climdyn_CREM.yml
recipe_zmnam.yml

Recipes that failed with DiagnosticError

0 out of 127 (1 fixed, not PR-ed yet)

recipe_carvalhais14nat.yml - Recipe recipe_carvalhais14nat.yml fails at plotting in diagnostic #2886 - @zklaus suggestion from Recipe recipe_carvalhais14nat.yml fails at plotting in diagnostic #2886 (comment) fixes the problem!

Recipes that failed of Missing Data

2 out of 127 final

recipe_check_obs.yml - comment by @remi-kazeroni - It was missing one variable for MERRA2. I don't know why but I fixed that. If you rerun it, you will encounter some missing derived ERA5 data. See Variable information from recipes overwritten by cmor table info when deriving custom variables ESMValCore#1388, we never took the time to fix that
recipe_climate_change_hotspot.yml

Recipes that failed of other reasons

3 out of 127 final

recipe_autoassess_radiation_rms_cfMon_all.yml - clisccp bugger handled by @alistairsellar OBS Tier1/ISCCP/clisccp_ISCCP_L3_V1.0_* on esmeval/JASMIN is slightly dodgy according to esmvaltool's CMOR checks ESMValCore#1238
recipe_perfmetrics_land_CMIP5.yml (run by @remi-kazeroni ) known issue recipe_perfmetrics_land_CMIP5 fails in multi_model_statistics step #2594
recipe_flato13ipcc.yml - comment by @remi-kazeroni and confirmed by @katjaweigel - I think this is not runnable at the moment but should be fixed by @katjaweigel in Splitting of flato13ipcc.yml into separate recipes and adding recipes for regional Figures #2156 (good luck, Katja!)

Obsolete/resolved issues comment:

The Julia ones are totally my bad - forgot to install Julia after installing esmvaltool, the autoassess ones are either of the old bug that @alistairsellar is fixing now, or they need aux data that is only on JASMIN, the ones of Missing Data are bothering me badly - since I have turned on auto downloads but they are still missing data, what do you guys recommend doing about those? @sloosvel @remi-kazeroni @bouweandela ? I will post detailed postmortems for the ones that have failed for odd reasons below 👍

valeriupredoi · 2022-10-26T11:38:13Z

Postmortem of failed recipes OTHER THAN Missing Data

Recipes that failed with DiagnosticError

0 out of 127 (1 fixed, not yet PR-ed)

recipe_carvalhais14nat.yml - Recipe recipe_carvalhais14nat.yml fails at plotting in diagnostic #2886 - @zklaus suggestion from Recipe recipe_carvalhais14nat.yml fails at plotting in diagnostic #2886 (comment) fixes the problem!

Recipes that failed of other reasons or are still running

1 out of 127

recipe_autoassess_radiation_rms_cfMon_all.yml - clisccp bugger handled by @alistairsellar OBS Tier1/ISCCP/clisccp_ISCCP_L3_V1.0_* on esmeval/JASMIN is slightly dodgy according to esmvaltool's CMOR checks ESMValCore#1238

remi-kazeroni · 2022-10-26T12:54:45Z

Hi @valeriupredoi, great job with the testing! I forgot to mention but we have a central pool of downloaded data on Levante at /work/bd0854/DATA/ESMValTool2/download/CMIP6, /work/bd0854/DATA/ESMValTool2/download/cmip5/output1, and /work/bd0854/DATA/ESMValTool2/download/cmip5/output1. Maybe you could add those to your path on top of your download directory? This should help solving the time limit issues (lots of fx files searched on ESGF and/or downloaded I guess).

remi-kazeroni · 2022-10-26T13:00:12Z

recipe_smpi.yml - too slow Elapsed time : 04:00:19 (Timelimit=04:00:00)

For this one, I would recommend using:

#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --constraint=512G

valeriupredoi · 2022-10-26T13:04:44Z

Indeed, cheers @remi-kazeroni - smpi is a memory gobbler - I restarted it on SLURM and promptly got kicked out coz mem limit (this time around I think all data has been downloaded, hence it went to intensive processing). I'll resubmit with mem reqs. What do you recommend about those that really-really are missing data?

valeriupredoi · 2022-10-26T13:09:10Z

recipe_smpi.yml - too slow Elapsed time : 04:00:19 (Timelimit=04:00:00)

For this one, I would recommend using:
#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --constraint=512G

even with 512G still fails out of MEM 😮

valeriupredoi · 2022-10-26T13:13:42Z

oh crap, forgot to change the partition 😶‍🌫️

remi-kazeroni · 2022-10-26T13:14:00Z

recipe_smpi.yml - too slow Elapsed time : 04:00:19 (Timelimit=04:00:00)

For this one, I would recommend using:
#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --constraint=512G
even with 512G still fails out of MEM 😮

You can try with 1024G then! But that's the highest available

valeriupredoi · 2022-10-26T13:15:22Z

recipe_smpi.yml - too slow Elapsed time : 04:00:19 (Timelimit=04:00:00)

For this one, I would recommend using:
#SBATCH --partition=compute
#SBATCH --time=08:00:00
#SBATCH --constraint=512G
even with 512G still fails out of MEM open_mouth
You can try with 1024G then! But that's the highest available

totally user-side - forgot to change the partition to compute - cheers, dude! 🍺

sloosvel · 2022-10-26T13:16:54Z

I never managed to run the smpi recipes, @remi-kazeroni did it for me in the last release. Maybe the batch script settings for this recipe can be changed in #2883

valeriupredoi · 2022-10-26T13:21:48Z

with correct SLURM settings as recommended by @remi-kazeroni (:beer:) those smpi monsters are happily plodding along now - yes, we should change the settings for sure. @sloosvel how did you fix the runs for those recipes that really-really dont have data, like I found in #2881 (comment)

remi-kazeroni · 2022-10-26T13:44:29Z

I don't have a definitive answer for the really-really missing data cases. As said in this comment, you could try to rerun the recipes adding these paths to you config file. But that data pool is 2 releases old. One could argue that we should delete it and re-download everything as /work/bd0854/DATA/ESMValTool2/download/ may contain data retracted from ESGF...

Taking a closer look at some of these (currently) 13 cases:

recipe_check_obs.yml (my favourite one!)-> It was missing one variable for MERRA2. I don't know why but I fixed that. If you rerun it, you will encounter some missing derived ERA5 data. See Variable information from recipes overwritten by cmor table info when deriving custom variables ESMValCore#1388, we never took the time to fix that
recipe_anav13jclim.yml -> I think this is a special case that needs cmip5/output2 data. You could retry with /work/bd0854/DATA/ESMValTool2/download/cmip5/output2
recipe_climate_change_hotspot.yml -> Maybe @sloosvel and @pepcos could say more. I thought this issue was recently fixed...
recipe_flato13ipcc.yml -> I think this is not runnable at the moment but should be fixed by @katjaweigel in Splitting of flato13ipcc.yml into separate recipes and adding recipes for regional Figures #2156.
recipe_meehl20sciadv.yml and recipe_schlund20esd.yml -> Maybe the recipe maintainer @schlunma could take a look 🍺
recipe_perfmetrics_*yml -> looks like the same datasets are missing...

sloosvel · 2022-10-26T13:45:23Z

I think for recipe_climate_change_hotspot.ym, I ended up running it on jasmin

valeriupredoi · 2022-10-26T14:01:12Z

Hi @remi-kazeroni @sloosvel awesome, thanks a lot! Here's the thing(s):

recipe_anav13jclim.yml - this is not optimal if "special" cmip5 data is needed, that is not available on ESGF - I would add this recipe to the list of those we have to see what to do about it wrt obsolete data
recipe_climate_change_hotspot.yml - same as above, unless there is a serious reason why it's not working, having to have preferred sites where recipes run is against our core principle of reproducibility of results

I'll have a closer look at the meeh and schnlund ones, and will ping @schlunma asap

katjaweigel · 2022-10-26T14:07:51Z

Yes, the version of recipe_flato13ipcc.yml currently in #2156 is running. The cost is to remove/comment out data sets, which do not work on Levante (and to fix a wrong time period for one model). There was already some discussion on how to deal with such cases, and if I remember right @axel-lauer , who is maintainer of the original recipe_flato13ipcc.yml did not agree on removing data sets? It should also be noted, that the option --skip_nonexistent does not work for all diagnostics in recipe_flato13ipcc.yml, because in several data sets from e.g. two different experiments are needed and it does not work, if only one is there. Therefore I was going to ask, which version of recipe_flato13ipcc.yml should be in the end in #2156 in this issue. (Unfortunately I'm also not completely ready with some issues in recipe_flato13ipcc_figures_938_941.yml I hope to finish them soon).

schlunma · 2022-10-26T14:20:33Z

V, can adapt the permission to /scratch/b/b382109/esmvaltool_output so I can have a look at the logs?

valeriupredoi · 2022-10-26T14:57:14Z

/scratch/b/b382109/esmvaltool_output

@schlunma Manu, they are here /home/b/b382109/manu_logs

valeriupredoi · 2022-10-26T14:59:05Z

Yes, the version of recipe_flato13ipcc.yml currently in #2156 is running. The cost is to remove/comment out data sets, which do not work on Levante (and to fix a wrong time period for one model). There was already some discussion on how to deal with such cases, and if I remember right @axel-lauer , who is maintainer of the original recipe_flato13ipcc.yml did not agree on removing data sets? It should also be noted, that the option --skip_nonexistent does not work for all diagnostics in recipe_flato13ipcc.yml, because in several data sets from e.g. two different experiments are needed and it does not work, if only one is there. Therefore I was going to ask, which version of recipe_flato13ipcc.yml should be in the end in #2156 in this issue. (Unfortunately I'm also not completely ready with some issues in recipe_flato13ipcc_figures_938_941.yml I hope to finish them soon).

@katjaweigel many thanks for your clarification! I will consider this recipe at-risk for now, and will not faff about it until you guys fix it - not the first and not the last time we include not really fully working recipes in a release 😁

schlunma · 2022-10-26T15:03:48Z

cd: permission denied: /home/b/b382109/manu_logs 😢

sloosvel · 2022-10-28T07:57:50Z

Hi @valeriupredoi please let me know if want to schedule a call, I have to say that I am quite confused by all your issues. I did not ran into any of that.

valeriupredoi · 2022-10-28T08:09:43Z

Hi @sloosvel - many thanks, am back on track now, no need for a call just yet, maybe if you could keep an eye on this issue if I ask for some help, that'd be awesome 🍺 👍

valeriupredoi · 2022-10-28T08:50:09Z

OK comparison tool is now plodding along nicely - I have also added the instructions in the issue description - we can use that description to hatch us a nice doc entry - the next RM should not go through the Gates of VM Purgatory like I did yesterday 👍

valeriupredoi · 2022-10-28T09:07:09Z

Comparison results

Run command and output stored

location: DKRZ VM
current: /work/bd0854/b382109/v270 (contains preproc/ dirs too, 122 recipes)
reference: /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4 (does not contain preproc/ dirs)
cmd: nohup python ESMValTool/esmvaltool/utils/testing/regression/compare.py /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4 /work/bd0854/b382109/v270 > compare270output.txt
examination from /home/b/b382109/compare270output.txt

Per recipe result

Legend:

OK: plots identical (even if some work nc are different)
NOK: plots differ

122 out of 127 final

recipe_anav13jclim.yml by @remi-kazeroni NOK
recipe_albedolandcover.yml NOK
recipe_arctic_ocean.yml OK
recipe_autoassess_landsurface_permafrost.yml OK
recipe_autoassess_landsurface_soilmoisture.yml NOK
recipe_autoassess_landsurface_surfrad.yml OK
recipe_autoassess_radiation_rms_Amon_all.yml OK
recipe_autoassess_radiation_rms_Amon_obs.yml OK
recipe_autoassess_stratosphere.yml OK
recipe_bock20jgr_fig_1-4.yml by @remi-kazeroni NOK
recipe_bock20jgr_fig_6-7.yml NOK
recipe_bock20jgr_fig_8-10.yml OK
recipe_capacity_factor.yml NOK
recipe_carvalhais14nat.yml all plot files are now pdf (from png) - need look NOK
recipe_climwip_brunner2019_med.yml by @remi-kazeroni OK
recipe_climwip_brunner20esd.yml OK
recipe_climwip_test_basic.yml OK
recipe_climwip_test_performance_sigma.yml OK
recipe_clouds_bias.yml OK
recipe_clouds_ipcc.yml OK
recipe_cmug_h2o.yml OK
recipe_collins13ipcc.yml by @remi-kazeroni NOK
recipe_combined_indices.yml OK
recipe_concatenate_exps.yml OK
recipe_consecdrydays.yml OK
recipe_correlation.yml OK
recipe_cox18nature.yml OK
recipe_cvdp.yml OK
recipe_daily_era5.yml OK
recipe_deangelis15nat.yml OK
recipe_deangelis15nat_fig1_fast.yml OK
recipe_decadal.yml OK
recipe_diurnal_temperature_index.yml OK
recipe_eady_growth_rate.yml OK
recipe_ecs.yml OK
recipe_ecs_constraints.yml NOK
recipe_ecs_scatter.yml OK
recipe_ensclus.yml OK
recipe_era5-land.yml OK
recipe_esacci_lst.yml OK
recipe_esacci_oc.yml OK
recipe_extract_shape.yml OK
recipe_extreme_events.yml missing 2 plots, plots differ too NOK
recipe_extreme_index.yml missing 1 plots, plots differ too NOK
recipe_eyring06jgr.yml OK
recipe_eyring13jgr_12.yml OK
recipe_gier2020bg.yml 1 plots differ NOK
recipe_globwat.yml OK
recipe_heatwaves_coldwaves.yml OK
recipe_hydro_forcing.yml OK
recipe_hyint.yml NOK
recipe_hyint_extreme_events.yml NOK
recipe_hype.yml OK
recipe_impact.yml by @remi-kazeroni no reference found, unable to check
recipe_ipccwg1ar6ch3_atmosphere.yml no reference run found, unable to check
recipe_julia.yml OK
recipe_kcs.yml NOK
recipe_landcover.yml OK
recipe_lauer13jclim.yml OK
recipe_li17natcc.yml NOK
recipe_lisflood.yml OK
recipe_marrmot.yml OK
recipe_martin18grl.yml NOK
recipe_meehl20sciadv.yml all plots went from png to pdf NOK
recipe_miles_block.yml NOK
recipe_miles_eof.yml NOK
recipe_miles_regimes.yml OK
recipe_modes_of_variability.yml NOK
recipe_monitor.yml NOK
recipe_monitor_with_refs.yml NOK
recipe_mpqb_xch4.yml OK
recipe_multimodel_products.yml NOK
recipe_my_personal_diagnostic.yml OK
recipe_ncl.yml OK
recipe_ocean_Landschuetzer2016.yml OK
recipe_ocean_amoc.yml OK
recipe_ocean_bgc.yml NOK
recipe_ocean_example.yml NOK
recipe_ocean_ice_extent.yml NOK
recipe_ocean_multimap.yml OK
recipe_ocean_quadmap.yml OK
recipe_ocean_scalar_fields.yml OK
recipe_pcrglobwb.yml OK
recipe_preprocessor_derive_test.yml OK
recipe_preprocessor_test.yml OK
recipe_psyplot.yml OK
recipe_pv_capacity_factor.yml OK
recipe_python.yml OK
recipe_quantilebias.yml OK
recipe_perfmetrics_CMIP5.yml by @remi-kazeroni OK
recipe_perfmetrics_CMIP5_4cds.yml by @remi-kazeroni NOK
recipe_r.yml OK
recipe_radiation_budget.yml OK
recipe_rainfarm.yml OK
recipe_runoff_et.yml OK
recipe_russell18jgr.yml OK
recipe_schlund20esd.yml all plots changed from png to pdf NOK
recipe_schlund20jgr_gpp_abs_rcp85.yml NOK
recipe_schlund20jgr_gpp_change_1pct.yml OK
recipe_schlund20jgr_gpp_change_rcp85.yml NOK
recipe_sea_surface_salinity.yml OK
recipe_seaice.yml by @remi-kazeroni OK
recipe_seaice_drift.yml OK
recipe_seaice_feedback.yml OK
recipe_shapeselect.yml OK
recipe_smpi.yml OK
recipe_smpi_4cds.yml OK
recipe_snowalbedo.yml OK
recipe_spei.yml NOK
recipe_tcr.yml OK
recipe_tebaldi21esd.yml no reference run found, unable to check
recipe_thermodyn_diagtool.yml OK
recipe_toymodel.yml NOK
recipe_validation.yml OK
recipe_validation_CMIP6.yml OK
recipe_variable_groups.yml OK
recipe_wenzel14jgr.yml OK
recipe_wenzel16jclim.yml OK
recipe_wenzel16nat.yml NOK
recipe_wflow.yml OK
recipe_williams09climdyn_CREM.yml OK
recipe_zmnam.yml OK

Result

We need to look at plots for 34 recipes; we're good to go for 85 recipes; 3 have no reference in 2.6.0

bouweandela · 2022-10-28T10:03:08Z

And as mentioned in my previous comment, anav13 is a special case since data from output2 cannot be read with the default DRS (our fault, not CMIPs!). You could also try:

CMIP5: /work/bd0854/DATA/ESMValTool2/download/cmip5/output2

@schlunma Why are the project and product facets hardcoded in the path? This should work fine if the DRS starts with {project.lower}/{product}/.. (i.e. the one called ESGF in config-developer.yml) and the rootpath is set to /work/bd0854/DATA/ESMValTool2/download.

bouweandela · 2022-10-28T10:08:07Z

Oh wait, is that because you're trying to combine data downloaded with the ESGF DRS with the DKRZ DRS and we don't support per path DRS settings? ESMValGroup/ESMValCore#129

schlunma · 2022-10-28T10:20:44Z

Oh wait, is that because you're trying to combine data downloaded with the ESGF DRS with the DKRZ DRS and we don't support per path DRS settings? ESMValGroup/ESMValCore#129

Exactly. We found that using the full paths with project and output for the downloaded data is currently the cleanest way to include the DKRZ and ESGF rootpaths.

bouweandela · 2022-10-28T10:34:37Z

Actually, the cleanest way to make it work is just to set download_dir: /work/bd0854/DATA/ESMValTool2/download/ (provided you have write access there) and run with offline: false and only use the 'official' rootpaths for the CMIP projects. This works because the tool will always check if it already has a file before downloading it. In this case you're lucky that the ESGF DRS is similar to the DKRZ one, so your approach works too. Anyway, I'll see if I can do something about ESMValGroup/ESMValCore#129 for the next release.

schlunma · 2022-10-28T10:40:31Z

We tried that too, but if I remember correctly there was a problem with this. Could be that it was an issue because at the beginning downloading was very slow (so there was no chance that slow recipes would run), which should be fixed now. I will give it another try sometime in the future.

It would be really great if you could do something about ESMValGroup/ESMValCore#129 🚀

valeriupredoi · 2022-10-28T13:28:00Z

alright folks, as we have seen in #2881 (comment) we need to look at some recipes that have not had the same plots as in v2.6.0, these are 34 party poopers:

To quickly identify differing plots please have a look at this log https://esmvaltool.dkrz.de/shared/esmvaltool/compare270output_trimmed.txt

We can have a look at them in the run list for v2.7.0 https://esmvaltool.dkrz.de/shared/esmvaltool/v2.7.0/debug.html vs the v2.6.0 one https://esmvaltool.dkrz.de/shared/esmvaltool/v2.6.0/debug.html - I will start having me a look but by all means, @ESMValGroup/esmvaltool-developmentteam I could really use a hand here, especially since you (as recipe maintainer/developer) you know these things well, they're all beetles and bugs on coloured paper to me 😁

bettina-gier · 2022-10-28T13:30:20Z

Is there a log file where we can see the differences? My recipe has a lot of plots and if just one of them differs as the list says it'd be easier to just look at that one =D

valeriupredoi · 2022-10-28T13:32:51Z

Is there a log file where we can see the differences? My recipe has a lot of plots and if just one of them differs as the list says it'd be easier to just look at that one =D

logfile coming right away - I'll post it in the comment above 🍺

valeriupredoi · 2022-10-28T13:43:20Z

before I post the log (currently curating it) to not lose you, Tina, here's the only bitty plot that differs for your recipe:
recipe_gier2020bg.yml: results differ from reference run
Reference run: /mnt/esmvaltool_disk2/shared/esmvaltool/v2.6.0rc4/recipe_gier2020bg_20220712_100159
Current run: /work/bd0854/b382109/v270/recipe_gier2020bg_20221025_142445
Differing files:

plots/cmip6_ensemble_analysis/main_ensemble/xco2_esm-hist_global_2003-2014_barplot_SA_obs.png

bettina-gier · 2022-10-28T13:48:28Z

You can pass that recipe, that's just a different sorting for the diff ensemble members in the histogram and looks diff cause they're not labeled. Cheers for the special extract for me ;)

valeriupredoi · 2022-10-28T13:51:31Z

You can pass that recipe, that's just a different sorting for the diff ensemble members in the histogram and looks diff cause they're not labeled. Cheers for the special extract for me ;)

@bettina-gier - legend, many thanks! 🍺

sloosvel · 2022-10-28T15:33:05Z

@valeriupredoi do you mind if I run recipe_climate_change_hotspot in jasmin, so that I can at least upload the results for this version?

valeriupredoi · 2022-10-28T15:40:43Z

@valeriupredoi do you mind if I run recipe_climate_change_hotspot in jasmin, so that I can at least upload the results for this version?

@sloosvel go for it! Cheers 🍺 - but am planning on releasing tonight, it looks promising. If you run it then upload results to the v2.7.0 that'd be awesome, and the release is not affecting that 👍 It'd be great if you was around to approve the last PR thereby changing the version number, in an hour or so, no probs if you not will ask @bouweandela

TomasTorsvik · 2022-10-28T15:48:44Z

@valeriupredoi for recipe_ocean_example, the only obvious difference seems to be the transect plots, Diag_Transect_1, Diag_Transect_2 and Diag_Transect_3. These plots are empty in v2.6.0 and non-empty in v2.7.0 (see bugfix #2858).

Diag_Transect_1 picks up the mask data 1.e20, but this is probably a separate issue.

valeriupredoi · 2022-10-28T15:52:02Z

@TomasTorsvik brilliant, many thanks for looking! And a positive difference too, thanks to your PR 🍺

Diag_Transect_1 picks up the mask data 1.e20, but this is probably a separate issue.

Would you be OK to open an issue about this, please? And tag @ledm so we can fix that in 2.8. Many thanks! 🍺

TomasTorsvik · 2022-10-28T16:03:02Z

@valeriupredoi the same applies for recipe_ocean_bgc, the v2.7.0 have plots for Diag_Transect_No_Data and Diag_Transect_vs_Woa that are empty in v2.6.0. The other plots look OK to me.

valeriupredoi · 2022-10-28T16:06:35Z

Fantastic, cheers @TomasTorsvik 🍺

TomasTorsvik · 2022-10-28T16:14:49Z

@valeriupredoi I'm not sure, but it seems the difference in recipe_ocean_ice_extent can be connected with land/coastline boarders. See e.g.

v2.6.0 :
https://esmvaltool.dkrz.de/shared/esmvaltool/v2.6.0/recipe_ocean_ice_extent_20220712_100640/plots/diag_ice_SHS/Global_seaice_timeseries/diag_CMIP5_HadGEM2-CC_OImon_historical_r1i1p1_sic_timeseries_SHS_ice_extent_diag_ice_SHS_1989_2004_ortho_map_Fractionalcover_2003DJF_0.png

v2.7.0 :
https://esmvaltool.dkrz.de/shared/esmvaltool/v2.7.0/recipe_ocean_ice_extent_20221025_143917/plots/diag_ice_SHS/Global_seaice_timeseries/diag_CMIP5_HadGEM2-CC_OImon_historical_r1i1p1_sic_timeseries_SHS_ice_extent_diag_ice_SHS_1989_2004_ortho_map_Fractionalcover_2003DJF_0.png

valeriupredoi · 2022-10-28T16:27:00Z

@valeriupredoi I'm not sure, but it seems the difference in recipe_ocean_ice_extent can be connected with land/coastline boarders. See e.g.

v2.6.0 : https://esmvaltool.dkrz.de/shared/esmvaltool/v2.6.0/recipe_ocean_ice_extent_20220712_100640/plots/diag_ice_SHS/Global_seaice_timeseries/diag_CMIP5_HadGEM2-CC_OImon_historical_r1i1p1_sic_timeseries_SHS_ice_extent_diag_ice_SHS_1989_2004_ortho_map_Fractionalcover_2003DJF_0.png

v2.7.0 : https://esmvaltool.dkrz.de/shared/esmvaltool/v2.7.0/recipe_ocean_ice_extent_20221025_143917/plots/diag_ice_SHS/Global_seaice_timeseries/diag_CMIP5_HadGEM2-CC_OImon_historical_r1i1p1_sic_timeseries_SHS_ice_extent_diag_ice_SHS_1989_2004_ortho_map_Fractionalcover_2003DJF_0.png

Eagle-eyed, man 🦅 - coastline contours are more pronounced in 2.7 - cartopy change most probably, but they look the same to me!

valeriupredoi · 2022-10-28T16:47:24Z

OK this concludes the release testing marathon! Good news is there are not many bad apples among the recipes, bad news is there are a couple - see #2881 (comment) - we found a couple MAGICs project R recipes that look dubious and opened at least one issue about #2890 - but since these recipes are unmaintained, developers who wrote them have in the meantime left the institutes they're listed under etc I am not going to hold the release for some Da Vinci Code-style tracking down; we need to think what we do with such recipes.

Oh and the ocean recipes by @ledm need some TLC but he's told me this for a while now, we should get together one time and fix them, no major bugs, but old crap that needs updating.

I declare this Tool ready for release! Many thanks to all who helped during this testing process @sloosvel @remi-kazeroni @schlunma @bettina-gier @TomasTorsvik and @bouweandela of course 😁 🍻

valeriupredoi · 2022-10-28T18:25:18Z

it's out and about! 🍺 https://pypi.org/project/ESMValTool/2.7.0/

bouweandela · 2022-11-01T10:37:34Z

You can pass that recipe, that's just a different sorting for the diff ensemble members in the histogram and looks diff cause they're not labeled. Cheers for the special extract for me ;)

@bettina-gier Would it be possible to sort the ensemble members in a way that is stable between runs? With the upcoming more regular recipe testing that @ehogan et al are working on, as described in #2723, issues like this will keep popping up.

valeriupredoi added testing release labels Oct 25, 2022

valeriupredoi added this to the v2.7.0 milestone Oct 25, 2022

valeriupredoi assigned valeriupredoi and sloosvel Oct 25, 2022

valeriupredoi pinned this issue Oct 25, 2022

schlunma mentioned this issue Oct 28, 2022

Recipes schlund20jgr_*.yml give non-deterministic results #2889

Closed

valeriupredoi closed this as completed Oct 28, 2022

valeriupredoi unpinned this issue Oct 28, 2022

valeriupredoi mentioned this issue Oct 29, 2022

Release documentation is lacking key steps #2895

Closed

Recipe testing and comparison for release 2.7.0 #2881

Recipe testing and comparison for release 2.7.0 #2881

Comments

valeriupredoi commented Oct 25, 2022 • edited

System and settings

conda/mamba

Git branch and state

Environment

Environment file

Extraneous file movements

Ad-hoc hacks (code changes)

Mods to config user file

Recipe runs

Running the comparison

Login and access to the DKRZ esmvaltool VM

Get and install miniconda on VM

Setting up the input files

Running compare tool at VM

Input/output/run

First pass result

Detailed plots inspection

valeriupredoi commented Oct 25, 2022

valeriupredoi commented Oct 25, 2022

sloosvel commented Oct 25, 2022

remi-kazeroni commented Oct 25, 2022

valeriupredoi commented Oct 25, 2022

valeriupredoi commented Oct 26, 2022 • edited

Recipe running session 2022-10-26 13:13:41.568698

Succesfully run recipes

Recipes that failed with DiagnosticError

Recipes that failed of Missing Data

Recipes that failed of other reasons

valeriupredoi commented Oct 26, 2022 • edited

Postmortem of failed recipes OTHER THAN Missing Data

Recipes that failed with DiagnosticError

Recipes that failed of other reasons or are still running

remi-kazeroni commented Oct 26, 2022

remi-kazeroni commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

remi-kazeroni commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

sloosvel commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

remi-kazeroni commented Oct 26, 2022

sloosvel commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

katjaweigel commented Oct 26, 2022 • edited

schlunma commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022

valeriupredoi commented Oct 26, 2022 • edited

schlunma commented Oct 26, 2022

sloosvel commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022 • edited

Comparison results

Run command and output stored

Per recipe result

Result

bouweandela commented Oct 28, 2022 • edited

bouweandela commented Oct 28, 2022

schlunma commented Oct 28, 2022

bouweandela commented Oct 28, 2022

schlunma commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022 • edited

bettina-gier commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

bettina-gier commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

sloosvel commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

TomasTorsvik commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022 • edited

TomasTorsvik commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

TomasTorsvik commented Oct 28, 2022

valeriupredoi commented Oct 28, 2022

valeriupredoi commented Oct 25, 2022 •

edited

`conda`/`mamba`

valeriupredoi commented Oct 26, 2022 •

edited

valeriupredoi commented Oct 26, 2022 •

edited

katjaweigel commented Oct 26, 2022 •

edited

valeriupredoi commented Oct 26, 2022 •

edited

valeriupredoi commented Oct 28, 2022 •

edited

bouweandela commented Oct 28, 2022 •

edited

valeriupredoi commented Oct 28, 2022 •

edited

valeriupredoi commented Oct 28, 2022 •

edited

bouweandela commented Nov 1, 2022 •

edited