Adding Solverstatisticscheck #36

CusiniM · 2023-09-12T00:30:17Z

In this PR I have added a new check called performancecheck (feel free to suggest a different name).

To summarize:

it parses the log to extract info about number of time steps (cycles), number of attempts, number of configuartion iterations, nonlinear iterations and linear iterations.
it writes this info to a file that's similar to what timehistory may provide for us in the future
compares this file against a baseline file using two different tolerances for nonlinear and linear iterations.

I could add some plots of the number of iterations if we think they could be useful.

I still see 2 main issues:

it is hard to know what to do if the number of time steps is different. Even though it's probably rare, this can happen on different platforms just because of an extra iteration. Maybe we could have the check compare the total first and then do the time-step by time-step comparison.
I think that having a mechanism to only run larger tests that use iterative solvers upon requests will become necessary soon.

@castelletto1, @francoishamon @victorapm we should also list the MGR strategies we want to test and pick a case for each one of them so that I can start adding tests.

For now, I have added:

PoroElastic_Mandel_smoke_fim_mgr.xml (mgr strategy SinglephasePoromechanics )
PoroElastic_staircase_co2_3d_mgr.xml (mgr strategy MultiphasePoromechanicsWithWells)
Sneddon_embeddedFrac_benchmark (mgr strategy SolidMechanicsEmbeddedFractures)

paveltomin · 2023-09-14T15:49:37Z

it is hard to know what to do if the number of time steps is different
how does currently the time history compared in that case? if data length is not the same - comparison gives 'fail', right?
we may think about some interpolation here and doing some norm between the interpolated curves if that is possible.

paveltomin · 2023-09-14T19:51:38Z

Please have a look at GEOS-DEV/GEOS#2680

CusiniM · 2023-09-15T15:31:12Z

it is hard to know what to do if the number of time steps is different how does currently the time history compared in that case? if data length is not the same - comparison gives 'fail', right? we may think about some interpolation here and doing some norm between the interpolated curves if that is possible.

For the curve_check that's what happens. Basically the solution is interpolated. The restarts will fail if timestepping is different (and they should). I feel that interpolating for the number of iterations does not make that much sense. It won't give us that much more information than comparing the total number of iterations or the average number per time-step. We could decide to only check that on all machine apart from the one used to create the baseline where we require a 1 to 1 match.

CusiniM · 2023-11-08T18:19:30Z

can someone review this PR so that we can move forward with it?

CusiniM · 2023-11-08T18:24:31Z

Please have a look at GEOS-DEV/GEOS#2680

I don't think I will add checks for sequential strategies for now. The goal is to add some coverage for MGR strategies which are only used for fully coupled.

wrtobin · 2023-11-08T18:33:29Z

Conceptually this is a good start.

I worry that we might want to use a different term than performance, as I tend to think of performance checks in terms of timings/flops/etc, and this is more about convergence characteristics. I don't think performance is wrong, I just wonder if there is a more applicable term that doesn't carry the same connotation.

Generically, what we're doing with this is checking for any divergence from a baseline that is not strictly about problem correctness.

Also, longer-term I think we would want to make the tool more generic to extract arbitrary metrics using regex's (or just python functions which consume strings and return metric info -- if we don't want to be strongly tied to regexes) to construct plots we do curve-checking against.

victorapm · 2023-11-08T18:39:10Z

How about verification?

wrtobin · 2023-11-08T18:41:02Z

How about verification?

Classically yeah, it would be validation == correctness of the solution, verification == correctness of execution

CusiniM · 2023-11-08T18:49:58Z

Conceptually this is a good start.

I worry that we might want to use a different term than performance, as I tend to think of performance checks in terms of timings/flops/etc, and this is more about convergence characteristics. I don't think performance is wrong, I just wonder if there is a more applicable term that doesn't carry the same connotation.

Generically, what we're doing with this is checking for any divergence from a baseline that is not strictly about problem correctness.

Maybe, since at the moment that's the only thing that it is doing, we can simply call it SolverStatisticsCheck? I feel people already struggle with understanding how our testing works so the less abstract of a name we pick the better.

Also, longer-term I think we would want to make the tool more generic to extract arbitrary metrics using regex's (or just python functions which consume strings and return metric info -- if we don't want to be strongly tied to regexes) to construct plots we do curve-checking against.

To be honest, I really hope that eventually we can output this data through time-history and completely avoid parsing the log. I just did it for now coz it felt like we needed a quick solution.

victorapm · 2023-11-08T20:05:50Z

the less abstract of a name we pick the better

Good point! SolverStatisticsCheck sounds accurate here

cssherman · 2023-12-08T21:35:56Z

scripts/geos_ats_package/geos_ats/helpers/solver_statistics_check.py

+             errors:
+    """
+    # Define regular expressions
+    cycle_pattern = r"\d+\s*:\s*Time: [\d.e+-]+ s, dt: [\d.e+-]+ s, Cycle: (\d+)"


With the new changes in log timestamp format, this would miss anything that is in minutes, years, etc. Maybe switch s, to \s*,?

cssherman · 2023-12-08T21:43:04Z

scripts/geos_ats_package/geos_ats/helpers/solver_statistics_check.py

+
+    # Create an HDF5 file for storing the data
+    output_fileName = os.path.join(os.path.dirname(fname), 'extracted_solverStat_data.h5')
+    with h5py.File(output_fileName, 'w') as hdf5_file:


Since you are already have hdf5_wrapper as a dependency, you could switch this to:

with hdf5_wrapper.hdf5_wrapper(output_fileName, 'w') as hdf5_file:
...

And then write to the object as if it were a simple python dictionary:

hdf5_file['some_key'] = {'some': ['value']}

cssherman · 2023-12-08T21:44:59Z

scripts/geos_ats_package/geos_ats/helpers/solver_statistics_check.py

+
+def solver_statistics_check_parser():
+    """
+    Build the curve check parser


Build the solver statistics parser

paveltomin · 2024-02-27T21:21:13Z

Recently just saw this:

********************************************************************************
Error: /Problem/Solvers/linearElasticity/SolverStatistics/numSuccessfulNonlinearIterations/__values__
	Scalar values of types int32 and int32 differ: 91, 58.
********************************************************************************
********************************************************************************
Error: /Problem/Solvers/linearElasticity/SolverStatistics/numSuccessfulLinearIterations/__values__
	Scalar values of types int32 and int32 differ: 91, 58.
********************************************************************************

in restartcheck outputs and wondering if that way can be extended instead of parsing the log?

CusiniM requested review from cssherman, castelletto1, francoishamon and rrsettgast September 12, 2023 00:53

CusiniM self-assigned this Sep 12, 2023

CusiniM force-pushed the feature/cusini/performancecheck branch from b5464a8 to f27933c Compare September 13, 2023 23:26

initial commit: create perfromance_check helper.

975fef0

CusiniM added 2 commits November 8, 2023 10:39

Adding performancecheck to geos_ats. Step runs.

86f0f9d

Performancecheck seems to work. Added 1 test.

abfca30

adding poroelastic cases.

7a434bf

CusiniM force-pushed the feature/cusini/performancecheck branch from 73a6165 to 7a434bf Compare November 8, 2023 18:41

renamed check.

565f4e7

CusiniM changed the title ~~Adding Performancecheck~~ Adding Solverstatisticscheck Nov 8, 2023

cssherman reviewed Dec 8, 2023

View reviewed changes

cssherman approved these changes Dec 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Solverstatisticscheck #36

Adding Solverstatisticscheck #36

CusiniM commented Sep 12, 2023 •

edited

Loading

paveltomin commented Sep 14, 2023

paveltomin commented Sep 14, 2023

CusiniM commented Sep 15, 2023 •

edited

Loading

CusiniM commented Nov 8, 2023

CusiniM commented Nov 8, 2023

wrtobin commented Nov 8, 2023

victorapm commented Nov 8, 2023

wrtobin commented Nov 8, 2023

CusiniM commented Nov 8, 2023 •

edited

Loading

victorapm commented Nov 8, 2023

cssherman Dec 8, 2023

cssherman Dec 8, 2023

cssherman Dec 8, 2023

paveltomin commented Feb 27, 2024

Adding Solverstatisticscheck #36

Are you sure you want to change the base?

Adding Solverstatisticscheck #36

Conversation

CusiniM commented Sep 12, 2023 • edited Loading

paveltomin commented Sep 14, 2023

paveltomin commented Sep 14, 2023

CusiniM commented Sep 15, 2023 • edited Loading

CusiniM commented Nov 8, 2023

CusiniM commented Nov 8, 2023

wrtobin commented Nov 8, 2023

victorapm commented Nov 8, 2023

wrtobin commented Nov 8, 2023

CusiniM commented Nov 8, 2023 • edited Loading

victorapm commented Nov 8, 2023

cssherman Dec 8, 2023

Choose a reason for hiding this comment

cssherman Dec 8, 2023

Choose a reason for hiding this comment

cssherman Dec 8, 2023

Choose a reason for hiding this comment

paveltomin commented Feb 27, 2024

CusiniM commented Sep 12, 2023 •

edited

Loading

CusiniM commented Sep 15, 2023 •

edited

Loading

CusiniM commented Nov 8, 2023 •

edited

Loading