Complex ANK+NK #130

anilyil · 2021-03-09T09:14:01Z

Purpose

This PR improves the complex ANK and NK solvers, as well as improving how complex residuals and printouts are handled. With @joanibal, we tried to figure out how the complex step method worked for implicit solvers. We concluded that the path of convergence is not important, but in the end, complex residuals must be converged to obtain accurate derivatives. This means that during a complex-step evaluation, we dont need to re-set flow, and just call the NK/ANK solvers repeatedly so that complex residuals converge. This PR has a few changes that makes this possible.

Here is a list of what we did with this PR:

The residual printout is fixed in complex mode. The code used to print the square of the residual, and the imaginary part of the printed value is equivalent to an interesting derivative (derivative of the solver update wrt complex-perturbed DV), but it was not printing the true complex residual. Because we need to converge the complex residuals, we print these directly now. (interestingly, complex residuals go to zero also when the derivative of the solver update wrt complex-perturbed DV also goes to zero, but the residual norm is easier to understand and is always positive)
The funcional printout is also improved. The output now looks like this:

#-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#  Grid  | Iter | Iter |  Iter  |   CFL   | Step | Lin  |    Wall    |            Res rho              |                    C_drag                       |            totalRes             |
#  level |      | Tot  |  Type  |         |      | Res  | Clock (s)  |                                 |                                                 |                                 |
#-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      1       0      0     None     ----    ----   ----  0.39389E-02   0.14551813E-14 +0.11287700E-129i  0.4710284646449747E-01 +0.1156509033894532E-123i  0.45644342E-12 +0.84458180E-128i
      1       1     21      *NK     ----    1.00  0.000  0.70712E+00   0.13697775E-14 +0.15267370E-132i  0.4710284646449758E-01 +0.1156509263873306E-123i  0.33574505E-12 +0.77222624E-131i

We print all 16 digits of real and complex parts for the functionals because the real part is the value and the complex part is the derivative. However, we do not print all digits for residuals since their magnitude is more important anyways. I have left about 8 digits on both real and complex parts of the residuals for debugging purposes. The nonlinear iteration header is also fixed so that things are aligned. I also explicitly give 3 digits to the exponent because we are now interested in doing smaller complex-steps that underflow and do not affect real parts when the real values are zero.

The complex build process is slightly improved. Complexify.f90 is moved to src_cs. We now recompile only the changed files in complex mode. We used to do this for fortran modules but still would copy over other include files which caused long build times. This is fixed.
The absolute tolerance (atol) in the NK solver is set separately for complex mode. For the real mode, we set this atol to be 0.01 lower than the real L2 convergence target. This makes sure that the NK solver solves the linear system well enough to put us across the finish line, while not doing any extra work. This approach does not work with complex step. Because derivatives lag, complex residuals must be converged, even after the real ones hit machine zero. Furthermore, if we want to converge the complex system without re-setting the flow, then the real residuals will hang around machine zero and we again need the linear solver to actually solve the system and not quit early. Same fix is also applied to the ANK solver.
ANK solver is modified to handle complex mode. This involved several changes to the iteration algorithm as well as the line searches. I am still not 100% on some details, but it works, and the parts I dont understand go to zero as the CFL number is increased, so it should be fine. We only really want to avoid having the complex part diverge. As long as a relatively stable and low value of complex state is maintained, the complex system needs to be converged after the real part converges anyways due to the lag, so again, it should be fine.
We added a nonlinear solver test for complex ANK/NK (but see the unfinished item no 2 below).

Here is a list that we did not do yet, and should be done in a future PR:

The complex build process needs to be cleaned up a bit more. This is not terribly bad, I just did not want to delay this PR for that because the complex build does work now. This will be a future PR.
We should add better ANK and NK solver tests. There are a few features that we want to maintain, and the simple test we added here can be modified to include more tests. Depending on feedback, we can do this in this PR, or in a future PR.
The complex residual computation squares the real and complex parts separately to compute the two separate residuals. This will underflow with a complex step of 1e-200 and the residuals will just print zero. We ideally want to fix this square=>squareroot process so that the values do not underflow.

Type of change

What types of change is it?
Select the appropriate type(s) that describe this PR

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (non-backwards-compatible fix or feature)
Code style update (formatting, renaming)
Refactoring (no functional changes, no API changes)
Documentation update
Maintenance update
Other (please describe)

Testing

We added complex-step tests for the ANK and NK solvers.

Checklist

Put an x in the boxes that apply.

I have run flake8 and black to make sure the code adheres to PEP-8 and is consistently formatted
I have run unit and regression tests which pass locally with my changes
I have added new tests that prove my fix is effective or that my feature works
I have added necessary documentation

…ed like the real part

…for complex only

…put is printed

anilyil · 2021-03-09T09:16:47Z

So I still have a few TODOs before this is ready to merge. The fixes only work for decoupled ANK. I will need to fix the coupled ANK and turbANK until the PR is ready. Also, I want to add a test for multiple modes of ANK/NK in both real and complex modes. Finally, I think we can also update the existing complex tests to use ANK/NK with this PR.

I wanted to create the PR to track these TODOs, and also have azure test the changes. Any feedback is welcome.

@nwu63 I plan on fixing the complex compile in a separate PR. I have most of it laid out mentally, just need to figure out a few details. Once I create that PR, someone else can apply the similar changes to idwarp (I believe its the only other code we have with a similar complex build process?)

anilyil · 2021-03-09T13:36:51Z

The tests fail because of the complex ank/nk test. That wasnt tagged as cmplx_ as it should be. I will take care of it soon. We will need to update input files as well

src/NKSolver/NKSolvers.F90

anilyil · 2021-03-10T08:33:52Z

I have updated the input files and added one real and one complex test for solver combinations. These should pass now. Then I will add more combinations with different options.

anilyil · 2021-03-10T12:39:05Z

This one is good to go for the review. I have added solver combination tests that switch between smoother, ANK, SANK, CSANK, and NK. I test all equation types with these, also with the 2 turbulence solvers that can be used in ANK. I still test the turbulence solvers with Euler and laminar NS because in the past, we had a few issues with these cases seg-faulting. With the current test, we just expect equation modes that does not have any turbulence model to just work regardless of the turbulence options specified. I test both real and complex versions with these.

The test itself uses a simple 2^3 block originally made by @joanibal. I have modified the mesh slightly so there is a wall, and rest of the BCs are farfield. The flow also comes in at an angle to the wall so that Euler residuals are not zero with free stream.

I have also modified existing complex tests to use ANK for the solution. @sseraj can you please check if I did it correctly? For the euler tests at least, I had to set the second order switch to get it to converge; it would oscillate a bit otherwise. Now it looks it is converging okay, but I dont really have a reference on how fast it should be.

The only remaining item that I wanted to possibly do with this PR is the "unfinished" item 3 I listed above. A complex step size of around 1e-160 and below will underflow with the residual norm computation and the residual monitor will print zero for complex residuals. We can maybe include that in the "complex magic" PR that does a few fancy things. We can also update the step sizes in complex tests in that PR so that they use the underflow version. I will create an issue about this so we remember what is going on.

sseraj

Overall, the code changes look good from what I understand. Some clarification questions and comments below

src/NKSolver/NKSolvers.F90

src_cs/build/Makefile1

sseraj · 2021-03-14T22:11:54Z

src/NKSolver/NKSolvers.F90

@@ -3043,15 +3061,15 @@ subroutine physicalityCheckANK(lambdaP)
    real(kind=realType), pointer :: wvec_pointer(:)
    real(kind=realType), pointer :: dvec_pointer(:)
    real(kind=alwaysRealType) :: lambdaL ! L is for local
-    real(kind=realType) :: ratio
-
+    real(kind=alwaysRealType) :: tmp ! to receive the global step


I would prefer a name like lambdaP_tmp so it's easier to keep track of what's happening.

or even step_size if that is what it is

changed all instances of tmp in the two physicality checks to lambdaP_recv

This works for me

anilyil · 2021-03-15T10:18:43Z

Thanks for the review @sseraj, I will go through your comments and make the suggested changes. @joanibal can you review this anytime soon? I would rather address your comments together with addressing @sseraj's comments.

joanibal · 2021-03-16T15:55:17Z

src/NKSolver/NKSolvers.F90

@@ -3043,15 +3061,15 @@ subroutine physicalityCheckANK(lambdaP)
    real(kind=realType), pointer :: wvec_pointer(:)
    real(kind=realType), pointer :: dvec_pointer(:)
    real(kind=alwaysRealType) :: lambdaL ! L is for local
-    real(kind=realType) :: ratio
-
+    real(kind=alwaysRealType) :: tmp ! to receive the global step


or even step_size if that is what it is

joanibal · 2021-03-16T15:56:29Z

src/NKSolver/NKSolvers.F90


                      ! check density
+#ifndef USE_COMPLEX
+                      ! to have the real mode sliiiightly more efficient, just do stuff with real numbers


I don't get this comment. In real mode aren't you always working with real numbers?

Yeah, so in the real mode, I dont need to check if the variable is real. I will rephrase the comment

src/NKSolver/NKSolvers.F90

joanibal · 2021-03-16T16:00:59Z

src/NKSolver/NKSolvers.F90

+    lambdaP = tmp
+#else
+    ! finally, as a safety check, purge the complex part of lambda
+    lambdaP = cmplx(tmp, 0.0_realType)


So lambdaP is always real?
lambdaL is always real

Shouldn't all lambda's always be real

if all lambdas are always real, then you could probably get rid of your tmp variables

I needed the tmp because the MPI_MIN operation in the MPI_Allreduce is not defined for complex numbers. It just fails. So I have to use an always real type to do the allreduce.

I actually had a comment right above as well:

! mpi allreduce is not defined for complex numbers with the min operation ! so we will use the lambdaP_recv variable to receive

oh I just realized, lambdaP is not alwaysReal, it is just realType. As a result, I needed that additional always real variable to receive the MPI allreduce's output

src/solver/solvers.F90

tests/reg_tests/test_solver_combos.py

joanibal · 2021-03-16T16:38:14Z

Added a few suggestions in my review

joanibal · 2021-03-16T16:53:07Z

also, to be clear.

Does this resolve the issue of the slow convergence of the imaginary part after the real part has converged?

anilyil · 2021-03-16T18:14:15Z

So slow is a bit relative term here and I am not sure what you have in mind.

After this PR, the complex parts still lag a bit. But, even the small amount of lag is okay for matching 10 digits of accuracy. So for most practical cases, we can just converge the real part tightly (1e-15) and the complex convergence you get there is good enough to get a 10 digit match with the adjoint. You can do more iterations to further nail down the complex residuals, and the accuracy increases (I mean difference between CS and adjoint results get smaller as expected).

The changes to the atol in NK type solvers also mean that even when the complex residuals lag and the real part converged, you can just do one more call to the solver and that makes significant progress in the complex part. This is because before this PR, the linear solver would quit early because it would hit the atol limit. Now because atol is so much lower, it does more work solving the linear system even when real part is converged.

I have also tested this w/o resetting the real flow. With these cases, the real residual starts from machine zero whereas complex residuals start on the order of the complex step. Here, the complex convergence depends on the linear solver tolerance. With a linear solver tolerance of 1e-6, complex residuals converge after like 2-3 calls to the CFD solver. This also supports the theory we have been developing.

Does this answer your last question @joanibal?

…solvers

joanibal · 2021-03-16T18:33:18Z

Yes it does.

Thank you

anilyil · 2021-03-16T18:54:11Z

I addressed the comments in the fortran code. Now I will go over the tests again. I left all of the conversations as unresolved, please make sure that I addressed your comments @joanibal and @sseraj

anilyil · 2021-03-16T20:05:02Z

@sseraj is there a particular reason why we have 1e-16 convergence on the rans complex cases? It just hits machine zero and bounces around. Tests pass with an L2 convergence of 1e-15 as well, so I am thinking of just modifying that. Any objections here?

…nt with the original code now

sseraj · 2021-03-16T20:25:59Z

@sseraj is there a particular reason why we have 1e-16 convergence on the rans complex cases? It just hits machine zero and bounces around. Tests pass with an L2 convergence of 1e-15 as well, so I am thinking of just modifying that. Any objections here?

No particular reason that I can think of. If all the tests pass with 1e-15, then that should be fine.

anilyil · 2021-03-16T20:27:04Z

With my recent changes to the tests, I believe I addressed every comment from both of you. Please make sure if all is good. If the tests also pass, this is good to go for me.

joanibal

See comment about setting ANK_physLSTOl and other variables as always real times.

src/solver/solvers.F90

tests/reg_tests/test_solver_combos.py

src/NKSolver/NKSolvers.F90

…if solvers stall

anilyil · 2021-03-18T10:03:45Z

I updated the input files. The cube mesh is now a 4x4x4 block and the Euler tests also converge 12 orders of magnitude with 2 procs like the rest of the cases. Are there any other changes to the tests you want @joanibal ? I think now the last issue is to figure out the passing of alwaysRealType stuff from python to fortran?

anilyil · 2021-03-19T08:58:58Z

I think I addressed all of the comments with the tests and the code. The only thing I did not address yet is passing alwaysRealType options to Fortran with f2py. I created an issue for this #133 and I think this should be a separate PR because it touches more places in the code. @joanibal and @sseraj are you happy with the current state? Do you want any other changes?

sseraj

Thanks for creating the issue. This is good to go for me.

anilyil and others added 10 commits February 9, 2021 16:32

modified complex makefile so it checks if any of the files are modifi…

c1da4a0

…ed like the real part

added cube test

5849f3a

added alpha perturb

69965f4

hard-coded fixes for complex ank. the code works now, but hard coded …

bc285ad

…for complex only

Merge remote-tracking branch 'mdolab/master' into cmplx_solvers

8cf15c7

fixed how complex residuals are computed and also how all complex out…

b12b663

…put is printed

moved the complexify.f90 file to src_cs

3824b5b

ank checkpoint. decoupled ank works in complex mode

fa35144

Merge remote-tracking branch 'mdolab/master' into cmplx_solvers

9b4d38c

set atol properly for complex. cleaned up todos

fdfff9e

anilyil requested a review from a team as a code owner March 9, 2021 09:14

anilyil requested review from sseraj and Xiaosong2105 March 9, 2021 09:14

anilyil marked this pull request as draft March 9, 2021 09:14

anilyil requested a review from joanibal March 9, 2021 09:14

sseraj reviewed Mar 9, 2021

View reviewed changes

src/NKSolver/NKSolvers.F90 Outdated Show resolved Hide resolved

anilyil added 3 commits March 10, 2021 10:01

removed step and cfl from first none iteration print

b0a42b7

removed the old test and adding the new tests for solver combinations

4e290e0

fixed realtype capitalization

a8131f5

anilyil added 2 commits March 10, 2021 14:07

minor fix to nk solvers. added solver combo tests

c3a766a

fixed remaining ank routines

49852cf

anilyil marked this pull request as ready for review March 10, 2021 11:43

updated complex adjoint tests to use ank. also tweaked the tuning a bit

38de99b

typos

5a72a56

anilyil requested a review from sseraj March 10, 2021 16:00

Merge branch 'master' into cmplx_solvers

a17d510

sseraj reviewed Mar 14, 2021

View reviewed changes

joanibal requested changes Mar 16, 2021

View reviewed changes

Merge branch 'cmplx_solvers' of github.com:anilyil/adflow into cmplx_…

85b9440

…solvers

addressing the code comments

465ce33

anilyil added 2 commits March 16, 2021 23:18

updated tests

6013a78

reverted some other changes in complex tests. should be more consiste…

1adddc6

…nt with the original code now

anilyil requested review from sseraj and joanibal March 16, 2021 20:31

anilyil and others added 2 commits March 17, 2021 12:12

updated comments on atol explanation

dcaf568

minor edits to comments

3339da4

joanibal requested changes Mar 17, 2021

View reviewed changes

src/solver/solvers.F90 Show resolved Hide resolved

tests/reg_tests/test_solver_combos.py Outdated Show resolved Hide resolved

src/NKSolver/NKSolvers.F90 Show resolved Hide resolved

src/NKSolver/NKSolvers.F90 Show resolved Hide resolved

anilyil added 2 commits March 18, 2021 12:19

updated test for euler cases. also reduced ncycles to catch failures …

46ecee3

…if solvers stall

formatting

cd8ae44

sseraj approved these changes Mar 19, 2021

View reviewed changes

ewu63 requested a review from joanibal March 19, 2021 17:08

joanibal approved these changes Mar 19, 2021

View reviewed changes

joanibal merged commit 6c6443b into mdolab:master Mar 19, 2021

joanibal deleted the cmplx_solvers branch March 19, 2021 21:34

sseraj mentioned this pull request Mar 20, 2021

Zipper mesh bug #4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complex ANK+NK #130

Complex ANK+NK #130

anilyil commented Mar 9, 2021 •

edited

Loading

anilyil commented Mar 9, 2021

anilyil commented Mar 9, 2021

anilyil commented Mar 10, 2021

anilyil commented Mar 10, 2021 •

edited

Loading

sseraj left a comment

sseraj Mar 14, 2021

joanibal Mar 16, 2021

anilyil Mar 16, 2021

sseraj Mar 16, 2021

anilyil commented Mar 15, 2021

joanibal Mar 16, 2021

joanibal Mar 16, 2021

anilyil Mar 16, 2021

joanibal Mar 16, 2021

joanibal Mar 16, 2021

joanibal Mar 16, 2021

anilyil Mar 16, 2021

anilyil Mar 16, 2021

anilyil Mar 16, 2021

joanibal commented Mar 16, 2021

joanibal commented Mar 16, 2021 •

edited

Loading

anilyil commented Mar 16, 2021

joanibal commented Mar 16, 2021

anilyil commented Mar 16, 2021

anilyil commented Mar 16, 2021

sseraj commented Mar 16, 2021

anilyil commented Mar 16, 2021

joanibal left a comment

anilyil commented Mar 18, 2021

anilyil commented Mar 19, 2021

sseraj left a comment

Complex ANK+NK #130

Complex ANK+NK #130

Conversation

anilyil commented Mar 9, 2021 • edited Loading

Purpose

Type of change

Testing

Checklist

anilyil commented Mar 9, 2021

anilyil commented Mar 9, 2021

anilyil commented Mar 10, 2021

anilyil commented Mar 10, 2021 • edited Loading

sseraj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anilyil commented Mar 15, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joanibal commented Mar 16, 2021

joanibal commented Mar 16, 2021 • edited Loading

anilyil commented Mar 16, 2021

joanibal commented Mar 16, 2021

anilyil commented Mar 16, 2021

anilyil commented Mar 16, 2021

sseraj commented Mar 16, 2021

anilyil commented Mar 16, 2021

joanibal left a comment

Choose a reason for hiding this comment

anilyil commented Mar 18, 2021

anilyil commented Mar 19, 2021

sseraj left a comment

Choose a reason for hiding this comment

anilyil commented Mar 9, 2021 •

edited

Loading

anilyil commented Mar 10, 2021 •

edited

Loading

joanibal commented Mar 16, 2021 •

edited

Loading