add Deep Galerkin method #802

ayushinav · 2024-02-08T23:46:48Z

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Additional context

Adding the Deep Galerkin method described here and fixes #220. Not sure how to add further doc updates.

src/deep_galerkin.jl

test/other_algs_test.jl

src/deep_galerkin.jl

test/other_algs_test.jl

src/deep_galerkin.jl

test/runtests.jl

test/other_algs_test.jl

sathvikbhagavan · 2024-02-13T04:02:30Z

@ayushinav can you add the test group in https://github.com/SciML/NeuralPDE.jl/blob/master/.github/workflows/CI.yml#L27 to actually run the tests and @ChrisRackauckas can you enable GHA workflows in the PR? (I guess once after @ayushinav pushes his latest changes especially adding the test group)

ayushinav · 2024-02-16T07:29:18Z

@ChrisRackauckas and @sathvikbhagavan, I guess it successfully ran the tests. Please let me know if anything else needs to be added for this.

ChrisRackauckas · 2024-02-18T14:18:44Z

This is missing docs but I think it's good to go.

ayushinav · 2024-02-19T05:53:05Z

This is missing docs but I think it's good to go.

I thought the docs refer to >? in REPL, but I realize it might be something in the tutorials/ examples?

sathvikbhagavan · 2024-02-19T06:04:35Z

I thought the docs refer to >? in REPL, but I realize it might be something in the tutorials/ examples?

Yes, we should add a tutorial for it.

ayushinav · 2024-02-29T07:23:45Z

@ayushinav, one of the DGM tests is failing. Can you look into that?

Hopefully, the last commit will get it. It passes a better test on my system.

ayushinav · 2024-03-03T05:57:57Z

@ChrisRackauckas looks like it's all good

docs/src/tutorials/dgm.md

ChrisRackauckas · 2024-03-03T12:03:43Z

2 hour and 46 minute test suite: is going that far required for convergence?

ayushinav · 2024-03-04T03:43:25Z

2 hour and 46 minute test suite: is going that far required for convergence?

We have 3 examples for testing. With as many logical configurations I could try, these seemed to work the best. I can increase the tolerance threshold to pass the test but that doesn't seem to me the best idea.

ayushinav · 2024-03-04T03:46:20Z

Solving PDEs using Deep Galerkin Method

Overview

Deep Galerkin Method is a meshless deep learning algorithm to solve high dimensional PDEs. The algorithm does so by approximating the solution of a PDE with a neural network. The loss function of the network is defined in the similar spirit as PINNs, composed of PDE loss and boundary condition loss.

Since, the cost functions can be computationally intenstive to calculate, the algorithm does so by randomly sampling points and training on them, like stochastic gradient descent.

Algorithm

The authors of DGM suggest a network composed of LSTM-type layers that works well for most of the parabolic and quasi-parabolic PDEs.

$$\begin{align*} S^1 &= \sigma_1(W^1 \vec{x} + b^1); \\\ Z^l &= \sigma_1(U^{z,l} \vec{x} + W^{z,l} S^l + b^{z,l}); \quad l = 1, \ldots, L; \\\ G^l &= \sigma_1(U^{g,l} \vec{x} + W^{g,l} S_l + b^{g,l}); \quad l = 1, \ldots, L; \\\ R^l &= \sigma_1(U^{r,l} \vec{x} + W^{r,l} S^l + b^{r,l}); \quad l = 1, \ldots, L; \\\ H^l &= \sigma_2(U^{h,l} \vec{x} + W^{h,l}(S^l \cdot R^l) + b^{h,l}); \quad l = 1, \ldots, L; \\\ S^{l+1} &= (1 - G^l) \cdot H^l + Z^l \cdot S^{l}; \quad l = 1, \ldots, L; \\\ f(t, x; \theta) &= \sigma_{out}(W S^{L+1} + b). \end{align*}$$

where $\vec{x}$ is the concatenated vector of $(t, x)$ and $L$ is the number of LSTM type layers in the network.

Example

Let's try to solve the following Burger's equation using Deep Galerkin Method for $\alpha = 0.05$ and compare our solution with the finite difference method:

$$ \partial_t u(t, x) + u(t, x) \partial_x u(t, x) - \alpha \partial_{xx} u(t, x) = 0 $$

defined over

$$ t \in [0, 1], x \in [-1, 1] $$

with boundary conditions

$$\begin{align*} u(t, x) & = - sin(πx), \\\ u(t, -1) & = 0, \\\ u(t, 1) & = 0 \end{align*}$$

Copy- Pasteable code

using NeuralPDE
using ModelingToolkit, Optimization, OptimizationOptimisers
import Lux: tanh, identity
using Distributions
import ModelingToolkit: Interval, infimum, supremum
using MethodOfLines, OrdinaryDiffEq

@parameters x t
@variables u(..)

Dt= Differential(t)
Dx= Differential(x)
Dxx= Dx^2
α = 0.05;
# Burger's equation
eq= Dt(u(t,x)) + u(t,x) * Dx(u(t,x)) - α * Dxx(u(t,x)) ~ 0 

# boundary conditions
bcs= [
    u(0.0, x) ~ - sin(π*x),
    u(t, -1.0) ~ 0.0,
    u(t, 1.0) ~ 0.0
]

domains = [t ∈ Interval(0.0, 1.0), x ∈ Interval(-1.0, 1.0)]

# MethodOfLines, for FD solution
dx= 0.01
order = 2
discretization = MOLFiniteDifference([x => dx], t, saveat = 0.01)
@named pde_system = PDESystem(eq, bcs, domains, [t, x], [u(t,x)])
prob = discretize(pde_system, discretization)
sol= solve(prob, Tsit5())
ts = sol[t]
xs = sol[x] 

u_MOL = sol[u(t,x)]

# NeuralPDE, using Deep Galerkin Method
strategy = QuasiRandomTraining(4_000, minibatch= 500);
discretization= DeepGalerkin(2, 1, 50, 5, tanh, tanh, identity, strategy);
@named pde_system = PDESystem(eq, bcs, domains, [t, x], [u(t,x)]);
prob = discretize(pde_system, discretization);
global iter = 0;
callback = function (p, l)
    global iter += 1;
    if iter%20 == 0
        println("$iter => $l")
    end
    return false
end

res = Optimization.solve(prob, Adam(0.01); callback = callback, maxiters = 300);
phi = discretization.phi;

u_predict= [first(phi([t, x], res.minimizer)) for t in ts, x in xs]

diff_u = abs.(u_predict .- u_MOL);

using Plots
p1 = plot(tgrid, xgrid, u_MOL', linetype = :contourf, title = "FD");
p2 = plot(tgrid, xgrid, u_predict', linetype = :contourf, title = "predict");
p3 = plot(tgrid, xgrid, diff_u', linetype = :contourf, title = "error");
plot(p1, p2, p3)

ChrisRackauckas · 2024-03-04T03:49:08Z

Since, the cost functions can be computationally intenstive to calculate, the algorithm does so by randomly sampling points and training on them, like stochastic gradient descent.

That's not entirely true. With quadrature it's not random. Delete that and I think this is good to go.

ayushinav · 2024-03-04T03:55:30Z

Since, the cost functions can be computationally intenstive to calculate, the algorithm does so by randomly sampling points and training on them, like stochastic gradient descent.

That's not entirely true. With quadrature it's not random. Delete that and I think this is good to go.

I agree that it's not entirely true for quadrature in general and in NeuralPDE.jl, we have the capability for different quadrature strategies as well, but for DGM, that's what they say. Here's the snippet from the paper, section 2.

Maybe I understood something wrong but wanted to clarify before going ahead.

ChrisRackauckas · 2024-03-04T03:58:11Z

That's true in their paper but not with the implementation you have in the library.

ayushinav · 2024-03-04T04:05:12Z

That's true in their paper but not with the implementation you have in the library.

I see. I thought using QuasiRandomTraining does the random sampling they do, but agree a user won't necessarily do that.
I'll push the changes for now and try to understand what's going on. Thanks!

ChrisRackauckas · 2024-03-04T04:06:42Z

Then say, "In this instance we will demonstrate training using Quasi-Random Sampling, a technique that ...."

ayushinav · 2024-03-04T04:07:44Z

Then say, "In this instance we will demonstrate training using Quasi-Random Sampling, a technique that ...."

Got it. That helps!

ChrisRackauckas · 2024-03-04T16:03:25Z

Let's merge, but please follow up with some test cuts. I think we can cut the quasi-random points from 4000 😅

ayushinav added 4 commits February 8, 2024 04:02

adding Deep Galerkin

d2214a1

Deep Galerkin for Lux

dc91cd4

adding tests for Deep Galerkin

617e2cd

SciML style

58c236e

ChrisRackauckas reviewed Feb 9, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Feb 9, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Feb 9, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Feb 9, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

Function type stability, docs in Deep Galerkin

9f9ef66

ChrisRackauckas requested a review from sathvikbhagavan February 10, 2024 16:34

sathvikbhagavan reviewed Feb 12, 2024

View reviewed changes

ayushinav and others added 2 commits February 12, 2024 08:31

Merge branch 'SciML:master' into master

1435984

docs, test updates in DGM

4640829

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

src/deep_galerkin.jl Outdated Show resolved Hide resolved

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

test/runtests.jl Outdated Show resolved Hide resolved

sathvikbhagavan reviewed Feb 13, 2024

View reviewed changes

test/other_algs_test.jl Outdated Show resolved Hide resolved

ayushinav and others added 2 commits February 13, 2024 22:28

fixes from review, DGM

b0c5c20

Merge branch 'SciML:master' into master

f1ed130

ayushinav requested a review from ChrisRackauckas February 17, 2024 08:31

ayushinav and others added 2 commits February 18, 2024 04:59

Merge branch 'SciML:master' into master

21e34b3

minor fix in docs, DGM

d1bade4

ayushinav requested a review from sathvikbhagavan February 18, 2024 10:04

tolerance check fix for DGM

4d3d4da

sathvikbhagavan approved these changes Feb 29, 2024

View reviewed changes

ChrisRackauckas reviewed Mar 3, 2024

View reviewed changes

docs/src/tutorials/dgm.md Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Mar 3, 2024

View reviewed changes

docs/src/tutorials/dgm.md Outdated Show resolved Hide resolved

ayushinav added 2 commits March 3, 2024 23:27

doc updates for DGM

15f8d88

doc updates for DGM

eced4ef

ayushinav closed this Mar 4, 2024

ayushinav force-pushed the master branch from eced4ef to e68f778 Compare March 4, 2024 04:31

ayushinav added 2 commits March 3, 2024 23:33

Merge branch 'dgm'

a867862

Merge branch 'master' of https://github.com/ayushinav/NeuralPDE.jl

6189427

ayushinav reopened this Mar 4, 2024

ayushinav closed this Mar 4, 2024

ayushinav reopened this Mar 4, 2024

Merge branch 'master' into master

c380453

ChrisRackauckas merged commit a20efb6 into SciML:master Mar 4, 2024
15 of 21 checks passed

This was referenced Mar 4, 2024

Deep learning methods for free-boundary PDEs #19

Closed

chore: format docstrings of DGM to make it consistent with others #825

Merged

ayushinav mentioned this pull request Mar 6, 2024

reducing test times for DGM #826

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Deep Galerkin method #802

add Deep Galerkin method #802

ayushinav commented Feb 8, 2024 •

edited

Loading

sathvikbhagavan commented Feb 13, 2024

ayushinav commented Feb 16, 2024

ChrisRackauckas commented Feb 18, 2024

ayushinav commented Feb 19, 2024

sathvikbhagavan commented Feb 19, 2024

ayushinav commented Feb 29, 2024

ayushinav commented Mar 3, 2024

ChrisRackauckas commented Mar 3, 2024

ayushinav commented Mar 4, 2024

ayushinav commented Mar 4, 2024

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024 •

edited

Loading

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024

ChrisRackauckas commented Mar 4, 2024

add Deep Galerkin method #802

add Deep Galerkin method #802

Conversation

ayushinav commented Feb 8, 2024 • edited Loading

Checklist

Additional context

sathvikbhagavan commented Feb 13, 2024

ayushinav commented Feb 16, 2024

ChrisRackauckas commented Feb 18, 2024

ayushinav commented Feb 19, 2024

sathvikbhagavan commented Feb 19, 2024

ayushinav commented Feb 29, 2024

ayushinav commented Mar 3, 2024

ChrisRackauckas commented Mar 3, 2024

ayushinav commented Mar 4, 2024

ayushinav commented Mar 4, 2024

Solving PDEs using Deep Galerkin Method

Overview

Algorithm

Example

Copy- Pasteable code

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024 • edited Loading

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Mar 4, 2024

ChrisRackauckas commented Mar 4, 2024

ayushinav commented Feb 8, 2024 •

edited

Loading

ayushinav commented Mar 4, 2024 •

edited

Loading