Interpolation example

DanielVandH · May 20, 2023 · abd8031 · abd8031
1 parent 4cab4d3
commit abd8031
Show file tree

Hide file tree

Showing 19 changed files with 367 additions and 27 deletions.
diff --git a/README.md b/README.md
@@ -35,12 +35,12 @@ sibson_1_vals = itp(_x, _y; method=Sibson(1))
 nearest = itp(_x, _y; method=Nearest())
 
 ## Plot 
-function plot_2d(i, j, title, vals, xg, yg, x, y, show_scatter=true)
+function plot_2d(fig, i, j, title, vals, xg, yg, x, y, show_scatter=true)
     ax = Axis(fig[i, j], xlabel="x", ylabel="y", width=600, height=600, title=title, titlealign=:left)
     contourf!(ax, xg, yg, reshape(vals, (length(xg), length(yg))), color=vals, colormap=:viridis, levels=-1:0.05:0, extendlow=:auto, extendhigh=:auto)
     show_scatter && scatter!(ax, x, y, color=:red, markersize=14)
 end
-function plot_3d(i, j, title, vals, xg, yg)
+function plot_3d(fig, i, j, title, vals, xg, yg)
     ax = Axis3(fig[i, j], xlabel="x", ylabel="y", width=600, height=600, title=title, titlealign=:left)
     surface!(ax, xg, yg, reshape(vals, (length(xg), length(yg))), color=vals, colormap=:viridis, levels=-1:0.05:0, extendlow=:auto, extendhigh=:auto)
 end
@@ -49,8 +49,8 @@ all_vals = (sibson_vals, triangle_vals, laplace_vals, sibson_1_vals, nearest, ex
 titles = ("(a): Sibson", "(b): Triangle", "(c): Laplace", "(d): Sibson-1", "(e): Nearest", "(f): Exact")
 fig = Figure(fontsize=36)
 for (i, (vals, title)) in enumerate(zip(all_vals, titles))
-    plot_2d(1, i, title, vals, xg, yg, x, y, !(vals === exact))
-    plot_3d(2, i, " ", vals, xg, yg)
+    plot_2d(fig, 1, i, title, vals, xg, yg, x, y, !(vals === exact))
+    plot_3d(fig, 2, i, " ", vals, xg, yg)
 end
 resize_to_layout!(fig)
 fig

diff --git a/docs/make.jl b/docs/make.jl
@@ -19,6 +19,11 @@ makedocs(;
         "Mathematical Details" => [
             "Interpolation" => "interpolation_math.md",
             "Differentiation" => "differentiation_math.md"
+        ],
+        "Examples" => [
+            "Interpolation" => "interpolation.md",
+            "Differentiaton" => "differentiation.md",
+            "Switzerland Elevation Data" => "swiss.md"
         ]
     ],
 )

diff --git a/docs/src/differentiation.md b/docs/src/differentiation.md
@@ -0,0 +1,9 @@
+```@meta
+CurrentModule = NaturalNeighbours
+```
+
+# Differentiation Example 
+
+The purpose of this example is to explore derivative generation. For this, it is important to note that we are thinking of _generating_ derivatives rather than _estimating_ them: Following [Alfeld (1989)](https://doi.org/10.1016/B978-0-12-460515-2.50005-6), derivative generation only seeks to find derivatives that best fit our assumptions of the data, i.e. that give a most satisfactory interpolant, rather than trying to find exact derivative values. The complete quote for this by [Alfeld (1989)](https://doi.org/10.1016/B978-0-12-460515-2.50005-6) is below:
+
+> It seems inevitable that in order to obtain an interpolant that is both local and smooth one has to supply derivative data. Typically, such data are not part of the interpolation problem and have to be made up from existing func tional data. This process is usually referred as derivative estimation, but this is probably a misnomer. The objective is not to estimate existing but unknown values of derivatives. Instead, it is to generate values that will yield a satisfactory interpolant. Even if an underlying primitive function did exist it might be preferable to use derivative values that differ from the exact ones. (For example, a maximum error might be decreased by using the "wrong" derivative values.) Therefore, I prefer the term derivative generation rather than derivative estimation.
diff --git a/docs/src/differentiation_math.md b/docs/src/differentiation_math.md
@@ -2,6 +2,8 @@
 CurrentModule = NaturalNeighbours
 ```
 
+# Differentiation 
+
 In this section, we give some of the mathematical detail used for implementing derivative generation, following this [thesis](https://kluedo.ub.rptu.de/frontdoor/deliver/index/docId/2104/file/diss.bobach.natural.neighbor.20090615.pdf). The discussion that follows is primarily sourced from Chapter 6 of the linked thesis. While it is possible to generate derivatives of arbitary order, our discussion here in this section will be limited to gradient and Hessian generation.  These ideas are implemented by the `generate_gradients` and `generate_derivatives` functions, which you should use via the `differentiate` function.
 
 # Generation at Data Sites 

diff --git a/docs/src/figures/example_1.png b/docs/src/figures/example_1.png
diff --git a/docs/src/figures/example_2.png b/docs/src/figures/example_2.png
diff --git a/docs/src/figures/sibson_vs_sibson1.png b/docs/src/figures/sibson_vs_sibson1.png
diff --git a/docs/src/figures/sibson_vs_sibson1_errors.png b/docs/src/figures/sibson_vs_sibson1_errors.png
diff --git a/docs/src/figures/sibson_vs_sibson1_project_false.png b/docs/src/figures/sibson_vs_sibson1_project_false.png
diff --git a/docs/src/interpolation.md b/docs/src/interpolation.md
@@ -0,0 +1,230 @@
+```@meta
+CurrentModule = NaturalNeighbours
+```
+
+# Interpolation Example
+
+Let us give an example of how interpolation can be performed. We consider the function
+
+```math
+f(x, y) = \frac19\left[\tanh\left(9y-9x\right) + 1\right].
+```
+
+First, let us generate some data.
+
+```julia
+using NaturalNeighbours 
+using CairoMakie 
+using StableRNGs 
+
+f = (x, y) -> (1 / 9) * (tanh(9 * y - 9 * x) + 1)
+
+rng = StableRNG(123)
+x = rand(rng, 100)
+y = rand(rng, 100)
+z = f.(x, y)
+```
+
+We can now construct our interpolant. To use the Sibson-1 interpolant, we need to have gradient information, so we specify `derivatives=true` to make sure these get generated at the data sites.
+
+```julia
+itp = interpolate(x, y, z; derivatives=true)
+```
+
+This `itp` is now callable. Let's generate a grid to evaluate `itp` at.
+
+```julia
+xg = LinRange(0, 1, 100)
+yg = LinRange(0, 1, 100)
+_x = vec([x for x in xg, _ in yg])
+_y = vec([y for _ in xg, y in yg])
+```
+
+We use vectors for this evaluation rather than evaluating like, say, `[itp(x, y) for x in xg, y in yg]`, since `itp`'s evaluation will use multithreading if we give it vectors. Consider the following times (including the time to make the vectors in the vector case):
+
+```julia
+using BenchmarkTools
+function testf1(itp, xg, yg, parallel)
+    _x = vec([x for x in xg, _ in yg])
+    _y = vec([y for _ in xg, y in yg])
+    return itp(_x, _y; parallel=parallel)
+end
+testf2(itp, xg, yg) = [itp(x, y) for x in xg, y in yg]
+b1 = @benchmark $testf1($itp, $xg, $yg, $true)
+b2 = @benchmark $testf2($itp, $xg, $yg)
+b3 = @benchmark $testf1($itp, $xg, $yg, $false)
+```
+```julia-repl
+julia> b1
+BenchmarkTools.Trial: 2310 samples with 1 evaluation.
+ Range (min … max):  1.333 ms … 165.550 ms  ┊ GC (min … max): 0.00% … 98.28%
+ Time  (median):     1.781 ms               ┊ GC (median):    0.00%
+ Time  (mean ± σ):   2.155 ms ±   3.446 ms  ┊ GC (mean ± σ):  3.27% ±  2.04%
+
+        ▄▆█▄▁
+  ▂▂▄▃▅▆█████▆▅▃▃▂▂▂▂▂▂▂▃▂▂▃▃▃▃▃▃▄▃▄▄▄▄▅▅▄▄▄▄▄▃▃▃▄▃▂▂▂▂▁▂▁▁▁▁ ▃
+  1.33 ms         Histogram: frequency by time        3.33 ms <
+
+ Memory estimate: 254.33 KiB, allocs estimate: 146.
+
+julia> b2
+BenchmarkTools.Trial: 257 samples with 1 evaluation.
+ Range (min … max):  14.790 ms … 27.120 ms  ┊ GC (min … max): 0.00% … 0.00%
+ Time  (median):     18.136 ms              ┊ GC (median):    0.00%
+ Time  (mean ± σ):   19.531 ms ±  4.177 ms  ┊ GC (mean ± σ):  0.00% ± 0.00%
+
+   ▅█
+  ▆██▄▄▄▄▂▅▁▁▄▃▄▅▃▃▃▃▁▃▃▁▃▄▃▃▃▂▂▄▂▃▃▂▄▂▄▄▃▂▄▄▃▃▄▃▄▄▃▄▃▃▄▄▂▄▅▄ ▃
+  14.8 ms         Histogram: frequency by time        26.7 ms <
+
+ Memory estimate: 78.17 KiB, allocs estimate: 2.
+
+julia> b3
+BenchmarkTools.Trial: 267 samples with 1 evaluation.
+ Range (min … max):  14.986 ms … 27.264 ms  ┊ GC (min … max): 0.00% … 0.00%
+ Time  (median):     17.354 ms              ┊ GC (median):    0.00%
+ Time  (mean ± σ):   18.710 ms ±  3.750 ms  ┊ GC (mean ± σ):  0.00% ± 0.00%
+
+   ▄█
+  ▄██▇▄▃▃▄▃▃▃▃▃▃▃▄▄▂▃▃▃▃▃▃▃▃▃▃▃▃▂▁▃▃▃▃▃▃▃▃▃▂▃▃▂▃▃▂▂▁▂▃▃▂▃▂▃▄▃ ▃
+  15 ms           Histogram: frequency by time        26.7 ms <
+
+ Memory estimate: 234.67 KiB, allocs estimate: 10.
+```
+
+ See that `itp(_x, _y)` took about 1.3 ms, while the latter two approaches both take around  15 ms. Pretty impressive given that we are evaluating $100^2$ points - this is a big advantage of local interpolation making parallel evaluation so cheap and simple. This effect can be made even more clear if we use more points:
+
+```julia
+xg = LinRange(0, 1, 1000)
+yg = LinRange(0, 1, 1000)
+b1 = @benchmark $testf1($itp, $xg, $yg, $true)
+b2 = @benchmark $testf2($itp, $xg, $yg)
+b3 = @benchmark $testf1($itp, $xg, $yg, $false)
+```
+```julia-repl
+julia> b1
+BenchmarkTools.Trial: 27 samples with 1 evaluation.
+ Range (min … max):  132.766 ms … 354.348 ms  ┊ GC (min … max): 0.00% … 0.00%
+ Time  (median):     144.794 ms               ┊ GC (median):    0.00%
+ Time  (mean ± σ):   188.429 ms ±  79.396 ms  ┊ GC (mean ± σ):  0.37% ± 2.38%
+
+  ▁█▃▁
+  ████▄▄▄▁▁▁▁▁▁▄▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▄▁▁▁▁▄▄▁▁▁▁▁▁▁▁▄▁▁▁▁▁▁▁▄▇ ▁
+  133 ms           Histogram: frequency by time          354 ms <
+
+ Memory estimate: 22.91 MiB, allocs estimate: 157.
+
+julia> b2
+BenchmarkTools.Trial: 2 samples with 1 evaluation.
+ Range (min … max):  2.564 s …    2.794 s  ┊ GC (min … max): 0.00% … 0.00%
+ Time  (median):     2.679 s               ┊ GC (median):    0.00%
+ Time  (mean ± σ):   2.679 s ± 162.574 ms  ┊ GC (mean ± σ):  0.00% ± 0.00%
+
+  █                                                        █
+  █▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁█ ▁
+  2.56 s         Histogram: frequency by time         2.79 s <
+
+ Memory estimate: 7.63 MiB, allocs estimate: 2.
+
+julia> b3
+BenchmarkTools.Trial: 2 samples with 1 evaluation.
+ Range (min … max):  2.557 s …   2.624 s  ┊ GC (min … max): 0.00% … 0.00%
+ Time  (median):     2.590 s              ┊ GC (median):    0.00%
+ Time  (mean ± σ):   2.590 s ± 46.978 ms  ┊ GC (mean ± σ):  0.00% ± 0.00%
+
+  █                                                       █
+  █▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁█ ▁
+  2.56 s         Histogram: frequency by time        2.62 s <
+
+ Memory estimate: 22.89 MiB, allocs estimate: 10.
+```
+
+Now let's continue with the example. We compare Sibson-0 to Sibson-1 (going back to the original definitions of `xg` and `yg` with $100^2$ points):
+
+```julia
+sib_vals = itp(_x, _y)
+sib1_vals = itp(_x, _y; method=Sibson(1))
+```
+
+Now we can plot.
+
+```julia
+function plot_itp(fig, x, y, vals, title, i, show_data_sites, itp, xd=nothing, yd=nothing, show_3d=true, levels=-0.1:0.05:0.3)
+    ax = Axis(fig[1, i], xlabel="x", ylabel="y", width=600, height=600, title=title, titlealign=:left)
+    c = contourf!(ax, x, y, vals, color=vals, colormap=:viridis, levels=levels, extendhigh=:auto)
+    show_data_sites && scatter!(ax, xd, yd, color=:red, markersize=9)
+    tri = itp.triangulation
+    ch_idx = get_convex_hull_indices(tri)
+    lines!(ax, [get_point(tri, i) for i in ch_idx], color=:white, linewidth=4, linestyle=:dash)
+    if show_3d
+        ax = Axis3(fig[2, i], xlabel="x", ylabel="y", zlabel=L"z", width=600, height=600, title=" ", titlealign=:left, azimuth=0.49)
+        surface!(ax, x, y, vals, color=vals, colormap=:viridis, colorrange=(-0.1, 0.3))
+        zlims!(ax, 0, 0.25)
+    end
+    return c
+end
+```
+
+```@raw html
+<figure>
+    <img src='../figs/sibson_vs_sibson1.png', alt'Sibson and Sibson-1 Interpolation'><br>
+</figure>
+```
+
+The red points in (c) show the data used for interpolating. The results are pretty similar, although Sibson-1 is more bumpy in the zero region of the function. Sibson-1 is smooth wave on the rising front of the function.
+
+Note that we are extrapolating in some parts of this function, where extrapolation refers to evaluating outside of the convex hull of the data sites. This convex hull is shown in white above. If we wanted to avoid extrapolating entirely, you can use `project=false` which replaces any extrapolated values with `Inf`.
+
+```julia
+sib_vals = itp(_x, _y, project=false)
+sib1_vals = itp(_x, _y; method=Sibson(1), project=false)
+fig = Figure(fontsize=36)
+plot_itp(fig, _x, _y, sib_vals, "(a): Sibson", 1, false, itp, x, y)
+plot_itp(fig, _x, _y, sib1_vals, "(b): Sibson-1", 2, false, itp, x, y)
+plot_itp(fig, _x, _y, f.(_x, _y), "(c): Exact", 3, true, itp, x, y)
+resize_to_layout!(fig)
+fig
+```
+
+```@raw html
+<figure>
+    <img src='../figs/sibson_vs_sibson1_project_false.png', alt'Sibson and Sibson-1 Interpolation without Extrapolation'><br>
+</figure>
+```
+
+To get a better comparison of the two interpolants, lets plot the errors at each point, including extrapolation.
+
+```julia
+sib_vals = itp(_x, _y)
+sib1_vals = itp(_x, _y; method=Sibson(1))
+sib_errs = @. 100abs(sib_vals - f(_x, _y))
+sib1_errs = @. 100abs(sib1_vals - f(_x, _y))
+
+fig = Figure(fontsize=36)
+plot_itp(fig, _x, _y, sib_errs, "(a): Sibson", 1, true, itp, x, y, false, 0:0.5:3)
+c = plot_itp(fig, _x, _y, sib1_errs, "(b): Sibson-1", 2, true, itp, x, y, false, 0:0.5:3)
+Colorbar(fig[1, 3], c)
+resize_to_layout!(fig)
+fig
+```
+
+```@raw html
+<figure>
+    <img src='../figs/sibson_vs_sibson1_errors.png', alt'Sibson and Sibson-1 Interpolation Errors'><br>
+</figure>
+```
+
+We see that the Sibson-1 interpolant has less error overall. To summarise these errors into a single scalar, we can use the relative root mean square error, defined by 
+
+```math 
+\varepsilon_{\text{rrmse}}(\boldsymbol y, \hat{\boldsymbol y}) = 100\sqrt{\frac{\sum_i (y_i - \hat y_i)^2}{\sum_i \hat y_i^2}}.
+```
+
+```julia-repl
+julia> esib0 = 100sqrt(sum((sib_vals .- f.(_x, _y)).^2) / sum(f.(_x, _y).^2))
+8.17183532625033
+
+julia> esib1 = 100sqrt(sum((sib1_vals .- f.(_x, _y)).^2) / sum(f.(_x, _y).^2))
+6.889270820893804
+```
+
diff --git a/docs/src/interpolation_math.md b/docs/src/interpolation_math.md
@@ -2,6 +2,8 @@
 CurrentModule = NaturalNeighbours
 ```
 
+# Interpolation
+
 In this section, we give some of the mathematical background behind natural neighbour interpolation, and other interpolation methods provided from this package. The discussion here will be limited, but you can see this [thesis](https://kluedo.ub.rptu.de/frontdoor/deliver/index/docId/2104/file/diss.bobach.natural.neighbor.20090615.pdf) or this [Wikipedia article](https://en.wikipedia.org/wiki/Natural_neighbor_interpolation) for more information. The discussion that follows is primarily sourced from the linked thesis. These ideas are implemented by the `interpolate` function.
 
 # Voronoi Tessellation 
@@ -224,6 +226,20 @@ An example of what this interpolant looks like is given below.
 </figure>
 ```
 
+# Regions of Influence 
+
+The _region of influence_ for the natural neighbour coordinates associated with a point $\boldsymbol x_i$ is the interior the union of all circumcircles coming from the triangles of the underlying triangulation that pass through $\boldsymbol x_i$. We can visualise this for the coordinates we define above below. (this region of influence definition not necessarily generalise to the triangle and nearest neighbour coordinates, but we still compare them).
+
+We take a set of data sites in $[-1, 1]^2$ such that all function values are zero except for $z_1 = 0$ with $\boldsymbol x_1 = \boldsymbol 0$. Using this setup, we obtain the following results (see also Figure 3.6 of Bobach's thesis linked previously):_
+
+```@raw html
+<figure>
+    <img src='../figs/influence.png', alt='Region of Influence'><br>
+</figure>
+```
+
+We can indeed see the effect of the region of influence about this single point $\boldsymbol x_1$. Note also that $f^{\text{SIB}1}$ is much smoother than the others.
+
 # Extrapolation
 
 An important consideration is extrapolation. Currently, all the methods above assume that the query point $\boldsymbol x_0$ is in $\mathcal C(\boldsymbol X)$, and the interpolation near the boundary of $\mathcal C(\boldsymbol X)$ often has some weird effects. There are many approaches available for extrapolation, such as with [ghost points](https://doi.org/10.1016/j.cad.2008.08.007), although these are not implemented in this package (yet!).

diff --git a/src/differentiation/differentiate.jl b/src/differentiation/differentiate.jl
@@ -21,7 +21,7 @@ The available keyword arguments are:
 - `method=default_diff_method(∂)`: Default method for evaluating the interpolant. `default_diff_method(∂)` returns `Direct()` if the underlying interpolant has no gradients, and `Iterative()` otherwise. The method must be a [`AbstractDifferentiator`](@ref).
 - `interpolant_method=Sibson()`: The method used for evaluating the interpolant to estimate `zᵢ` for the latter three methods. See [`AbstractInterpolator`](@ref) for the avaiable methods.
 - `rng=Random.default_rng()`: The random number generator used for estimating `zᵢ` for the latter three methods, or for constructing the natural coordinates.
-- `project=false`: Whether to project any extrapolated points onto the boundary of the convex hull of the data sites and perform two-point interpolation, or to simply replace any extrapolated values with `NaN`, when evaluating the interpolant in the latter three methods.
+- `project=false`: Whether to project any extrapolated points onto the boundary of the convex hull of the data sites and perform two-point interpolation, or to simply replace any extrapolated values with `Inf`, when evaluating the interpolant in the latter three methods.
 - `use_cubic_terms=true`: If estimating second order derivatives, whether to use cubic terms. Only relevant for `method == Direct()`.
 - `alpha=0.1`: If estimating second order derivatives, the weighting parameter used for estimating the second order derivatives. Only relevant for `method == Iterative()`.
 - `use_sibson_weight=true`: Whether to weight the residuals in the associated least squares problems by the associated Sibson coordinates. Only relevant for `method == Iterative()` if `order == 2`.

diff --git a/src/interpolation/extrapolation.jl b/src/interpolation/extrapolation.jl
@@ -24,7 +24,7 @@ function two_point_interpolate!(coordinates::AbstractVector{F}, envelope, tri, i
     else
         resize!(coordinates, 1)
         resize!(envelope, 1)
-        coordinates[1] = F(NaN)
+        coordinates[1] = F(Inf)
         envelope[1] = i
     end
     return NaturalCoordinates(coordinates, envelope, r, tri)

diff --git a/src/interpolation/interpolate.jl b/src/interpolation/interpolate.jl
@@ -27,7 +27,7 @@ In each method, `method` defines the method used for evaluating the interpolant,
 method, `parallel` is ignored, but for the latter two methods it defines whether to use multithreading or not for evaluating the interpolant at 
 all the points. The `kwargs...` argument is passed into `add_point!` from DelaunayTriangulation.jl, e.g. you could pass some `rng`. Lastly, 
 the `project` argument determines whether extrapolation is performed by projecting any exterior points onto the boundary of the convex hull 
-of the data sites and performing two-point interpolation, or to simply replaced any extrapolated values with `NaN`.
+of the data sites and performing two-point interpolation, or to simply replaced any extrapolated values with `Inf`.
 """
 interpolate(tri::Triangulation, z; gradient=nothing, hessian=nothing, kwargs...) = NaturalNeighboursInterpolant(tri, z, gradient, hessian; kwargs...)