
<p align="center">
    <img src="https://github.com/GeostatsGuy/GeostatsPy/blob/master/TCG_color_logo.png?raw=true" width="220" height="240" />

</p>

## Interactive Simple Kriging Behavoir Demonstration


### Michael Pyrcz, Associate Professor, University of Texas at Austin 

##### [Twitter](https://twitter.com/geostatsguy) | [GitHub](https://github.com/GeostatsGuy) | [Website](http://michaelpyrcz.com) | [GoogleScholar](https://scholar.google.com/citations?user=QVZ20eQAAAAJ&hl=en&oi=ao) | [Book](https://www.amazon.com/Geostatistical-Reservoir-Modeling-Michael-Pyrcz/dp/0199731446) | [YouTube](https://www.youtube.com/channel/UCLqEr-xV-ceHdXXXrTId5ig)  | [LinkedIn](https://www.linkedin.com/in/michael-pyrcz-61a648a1)


### The Interactive Workflow

Here's a simple workflow for demonstrating the behavoir of simple kriging with regard spatial continuity impacting:
* data closeness
* data redudancy

See my YouTube channel fo r

#### Spatial Estimation

Consider the case of making an estimate at some unsampled location, $𝑧(\bf{u}_0)$, where $z$ is the property of interest (e.g. porosity etc.) and $𝐮_0$ is a location vector describing the unsampled location.

How would you do this given data, $𝑧(\bf{𝐮}_1)$, $𝑧(\bf{𝐮}_2)$, and $𝑧(\bf{𝐮}_3)$?

It would be natural to use a set of linear weights to formulate the estimator given the available data.

\begin{equation}
z^{*}(\bf{u}) = \sum^{n}_{\alpha = 1} \lambda_{\alpha} z(\bf{u}_{\alpha})
\end{equation}

We could add an unbiasedness constraint to impose the sum of the weights equal to one.  What we will do is assign the remainder of the weight (one minus the sum of weights) to the global average; therefore, if we have no informative data we will estimate with the global average of the property of interest.

\begin{equation}
z^{*}(\bf{u}) = \sum^{n}_{\alpha = 1} \lambda_{\alpha} z(\bf{u}_{\alpha}) + \left(1-\sum^{n}_{\alpha = 1} \lambda_{\alpha} \right) \overline{z}
\end{equation}

We will make a stationarity assumption, so let's assume that we are working with residuals, $y$. 

\begin{equation}
y^{*}(\bf{u}) = z^{*}(\bf{u}) - \overline{z}(\bf{u})
\end{equation}

If we substitute this form into our estimator the estimator simplifies, since the mean of the residual is zero.

\begin{equation}
y^{*}(\bf{u}) = \sum^{n}_{\alpha = 1} \lambda_{\alpha} y(\bf{u}_{\alpha})
\end{equation}

while satisfying the unbaisedness constraint.  

#### Kriging

Now the next question is what weights should we use?  

We could use equal weighting, $\lambda = \frac{1}{n}$, and the estimator would be the average of the local data applied for the spatial estimate. This would not be very informative.

We could assign weights considering the spatial context of the data and the estimate:

* **spatial continuity** as quantified by the variogram (and covariance function)
* **redundancy** the degree of spatial continuity between all of the available data with themselves 
* **closeness** the degree of spatial continuity between the avaiable data and the estimation location

The kriging approach accomplishes this, calculating the best linear unbiased weights for the local data to estimate at the unknown location.  The derivation of the kriging system and the resulting linear set of equations is available in the lecture notes.  Furthermore kriging provides a measure of the accuracy of the estimate!  This is the kriging estimation variance (sometimes just called the kriging variance).

\begin{equation}
\sigma^{2}_{E}(\bf{u}) = C(0) - \sum^{n}_{\alpha = 1} \lambda_{\alpha} C(\bf{u}_0 - \bf{u}_{\alpha})
\end{equation}

What is 'best' about this estimate? Kriging estimates are best in that they minimize the above estimation variance. 

#### Properties of Kriging

Here are some important properties of kriging:

* **Exact interpolator** - kriging estimates with the data values at the data locations
* **Kriging variance** can be calculated before getting the sample information, as the kriging estimation variance is not dependent on the values of the data nor the kriging estimate, i.e. the kriging estimator is homoscedastic. 
* **Spatial context** - kriging takes into account, furthermore to the statements on spatial continuity, closeness and redundancy we can state that kriging accounts for the configuration of the data and structural continuity of the variable being estimated.
* **Scale** - kriging may be generalized to account for the support volume of the data and estimate. We will cover this later.
* **Multivariate** - kriging may be generalized to account for multiple secondary data in the spatial estimate with the cokriging system. We will cover this later.
* **Smoothing effect** of kriging can be forecast. We will use this to build stochastic simulations later.

#### Spatial Continuity 

**Spatial Continuity** is the correlation between values over distance.

* No spatial continuity – no correlation between values over distance, random values at each location in space regardless of separation distance.

* Homogenous phenomenon have perfect spatial continuity, since all values as the same (or very similar) they are correlated. 

We need a statistic to quantify spatial continuity! A convenient method is the Semivariogram.

#### The Semivariogram

Function of difference over distance.

* The expected (average) squared difference between values separated by a lag distance vector (distance and direction), $h$:

\begin{equation}
\gamma(\bf{h}) = \frac{1}{2 N(\bf{h})} \sum^{N(\bf{h})}_{\alpha=1} (z(\bf{u}_\alpha) - z(\bf{u}_\alpha + \bf{h}))^2  
\end{equation}

where $z(\bf{u}_\alpha)$ and $z(\bf{u}_\alpha + \bf{h})$ are the spatial sample values at tail and head locations of the lag vector respectively.

* Calculated over a suite of lag distances to obtain a continuous function.

* the $\frac{1}{2}$ term converts a variogram into a semivariogram, but in practice the term variogram is used instead of semivariogram.
* We prefer the semivariogram because it relates directly to the covariance function, $C_x(\bf{h})$ and univariate variance, $\sigma^2_x$:

\begin{equation}
C_x(\bf{h}) = \sigma^2_x - \gamma(\bf{h})
\end{equation}

Note the correlogram is related to the covariance function as:

\begin{equation}
\rho_x(\bf{h}) = \frac{C_x(\bf{h})}{\sigma^2_x}
\end{equation}

The correlogram provides of function of the $\bf{h}-\bf{h}$ scatter plot correlation vs. lag offset $\bf{h}$.  

\begin{equation}
-1.0 \le \rho_x(\bf{h}) \le 1.0
\end{equation}

#### Objective 

In the PGE 383: Stochastic Subsurface Modeling class I want to provide hands-on experience with building subsurface modeling workflows. Python provides an excellent vehicle to accomplish this. I have coded a package called GeostatsPy with GSLIB: Geostatistical Library (Deutsch and Journel, 1998) functionality that provides basic building blocks for building subsurface modeling workflows. 

The objective is to remove the hurdles of subsurface modeling workflow construction by providing building blocks and sufficient examples. This is not a coding class per se, but we need the ability to 'script' workflows working with numerical methods.    

#### Getting Started

Here's the steps to get setup in Python with the GeostatsPy package:

1. Install Anaconda 3 on your machine (https://www.anaconda.com/download/). 
2. From Anaconda Navigator (within Anaconda3 group), go to the environment tab, click on base (root) green arrow and open a terminal. 
3. In the terminal type: pip install geostatspy. 
4. Open Jupyter and in the top block get started by copy and pasting the code block below from this Jupyter Notebook to start using the geostatspy functionality. 

You will need to copy the data file to your working directory.  They are available here:

* Tabular data - sample_data.csv at https://git.io/fh4gm.

There are exampled below with these functions. You can go here to see a list of the available functions, https://git.io/fh4eX, other example workflows and source code. 

#### Load the required libraries

The following code loads the required libraries.

In [1]:
import geostatspy.GSLIB as GSLIB                       # GSLIB utilities, visualization and wrapper
import geostatspy.geostats as geostats                 # GSLIB methods convert to Python    

We will also need some standard packages. These should have been installed with Anaconda 3.

In [2]:
%matplotlib inline
import os                                               # to set current working directory 
import sys                                              # supress output to screen for interactive variogram modeling
import io
import numpy as np                                      # arrays and matrix math
import pandas as pd                                     # DataFrames
import matplotlib.pyplot as plt                         # plotting
from matplotlib.pyplot import cm                        # color maps
from matplotlib.patches import Ellipse                  # plot an ellipse
import math                                             # sqrt operator
from scipy.stats import norm
from ipywidgets import interactive                      # widgets and interactivity
from ipywidgets import widgets                            
from ipywidgets import Layout
from ipywidgets import Label
from ipywidgets import VBox, HBox
plt.rc('axes', axisbelow=True)                          # grid behind plotting elements

If you get a package import error, you may have to first install some of these packages. This can usually be accomplished by opening up a command window on Windows and then typing 'python -m pip install [package-name]'. More assistance is available with the respective package docs.  

#### Simple, Simple Kriging Function

Let's write a fast Python function to take data points and unknown location and provide the:

* **simple kriging estimate**

* **simple kriging variance / estimation variance**

* **simple kriging weights**

This provides a fast method for small datasets, with less parameters (no search parameters) and the ability to see the simple kriging weights 

In [3]:
def simple_simple_krige(df,xcol,ycol,vcol,dfl,xlcol,ylcol,vario,skmean):
# load the variogram
    nst = vario['nst']; pmx = 9999.9
    cc = np.zeros(nst); aa = np.zeros(nst); it = np.zeros(nst)
    ang = np.zeros(nst); anis = np.zeros(nst)
    nug = vario['nug']; sill = nug 
    cc[0] = vario['cc1']; sill = sill + cc[0]
    it[0] = vario['it1']; ang[0] = vario['azi1']; 
    aa[0] = vario['hmaj1']; anis[0] = vario['hmin1']/vario['hmaj1'];
    if nst == 2:
        cc[1] = vario['cc2']; sill = sill + cc[1]
        it[1] = vario['it2']; ang[1] = vario['azi2']; 
        aa[1] = vario['hmaj2']; anis[1] = vario['hmin2']/vario['hmaj2'];    

# set up the required matrices
    rotmat, maxcov = geostats.setup_rotmat(nug,nst,it,cc,ang,pmx)    
    ndata = len(df); a = np.zeros([ndata,ndata]); r = np.zeros(ndata); s = np.zeros(ndata); rr = np.zeros(ndata)
    nest = len(dfl)

    est = np.zeros(nest); var = np.full(nest,sill); weights = np.zeros([nest,ndata])

# Make and solve the kriging matrix, calculate the kriging estimate and variance 
    for iest in range(0,nest):
        for idata in range(0,ndata):
            for jdata in range(0,ndata):
                a[idata,jdata] = geostats.cova2(df[xcol].values[idata],df[ycol].values[idata],df[xcol].values[jdata],df[ycol].values[jdata],
                                        nst,nug,pmx,cc,aa,it,ang,anis,rotmat,maxcov)
            r[idata] = geostats.cova2(df[xcol].values[idata],df[ycol].values[idata],dfl[xlcol].values[iest],dfl[ylcol].values[iest],
                                        nst,nug,pmx,cc,aa,it,ang,anis,rotmat,maxcov)
            rr[idata] = r[idata]
        
        s = geostats.ksol_numpy(ndata,a,r)    
        sumw = 0.0
        for idata in range(0,ndata):                          
            sumw = sumw + s[idata]
            weights[iest,idata] = s[idata]
            est[iest] = est[iest] + s[idata]*df[vcol].values[idata]
            var[iest] = var[iest] - s[idata]*rr[idata]
        est[iest] = est[iest] + (1.0-sumw)*skmean
    return est,var,weights 

def calculate_new_point(prev_x,prev_y, offset, azimuth):
    azimuth_rad = math.radians(azimuth)
    x = prev_x + offset*math.sin(azimuth_rad)
    y = prev_y + offset*math.cos(azimuth_rad)    
    return x,y

### Interactive Simple Kriging Closeness Method

The following code includes:

* dashboard with variogram model, data locations via subtended angle and radius 
* data locations with point scaled by weights and kriging weights over all subtended angles from 0 to 360

In [6]:
import warnings; warnings.simplefilter('ignore')

# interactive calculation of the sample set (control of source parametric distribution and number of samples)
style = {'description_width': 'initial'}
l = widgets.Text(value='                                              Simple Kriging, Michael Pyrcz, Associate Professor, The University of Texas at Austin',layout=Layout(width='950px', height='30px'))
nug = widgets.FloatSlider(min = 0, max = 1.0, value = 0.0, step = 0.1, description = '$c_{nug}$',orientation='vertical',
                          readout_format='.0f',layout=Layout(width='25px', height='218px'))
nug.style.handle_color = 'gray'
it1 = widgets.Dropdown(options=['Sph', 'Exp', 'Gaus'],value='Sph',
    description='$it_1$',disabled=False,layout=Layout(width='120px', height='30px'), style=style,continuous_update=False)

azi = widgets.FloatSlider(min=0, max = 360, value = 0, step = 22.5, description = '$Azi$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
azi.style.handle_color = 'gray'
hmaj1 = widgets.FloatSlider(min=0.01, max = 10000.0, value = 400.0, step = 25.0, description = '$a_{maj}$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
hmaj1.style.handle_color = 'gray'
hmin1 = widgets.FloatSlider(min = 0.01, max = 10000.0, value = 400.0, step = 25.0, description = '$a_{min}$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
hmin1.style.handle_color = 'gray'
uikvar = widgets.HBox([azi,hmaj1,hmin1],)                   # basic widget formatting
uikvar2 = widgets.VBox([it1,uikvar],)                       # basic widget formatting
uikvar3 = widgets.HBox([nug,uikvar2],)                      # basic widget formatting

distance = widgets.FloatSlider(min=0.01, max = 359, value = 45.0, step = 25.0, description = r'distance',orientation='horizontal',
                         layout=Layout(width='600px', height='30px'),readout_format = '.0f',style=style,continuous_update=False)
distance.style.handle_color = 'blue'

uidata = widgets.VBox([distance],) 

uipars = widgets.HBox([uikvar3,uidata],) 

uik_closeness = widgets.VBox([l,uipars],)

def convert_type(it):
    if it == 'Spherical': 
        return 1
    elif it == 'Exponential':
        return 2
    else: 
        return 3

def calc_krige_weights_distance(nug,it1,azi,hmaj1,hmin1,distance,):
    text_trap = io.StringIO()
    sys.stdout = text_trap
    it1 = convert_type(it1)
    nst = 1; xlag = 10; nlag = int(hmaj1/xlag); c1 = 1.0-nug
    vario = GSLIB.make_variogram(nug,nst,it1,c1,azi,hmaj1,hmin1) # make model object
    index_maj,h_maj,gam_maj,cov_maj,ro_maj = geostats.vmodel(nlag,xlag,azi,vario)   # project the model in the major azimuth                                                  # project the model in the 135 azimuth
    index_min,h_min,gam_min,cov_min,ro_min = geostats.vmodel(nlag,xlag,azi+90.0,vario) # project the model in the minor azimuth
    
    xu = 500.0; yu = 500.0
    x2 = 200; y2 = 500.0
    x1,y1 = calculate_new_point(xu,yu,distance,90)
    x = [x1,x2]; y = [y1,y2]; value = [1.0,2.0]
    df = pd.DataFrame({'X':x,'Y':y,'Value':value})

    xl = [xu,0,1]; yl = [yu,0,1]; value1 = [0,0,0]
    dfl = pd.DataFrame({'X':xl,'Y':yl, 'Value':value1})
    vmean = 2.0
    sk_est, sk_var, sk_weights =  simple_simple_krige(df,'X','Y','Value',dfl,'X','Y',vario,skmean=vmean)
    return sk_weights[0]
    
def f_make_krige_closeness(nug,it1,azi,hmaj1,hmin1,distance,): # function to take parameters, make sample and plot
    text_trap = io.StringIO()
    sys.stdout = text_trap
    it1 = convert_type(it1)
    nst = 1; xlag = 10; nlag = int(hmaj1/xlag); c1 = 1.0-nug
    vario = GSLIB.make_variogram(nug,nst,it1,c1,azi,hmaj1,hmin1) # make model object
    index_maj,h_maj,gam_maj,cov_maj,ro_maj = geostats.vmodel(nlag,xlag,azi,vario)   # project the model in the major azimuth                                                  # project the model in the 135 azimuth
    index_min,h_min,gam_min,cov_min,ro_min = geostats.vmodel(nlag,xlag,azi+90.0,vario) # project the model in the minor azimuth
    
    xu = 500.0; yu = 500.0
    x2 = 200; y2 = 500.0
    x1,y1 = calculate_new_point(xu,yu,distance,90)

    clist = ['blue','red']
    x = [x1,x2]; y = [y1,y2]; value = [1.0,2.0]
    df = pd.DataFrame({'X':x,'Y':y,'Value':value})

    xl = [xu,0,1]; yl = [yu,0,1]; value1 = [0,0,0]
    dfl = pd.DataFrame({'X':xl,'Y':yl, 'Value':value1})
    vmean = 2.0
    sk_est, sk_var, sk_weights =  simple_simple_krige(df,'X','Y','Value',dfl,'X','Y',vario,skmean=vmean)
    
    plt.subplot(1,2,1)
        
    ax = plt.gca()
    plt.xlabel('X (m)'); plt.ylabel('Y (m)')
    plt.title('Simple Kriging - Data and Unknown Locations')
    plt.xlim([0,1000])
    plt.ylim([0,1000])
    for i, txt in enumerate(np.round(sk_weights[0],2)):
        plt.annotate('$\lambda$' + ' ' + '$=$' + str(txt), (x[i]+20, y[i]+20),color = clist[i])
        plt.annotate(i+1, (x[i]+46, y[i]+15),fontsize=6,color = clist[i])
#     for i, txt in enumerate(value):
#         plt.annotate('$z(\mathbf{u}$' + ' ' + '$ )$  = ' + str(txt), (x[i]+20, y[i]-40),color = clist[i])
#         plt.annotate(i+1, (x[i]+80, y[i]-45),fontsize=6,color = clist[i])
    plt.annotate('$\sum \lambda_{\\alpha} = $'+ str(np.round(np.sum(sk_weights[0]),2)), (20, 20))
#     plt.annotate('$z^*(\mathbf{u}_0)$ = '+ str(np.round(sk_est[0],2)), (xu+20, yu-40))
    plt.annotate('?', (xu-20, yu + 30))

    ax = plt.gca()
#     ellipse = Ellipse((xu, yu),width=radius*2,height=radius*2,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='black',ls='--',zorder=1)
#     ax.add_patch(ellipse)
    ellipse = Ellipse((xu, yu),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='gray',alpha = 0.1,edgecolor='black',zorder=10)
    ax.add_patch(ellipse)
    ellipse = Ellipse((xu, yu),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='black',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x1, y1),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='blue',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x1, y1),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='blue',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x2, y2),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='red',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x2, y2),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='red',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x3, y3),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='green',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x3, y3),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='green',zorder=10)
    ax.add_patch(ellipse)
   
    if sk_weights[0,0] > 0.01:
        plt.scatter(x1,y1,color = 'blue', edgecolors = 'black', s = sk_weights[0,0]*1000, alpha = 0.3,zorder=100,label=r'$\bf{u}_1$')
    else:
        plt.scatter(x1,y1,color = 'blue', edgecolors = 'black', marker = 'x', alpha = 0.3,zorder=100,label=r'$\bf{u}_1$')
    if sk_weights[0,1] > 0.01:    
        plt.scatter(x2,y2,color = 'red', edgecolors = 'black', s = sk_weights[0,1]*1000,alpha = 0.3,zorder=100,label=r'$\bf{u}_2$')
    else:
        plt.scatter(x2,y2,color = 'red', edgecolors = 'black', marker = 'x',alpha = 0.3,zorder=100,label=r'$\bf{u}_2$')

    scatter = plt.scatter(xu,yu,color = 'black', edgecolors = 'black', marker = 'x',s=40,alpha = 0.3,zorder=100,label=r'$\bf{u}_0$')
    plt.legend(loc = 'upper right')
    nbin = 100
    ndistance = np.linspace(0.01,600,nbin)
    wt1 = np.zeros(nbin); wt2 = np.zeros(nbin)
    for iangle, cdistance in enumerate(ndistance):
        wt1[iangle],wt2[iangle] = calc_krige_weights_distance(nug,it1,azi,hmaj1,hmin1,cdistance,)

    plt.subplot(1,2,2)
    plt.plot(ndistance,wt1,color='blue',alpha=0.6,lw=2,zorder=1,label='$\lambda_1$')
    plt.plot(ndistance,wt2,color='red',alpha=0.6,lw=2,zorder=1,label='$\lambda_2$')
    plt.plot(ndistance,wt1+wt2,color='purple',alpha=0.6,lw=2,zorder=1,label='$\lambda_1+\lambda_2$')
    plt.scatter(distance,sk_weights[0,0],color='blue',s=100,edgecolor='black',alpha=0.3,zorder=10)
    plt.scatter(distance,sk_weights[0,1],color='red',s=100,edgecolor='black',alpha=0.3)
    plt.xlabel('Distance (m)'); plt.ylabel('Kriging Weight'); plt.title('Closeness Impact on Weights')
    plt.vlines(distance,0.0,1.2,color='black',zorder=1)
    plt.ylim([0.0,1.2]); plt.xlim([0,600]); plt.grid()
    plt.legend(loc='upper right')
    
    plt.subplots_adjust(left=0.0, bottom=0.0, right=2.0, top=1.2, wspace=0.3, hspace=0.3)
    plt.show()
    
# connect the function to make the samples and plot to the widgets    
interactive_plot_closeness = widgets.interactive_output(f_make_krige_closeness, {'nug':nug, 'it1':it1, 'azi':azi, 'hmaj1':hmaj1, 'hmin1':hmin1, 
                                                      'distance':distance})
interactive_plot_closeness.clear_output(wait = True)               # reduce flickering by delaying plot updating

### Interactive Simple Kriging Closeness Demonstration

* select the variogram model and the data closeness to the unknown location

#### Michael Pyrcz, Associate Professor, University of Texas at Austin 

##### [Twitter](https://twitter.com/geostatsguy) | [GitHub](https://github.com/GeostatsGuy) | [Website](http://michaelpyrcz.com) | [GoogleScholar](https://scholar.google.com/citations?user=QVZ20eQAAAAJ&hl=en&oi=ao) | [Book](https://www.amazon.com/Geostatistical-Reservoir-Modeling-Michael-Pyrcz/dp/0199731446) | [YouTube](https://www.youtube.com/channel/UCLqEr-xV-ceHdXXXrTId5ig)  | [LinkedIn](https://www.linkedin.com/in/michael-pyrcz-61a648a1) | [GeostatsPy](https://github.com/GeostatsGuy/GeostatsPy)

### The Inputs

Select the variogram model and the data locations:

* $c_{nug}$, $C_1 = 1.0 - c_{nug}$, nugget effect contribution to the sill

* $hmaj1$ / $hmin1$: range in the major and minor direction

* $distance$: spatial data locations, subtended angle and radius 

In [7]:
display(uik_closeness, interactive_plot_closeness)                            # display the interactive plot

VBox(children=(Text(value='                                              Simple Kriging, Michael Pyrcz, Associ…

Output(outputs=({'output_type': 'display_data', 'data': {'text/plain': '<Figure size 432x288 with 2 Axes>', 'i…

### Interactive Simple Kriging Redundancy Method

The following code includes:

* dashboard with variogram model, data locations via subtended angle and radius 
* data locations with point scaled by weights and kriging weights over all subtended angles from 0 to 360

In [6]:
import warnings; warnings.simplefilter('ignore')

# interactive calculation of the sample set (control of source parametric distribution and number of samples)
style = {'description_width': 'initial'}
l = widgets.Text(value='                                              Simple Kriging, Michael Pyrcz, Associate Professor, The University of Texas at Austin',layout=Layout(width='950px', height='30px'))
nug = widgets.FloatSlider(min = 0, max = 1.0, value = 0.0, step = 0.1, description = '$c_{nug}$',orientation='vertical',
                          readout_format='.0f',layout=Layout(width='25px', height='218px'))
nug.style.handle_color = 'gray'
it1 = widgets.Dropdown(options=['Sph', 'Exp', 'Gaus'],value='Sph',
    description='$it_1$',disabled=False,layout=Layout(width='120px', height='30px'), style=style,continuous_update=False)

azi = widgets.FloatSlider(min=0, max = 360, value = 0, step = 22.5, description = '$Azi$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
azi.style.handle_color = 'gray'
hmaj1 = widgets.FloatSlider(min=0.01, max = 10000.0, value = 600.0, step = 25.0, description = '$a_{maj}$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
hmaj1.style.handle_color = 'gray'
hmin1 = widgets.FloatSlider(min = 0.01, max = 10000.0, value = 600.0, step = 25.0, description = '$a_{min}$',
                        orientation='vertical',readout_format='.0f',layout=Layout(width='40px', height='178px'),continuous_update=False)
hmin1.style.handle_color = 'gray'
uikvar = widgets.HBox([azi,hmaj1,hmin1],)                   # basic widget formatting
uikvar2 = widgets.VBox([it1,uikvar],)                       # basic widget formatting
uikvar3 = widgets.HBox([nug,uikvar2],)                      # basic widget formatting

angle = widgets.FloatSlider(min=0.01, max = 359, value = 45.0, step = 1.0, description = r'$\alpha$',orientation='horizontal',
                         layout=Layout(width='600px', height='30px'),readout_format = '.0f',style=style,continuous_update=False)
angle.style.handle_color = 'blue'

radius = widgets.FloatSlider(min=0.0, max = 1000.0, value = 200.0, step = 1.0, description = '$r$',orientation='horizontal',
                         layout=Layout(width='600px', height='30px',margin='0 0 0 10px'),readout_format = '.0f',style=style,continuous_update=False)
radius.style.handle_color = 'blue'

uidata = widgets.VBox([angle,radius],) 

uipars = widgets.HBox([uikvar3,uidata],) 

uik_redundancy = widgets.VBox([l,uipars],)

def convert_type(it):
    if it == 'Spherical': 
        return 1
    elif it == 'Exponential':
        return 2
    else: 
        return 3

def calc_krige_weights(nug,it1,azi,hmaj1,hmin1,angle,radius,):
    text_trap = io.StringIO()
    sys.stdout = text_trap
    it1 = convert_type(it1)
    nst = 1; xlag = 10; nlag = int(hmaj1/xlag); c1 = 1.0-nug
    vario = GSLIB.make_variogram(nug,nst,it1,c1,azi,hmaj1,hmin1) # make model object
    index_maj,h_maj,gam_maj,cov_maj,ro_maj = geostats.vmodel(nlag,xlag,azi,vario)   # project the model in the major azimuth                                                  # project the model in the 135 azimuth
    index_min,h_min,gam_min,cov_min,ro_min = geostats.vmodel(nlag,xlag,azi+90.0,vario) # project the model in the minor azimuth
    
    xu = 500.0; yu = 500.0
    x3 = 500.0 + radius; y3 = 500.0
    x1,y1 = calculate_new_point(xu,yu,radius,270-angle/2)
    x2,y2 = calculate_new_point(xu,yu,radius,270+angle/2) 
    x = [x1,x2,x3]; y = [y1,y2,y3]; value = [1,2,3]
    df = pd.DataFrame({'X':x,'Y':y,'Value':value})

    xl = [xu,0,1]; yl = [yu,0,1]; value1 = [0,0,0]
    dfl = pd.DataFrame({'X':xl,'Y':yl, 'Value':value1})
    vmean = 2.0
    sk_est, sk_var, sk_weights =  simple_simple_krige(df,'X','Y','Value',dfl,'X','Y',vario,skmean=vmean)
    return sk_weights[0]
    
def f_make_krige_redudancy(nug,it1,azi,hmaj1,hmin1,angle,radius,): # function to take parameters, make sample and plot
    text_trap = io.StringIO()
    sys.stdout = text_trap
    it1 = convert_type(it1)
    nst = 1; xlag = 10; nlag = int(hmaj1/xlag); c1 = 1.0-nug
    vario = GSLIB.make_variogram(nug,nst,it1,c1,azi,hmaj1,hmin1) # make model object
    index_maj,h_maj,gam_maj,cov_maj,ro_maj = geostats.vmodel(nlag,xlag,azi,vario)   # project the model in the major azimuth                                                  # project the model in the 135 azimuth
    index_min,h_min,gam_min,cov_min,ro_min = geostats.vmodel(nlag,xlag,azi+90.0,vario) # project the model in the minor azimuth
    
    xu = 500.0; yu = 500.0
    x3 = 500.0 + radius; y3 = 500.0
    x1,y1 = calculate_new_point(xu,yu,radius,270-angle/2)
    x2,y2 = calculate_new_point(xu,yu,radius,270+angle/2)
 
    clist = ['blue','red','green']
    x = [x1,x2,x3]; y = [y1,y2,y3]; value = [1,2,3]
    df = pd.DataFrame({'X':x,'Y':y,'Value':value})

    xl = [xu,0,1]; yl = [yu,0,1]; value1 = [0,0,0]
    dfl = pd.DataFrame({'X':xl,'Y':yl, 'Value':value1})
    vmean = 2.0
    sk_est, sk_var, sk_weights =  simple_simple_krige(df,'X','Y','Value',dfl,'X','Y',vario,skmean=vmean)
    
    plt.subplot(1,2,1)
        
    ax = plt.gca()
    plt.xlabel('X (m)'); plt.ylabel('Y (m)')
    plt.title('Simple Kriging - Data and Unknown Locations')
    plt.xlim([0,1000])
    plt.ylim([0,1000])
    for i, txt in enumerate(np.round(sk_weights[0],2)):
        plt.annotate('$\lambda$' + ' ' + '$=$' + str(txt), (x[i]+20, y[i]+20),color = clist[i])
        plt.annotate(i+1, (x[i]+46, y[i]+15),fontsize=6,color = clist[i])
#     for i, txt in enumerate(value):
#         plt.annotate('$z(\mathbf{u}$' + ' ' + '$ )$  = ' + str(txt), (x[i]+20, y[i]-40),color = clist[i])
#         plt.annotate(i+1, (x[i]+80, y[i]-45),fontsize=6,color = clist[i])
    plt.annotate('$\sum \lambda_{\\alpha} = $'+ str(np.round(np.sum(sk_weights[0]),2)), (xu+20, yu+20))
#     plt.annotate('$z^*(\mathbf{u}_0)$ = '+ str(np.round(sk_est[0],2)), (xu+20, yu-40))
    plt.annotate('?', (xu-20, yu + 30))

    ax = plt.gca()
    ellipse = Ellipse((xu, yu),width=radius*2,height=radius*2,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='black',ls='--',zorder=1)
    ax.add_patch(ellipse)
    ellipse = Ellipse((xu, yu),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='gray',alpha = 0.1,edgecolor='black',zorder=10)
    ax.add_patch(ellipse)
    ellipse = Ellipse((xu, yu),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='black',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x1, y1),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='blue',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x1, y1),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='blue',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x2, y2),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='red',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x2, y2),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='red',zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x3, y3),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='green',alpha = 0.1,zorder=10)
    ax.add_patch(ellipse)
    #ellipse = Ellipse((x3, y3),width=hmin1*2.0,height=hmaj1*2.0,angle = 360-azi,facecolor='none',alpha = 1.0,edgecolor='green',zorder=10)
    ax.add_patch(ellipse)
   
    if sk_weights[0,0] > 0.01:
        plt.scatter(x1,y1,color = 'blue', edgecolors = 'black', s = sk_weights[0,0]*1000, alpha = 0.3,zorder=100,label=r'$\bf{u}_1$')
    else:
        plt.scatter(x1,y1,color = 'blue', edgecolors = 'black', marker = 'x', alpha = 0.3,zorder=100,label=r'$\bf{u}_1$')
    if sk_weights[0,1] > 0.01:    
        plt.scatter(x2,y2,color = 'red', edgecolors = 'black', s = sk_weights[0,1]*1000,alpha = 0.3,zorder=100,label=r'$\bf{u}_2$')
    else:
        plt.scatter(x2,y2,color = 'red', edgecolors = 'black', marker = 'x',alpha = 0.3,zorder=100,label=r'$\bf{u}_2$')
    if sk_weights[0,2] > 0.01:
        plt.scatter(x3,y3,color = 'green', edgecolors = 'black', s = sk_weights[0,2]*1000, alpha = 0.3,zorder=100,label=r'$\bf{u}_3$')
    else:
        plt.scatter(x3,y3,color = 'green', edgecolors = 'black', marker = 'x', alpha = 0.3,zorder=100,label=r'$\bf{u}_3$')
       
    scatter = plt.scatter(xu,yu,color = 'black', edgecolors = 'black', marker = 'x',s = 40,alpha = 0.3,zorder=100,label=r'$\bf{u}_0$')
    plt.legend(loc = 'upper right')
    nbin = 100
    nangle = np.linspace(0.01,359,nbin)
    wt1 = np.zeros(nbin); wt2 = np.zeros(nbin); wt3 = np.zeros(nbin) 
    for iangle, cangle in enumerate(nangle):
        wt1[iangle],wt2[iangle],wt3[iangle] = calc_krige_weights(nug,it1,azi,hmaj1,hmin1,cangle,radius,)

    plt.subplot(1,2,2)
    plt.plot(nangle,wt1,color='blue',alpha=0.6,ls='--',label='$\lambda_1$')
    plt.plot(nangle,wt2,color='red',alpha=0.6,ls='-.',label='$\lambda_2$')
    plt.plot(nangle,wt3,color='green',alpha=0.6,lw=2,label='$\lambda_3$')
    plt.plot(nangle,wt1+wt2,color='purple',alpha=0.6,lw=2,label='$\lambda_1+\lambda_2$')
    plt.scatter(angle,sk_weights[0,0],color='blue',s=100,edgecolor='black',alpha=0.3)
    plt.scatter(angle,sk_weights[0,1],color='red',s=100,edgecolor='black',alpha=0.3)
    plt.scatter(angle,sk_weights[0,2],color='green',s=100,edgecolor='black',alpha=0.3)
    plt.xlabel('Subtended Angle'); plt.ylabel('Kriging Weight'); plt.title('Redundancy Impact on Weights')
    plt.vlines(angle,0.1,1.0,color='black',zorder=1)
    plt.ylim([0.1,1.0]); plt.xlim([0,360]); plt.grid()
    plt.legend(loc='upper right')
    
#     plt.hist(samples,histtype = 'stepfilled',cumulative = True, bins = np.linspace(vmin,vmax,200),alpha=0.8,color="darkorange",edgecolor='black',density=True)
#     plt.xlim([vmin,vmax]); plt.ylim([0,1.0])
#     plt.title('Kriging Uncertainty Model at Unknown Location')
#     plt.xlabel('Value'); plt.ylabel('Frequency')
    
#     ax = plt.gca()
#     ax.annotate(r'$z^*(\mathbf{u}_0)$ = ' + str(np.round(sk_est[0],2)), (0.05*(vmax-vmin)+vmin, 0.9))
#     ax.annotate(r'$\sigma_z^2(\mathbf{u}_0)$ = ' + str(np.round(sk_var[0],2)), (0.05*(vmax-vmin)+vmin, 0.83))
#     ax.annotate(r'$P10_z(\mathbf{u}_0)$ = ' + str(np.round(np.percentile(samples,10),2)), (0.05*(vmax-vmin)+vmin, 0.76))
#     ax.annotate(r'$P90_z(\mathbf{u}_0)$ = ' + str(np.round(np.percentile(samples,90),2)), (0.05*(vmax-vmin)+vmin, 0.69))
    plt.subplots_adjust(left=0.0, bottom=0.0, right=2.0, top=1.2, wspace=0.3, hspace=0.3)
    plt.show()
    
# connect the function to make the samples and plot to the widgets    
interactive_plot_redundancy = widgets.interactive_output(f_make_krige_redudancy, {'nug':nug, 'it1':it1, 'azi':azi, 'hmaj1':hmaj1, 'hmin1':hmin1, 
                                                      'angle':angle, 'radius':radius})
interactive_plot_redundancy.clear_output(wait = True)               # reduce flickering by delaying plot updating

### Interactive Simple Kriging Redudancy Demonstration

* select the variogram model and the data radius subtended angle of the data

#### Michael Pyrcz, Associate Professor, University of Texas at Austin 

##### [Twitter](https://twitter.com/geostatsguy) | [GitHub](https://github.com/GeostatsGuy) | [Website](http://michaelpyrcz.com) | [GoogleScholar](https://scholar.google.com/citations?user=QVZ20eQAAAAJ&hl=en&oi=ao) | [Book](https://www.amazon.com/Geostatistical-Reservoir-Modeling-Michael-Pyrcz/dp/0199731446) | [YouTube](https://www.youtube.com/channel/UCLqEr-xV-ceHdXXXrTId5ig)  | [LinkedIn](https://www.linkedin.com/in/michael-pyrcz-61a648a1) | [GeostatsPy](https://github.com/GeostatsGuy/GeostatsPy)

### The Inputs

Select the variogram model and the data locations:

* $c_{nug}$, $C_1 = 1.0 - c_{nug}$, nugget effect contribution to the sill

* $hmaj1$ / $hmin1$: range in the major and minor direction

* $\alpha$, $r$: spatial data locations, subtended angle and radius 

In [7]:
display(uik_redundancy, interactive_plot_redundancy)                            # display the interactive plot

VBox(children=(Text(value='                                              Simple Kriging, Michael Pyrcz, Associ…

Output(outputs=({'output_type': 'display_data', 'data': {'text/plain': '<Figure size 432x288 with 2 Axes>', 'i…

#### Comments

This was an interactive demonstration of simple kriging behavoir, the impact of data closeness and redudancy on kriging weights for spatial data analytics. Much more could be done, I have other demonstrations on the basics of working with DataFrames, ndarrays, univariate statistics, plotting data, declustering, data transformations and many other workflows available at https://github.com/GeostatsGuy/PythonNumericalDemos and https://github.com/GeostatsGuy/GeostatsPy. 
  
#### The Author:

### Michael Pyrcz, Associate Professor, University of Texas at Austin 
*Novel Data Analytics, Geostatistics and Machine Learning Subsurface Solutions*

With over 17 years of experience in subsurface consulting, research and development, Michael has returned to academia driven by his passion for teaching and enthusiasm for enhancing engineers' and geoscientists' impact in subsurface resource development. 

For more about Michael check out these links:

#### [Twitter](https://twitter.com/geostatsguy) | [GitHub](https://github.com/GeostatsGuy) | [Website](http://michaelpyrcz.com) | [GoogleScholar](https://scholar.google.com/citations?user=QVZ20eQAAAAJ&hl=en&oi=ao) | [Book](https://www.amazon.com/Geostatistical-Reservoir-Modeling-Michael-Pyrcz/dp/0199731446) | [YouTube](https://www.youtube.com/channel/UCLqEr-xV-ceHdXXXrTId5ig)  | [LinkedIn](https://www.linkedin.com/in/michael-pyrcz-61a648a1)

#### Want to Work Together?

I hope this content is helpful to those that want to learn more about subsurface modeling, data analytics and machine learning. Students and working professionals are welcome to participate.

* Want to invite me to visit your company for training, mentoring, project review, workflow design and / or consulting? I'd be happy to drop by and work with you! 

* Interested in partnering, supporting my graduate student research or my Subsurface Data Analytics and Machine Learning consortium (co-PIs including Profs. Foster, Torres-Verdin and van Oort)? My research combines data analytics, stochastic modeling and machine learning theory with practice to develop novel methods and workflows to add value. We are solving challenging subsurface problems!

* I can be reached at mpyrcz@austin.utexas.edu.

I'm always happy to discuss,

*Michael*

Michael Pyrcz, Ph.D., P.Eng. Associate Professor The Hildebrand Department of Petroleum and Geosystems Engineering, Bureau of Economic Geology, The Jackson School of Geosciences, The University of Texas at Austin

#### More Resources Available at: [Twitter](https://twitter.com/geostatsguy) | [GitHub](https://github.com/GeostatsGuy) | [Website](http://michaelpyrcz.com) | [GoogleScholar](https://scholar.google.com/citations?user=QVZ20eQAAAAJ&hl=en&oi=ao) | [Book](https://www.amazon.com/Geostatistical-Reservoir-Modeling-Michael-Pyrcz/dp/0199731446) | [YouTube](https://www.youtube.com/channel/UCLqEr-xV-ceHdXXXrTId5ig)  | [LinkedIn](https://www.linkedin.com/in/michael-pyrcz-61a648a1)  
  