# Example of DOV search methods for interpretations (geotechnische codering)

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/DOV-Vlaanderen/pydov/master?filepath=docs%2Fnotebooks%2Fsearch_geotechnische_codering.ipynb)

## Use cases explained below
* Get 'geotechnische codering' in a bounding box
* Get 'geotechnische codering' with specific properties within a distance from a point
* Get 'geotechnische codering' in a bounding box with specific properties
* Get 'geotechnische codering' based on fields not available in the standard output dataframe
* Get 'geotechnische codering' data, returning fields not available in the standard output dataframe

In [1]:
%matplotlib inline
import inspect, sys

In [2]:
# check pydov path
import pydov

## Get information about the datatype 'Geotecnische codering'

In [3]:
from pydov.search.interpretaties import GeotechnischeCoderingSearch
itp = GeotechnischeCoderingSearch()

A description is provided for the 'Geotechnische codering' datatype:

In [4]:
itp.get_description()

'Een geotechnische codering van een boring is een codering opgesteld vanuit geotechnisch oogpunt, rekening houdend met informatie uit de lithologie, laboproeven en bijhorende sondering(en).'

The different fields that are available for objects of the 'Geotechnische codering' datatype can be requested with the get_fields() method:

In [5]:
fields = itp.get_fields()

# print available fields
for f in fields.values():
    print(f['name'])

pkey_interpretatie
Type_proef
Proefnummer
pkey_boring
x
y
start_interpretatie_mtaw
diepte_tot_m
gemeente
Auteurs
Datum
Opdrachten
betrouwbaarheid_interpretatie
Geldig_van
Geldig_tot
eerste_invoer
geom
diepte_laag_van
diepte_laag_tot
hoofdnaam1_grondsoort
hoofdnaam2_grondsoort
bijmenging1_plaatselijk
bijmenging1_hoeveelheid
bijmenging1_grondsoort
bijmenging2_plaatselijk
bijmenging2_hoeveelheid
bijmenging2_grondsoort
bijmenging3_plaatselijk
bijmenging3_hoeveelheid
bijmenging3_grondsoort


You can get more information of a field by requesting it from the fields dictionary:

* *name*: name of the field
* *definition*: definition of this field
* *cost*: currently this is either 1 or 10, depending on the datasource of the field. It is an indication of the expected time it will take to retrieve this field in the output dataframe.
* *notnull*: whether the field is mandatory or not
* *type*: datatype of the values of this field
* *codelist*: optionally, a codelist that describes the possible values of this field

Alternatively, you can list all the fields and their details by inspecting the `get_fields()` output or the search instance itself in a notebook:

In [6]:
itp

## Example use cases

### Get 'Geotechnische codering' in a bounding box

Get data for all the 'Geotechnische codering' interpretations that are geographically located within the bounds of the specified box.

The coordinates are in the Belgian Lambert72 (EPSG:31370) coordinate system and are given in the order of lower left x, lower left y, upper right x, upper right y.

In [7]:
from pydov.util.location import Within, Box

df = itp.search(location=Within(Box(108281, 197850, 108282, 197851)))
df.head()

[000/001] .
[000/001] .


Unnamed: 0,pkey_interpretatie,pkey_boring,betrouwbaarheid_interpretatie,x,y,start_interpretatie_mtaw,diepte_laag_van,diepte_laag_tot,hoofdnaam1_grondsoort,hoofdnaam2_grondsoort,bijmenging1_plaatselijk,bijmenging1_hoeveelheid,bijmenging1_grondsoort,bijmenging2_plaatselijk,bijmenging2_hoeveelheid,bijmenging2_grondsoort,bijmenging3_plaatselijk,bijmenging3_hoeveelheid,bijmenging3_grondsoort
0,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2011...,goed,108281.2,197850.2,7.99,0.0,0.5,FZ,,False,N,LE,False,N,PL,,,
1,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2011...,goed,108281.2,197850.2,7.99,0.5,1.0,LE,,False,V,XZ,,,,,,
2,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2011...,goed,108281.2,197850.2,7.99,1.0,3.0,FZ,,False,W,LE,,,,,,
3,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2011...,goed,108281.2,197850.2,7.99,3.0,4.5,FZ,,,,,,,,,,
4,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2011...,goed,108281.2,197850.2,7.99,4.5,5.0,FZ,,,,,,,,,,


The dataframe contains one 'Geotechnische codering' interpretation where ten layers ('laag') were identified. The available data are flattened to represent unique attributes per row of the dataframe.

Using the *pkey_interpretatie* field one can request the details of this interpretation in a webbrowser:

In [8]:
for pkey_interpretatie in set(df.pkey_interpretatie):
    print(pkey_interpretatie)

https://www.dov.vlaanderen.be/data/interpretatie/2011-172244


### Get 'Geotechnische codering' with specific properties within a distance from a point

Next to querying interpretations based on their geographic location within a bounding box, we can also search for interpretations matching a specific set of properties. For this we can build a query using a combination of the 'Geotechnische codering' fields and operators provided by the WFS protocol.

A list of possible operators can be found below:

In [9]:
[i for i,j in inspect.getmembers(sys.modules['owslib.fes2'], inspect.isclass) if 'Property' in i]

['PropertyIsBetween',
 'PropertyIsEqualTo',
 'PropertyIsGreaterThan',
 'PropertyIsGreaterThanOrEqualTo',
 'PropertyIsLessThan',
 'PropertyIsLessThanOrEqualTo',
 'PropertyIsLike',
 'PropertyIsNotEqualTo',
 'PropertyIsNull',
 'SortProperty']

In this example we build a query using the *PropertyIsGreaterThan* and *PropertyIsEqualTo* operators to find all interpretations that are at least 20 m deep, that are deemed appropriate for a range of 1 km from a defined point:

In [10]:
from owslib.fes2 import And, PropertyIsGreaterThan, PropertyIsEqualTo
from pydov.util.location import WithinDistance, Point

query = And([PropertyIsEqualTo(propertyname='Betrouwbaarheid',
                              literal='goed'),
            PropertyIsGreaterThan(propertyname='diepte_tot_m',
                                 literal='20'),
           ])
            
df = itp.search(query=query, 
                location=WithinDistance(Point(153145, 206930), 1000))

df.head()

[000/001] .
[000/003] ...


Unnamed: 0,pkey_interpretatie,pkey_boring,betrouwbaarheid_interpretatie,x,y,start_interpretatie_mtaw,diepte_laag_van,diepte_laag_tot,hoofdnaam1_grondsoort,hoofdnaam2_grondsoort,bijmenging1_plaatselijk,bijmenging1_hoeveelheid,bijmenging1_grondsoort,bijmenging2_plaatselijk,bijmenging2_hoeveelheid,bijmenging2_grondsoort,bijmenging3_plaatselijk,bijmenging3_hoeveelheid,bijmenging3_grondsoort
0,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1971...,goed,153993.0,206978.0,14.8,0.0,2.0,FZ,,False,N,LE,,,,,,
1,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1971...,goed,153993.0,206978.0,14.8,2.0,3.0,FZ,LE,,,,,,,,,
2,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1971...,goed,153993.0,206978.0,14.8,3.0,3.75,FZ,,False,N,LE,False,N,SX,False,N,SF
3,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1971...,goed,153993.0,206978.0,14.8,3.75,4.25,FZ,,False,N,GL,,,,,,
4,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1971...,goed,153993.0,206978.0,14.8,4.25,13.0,FZ,,False,N,GL,,,,,,


Once again we can use the *pkey_interpretatie* as a permanent link to the information of these interpretations:

In [11]:
for pkey_interpretatie in set(df.pkey_interpretatie):
    print(pkey_interpretatie)

https://www.dov.vlaanderen.be/data/interpretatie/2012-180862
https://www.dov.vlaanderen.be/data/interpretatie/2012-180861
https://www.dov.vlaanderen.be/data/interpretatie/2012-180863


### Get 'Geotechnische codering' in a bounding box based on specific properties

We can combine a query on attributes with a query on geographic location to get the interpretations within a bounding box that have specific properties.

The following example requests the interpretations of boreholes only, within the given bounding box.

(Note that the datatype of the *literal* parameter should be a string, regardless of the datatype of this field in the output dataframe.)

In [12]:
from owslib.fes2 import PropertyIsEqualTo

query = PropertyIsEqualTo(
            propertyname='Type_proef',
            literal='Boring')

df = itp.search(
    location=Within(Box(153145, 206930, 154145, 207930)),
    query=query
    )

df.head()

[000/001] .
[000/021] c....c........c......


Unnamed: 0,pkey_interpretatie,pkey_boring,betrouwbaarheid_interpretatie,x,y,start_interpretatie_mtaw,diepte_laag_van,diepte_laag_tot,hoofdnaam1_grondsoort,hoofdnaam2_grondsoort,bijmenging1_plaatselijk,bijmenging1_hoeveelheid,bijmenging1_grondsoort,bijmenging2_plaatselijk,bijmenging2_hoeveelheid,bijmenging2_grondsoort,bijmenging3_plaatselijk,bijmenging3_hoeveelheid,bijmenging3_grondsoort
0,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1970...,goed,153846.0,207874.0,15.36,0.0,0.75,LE,,False,N,FZ,,,,,,
1,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1970...,goed,153846.0,207874.0,15.36,0.75,2.2,FZ,,False,V,LE,,,,,,
2,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1970...,goed,153846.0,207874.0,15.36,2.2,2.8,FZ,,False,N,KL,False,N,GL,,,
3,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1970...,goed,153846.0,207874.0,15.36,2.8,4.3,FZ,,False,W,KL,False,N,GL,,,
4,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1970...,goed,153846.0,207874.0,15.36,4.3,5.25,FZ,,False,V,GL,,,,,,


We can look at one of the interpretations in a webbrowser using its *pkey_interpretatie*:

In [13]:
for pkey_interpretatie in set(df.pkey_interpretatie):
    print(pkey_interpretatie)

https://www.dov.vlaanderen.be/data/interpretatie/2012-180853
https://www.dov.vlaanderen.be/data/interpretatie/2012-180866
https://www.dov.vlaanderen.be/data/interpretatie/2013-182358
https://www.dov.vlaanderen.be/data/interpretatie/2013-182282
https://www.dov.vlaanderen.be/data/interpretatie/2013-182276
https://www.dov.vlaanderen.be/data/interpretatie/2012-180851
https://www.dov.vlaanderen.be/data/interpretatie/2024-385644
https://www.dov.vlaanderen.be/data/interpretatie/2013-182359
https://www.dov.vlaanderen.be/data/interpretatie/2012-180863
https://www.dov.vlaanderen.be/data/interpretatie/2012-180852
https://www.dov.vlaanderen.be/data/interpretatie/2013-182279
https://www.dov.vlaanderen.be/data/interpretatie/2012-180862
https://www.dov.vlaanderen.be/data/interpretatie/2012-180861
https://www.dov.vlaanderen.be/data/interpretatie/2012-180855
https://www.dov.vlaanderen.be/data/interpretatie/2012-180864
https://www.dov.vlaanderen.be/data/interpretatie/2013-182360
https://www.dov.vlaander

### Get 'Geotechnische codering' based on fields not available in the standard output dataframe

To keep the output dataframe size acceptable, not all available WFS fields are included in the standard output. However, one can use this information to select interpretations as illustrated below.

For example, make a selection of the interpretations in municipality the of Antwerp, before 1/1/1990:

!*remark: mind that the municipality attribute is merely an attribute that is defined by the person entering the data. It can be ok, empty, outdated or wrong*!

In [14]:
from owslib.fes2 import And, PropertyIsEqualTo, PropertyIsLessThan

query = And([PropertyIsEqualTo(propertyname='gemeente',
                               literal='Antwerpen'),
             PropertyIsLessThan(propertyname='Datum', 
                                 literal='2010-01-01')]
            )
df = itp.search(query=query,
                return_fields=('pkey_interpretatie', 'Datum'))
df.head()

[000/001] .


Unnamed: 0,pkey_interpretatie,Datum
0,https://www.dov.vlaanderen.be/data/interpretat...,2003-03-31
1,https://www.dov.vlaanderen.be/data/interpretat...,2005-03-02
2,https://www.dov.vlaanderen.be/data/interpretat...,2003-04-01
3,https://www.dov.vlaanderen.be/data/interpretat...,2007-03-22
4,https://www.dov.vlaanderen.be/data/interpretat...,2007-03-26


### Get 'Geotechnische codering' data, returning fields not available in the standard output dataframe

As denoted in the previous example, not all available fields are available in the default output frame to keep its size limited. However, you can request any available field by including it in the *return_fields* parameter of the search:

In [15]:
query = PropertyIsEqualTo(
            propertyname='gemeente',
            literal='Leuven')

df = itp.search(query=query,
                return_fields=('pkey_interpretatie', 'pkey_boring',
                               'x', 'y', 'start_interpretatie_mtaw', 'gemeente', 'Auteurs', 'Proefnummer'))

df.head()

[000/001] .


Unnamed: 0,pkey_interpretatie,pkey_boring,x,y,start_interpretatie_mtaw,gemeente,Auteurs,Proefnummer
0,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2010...,174125.3,177188.2,16.76,Leuven,"Luyten, Marc - VO - Afdeling Geotechniek",GEO-10/093-B2
1,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1961...,174330.0,174683.0,27.62,Leuven,"Vergauwen, Ilse - VO - Afdeling Geotechniek",GEO-61/3124-d
2,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2010...,174209.8,177162.4,16.05,Leuven,"Luyten, Marc - VO - Afdeling Geotechniek",GEO-10/093-B4
3,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/2005...,173884.96,177091.37,17.67,Leuven,"Luyten, Marc - MVG - Afdeling Geotechniek",GEO-05/043-B2
4,https://www.dov.vlaanderen.be/data/interpretat...,https://www.dov.vlaanderen.be/data/boring/1961...,174374.0,174550.0,29.3,Leuven,"Vergauwen, Ilse - VO - Afdeling Geotechniek",GEO-61/3124-aBIS


## Visualize results

Using Folium, we can display the results of our search on a map.

In [16]:
# import the necessary modules (not included in the requirements of pydov!)
import folium
from folium.plugins import MarkerCluster
from pyproj import Transformer

In [17]:
# convert the coordinates to lat/lon for folium
def convert_latlon(x1, y1):
    transformer = Transformer.from_crs("epsg:31370", "epsg:4326", always_xy=True)
    x2,y2 = transformer.transform(x1, y1)
    return x2, y2

df['lon'], df['lat'] = zip(*map(convert_latlon, df['x'], df['y'])) 
# convert to list
loclist = df[['lat', 'lon']].values.tolist()

In [18]:
# initialize the Folium map on the centre of the selected locations, play with the zoom until ok
fmap = folium.Map(location=[df['lat'].mean(), df['lon'].mean()], zoom_start=12)
marker_cluster = MarkerCluster().add_to(fmap)
for loc in range(0, len(loclist)):
    folium.Marker(loclist[loc], popup=df['Proefnummer'][loc]).add_to(marker_cluster)
fmap
