Skip to content

Simple algorithms to display ignorance maps of raw distributional data accessed from species observation databases

License

Notifications You must be signed in to change notification settings

AlejandroRuete/IgnoranceMaps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Simple algorithms to display ignorance maps of raw data accessed from species observation databases

The Swedish LifeWatch implementation

Here you can download an interactive HTML application compiled in R that accompanies the text: Ruete A. 2015. Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps. Biodiversity Data Journal 3:e5361.

This interactive application exemplifies the use of three different algorithms to map the ignorance (i.e. bias and lack of sampling effort) found in the observations accessed through the Swedish LifeWatch biodiversity analysis portal www.swedishlifewatch.se.

To run it you will need to install R http://www.r-project.org/ and install the following packages: shiny, raster, rgdal, and maptools.

   install.packages(c("shiny", "raster", "rgdal", "maptools"))

NOTE: there are some known issues when installing the package "rgdal" on Linux. Please, refer to this blog http://robinlovelace.net/r/2013/11/26/installing-rgdal-on-ubuntu.html or this blog https://philmikejones.wordpress.com/2014/07/14/installing-rgdal-in-r-on-linux/ to solve the issue.

Execute the following script in R to run the interactive application.

   require(shiny)
   shiny::runGitHub(repo="IgnoranceMaps", username="alejandroruete", subdir="SLWapp")

Alternatively, download the files and run the following scripts.

   runApp("~/SLWapp") # where ~ indicates the path of the folder.
   runApp("~/SLWapp", display.mode = "showcase") # Use this command to see the R code

Running the Application

Examples are provided for seven reference taxonomic groups (i.e. Amphibians, Birds, Butterflies, Land Mammals, Harvestmen, Dragonflies, and Vascular Plants) as groups with different sampling effort intensity and extension as well as different number of amateur observers reporting observations to the database. For each group we show examples of a common (or widely distributed) and a rare (or locally distributed) species. Note that in many cases common species are not reported as enthusiastically as more interesting species are (in Sweden, a moose is not as frequently reported as hedgehogs are). Therefore, one can expect that for common species the observations are very scattered and they do not cover the whole expected range of the species.

Tabs (Maps, Data plots, and Read me)

You are now reading under the “Read me” tab. The data is shown the form of “Maps” and “Data plots”. Four maps of Sweden with a resolution of 10 x 10 km are plotted under the tab “Maps” displaying:

  1. the number of observations per pixel for the reference taxonomic group selected,

  2. the ignorance scores per pixel,

  3. an estimate of pseudo-absence (henceforth ps.absences) for the selected target species. Ps.absences are estimated with the same algorithms used to produce the ignorance maps. Be aware that the results of the algorithms for a single species have a different interpretation than for a reference target group. For individual target species the lack of observations could represent a true absence or lack of observers. To separate true absences from the lack of observers we could weight ps.absence estimates with the ignorance map as it is shown in the next map. Also, be aware that especially for common species or species that are not likely to be reported as frequently as they are seen, these estimates of ps.absences are not reliable because the raw data is not reliable.
    Disclaimer: the pseudo-absence map is only an exercise to explore the use of the ignorance map.

  4. the presence map (P = 1- ps.absences; i.e. scaled observations). This is an example of the use of ignorance maps to weight and mask different estimates for target species. In this case, the black to white scale represents the scaled observations or the knowledge we have on the presence of the species. On top, a transparent layer shows areas where a minimum certainty for ps.absences or presence is achieved after multiplying 1-ignorance (i.e. certainty) by the ps.absences (red) or presence (green). Then, areas where there is low ignorance but the target species has not been found are shown in red, while areas where the target species has been found but a minimum certainty is required are shown in green.

The outline in those maps is a 10km buffer around the Swedish land surface.

Under the “Data plots” tab you find:

  1. a density plot of the number of observations per grid cell,

  2. a species discovery plot and

  3. curves of the transformations of the number of observation into ignorance scores (note that this later plot is reactive to the value set for O0.5).

Options

On the left panel you find options for the Reference taxonomic group and Target species to be displayed. First, you can opt to use the raw observations as a measure of sampling effort or the Observation Index, a sampling effort relative to the number of species sampled in a particular grid cell. On each subpanel you can set the algorithm can be set and O0.5 (the number of observations that are enough to reduce the ignorance score to 0.5) for the Half-ignorance algorithm. For target species, a Step algorithm is added where any observation number below O0.5 sets the ignorance score to 0. Else the Step algorithm is equal to the Half-ignorance algorithm.

Authors and Contributors

Application developed by Alejandro Ruete in Dec 2014. DOI: dx.doi.org/10.5281/zenodo.17593

Licence GNU v.3