# Data Gathering and Processing: Predictors

This notebook serves as an overview of the environmental predictors relevant to amphibian populations in Scotland. Understanding these predictors is essential for assessing habitat connectivity and suitability for various amphibian species, especially in the context of urbanization and habitat fragmentatio

## Table of Contents
1. [Data Overview](#Data-Overview)
2. [Slope](#1.-Slope)
3. [Vegetation Height](#2.-Vegetation-Height)
4. [Distance to Water](#3.-Distance-to-Water)
5. [NDVI (Normalized Difference Vegetation Index)](#4.-NDVI-(Normalized-Difference-Vegetation-Index))
6. [Runoff Coefficient](#5.-Runoff-Coefficient)
7. [Grasslands in Surrounding 250m](#6.-Grasslands-in-Surrounding-250m)
8. [Distance to Forest](#7.-Distance-to-Forest)
9. [Urban Density](#8.-Urban-Density)
10. [Traffic Intensity](#9.-Traffic-Intensity)
11. [Pollution Levels](#10.-Pollution-Levels)
12. [Soil Moisture Variability](#11.-Soil-Moisture-Variability)
13. [Drought Risk](#12.-Drought-Risk)

## Data Overview

The data presented in this notebook includes various environmental predictors that influence the life cycles and habitats of amphibians. Each predictor has been carefully selected based on its biological significance and relevance to amphibian ecology. The following, adapted from [Donati et al., (2022)](https://www.sciencedirect.com/science/article/pii/S0301479722008271), table summarizes the key predictors, their descriptions, biological interpretations, sources, and selection criteria.

| Predictor Code     | Predictor Description                     | Biological Interpretation                                                                                  | Source                                             | Selection                                      |
|--------------------|------------------------------------------|------------------------------------------------------------------------------------------------------------|---------------------------------------------------|------------------------------------------------|
| Forest Dist.       | Nearest distance to forest               | Important for providing shelter and breeding sites for amphibians.                                        | Local woodland datasets or forest inventory data  | Proximity to forests influences habitat choice |
| Water Dist.        | Nearest distance to water                | Essential for breeding, feeding, and aquatic habitat availability for amphibians.                Open Water          | OS Open Rivers, hydrologicat           | Affects reproductive success                    |
| Soil Hum. Var.     | Soil moisture variability                 | Influences amphibian survival and affects physiological processes.                                         | Soil moisture datasets or interpolated data       | Critical for both aquatic and terrestrial habitats |
| NDVI Med           | NDVI median (2016-2019, April-October) | Indicates vegetation health and density, impacting habitat quality.                                        | Sentinel-2 NDVI data                              | Reflects primary productivity                   |
| NDVI SD            | NDVI standard deviation (2016-2019)     | Variability in vegetation health can indicate habitat stability and resilience.                            | Sentinel-2 NDVI data                              | Important for assessing habitat heterogeneity   |
| Road Dist.         | Nearest distance to road                 | Roads can fragment habitats and pose barriers to amphibian movement.                                       | Transport datasets or OS Open Roads               | Assessing impacts on connectivity                |
| Runoff Coefficient | Runoff coefficient                        | Indicates potential runoff and its effects on aquatic habitats, influencing survival and reproduction.     | Hydrological models or datasets                   | Impacts habitat quality                         |
| Slope              | Terrain slope                            | Steeper slopes may obstruct movement and access to breeding sites for certain species.      ESRI Living Atlas               | OS                                | Influences movement patterns                    |
| Traffic Volume     | Average daily traffic volume              | Traffic can pose direct risks to amphibians through road mortality and habitat degradation.                | Scottish Transport Statistics                      | Assessing urban impacts                         |
| Urbanization       | Urbanization proxy (density of buildings)| Urbanization can lead to habitat loss, fragmentation, and increased mortality risks.                       | Scotland Land Use Database                        | Important for habitat connectivity               |
| Veg. Height        | Vegetation height model median           | Taller vegetation can provide better shelter and foraging opportunities for amphibians.  Living Atlas Canopy                   | Vegetation heigh              | Affects habitat suitability                     |
| Grassland Density   | Grassland density in 250x250m           | Influences habitat availability and diversity of niches for amphibians.                                    | Land cover datasets                                | Relevant for terrestrial habitats               |

## Connection to Amphibian Populations

Amphibians are sensitive to environmental changes, making it crucial to understand how various factors influence their populations and habitat availability. The predictors outlined in the table play a significant role in shaping amphibian distributions and behaviors. By analyzing these predictors, we can identify areas of suitable habitat, assess the impacts of urbanization, and develop strategies for conservation and habitat enhancement.

In the following sections of this notebook, we will explore the data in more detail, examine its sources, and visualize the relationships between these predictors 

# Environmental Predictors for Amphibian Movement and Habitat Suitability

## 1. Slope
### **Impact on Amphibians**
Slope can influence amphibian movement and habitat selection by affecting moisture retention and temperature regulation. Steeper slopes may hinder movement and increase the risk of desiccation.

### **Data Processing**
The slope data for this study was derived from the Terrain layer available in the ArcGIS Living Atlas of the World. This dataset has a variable native resolution ranging from 300 metres to 1 metre across the study area. To ensure consistency and relevance for ensemble modelling, the data was resampled to 30 metres before slope calculation.

### Workflow
1. **Resampling Terrain to 30 Metres**:  
   The Terrain raster was first resampled to 30-metre resolution using bilinear interpolation. Bilinear interpolation was selected as it balances the preservation of the continuous nature of elevation data with reduced artefacts. This method ensures smooth transitions between elevation values, which is critical for calculating accurate slope values (Kennedy et al., 1998; Smith et al., 2019).

2. **Slope Calculation**:  
   Following resampling, the slope was computed using the terrain analysis tools in ArcGIS Pro. Calculating slope on the resampled raster ensures that the slope output aligns directly with the desired resolution for subsequent analyses, reducing computational inconsistencies and errors in derived gradient values (Evans, 1972; Wilson & Gallant, 2000).

3. **Export for Analysis**:  
   The processed slope raster was exported at 30-metre resolution, ready for integration into the species distribution modelling workflow.

### **Justification for Methodology**
- **Resampling Terrain First**:  
   Resampling the elevation raster before calculating slope helps maintain the fidelity of elevation gradients over varying terrain. Computing slope on a higher-resolution raster derived from bilinear resampling minimizes artefacts that could arise from upscaling after slope computation (Wilson & Gallant, 2000).

- **Selection of 30-Metre Resolution**:  
   A 30-metre resolution was chosen to balance computational efficiency with ecological relevance. Amphibian habitat analysis and Blue-Green Infrastructure (BGI) planning benefit from spatial detail without unnecessary computational overhead. Higher resolutions, such as 10 metres, would increase computational demands significantly without proportionate gains in modelling accuracy for the scale of the study area (~16,624 km²) (Riley et al., 1999; Guisan & Thuiller, 2005).

- **Use of Bilinear Interpolation**:  
   Bilinear interpolation is suitable for continuous data like elevation, as it preserves the smooth transitions between cell values, which are critical for accurate slope derivation (Kennedy et al., 1998). Alternative methods, such as nearest neighbour, may introduce abrupt changes in slope values, reducing ecological validity.

### References
- Evans, I. S. (1972). "General geomorphometry, derivatives of altitude, and descriptive statistics." *Spatial Analysis in Geomorphology*, 17-90.
- Guisan, A., & Thuiller, W. (2005). "Predicting species distribution: offering more than simple habitat models." *Ecology Letters*, 8(9), 993–1009.
- Kennedy, M., & Leigh, M. (1998). *The Global Positioning System and GIS: An Introduction*. CRC Press.
- Riley, S. J., et al. (1999). "A terrain ruggedness index that quantifies topographic heterogeneity." *Intermountain Journal of Sciences*, 5(1-4), 23-27.
- Smith, M. J., et al. (2019). "Interpolating elevation data for geomorphological applications." *Journal of Geographical Systems*, 21(4), 545–567.
- Wilson, J. P., & Gallant, J. C. (2000). *Terrain Analysis: Principles and Applications*. John Wiley & Sons.

______________________

## 2. Vegetation Height
### **Impact on Amphibians**
Vegetation height can influence the microclimate and shelter availability for amphibians. Taller vegetation may provide more cover from predators and environmental extremes.

### **Data Source and Description**
The vegetation height data used in this study was sourced from the **2020 Global Vegetation Height Map** developed by the EcoVision Lab at ETH Zurich. This dataset provides global canopy height estimates at a **10m spatial resolution**, derived from LiDAR data collected by the Global Ecosystem Dynamics Investigation (GEDI) on board the International Space Station and Sentinel-2 imagery. The dataset was generated using a deep convolutional neural network trained with LiDAR observations as ground truth data, achieving an accuracy of **±5m**. The data is designed to support applications in biodiversity monitoring, ecosystem function analysis, and sustainable land-use planning.

### **Data Processing**
### Workflow
1. **Clipping to Study Area**:
The global vegetation height data was clipped to the extent of the study area to focus on Central Scotland. This ensures that only the relevant geographical extent is included in the analysis, reducing computational overhead and maintaining ecological relevance.

2. **Resampling to 30m Resolution**:
The original 10m resolution data was resampled to a **30m resolution** to match the spatial scale of other environmental predictors in the study. Consistency in resolution is critical for ensuring accurate and unbiased inputs for species distribution modelling (SDM).

3. **Resampling Method**:
**Bilinear interpolation** was used as the resampling method. This approach is well-suited for continuous data like vegetation height, as it calculates the value of each new cell based on a weighted average of the four nearest cells. Bilinear interpolation smooths transitions between neighbouring pixels, preserving gradual changes in vegetation height while avoiding artefacts introduced by simpler methods like nearest neighbour resampling (Chen et al., 2007; Hijmans et al., 2005).

### Justification for Resampling Method
- **Suitability for Continuous Variables**:  
  Vegetation height represents a continuous variable where abrupt transitions between pixels are not ecologically meaningful. Bilinear interpolation reduces artificial boundaries and ensures smoother transitions, aligning with standard practices for processing environmental data (Chen et al., 2007).

- **Consistency with Predictor Integration**:  
  Ensuring a uniform resolution across all predictors avoids artefacts during model development and enhances comparability among variables (Dormann et al., 2013).

#### References
- Chen, X., Vierling, L., & Deering, D. (2007). "A simple and effective method for detecting specular reflection in airborne LiDAR intensity data." *Remote Sensing of Environment*, 109(2), 273-282. https://doi.org/10.1016/j.rse.2007.01.002  
- Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., & Jarvis, A. (2005). "Very high resolution interpolated climate surfaces for global land areas." *International Journal of Climatology: A Journal of the Royal Meteorological Society*, 25(15), 1965-1978. https://doi.org/10.1002/joc.1276  
- Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, G., ... & Münkemüller, T. (2013). "Collinearity: a review of methods to deal with it and a simulation study evaluating their performance." *Ecography*, 36(1), 27-46. https://doi.org/10.1111/j.1600-0587.2012.07348.x

___________


## 3. Distance to Water

### **Impact on Amphibians**
Proximity to water bodies is critical for amphibian survival as it provides breeding sites and essential moisture. Increased distance may limit access to these resources.

### **Data Source and Context**
The **distance to water predictor layer** was developed to incorporate the proximity of habitats to water bodies into the species distribution modelling (SDM). Water availability is a critical environmental factor for amphibians, influencing their habitat suitability. The predictor was derived from the **Global Surface Water Occurrence 1984–2021 dataset** (Pekel et al., 2016), provided by the European Commission Joint Research Centre. This dataset identifies surface water dynamics globally over a 37-year period and was accessed in raster format.

### **Data Preparation and Processing**
1. **Resampling the Water Bodies Raster**: The original dataset was resampled from its native resolution to **30 m** to maintain consistency with other environmental predictors.The **nearest neighbour resampling method** was employed to preserve the categorical integrity of the water body classification, ensuring that water and non-water cells remained distinct.

2. **Reclassification for Euclidean Distance Calculation**: The resampled raster was reclassified to create a binary dataset:
* Cells representing water bodies were assigned a value of `1`.
* Non-water cells were set to `NoData`.
* This binary format was necessary for the **Euclidean Distance** tool to accurately compute distances from water body cells.

3. **Calculating Euclidean Distance**: The reclassified raster was used as the input for the **Euclidean Distance** tool in ArcGIS Pro. This process calculated the straight-line distance from each cell to the nearest water body, resulting in a continuous raster layer representing proximity to water.

### Output
The resulting distance to water raster was exported in **GeoTIFF format** for integration into the SDM. This predictor layer provides valuable spatial information on habitat accessibility to water bodies, aiding in the identification of suitable areas for amphibian conservation and planning.

#### References
- Pekel, J.-F., Cottam, A., Gorelick, N., & Belward, A. S. (2016). High-resolution mapping of global surface water and its long-term changes. *Nature*, 540, 418–422. https://doi.org/10.1038/nature20584
- ArcGIS Pro Documentation: Euclidean Distance Tool.

_____________

## 4. NDVI (Normalized Difference Vegetation Index)
### **Impact on Amphibians**
NDVI serves as an indicator of vegetation health and density, which can affect habitat quality and availability of food resources for amphibians.

To enhance the environmental predictors used in the species distribution modelling, NDVI (Normalized Difference Vegetation Index) data were processed to represent both the median values and the standard deviation over the study period. NDVI is a critical indicator of vegetation health and distribution, which significantly impacts habitat suitability for amphibians.

### **Data Source and Preprocessing**
The Sentinel-2 satellite imagery from the Copernicus programme was used as the source dataset, covering the period from April 2019 to October 2024. Sentinel-2 Level-2A data provides surface reflectance values with high spatial resolution (10m for visible bands). The dataset was filtered to include only images with less than 10% cloud cover and clipped to the study area.

A cloud masking algorithm was applied using the Sentinel-2 Scene Classification Layer (SCL) to remove cloud, shadow, and snow pixels, ensuring the integrity of the data. This step minimized noise and preserved only reliable reflectance values for NDVI calculations.

### **NDVI Calculation and Metrics**
NDVI was calculated for each image in the filtered collection using the formula:

$$ \text{NDVI} = \frac{\text{NIR} - \text{Red}}{\text{NIR} + \text{Red}} $$

where the Near Infrared (NIR) band corresponds to Band 8, and the Red band corresponds to Band 4. 

The following two metrics were derived from the NDVI collection:
1. **NDVI Median:** The median NDVI value was computed to represent the central tendency of vegetation greenness during the study period, reflecting typical vegetation conditions.
2. **NDVI Standard Deviation:** The standard deviation of NDVI values was calculated to capture temporal variability in vegetation, highlighting areas with dynamic vegetation patterns.

### **Output and Use**
Both NDVI metrics were exported as raster layers with a spatial resolution of 30m, using bilinear resampling to align with the resolution of other predictors in the model. The rasters were re-projected to the British National Grid (OSGB 1936), ensuring compatibility with the modelling environment.

These layers will serve as key predictors in the ensemble modelling framework, providing insights into the role of vegetation distribution and variability in shaping amphibian habitat suitability. This step is critical for linking vegetation dynamics to ecological processes relevant to Blue-Green Infrastructure (BGI) planning in Central Scotland.


----
## 5. Runoff Coefficient

### Impact on Amphibians
The runoff coefficient (C) is a critical measure of the imperviousness of land cover, influencing hydrological processes such as infiltration, surface runoff, and water retention. Amphibians are highly dependent on water availability for breeding and foraging, making the runoff coefficient an important predictor in species distribution modelling. High imperviousness can reduce habitat suitability by increasing surface runoff, which leads to faster water drainage, habitat desiccation, and loss of breeding sites.

### Data Processing

#### 1. **Data Acquisition**
The runoff coefficient values were derived from reputable hydrological studies and tailored to match the land cover classes available in the ESRI Land Cover dataset. These coefficients quantify the proportion of precipitation that becomes surface runoff for each land cover type.

#### 2. **Reclassification**
The ESRI Land Cover dataset, initially at 10 m resolution, was resampled to 30 m to align with the spatial resolution of other environmental predictors in the model. The resampling used the **nearest neighbour method** to preserve discrete land cover classifications.

#### 3. **Runoff Coefficient Assignment**
Runoff coefficients were assigned to each of the land cover classes in the dataset using a reclassification approach. Specific values were selected based on a review of literature and hydrological data, as outlined in the table below:

| **Land Cover Class**   | **Runoff Coefficient (C)** | **Rationale**                                                                                                                                                                                                                   |
|------------------------|----------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| **Water**              | 1.00                       | All precipitation contributes to runoff over water bodies. *(USDA-NRCS, 2007)*                                                                                                            |
| **Crops**              | 0.40                       | Representative value for agricultural fields, assuming typical soil and crop management conditions. *(Boyd et al., 2003; USDA-NRCS, 2007)*                                                                                       |
| **Snow/Ice**           | 0.60                       | Mid-value reflecting varying conditions from melting snow and ice. *(Horton, 1933; Kane et al., 1991)*                                                                                                                           |
| **Trees**              | 0.15                       | Typical coefficient for forested areas with dense canopy and organic matter. *(Dunne & Leopold, 1978; USDA-NRCS, 2007)*                                                                                                         |
| **Built Areas**        | 0.85                       | Average coefficient for impervious urban surfaces like asphalt and rooftops. *(Ellis, 2006; Chow et al., 1988)*                                                                                                                  |
| **Flooded Vegetation** | 0.80                       | Saturated areas with vegetation have high runoff due to limited infiltration. *(Boyd et al., 2003; Chow et al., 1988)*                                                                                                          |
| **Bare Ground**        | 0.60                       | Typical coefficient for bare, compacted soils with poor infiltration. *(Chow et al., 1988; USDA-NRCS, 2007)*                                                                                                                     |
| **Rangeland**          | 0.30                       | Representative value for sparsely vegetated grasslands. *(Dunne & Leopold, 1978; USDA-NRCS, 2007)*                                                                                                                              |

#### 4. **Raster Layer Creation**
Using ArcGIS Pro:
- The reclassified runoff coefficient values were applied to the resampled land cover raster to create the runoff coefficient layer.
- Each land cover type in the raster was assigned its corresponding runoff coefficient.
- The final runoff coefficient raster was exported as a GeoTIFF for use in modelling.

### References
- Boyd, M. J., Bufill, M. C., & Knee, R. M. (2003). Pervious and impervious runoff simulation using kinematic wave theory. *Hydrological Processes, 17*(12), 2463-2481.
- Chow, V. T., Maidment, D. R., & Mays, L. W. (1988). *Applied Hydrology.* McGraw-Hill.
- Dunne, T., & Leopold, L. B. (1978). *Water in Environmental Planning.* W. H. Freeman.
- Ellis, J. B. (2006). Sediment processes in urban surface water systems. *Water and Environment Journal, 20*(3), 136-144.
- Horton, R. E. (1933). The role of infiltration in the hydrologic cycle. *Transactions of the American Geophysical Union, 14*, 446-460.
- Kane, D. L., Hinzman, L. D., & Zarling, J. P. (1991). Thermal response of permafrost to climate warming: Two examples. *Cold Regions Science and Technology, 19*(2), 111-122.
- USDA-NRCS (2007). National Engineering Handbook, Part 630 Hydrology, Chapter 10. United States Department of Agriculture.

This runoff coefficient layer will contribute to identifying areas prone to rapid water drainage or retention, helping to assess habitat suitability for amphibians.


---

## 6. Grasslands in Surrounding 250m

### Impact on Amphibians
The presence of grasslands can facilitate movement and provide suitable foraging habitats for many amphibian species.  
Grassland habitats play a critical role in supporting amphibian populations by providing essential resources such as foraging opportunities, shelter, and movement corridors. Amphibians are known to utilise different habitat types during various stages of their life cycle. While aquatic habitats are crucial for breeding, terrestrial habitats, including grasslands, are equally important for their post-breeding dispersal and foraging activities (Beebee & Griffiths, 2005; Cushman, 2006).

The presence of grasslands in the vicinity of breeding sites enhances habitat connectivity, reduces desiccation risks, and increases food availability, thereby facilitating amphibian movement across fragmented landscapes (Semlitsch & Bodie, 2003). Furthermore, grasslands with high vegetation cover can mitigate predation risks and provide microclimatic conditions favourable for amphibians, which are highly sensitive to moisture and temperature variations (Hamer & McDonnell, 2008).

By analysing the **density of grasslands** within a **250m neighbourhood radius** around each cell, this predictor layer captures the potential terrestrial habitat quality surrounding amphibian habitats and facilitates an assessment of landscape permeability for amphibian movement. The 250m distance is based on ecological studies indicating that many amphibian species exhibit limited terrestrial dispersal distances, typically ranging between 100m and 500m from breeding sites (Jehle et al., 2011). This makes grassland density within a 250m radius a relevant predictor for species distribution models.

### Summary of the Process to Create the Predictor Layer

#### **1. Data Acquisition**
- The **EUNIS NatureScot Land Cover Map 2022** was used as the primary data source for land cover classification in central Scotland.
- This raster dataset provides detailed land cover classes, including multiple types of grasslands.

#### **2. Grassland Reclassification**
- Relevant grassland types were identified from the EUNIS classification:
  - Alpine and subalpine grasslands
  - Dry grasslands
  - Mesic grasslands
  - Seasonally wet and wet grasslands
- The grassland types were reclassified into a binary raster:
  - **1** for grassland presence.
  - **0** for non-grassland areas.

#### **3. Grassland Density Calculation**
- The **Focal Statistics Tool** in ArcGIS Pro was used to calculate grassland density within a **250m neighbourhood radius** around each cell.
  - Neighbourhood type: **Circle**.
  - Neighbourhood radius: **250m**.
  - Statistic type: **Mean** (this calculates the proportion of grasslands within the neighbourhood).
- The resulting raster (`Grassland_Density.tif`) contains continuous values ranging from **0** (no grasslands in the neighbourhood) to **1** (100% grassland coverage in the neighbourhood).

#### **4. Output Generation**
The final output is a continuous raster layer representing grassland density within a 250m neighbourhood radius around each cell. This raster layer is ready to be used as a predictor in the species distribution model.

### **References**
- Beebee, T. J. C., & Griffiths, R. A. (2005). *The amphibian decline crisis: A watershed for conservation biology?* Biological Conservation, 125(3), 271-285. https://doi.org/10.1016/j.biocon.2005.04.009

- Cushman, S. A. (2006). *Effects of habitat loss and fragmentation on amphibians: A review and prospectus.* Biological Conservation, 128(2), 231-240. https://doi.org/10.1016/j.biocon.2005.09.031

- Hamer, A. J., & McDonnell, M. J. (2008). *Amphibian ecology and conservation in the urbanising world: A review.* Biological Conservation, 141(10), 2432-2449. https://doi.org/10.1016/j.biocon.2008.07.020

- Jehle, R., Thiesmeier, B., & Foster, J. (2011). *The crested newt: A dwindling pond dweller.* Laurenti-Verlag.

- Semlitsch, R. D., & Bodie, J. R. (2003). *Biological criteria for buffer zones around wetlands and riparian habitats for amphibians and reptiles.* Conservation Biology, 17(5), 1219-1228. https://doi.org/10.1046/j.1523-1739.2003.02177.x


---

## 7. Distance to Forest
### Impact on Amphibians
Forests can serve as essential corridors for amphibian movement, providing shelter and moisture retention, thus influencing habitat selection. Forested areas serve as critical corridors for amphibian movement by providing shelter, moisture retention, and protection from predators. These factors are particularly important during post-breeding migrations and dispersal phases, as amphibians are highly sensitive to desiccation and environmental fluctuations (Beebee & Griffiths, 2005; Cushman, 2006). The proximity to forested areas can influence amphibian habitat selection and survival, making **distance to the nearest forest** a key predictor in species distribution modelling (SDM) (Semlitsch & Bodie, 2003; Hartel et al., 2010).

By calculating the distance to forest for each raster cell, we obtain a continuous predictor layer that captures the potential influence of forest proximity on amphibian occurrence.

### Summary of the Process to Create the Predictor Layer

#### **1. Data Acquisition**
- The **EUNIS NatureScot Land Cover Map 2022** was used as the primary data source for identifying forested areas in central Scotland.
- The following EUNIS classes were identified as forest:
  - **G1**: Broadleaved deciduous woodland
  - **G4**: Mixed deciduous and coniferous woodland
  - **G3**: Highly artificial coniferous plantations
  - **G5**: Lines of trees, small woodlands, recently felled woodland, early-stage woodland, and coppice

#### **2. Reclassifying Woodland Areas**
- The woodland areas were reclassified using the **Reclassify Tool** in ArcGIS Pro:
  - **1** for forested areas (G1, G4, G3, G5).
  - **0** for non-forested areas.
- The resulting binary raster (`Woodland_Binary.tif`) was saved for further processing.

#### **3. Extracting Woodland Areas**
- To ensure accurate distance calculations, the **Extract by Attributes Tool** was used to extract only the woodland cells where the value = 1.
- This step resulted in a raster (`Extracted_Woodland.tif`) containing only forested areas, with all non-forested areas set to **NoData**.

#### **4. Calculating Distance to Woodland**
- The **Euclidean Distance Tool** in ArcGIS Pro was used to calculate the shortest distance from each cell to the nearest woodland:
  - **Input Raster**: `Extracted_Woodland.tif`
  - **Neighbourhood Distance**: Calculated in metres.
  - **Output Raster**: `Distance_to_Woodland.tif`
- The resulting continuous raster represents the distance to the nearest forest for each cell.

### Output
- The final output is a continuous raster layer (`Distance_to_Woodland.tif`) where:
  - Cells within forested areas have a distance of **0**.
  - Cells outside forested areas have positive values representing the shortest distance (in metres) to the nearest forest.
- This layer is ready to be used as a continuous predictor in species distribution modelling.

### Rationale
Forests are known to influence amphibian distribution by acting as dispersal corridors and providing essential microhabitats (Semlitsch & Bodie, 2003). Calculating **Euclidean distance** from each cell to the nearest forest allows us to quantify the accessibility of these critical habitats for amphibians across the landscape. Euclidean distance has been widely used in ecological modelling to assess proximity-based habitat effects (Cushman, 2006; Hartel et al., 2010).

### **References**
- Beebee, T. J. C., & Griffiths, R. A. (2005). *The amphibian decline crisis: A watershed for conservation biology?* Biological Conservation, 125(3), 271-285. https://doi.org/10.1016/j.biocon.2005.04.009

- Cushman, S. A. (2006). *Effects of habitat loss and fragmentation on amphibians: A review and prospectus.* Biological Conservation, 128(2), 231-240. https://doi.org/10.1016/j.biocon.2005.09.031

- Hartel, T., Schweiger, O., Öllerer, K., Cogălniceanu, D., & Arntzen, J. W. (2010). *Amphibian distribution in a traditionally managed rural landscape of Eastern Europe: Probing the effect of landscape composition.* Biological Conservation, 143(5), 1118-1124. https://doi.org/10.1016/j.biocon.2010.02.006

- Semlitsch, R. D., & Bodie, J. R. (2003). *Biological criteria for buffer zones around wetlands and riparian habitats for amphibians and reptiles.* Conservation Biology, 17(5), 1219-1228. https://doi.org/10.1046/j.1523-1739.2003.02177.x
---

## 8. Urban Density
### Impact on Amphibians
Urbanisation significantly alters natural landscapes, affecting amphibian populations by fragmenting habitats, increasing impermeable surfaces, and creating barriers to movement (Cushman, 2006; Hamer & McDonnell, 2008). Built-up areas reduce habitat permeability and increase desiccation risks, which is critical for amphibians that rely on moist environments for survival and dispersal. Therefore, quantifying **average building density within a 250m neighbourhood radius** is essential to assess the impact of urbanisation on amphibian habitat suitability and movement patterns.

By calculating building density using Kernel Density, we obtain a continuous predictor layer that captures urbanisation pressure across the study area.

### Summary of the Process to Create the Predictor Layer

#### **1. Data Acquisition**
- Building footprint polygons were downloaded from **OpenStreetMap (OSM)** using **Geofabrik’s data portal**, covering the study area in central Scotland.
- The dataset provides detailed polygon representations of buildings, necessary for calculating urban density at a fine spatial scale.

#### **2. Kernel Density Calculation**
- The **Kernel Density Tool** in ArcGIS Pro was used to calculate the density of buildings within a **250m neighbourhood radius**:
  - **Input Features**: Building footprint polygon layer.
  - **Population Field**: A constant value of **1** was used to ensure equal weighting for all buildings.
  - **Output Cell Size**: Set to **30m** to match the resolution of other predictors.
  - **Search Radius**: Set to **250m** based on ecological studies indicating typical amphibian dispersal distances (Jehle et al., 2011; Cushman, 2006).
  - **Output Cell Values**: The **Densities** option was selected, resulting in a continuous raster with values representing the **density of buildings per unit area**.

#### **3. Normalisation**
- The output raster values represented building density but ranged up to **2719.97**.
- To ensure the raster was scaled consistently with other predictors (i.e., within a **0 to 1 range**), it was normalised by dividing each cell value by the **maximum observed density**:

  $$
  \text{Normalised Density} = \frac{\text{Building Density}}{2719.97}
  $$

- The resulting raster had values between **0** (no buildings) and **1** (maximum observed building density).

### Output
- The final output is a **30m resolution GeoTIFF raster** with values ranging from **0 to 1**, representing normalised building density within a **250m neighbourhood radius**.
- Cells with values close to **1** indicate high urban density, while values close to **0** indicate low or no urban density.
- This layer is ready to be used as a continuous predictor in species distribution modelling.

### Rationale
Urban density is a critical factor influencing amphibian habitat suitability by reducing connectivity and altering local environmental conditions (Hamer & McDonnell, 2008). Amphibians rely on terrestrial habitats for post-breeding dispersal and foraging, making urbanisation an important consideration in habitat modelling. By calculating the **average building density within a 250m radius**, this predictor layer captures the influence of built-up areas on amphibian movement and survival. The **250m neighbourhood radius** was selected based on ecological studies indicating amphibian dispersal distances of **100m to 500m** (Jehle et al., 2011; Cushman, 2006)

### **References**
- Cushman, S. A. (2006). *Effects of habitat loss and fragmentation on amphibians: A review and prospectus.* Biological Conservation, 128(2), 231-240. https://doi.org/10.1016/j.biocon.2005.09.031  
- Hamer, A. J., & McDonnell, M. J. (2008). *Amphibian ecology and conservation in the urbanising world: A review.* Biological Conservation, 141(10), 2432-2449. https://doi.org/10.1016/j.biocon.2008.07.020  
- Jehle, R., Thiesmeier, B., & Foster, J. (2011). *The crested newt: A dwindling pond dweller.* Laurenti-Verlag.
---

## 9. Traffic Intensity
### Importance for Amphibians
Road traffic poses significant threats to amphibians by causing habitat fragmentation, direct mortality from vehicle collisions, and disruption of movement patterns (Cushman, 2006; Hamer & McDonnell, 2008). Amphibians are particularly vulnerable to roads due to their low mobility and reliance on both aquatic and terrestrial habitats. The **traffic intensity predictor layer** quantifies the cumulative impact of road traffic within a defined radius, enabling the inclusion of road-related pressures in species distribution models (SDMs).

Studies suggest that road-effect zones for amphibians extend up to **1,000–1,500 metres** depending on species and landscape characteristics (Carr & Fahrig, 2002; Hartel et al., 2010). This predictor helps assess habitat suitability and connectivity by incorporating the influence of traffic intensity.

### Summary of the Process to Create the Predictor Layer

#### **1. Data Acquisition**
- **Road Layer**: OpenStreetMap (OSM) roads data was used as the base road network.
- **Traffic Data**: Annual average daily traffic (AADT) counts from the UK Department for Transport (DfT) were spatially joined to the road layer.

#### **2. Assigning Traffic Counts to Road Segments**
- A **spatial join** was performed to assign traffic counts to road segments based on proximity to traffic counters.
  - **Join Operation**: Closest, with a search radius of **50m**.
- Null values in the traffic count field were populated using **mean or median traffic counts** by road type to ensure no road segments were left without data.

#### **3. Line Density Analysis**
- The **Line Density Tool** in ArcGIS Pro was used to calculate the cumulative traffic intensity:
  - **Input Features**: Road layer with traffic counts (`all_motor_vehicles`).
  - **Population Field**: `all_motor_vehicles`.
  - **Search Radius**: **1,000m** (based on ecological studies indicating the road-effect zone for amphibians).
  - **Output Cell Size**: **30m**, consistent with other predictor layers.
- The resulting raster (`Traffic_Intensity.tif`) represents the cumulative traffic intensity within a 1,000m radius of each cell.

#### **4. Normalisation**
- The traffic intensity raster was normalised to a 0–1 range using the Raster Calculator:
  - Expression:
    ```plaintext
    "Traffic_Intensity" / max_value
    ```
  - This ensures compatibility with other predictors.

### **Output**
- **Final Predictor**: A normalised raster (`Normalised_Traffic_Intensity.tif`) with values ranging from **0** (low/no traffic) to **1** (high traffic intensity).
- This layer captures the spatial distribution of traffic pressure on amphibians and is ready for use in SDMs.


### **Rationale**
By including traffic intensity as a predictor, this layer accounts for the negative effects of roads on amphibian populations, enabling better-informed conservation and mitigation planning. The 1,000m radius reflects the extent of the road-effect zone as indicated by ecological studies (Carr & Fahrig, 2002; Hartel et al., 2010).


### **References**
- Carr, L. W., & Fahrig, L. (2002). *Effect of road traffic on two amphibian species of differing vagility*. Conservation Biology, 16(1), 331-340. https://conbio.onlinelibrary.wiley.com/doi/10.1046/j.1523-1739.2001.0150041071.x

- Cushman, S. A. (2006). *Effects of habitat loss and fragmentation on amphibians: A review and prospectus*. Biological Conservation, 128(2), 231-240. https://doi.org/10.1016/j.biocon.2005.09.031

- Hamer, A. J., & McDonnell, M. J. (2008). *Amphibian ecology and conservation in the urbanising world: A review*. Biological Conservation, 141(10), 2432-2449. https://doi.org/10.1016/j.biocon.2008.07.020

- Hartel, T., Schweiger, O., Öllerer, K., Cogălniceanu, D., & Arntzen, J. W. (2010). *Amphibian distribution in a traditionally managed rural landscape of Eastern Europe: Probing the effect of landscape composition*. Biological Conservation, 143(5), 1118-1124. https://doi.org/10.1016/j.biocon.2010.02.006

---

## 10. Pollution Levels

### Importance for Amphibians
Amphibians are highly susceptible to environmental pollution due to their permeable skin, which facilitates direct chemical absorption from the environment. Among various pollutants, **nitrogen oxides (NOx)** have been shown to directly impact amphibian populations by altering their aquatic habitats.

- **Aquatic Habitat Degradation**: NOx contributes to acid rain, which acidifies aquatic habitats, reducing breeding success and larval survival. Amphibians exposed to low pH environments experience higher mortality rates and developmental issues (Beebee & Griffiths, 2005).

Given the significant role of pollution in shaping amphibian distributions, the inclusion of NOx as a pollution predictor in species distribution modelling (SDM) provides a more accurate representation of environmental pressures.

### Process to Create Pollution Predictors

#### **Data Acquisition**
Pollution data for **NOx** was obtained from the [Scottish Air Quality Mapping Data](https://www.scottishairquality.scot/data/mapping/data). NOx was selected based on its well-documented impacts on amphibians as described in the literature. The dataset provided annual average concentrations across Scotland, ensuring comprehensive coverage of the study area.

#### **Data Processing**
The NOx dataset was processed using ArcGIS Pro to create a standardised predictor layer for inclusion in the SDM. The following steps were undertaken:

1. **Loading Data into GIS**:  
   The NOx dataset was imported into ArcGIS Pro. The extent of the dataset was inspected to confirm that it fully covered the study area. Since the data was initially in a geographic coordinate system (WGS84), it was reprojected to **British National Grid (EPSG:27700)** to maintain consistency with other spatial predictors.

2. **Interpolating Point Data Using IDW**:  
   Since the NOx dataset consisted of point measurements, an interpolated surface was created using the **Inverse Distance Weighting (IDW)** tool. IDW was selected for its efficiency in generating smooth, continuous surfaces from point data. The following parameters were applied:
   - **Power**: 2 (default value, emphasising nearby points)
   - **Cell size**: 30m (to match the resolution of other predictors)
   - **Search radius**: Variable, including 12 nearest points
   The output of the IDW interpolation was saved as `NOx_Interpolated.tif`.

3. **Clipping to Study Area**:  
   The interpolated pollution dataset was clipped to the study area boundary using the **Clip Raster Tool**. This step ensured that only data relevant to the study area was retained, thereby improving computational efficiency.

4. **Resampling to Standard Resolution**:  
   To ensure uniform spatial resolution across all predictors, the clipped NOx raster was resampled to a **30m cell size** using the **Resample Tool**. Bilinear interpolation was applied to preserve the continuous nature of pollution data. This resolution was chosen to match other environmental predictors and maintain a balance between spatial detail and computational feasibility.

5. **Standardisation**:  
   The NOx raster was standardised to a **0–1 range** to facilitate comparison with other predictors. This process involved applying the following normalisation formula in the **Raster Calculator**:
   ```plaintext
   ("NOx_Interpolated" - in_value) / (max_value - min_value)

   ```
   where `min_value` and `max_value` represent the minimum and maximum NOx concentrations in the dataset. The resulting raster was saved as `Standardised_NOx.tif`.

### Output
The final output consisted of a standardised raster layer representing the relative concentrations of **NOx** within the study area. The layer contained values ranging from **0** (low pollution) to **1** (high pollution). This layer serves as a continuous predictor for modelling amphibian distributions.

### Rationale
Nitrogen oxides (**NOx**) were selected as a key pollution predictor due to their known role in altering amphibian habitats. NOx contributes to acidification of aquatic habitats, which directly impacts amphibian reproductive success and larval development (Beebee & Griffiths, 2005). Including NOx in the SDM enables the model to account for spatial variability in anthropogenic pressures, improving its accuracy in predicting suitable habitats for amphibians.

### References
Beebee, T. J. C., & Griffiths, R. A. (2005). *The amphibian decline crisis: A watershed for conservation biology?* Biological Conservation, 125(3), 271-285. https://doi.org/10.1016/j.biocon.2005.04.009

Egea-Serrano, A., Relyea, R. A., Tejedo, M., & Torralva, M. (2012). Understanding of the impact of chemicals on amphibians: A meta-analytic review. Ecology and Evolution, 2(7), 1382-1397. https://www.biology.pitt.edu/sites/default/files/facilities-images/Relyea_pubs/2012%20Egea-Serrano%20et%20al.pdf?utm_source=chatgpt.com

---

## 11. Soil Moisture Variability

### Importance for Amphibians
Amphibians are highly sensitive to changes in moisture levels due to their permeable skin and reliance on moist environments for hydration and movement. Soil moisture plays a critical role in amphibian habitat suitability, particularly during terrestrial phases of their life cycle. Variability in soil moisture can lead to desiccation risks and reduced habitat connectivity, ultimately impacting amphibian populations (Rittenhouse et al., 2008; Ousterhout & Semlitsch, 2018).

Projected increases in climate variability may exacerbate soil moisture fluctuations, making it essential to include this predictor in species distribution models (SDMs) to assess habitat suitability under future climatic conditions.

### Process to Create the Soil Moisture Variability Layer

#### **Data Acquisition**
Soil moisture data was sourced from publicly available climate datasets that provide long-term spatial and temporal soil moisture information. The following sources were considered:
- **Copernicus Global Land Service (CGLS)**: Satellite-derived soil moisture data.
- **ESA Climate Change Initiative (CCI)**: Long-term soil moisture datasets.
- **ERA5-Land (ECMWF)**: High-resolution reanalysis data providing daily and monthly soil moisture values.

#### **Data Processing**

1. **Download Soil Moisture Data**  
   Monthly or daily soil moisture rasters covering the study area for a period of 10–20 years were downloaded to capture temporal variability.

2. **Load Data into GIS**  
   The downloaded rasters were imported into ArcGIS Pro, and their extents were inspected to ensure they fully covered the study area. All rasters were reprojected to **British National Grid (EPSG:27700)** for consistency with other predictors.

3. **Calculate Temporal Variability**  
   Temporal variability in soil moisture was calculated using the **Cell Statistics Tool** in ArcGIS Pro, applying the **standard deviation** function across the time series:
   ```plaintext
   StdDev("SoilMoisture_Month1", "SoilMoisture_Month2", ..., "SoilMoisture_MonthN")
   ```
    The final output was saved as `Standardised_SoilMoisture_Variability.tif`.

### Output
The final output is a standardised raster layer representing soil moisture variability across the study area. Values range from **0** (low variability) to **1** (high variability). This predictor will be used in the SDM to model amphibian habitat suitability more accurately by accounting for spatial and temporal variability in soil moisture.

### Rationale
Soil moisture variability is a key factor affecting amphibian distribution and survival. High variability in soil moisture increases desiccation risks and reduces the suitability of terrestrial habitats. By incorporating this predictor, the SDM captures critical habitat dynamics and improves predictions of amphibian distributions under different environmental conditions (Rittenhouse et al., 2008; Ousterhout & Semlitsch, 2018).

### References
- Rittenhouse, T. A. G., Harper, E. B., Rehard, L. R., & Semlitsch, R. D. (2008). *The role of microhabitats in the desiccation and survival of amphibians in recently harvested oak-hickory forest*. Copeia, 2008(4), 807-814. https://doi.org/10.1643/CH-07-265
- Ousterhout, B. H., & Semlitsch, R. D. (2018). *Measuring terrestrial movement behavior using passive integrated transponder (PIT) tags: effects of tag loss and habitat selection*. Herpetological Conservation and Biology, 13(2), 334-342.
---

## 12. Drought Risk

### Importance for Amphibians

Amphibians are highly sensitive to changes in moisture levels due to their permeable skin and reliance on aquatic environments for breeding and larval development. Projected increases in drought frequency and duration pose significant threats to amphibian populations by:

- **Habitat Desiccation**: Increased drought leads to the drying of wetlands and breeding ponds, resulting in loss of suitable habitats for amphibian reproduction. This can cause population declines, especially in species with specific habitat requirements (Kirkpatrick Baird et al., 2023).

- **Reduced Breeding Success**: Drought conditions can alter the hydroperiods of breeding sites, leading to desiccation of egg masses and increased mortality of larvae before metamorphosis. This disruption in reproductive cycles can have long-term impacts on population dynamics (Beebee & Griffiths, 2005).

- **Increased Competition and Predation**: As water bodies shrink, amphibians are forced into smaller areas, increasing competition for resources and vulnerability to predators. This heightened stress can further exacerbate population declines (Hamer & McDonnell, 2008).

Incorporating drought risk into species distribution models (SDMs) is crucial for accurately predicting amphibian distributions under changing climatic conditions.


### Process to Create Drought Risk Predictors

#### **Data Acquisition**

Drought risk data was obtained from the study by Kirkpatrick Baird et al. (2023), which provides projections of extreme drought frequency and duration in Scotland for the period 2021-2040. The data includes spatially explicit information on drought occurrence, allowing for detailed analysis within the study area.

#### **Data Processing**

The drought risk data was processed using ArcGIS Pro to create a predictor layer for inclusion in the SDM. The following steps were undertaken:

1. **Loading Data into GIS**:  
   The drought projection data was imported into ArcGIS Pro. The dataset was reviewed to ensure it encompassed the entire study area. If necessary, the data was reprojected to align with the coordinate system used for other environmental predictors.

2. **Clipping to Study Area**:  
   The dataset was clipped to the study area boundary to focus the analysis on the region of interest. This step also enhances computational efficiency by excluding extraneous data.

3. **Resampling to Standard Resolution**:  
   To maintain consistency with other predictors, the drought risk raster was resampled to a 30m cell size using bilinear interpolation. This resolution balances spatial detail with computational feasibility.

4. **Standardisation**:  
   The drought risk values were standardised to a 0–1 range to facilitate integration with other predictors. This was achieved using the following formula in the Raster Calculator:

   ```plaintext
   ("Drought_Risk_Layer" - min_value) / (max_value - min_value)
   ```
where `min_value` and `max_value` represent the minimum and maximum drought risk values in the dataset. The resulting raster was saved as `Standardised_Drought_Risk.tif`.

### Output
The final output is a standardised raster layer representing the relative drought risk within the study area. Values range from **0** (low drought risk) to **1** (high drought risk). This layer serves as a continuous predictor in the SDM, enabling the assessment of how drought conditions may influence amphibian distributions.

### Rationale
Drought poses a significant threat to amphibians by altering the availability and quality of essential aquatic habitats. By incorporating a drought risk predictor into the SDM, the model can account for areas that may become unsuitable due to increased drought frequency and duration. This integration enhances the model's ability to predict future habitat suitability under climate change scenarios, providing valuable insights for conservation planning.

### References

- Beebee, T. J. C., & Griffiths, R. A. (2005). The amphibian decline crisis: A watershed for conservation biology? Biological Conservation, 125(3), 271-285. https://doi.org/10.1016/j.biocon.2005.04.009

- Hamer, A. J., & McDonnell, M. J. (2008). Amphibian ecology and conservation in the urbanising world: A review. Biological Conservation, 141(10), 2432-2449. https://doi.org/10.1016/j.biocon.2008.07.020

- Kirkpatrick Baird, F., Spray, D., Hall, J., & Partridge, J. S. (2023). Projected increases in extreme drought frequency and duration by 2040 affect specialist habitats and species in Scotland. Ecological Solutions and Evidence. https://doi.org/10.1002/2688-8319.12256

---

---

## 10. Distance to Rock-Gravel-Sand
### Impact on Amphibians
- The availability of rock, gravel, and sand can affect breeding habitats and the microhabitat conditions necessary for different amphibian species.

### Data Processing
- **Acquisition**: Map the locations of rock, gravel, and sand areas from geological surveys.
- **Processing**: Calculate distance to these areas, creating a distance raster layer to include in habitat models.

---

# Conclusion
Following these steps will ensure that each environmental predictor is effectively processed and integrated into habitat suitability models, enabling a comprehensive understanding of how these factors influence amphibian movement and population dynamics.