# Finding the major descriptors of species networks

Tanya Strydom [](https://orcid.org/0000-0001-6067-1349) (School of Biosciences, University of Sheffield, Sheffield, UK)  
Andrew P. Beckerman [](https://orcid.org/0000-0002-7859-8394) (School of Biosciences, University of Sheffield, Sheffield, UK)  
January 29, 2026

TODO

# Introduction

To bridge the gap between the original paper and your new objectives, your introduction could follow this logical flow:

The Evolution of Ecological Network Theory

The Hook: Acknowledge the foundational shift from viewing biodiversity as a simple “species count” to viewing it as a complex web of interactions.

The Baseline: Summarize the core findings of the paper you’re expanding on—specifically, how the architecture (e.g., compartmentalization vs. nestedness) affects stability differently in mutualistic vs. trophic networks.

The Need for Dimensionality

The Gap: Argue that while “connectance” and “nestedness” are vital, they don’t capture the full resolution of ecosystem dynamics.

The Expansion: Introduce the necessity of more nuanced metrics (e.g., motifs, centrality, modularity, and beta-diversity of interactions) to capture the “hidden” stability of diverse networks.

Linking Structure to Ecosystem Function (EF)

The Framework: Explicitly connect structural metrics to the “Stability-Complexity” debate.

The Hypothesis: Propose how specific structural arrangements (like high modularity) act as “firewalls” to prevent the spread of perturbations, thereby maintaining ecosystem function under stress.

Objectives

The overarching goal of this study is to move beyond bipartite generalizations and define a comprehensive “structural fingerprint” of ecosystem stability. To achieve this, we address two primary objectives:

**Identification of a Core Structural Subset**

Ecological networks are characterized by a high degree of collinearity among structural descriptors. We aim to determine whether the 31 metrics analyzed in this study can be reduced to a **Minimum Sufficient Set**—a small, non-redundant group of indicators that capture the essential topological features of an ecosystem. By employing multivariate techniques such as **Variable Clustering** and **SVD Complexity**, we seek to move away from arbitrary metric selection toward a data-driven framework for network characterization.

**Mapping the Multi-dimensional Stability Landscape**

Building on the “stability-complexity” debate (McCann 2000, Ives & Carpenter 2007), we aim to map how these diverse structural metrics correlate with different facets of ecosystem health. Specifically, we test the following hypotheses:

- **The Robustness Hypothesis:** Metrics of redundancy (e.g., *Connectance, MaxSim*) will be the primary predictors of resistance to primary species loss.

- **The Containment Hypothesis:** Modular structures (e.g., *Clust, Modularity*) will correlate with system-wide resilience by preventing the propagation of local perturbations.

- **The Dynamic Capacity Hypothesis:** Information-theoretic measures (e.g., *SVD Complexity, Spectral Radius*) will provide a superior bridge between static topology and the dynamic ability of the system to return to equilibrium.

Clearly state that this study expands the taxonomic and structural scope of previous models to provide a generalized rulebook for network-mediated stability.

Synthesis: Linking to “Stability”

In your manuscript, you can group these metrics into **three functional categories**:

1.  **Robustness Metrics:** (Richness, Connectance, Robustness, MaxSim) — These describe how many “hits” the network can take before collapsing.

2.  **Efficiency/Flow Metrics:** (Path, ChLen, TL, Diameter) — These describe how quickly energy or perturbations move through the system.

3.  **Organization Metrics:** (ρ, Complexity, Modularity/Clust, Intervality) — These describe the “logic” of the arrangement, which dictates whether the system behaves predictably or chaotically.

Blah blah blah @vermaatMajorDimensionsFoodweb2009

*“It is incumbent on network ecologists to establish clearly the independence and uniqueness of the descriptive metrics used.”* - @lauEcologicalNetworkMetrics2017

| Dimension | Key Metrics | Expected Effect on Stability | Supporting Literature |
|------------------|------------------|------------------|------------------|
| Complexity & Redundancy | Connectance, MaxSim, Links | **Positive:** High redundancy allows for “functional compensation” if one species is lost. | @dunneFoodwebStructureNetwork2002; @mccannDiversityStabilityDebate2000 |
| Compartmentalization | Clust, Modularity, ρ | **Positive:** Limits the spread of perturbations; local collapses don’t become global. | @stoufferCompartmentalizationIncreasesFoodweb2011 |
| Feedback & Coupling | Omnivory (S2), Loop, ChLen | **Variable:** Omnivory can stabilize by diffusing energy, but long chains can amplify oscillations. | @mccannDiversityStabilityDebate2000; @neutelStabilityRealFood2002 |
| Hierarchy & Shape | Prey:Predator, Basal, Top | **Critical:** “Bottom-heavy” systems are generally more stable; inverted pyramids are fragile. |  |
| Information Heterogeneity | SVD Complexity, LinkSD | **Positive:** Diverse interaction strengths prevent “resonant” instabilities. | @ulanowiczInformationTheoryEcology2001 |

Stuff

# Materials and Methods

<table style="width:100%;">
<colgroup>
<col style="width: 3%" />
<col style="width: 36%" />
<col style="width: 45%" />
<col style="width: 13%" />
</colgroup>
<thead>
<tr>
<th>Label</th>
<th>Definition</th>
<th>Ecological Significance</th>
<th>Reference (for maths), can make footnotes probs</th>
</tr>
</thead>
<tbody>
<tr>
<td>Basal</td>
<td>Percentage of basal taxa, defined as species who have a vulnerability of zero</td>
<td>Measures the energy entry points; high basal % suggests a bottom-heavy, potentially more stable energy base.</td>
<td></td>
</tr>
<tr>
<td>Connectance</td>
<td><span class="math inline">\(L/S^2\)</span>, where <span class="math inline">\(S\)</span> is the number of species and <span class="math inline">\(L\)</span> the number of links</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Cannibal</td>
<td>Percentage of species that are cannibals</td>
<td></td>
<td></td>
</tr>
<tr>
<td>ChLen</td>
<td>Mean food chain length, averaged over all species (where a food chain is defined as a continuous path from a ‘basal’ to a ‘top’ species)</td>
<td>Reflects energy transfer efficiency. Longer chains may be more prone to top-down trophic cascades.</td>
<td></td>
</tr>
<tr>
<td>ChSD</td>
<td>Standard deviation of ChLen</td>
<td>High SD indicates a mix of energy pathways, which can buffer the system</td>
<td></td>
</tr>
<tr>
<td>ChNum</td>
<td>log number of food chains</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Clust</td>
<td>mean clustering coefficient (probability that two taxa linked to the same taxon are also linked)</td>
<td>Quantifies local redundancy; high clustering can buffer the network against the loss of specific interaction pathways.</td>
<td><p><strong>TODO</strong></p>
<p><span class="citation" data-cites="wattsCollectiveDynamicsSmallworld1998">@wattsCollectiveDynamicsSmallworld1998</span></p></td>
</tr>
<tr>
<td>GenSD</td>
<td>Normalized standard deviation of generality of a species standardized by <span class="math inline">\(L/S\)</span></td>
<td>Interaction asymmetry. High variance in how links are distributed often points to the presence of ‘hubs’ (highly connected species), which makes the network robust to random loss but vulnerable to targeted ‘keystone’ removal.</td>
<td><span class="citation" data-cites="williamsSuccessItsLimits2008a">@williamsSuccessItsLimits2008a</span></td>
</tr>
<tr>
<td>Herbivore</td>
<td>Percentage of herbivores plus detritivores (taxa that feed only on basal taxa)</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Intermediate</td>
<td>Percentage of intermediate taxa (with both consumers and resources)</td>
<td></td>
<td></td>
</tr>
<tr>
<td>LinkSD</td>
<td>Normalized standard deviation of links (number of consumers plus resources per taxon)</td>
<td>Interaction asymmetry. High variance in how links are distributed often points to the presence of ‘hubs’ (highly connected species), which makes the network robust to random loss but vulnerable to targeted ‘keystone’ removal.</td>
<td></td>
</tr>
<tr>
<td>Loop</td>
<td>Percentage of taxa in loops (food chains in which a taxon occurs twice)</td>
<td>High percentages of loops can lead to feedback cycles (positive or negative) that either amplify or dampen oscillations, directly impacting local stability.</td>
<td></td>
</tr>
<tr>
<td>L/S</td>
<td>links per species</td>
<td></td>
<td></td>
</tr>
<tr>
<td>MaxSim</td>
<td>Mean of the maximum trophic similarity of each taxon to other taxa, the number of predators and prey shared by a pair of species divided by their total number of predators and prey</td>
<td>Indicates functional redundancy; high similarity suggests species are replaceable, increasing robustness to individual extinctions.</td>
<td><p><strong>TODO</strong></p>
<p><span class="citation" data-cites="yodzisSearchOperationalTrophospecies1999">@yodzisSearchOperationalTrophospecies1999</span></p></td>
</tr>
<tr>
<td>Omnivory</td>
<td>Percentage of omnivores (taxa that feed on <span class="math inline">\(\geq\)</span> 2 taxa with different trophic levels)</td>
<td>Links to coupling of energy channels; historically debated, but often found to stabilize food webs by diffusing top-down pressure.</td>
<td><span class="citation" data-cites="mccannDiversityStabilityDebate2000">@mccannDiversityStabilityDebate2000</span></td>
</tr>
<tr>
<td>Path</td>
<td>characteristic path length, the mean shortest food chain length between species pairs</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Richness</td>
<td>Number of nodes in the network</td>
<td></td>
<td></td>
</tr>
<tr>
<td>TL</td>
<td>Prey-weighted trophic level averaged across taxa</td>
<td></td>
<td><span class="citation" data-cites="williamsLimitsTrophicLevels2004">@williamsLimitsTrophicLevels2004</span></td>
</tr>
<tr>
<td>Top</td>
<td>Percentage of top taxa (taxa without consumers)</td>
<td></td>
<td></td>
</tr>
<tr>
<td>VulSD</td>
<td>Normalized standard deviation of vulnerability of a species standardized by <span class="math inline">\(L/S\)</span></td>
<td>Interaction asymmetry. High variance in how links are distributed often points to the presence of ‘hubs’ (highly connected species), which makes the network robust to random loss but vulnerable to targeted ‘keystone’ removal.</td>
<td></td>
</tr>
<tr>
<td>Links</td>
<td>The number of links in the network</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Diameter</td>
<td>Diameter can also be measured as the average of the distances between each pair of nodes in the network</td>
<td></td>
<td><span class="citation" data-cites="delmasAnalysingEcologicalNetworks2019">@delmasAnalysingEcologicalNetworks2019</span></td>
</tr>
<tr>
<td><span class="math inline">\(\rho\)</span></td>
<td>Spectral radius is a a conceptual analog to nestedness. It is defined as the absolute value of the largest real part of the eigenvalues of the <em>undirected</em> adjacency matrix</td>
<td>Acts as a proxy for system-wide resilience; captures the speed at which a system returns to equilibrium after a small pulse perturbation.</td>
<td><span class="citation" data-cites="staniczenkoGhostNestednessEcological2013">@staniczenkoGhostNestednessEcological2013</span></td>
</tr>
<tr>
<td>Complexity</td>
<td>SVD complexity of a network, defined as the Pielou entropy of its singular values</td>
<td>Captures structural heterogeneity; distinguishes between a truly complex system and one that is merely large or ‘random’.</td>
<td><span class="citation" data-cites="strydomSVDEntropyReveals2021">@strydomSVDEntropyReveals2021</span></td>
</tr>
<tr>
<td>Centrality</td>
<td>Centrality is a measure of how ‘influential’ a species is, under various definitions of ‘influence’.</td>
<td>Centrality can help in quantifying the importance of species in a network</td>
<td><span class="citation" data-cites="estradaUsingNetworkCentrality2008">@estradaUsingNetworkCentrality2008</span></td>
</tr>
<tr>
<td>S1</td>
<td>Number of linear chains</td>
<td>Building blocks of stability (compartmentalisation, Stouffer and Bascompte?)</td>
<td><span class="citation" data-cites="stoufferEvidenceExistenceRobust2007">@stoufferEvidenceExistenceRobust2007</span> <span class="citation" data-cites="miloNetworkMotifsSimple2002">@miloNetworkMotifsSimple2002</span></td>
</tr>
<tr>
<td>S2</td>
<td>Number of omnivory motifs</td>
<td>Building blocks of stability (compartmentalisation, Stouffer and Bascompte?)</td>
<td><span class="citation" data-cites="stoufferEvidenceExistenceRobust2007">@stoufferEvidenceExistenceRobust2007</span> <span class="citation" data-cites="miloNetworkMotifsSimple2002">@miloNetworkMotifsSimple2002</span></td>
</tr>
<tr>
<td>S4</td>
<td>Number of apparent competition motifs</td>
<td>Building blocks of stability (compartmentalisation, Stouffer and Bascompte?)</td>
<td><span class="citation" data-cites="stoufferEvidenceExistenceRobust2007">@stoufferEvidenceExistenceRobust2007</span> <span class="citation" data-cites="miloNetworkMotifsSimple2002">@miloNetworkMotifsSimple2002</span></td>
</tr>
<tr>
<td>S5</td>
<td>Number of direct competition motifs</td>
<td>Building blocks of stability (compartmentalisation, Stouffer and Bascompte?)</td>
<td><span class="citation" data-cites="stoufferEvidenceExistenceRobust2007">@stoufferEvidenceExistenceRobust2007</span> <span class="citation" data-cites="miloNetworkMotifsSimple2002">@miloNetworkMotifsSimple2002</span></td>
</tr>
<tr>
<td>Intervality</td>
<td>The degree to which the prey in a food web can be ordered so that all species can be placed along a single dimension</td>
<td>Measures niche dimension; high intervality suggests a simpler organization where species feeding habits are constrained by a single trait (like body size).</td>
<td><span class="citation" data-cites="stoufferRobustMeasureFood2006a">@stoufferRobustMeasureFood2006a</span></td>
</tr>
<tr>
<td>Prey:predator</td>
<td>Ratio of prey (basal + intermediate) to predators (top + intermediate)</td>
<td>A measure of food web ‘shape’. Values &lt;1 imply an inverted structure and might indicate instability</td>
<td></td>
</tr>
<tr>
<td>Robustness</td>
<td>Minimum level of secondary extinction that occurs in response to a particular perturbation</td>
<td></td>
<td><span class="citation" data-cites="jonssonReliabilityR50Measure2015">@jonssonReliabilityR50Measure2015</span></td>
</tr>
</tbody>
</table>

In [None]:
library(knitr)
library(tidyverse)

── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.2     ✔ tibble    3.3.0
✔ lubridate 1.9.4     ✔ tidyr     1.3.1
✔ purrr     1.1.0     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

Rows: 33 Columns: 4
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): Property, PCA 1 (30%), PCA 2 (20%), PCA 3 (17%)

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

  Property        PCA 1 (30%)   PCA 2 (20%)   PCA 3 (17%)
  --------------- ------------- ------------- -------------
  richness        0.3           **0.89**      -0.16
  links           0.62          **0.72**      0.04
  connectance     0.52          -0.62         0.49
  diameter        **0.74**      0.38          -0.3
  complexity      -0.52         0.09          -0.49
  distance        0             0.3           0.18
  basal           -0.47         0.29          **0.75**
  top             -0.58         0.2           -0.24
  intermediate    **0.69**      -0.35         -0.52
  predpreyRatio   -0.26         0.27          **0.76**
  herbivory       -0.54         0.22          0.07
  omnivory        **0.78**      -0.23         -0.21
  cannibal        **0.72**      0.07          0.31
  l_S             **0.83**      0.47          0.23
  GenSD           -0.4          0.58          0.45
  VulSD           -0.41         0.58          -0.26
  TL              0.52          -0.24         **-0.77**
  ChLen           0.51          -0.41         -0.62
  ChSD            0.32          0.2           -0.45
  ChNum           -0.2          **0.8**       -0.3
  path            0.26          0.4           -0.26
  LinkSD          -0.27         **0.74**      -0.23
  S1              **0.9**       0.03          0.03
  S2              **0.84**      -0.07         0.36
  S4              0.61          0.49          0.28
  S5              **0.67**      0.39          0.49
  ρ               0.57          -0.43         0.48
  centrality      -0.24         **-0.67**     0.18
  loops           **0.8**       0.32          0.12
  robustness      0.05          -0.05         0.66
  intervals       0.45          **0.7**       -0.05
  MaxSim          -0.03         -0.17         0.6
  Clust           **0.69**      -0.33         0.06


<figure>
<img src="attachment:figures/pca_vermaat.png" alt="VERMAAT networks only" />
<figcaption aria-hidden="true">VERMAAT networks only</figcaption>
</figure>

<figure>
<img src="attachment:figures/pca_allNetworks.png" alt="All networks. Vermaat subset = using only the structural measures from Vermaat" />
<figcaption aria-hidden="true">All networks. Vermaat subset = using only the structural measures from Vermaat</figcaption>
</figure>

# References