-
Notifications
You must be signed in to change notification settings - Fork 0
STEP Model
STEP (Short-Term Earthquake Probability) is the model that turned aftershock statistics into a map. Where Reasenberg–Jones answers "how many aftershocks, and how likely is a large one" with no sense of place, STEP answers "how likely is strong shaking here, in the next interval" — a near-real-time, gridded probability surface updated as the seismicity evolves. It is the canonical production template for "tomorrow's earthquakes" probabilistic maps, and therefore the closest published analogue of this product's daily gridded output. This page is a deep, self-contained treatment of that one model: intuition and history, the governing equations and how they assemble into a map, parameter estimation, assumptions, strengths and limitations, its operational role, a worked sketch, a structural diagram, and references.
Honest framing. STEP produces a bounded, calibrated probability surface, never a
deterministic prediction and never an alarm. Each cell carries a probability strictly in
- Intuition and history
- What STEP computes
- The three rate components
- Blending the components per cell
- From rate to shaking probability (the GMPE step)
- Parameter estimation
- Assumptions and failure modes
- Strengths and limitations
- Role in operational earthquake forecasting
- Worked sketch of a single cell
- How STEP informs this product
- References
By the early 2000s the Reasenberg–Jones model gave reliable aggregate aftershock probabilities, and seismic networks in California were dense enough to locate small earthquakes within minutes. The missing piece was space: a forecast that says "a damaging aftershock is 100× more likely near the rupture than 50 km away" is far more actionable than a single regional number.
STEP, developed by Matthew Gerstenberger, Stefan Wiemer, Lucile Jones and Paul Reasenberg
(USGS + SCEC + ETH Zürich), was the first system to deliver this operationally. Published in
Nature in 2005 under the title "Real-time forecasts of tomorrow's earthquakes in California", it
produced hourly maps of the probability of experiencing strong shaking (Modified Mercalli
intensity
Two design choices made STEP both useful and honest:
- It separates a slowly-varying background rate from a fast, event-driven clustering rate, so the map is dominated by the background in quiet times and lights up around fresh sequences.
- It converts the seismicity rate into a shaking probability via a ground-motion model — the quantity people actually care about — rather than reporting an abstract event rate.
STEP's architecture was later adopted, in spirit, by OEF-Italy, where a STEP-type component sits in an ensemble with ETAS and ETES.
Why "tomorrow's earthquakes" is the right framing. STEP never claims to know which fault will move or when the next event strikes. It quantifies, cell by cell, the elevated probability that the current state of seismicity confers on the next interval — which is exactly the honest thing a short-term forecast can say.
STEP's output is a grid of cells; for each cell
- A conditional earthquake rate
$\lambda(s_j, t)$ — the expected rate of events (above a reference magnitude, with a Gutenberg–Richter magnitude distribution) in that cell, given the current catalog. - A probability of strong shaking — the probability that ground motion in that cell exceeds a
chosen intensity threshold (originally MMI
$\ge$ VI) during the interval, obtained by pushing the rate through a ground-motion prediction equation (GMPE).
The pipeline is therefore: catalog → per-cell rate (background + clustering) → magnitude distribution → ground-motion conversion → exceedance probability map.
flowchart TD
CAT["Real-time catalog<br/>(located events)"] --> BG["Background rate<br/>μ(s) — smoothed, time-independent"]
CAT --> CL["Clustering rate<br/>Σ Reasenberg–Jones contributions<br/>of recent events"]
BG --> SUM["Per-cell conditional rate<br/>λ(s,t) = μ(s) + Σ R–J"]
CL --> SUM
SUM --> GR["Gutenberg–Richter<br/>magnitude distribution"]
GR --> GMPE["GMPE / ground-motion model<br/>P(shaking ≥ threshold)"]
GMPE --> MAP["Gridded shaking-probability map<br/>(updated each interval)"]
The clustering rate in STEP is built entirely on the Reasenberg–Jones temporal–magnitude law
but STEP estimates the parameters three ways simultaneously and keeps all three:
| Component | How parameters are obtained | When it is most informative |
|---|---|---|
| Generic | Fixed regional |
The instant a new mainshock is located — before its aftershocks accumulate. |
| Sequence-specific |
|
Once a productive sequence has enough aftershocks to constrain the fit. |
| Spatially-varying | Parameters estimated per grid cell, capturing along-rupture variation in productivity and decay. | Where the aftershock distribution is spatially non-uniform (most large ruptures). |
Each component yields its own estimate of the clustering rate at a cell. The background rate
The key operational idea is that STEP does not commit to one parameterization. For each cell it selects the most informative of the three clustering estimates available there, and adds the background:
where
In quiet periods
flowchart LR
SV["Spatially-varying<br/>per-cell fit"] -->|"enough local data?"| PICK{Select most<br/>informative}
SS["Sequence-specific<br/>fit"] --> PICK
GEN["Generic<br/>regional"] --> PICK
PICK --> CL["λ_cluster(s,t)"]
MU["Background μ(s)"] --> ADD["λ(s,t) = μ(s) + λ_cluster(s,t)"]
CL --> ADD
STEP reports shaking probability, not event rate, because shaking is what damages buildings and what civil protection acts on. Two conversions are layered on top of the rate.
The cell rate
For an event of magnitude
the same
Why this matters. The GMPE step is what makes STEP a hazard product rather than a seismicity product. It also means STEP inherits the GMPE's uncertainty — an honest map carries that scatter through to the published probability.
-
Clustering parameters
$(a, c, p)$ : Omori–Utsu maximum likelihood (Ogata, 1983) on the ongoing sequence for the sequence-specific and spatially-varying components; regional priors for the generic component. See Reasenberg-Jones-Model §6. -
$b$ -value: Aki–Utsu MLE above the completeness magnitude$M_c$ , never hard-coded — see Models-Classical §1. -
Background
$\mu(s)$ : adaptive-kernel smoothed seismicity on a declustered catalog (Models-Classical §7). - GMPE: a region-appropriate ground-motion model with its aleatory scatter; the choice of GMPE is itself a modelling decision that must match the tectonic setting.
-
Early incompleteness: as with R–J, the minutes-to-hours after a large mainshock are
incomplete; the fit must start past the time-of-completeness or model
$M_c(t)$ , or productivity is underestimated exactly when the map matters most.
| Assumption | What breaks it |
|---|---|
| Clustering is aftershock-type, captured by Reasenberg–Jones. | Swarms, slow-slip-driven and induced sequences are not aftershock decay; STEP mis-fits them. |
| Background is stationary over the relevant horizon. | Transient background changes (fluid injection, volcanic unrest) violate it. |
| Non-homogeneous Poisson counts per cell. | Real counts are over-dispersed; secondary triggering is only partially captured. |
| GMPE transfers to the region and threshold. | A California-tuned GMPE does not describe subduction shaking; the wrong GMPE biases the whole map. |
| Generic parameters are regime-appropriate. | Reusing California generics outside California is invalid (the same pitfall as R–J). |
| Catalog is complete above |
Early/near-source incompleteness suppresses the very cells that should light up. |
The signature failure mode is mainshock/foreshock anticipation: STEP is aftershock-dominated by design, so it has little skill at flagging the next large independent event before any clustering signal exists. It is excellent at mapping the elevated hazard after a sequence starts.
Strengths
- Gridded and spatial — the first operational map of short-term shaking probability; the output shape this product targets.
- Hazard-relevant — reports shaking, not abstract rate, via the GMPE step.
- Graceful data dependence — per-cell selection of generic / sequence-specific / spatial parameters means it degrades sensibly where data are sparse.
- Real-time and automatic — designed to update each interval with no human in the loop.
Limitations
- Aftershock-dominated — weak for anticipating independent mainshocks/foreshocks by design.
- No secondary triggering in the R–J core — large aftershocks' own sequences are under-modelled relative to ETAS.
- California-tuned origins — generic parameters and GMPE must be re-derived for any other region.
- GMPE-limited — map accuracy is capped by the ground-motion model's fidelity and scatter.
In direct CSEP-style comparison ETAS is the more complete generator, but STEP-type models often excel specifically during vigorous aftershock sequences, which is why operational ensembles keep both — see Evaluation-and-Tests.
STEP is a foundational operational system, not a paper model:
- It ran as a near-real-time California service producing hourly gridded shaking-probability maps — the first instance of operational earthquake forecasting in the modern sense.
- Its architecture (background + Reasenberg–Jones clustering → GMPE → gridded exceedance) is the template that this product's daily gridded map follows.
- OEF-Italy embeds a STEP-type component in a validated three-model ensemble (with ETAS and ETES) that runs as a scheduled civil-protection service and publishes calibrated probabilities with uncertainty; its long-term validation found the ensemble broadly reliable.
- Across a decade-long California experiment in which dozens of next-day models were scored with pyCSEP, the honest finding is that no single model dominates: STEP-type models shine during aftershock sequences while ETAS is the consistent generalist — the empirical justification for running an ensemble.
STEP's enduring contribution is conceptual as much as computational: it established that the right operational deliverable is a calibrated, gridded probability surface conditioned on present seismicity — never an alarm, never a countdown.
Illustrative only — not a forecast.
Consider a cell ~10 km from an
-
Clustering rate (R–J core). Using generic
$a = -1.7$ ,$b = 0.9$ ,$c = 0.05$ d,$p = 1.1$ , the rate of$\ge 4.0$ events in the whole sequence over that day is$10^{-1.7 + 0.9(6.0 - 4.0)} \int_2^3 (t + 0.05)^{-1.1}, dt$ . The magnitude term is$10^{0.1} \approx 1.26$ ; the day's time integral is small ($\approx 0.30$ ), giving a sequence rate$\approx 0.38$ events$\ge 4.0$ that day. -
Spatial share. The spatially-varying component assigns a fraction of that rate to this cell
based on the local aftershock density — say 5 %, i.e.
$\approx 0.019$ events$\ge 4.0$ in the cell that day. -
Add background. Add the (small) stationary background
$\mu(s)$ for the cell. -
GMPE → shaking. Push the cell's magnitude-distributed rate through the GMPE to get the rate
$\Lambda_{\ge \text{VI}}$ of MMI$\ge$ VI shaking, then$P = 1 - e^{-\Lambda_{\ge \text{VI}}}$ .
The map is this calculation repeated over every cell, dominated by the cells hugging the rupture and fading to background away from it.
Read it honestly. A cell showing "2 % chance of MMI ≥ VI today" is a probability, scored over many cells and many days (see Evaluation-and-Tests) — not a promise about that one cell on that one day.
CAOS_SEISMIC adopts STEP's output shape and architecture, not its California parameters:
- The product emits a gridded, calibrated probability surface per cycle, exactly STEP's deliverable form — see Models-Employed, Pipeline and Technical-Architecture.
- The internal rate is assembled as background + time-dependent clustering, with the clustering term supplied by ETAS (more complete than R–J's single-trigger core) and cross-checked by Reasenberg–Jones.
- The magnitude tail comes from a rolling Gutenberg–Richter fit, and the per-cell exceedance
probability uses the same
$1 - e^{-N}$ machinery STEP pioneered (Methodology-History). - All parameters and any ground-motion conversion are derived for the operating tectonic regime; California generics and California GMPEs are never reused unchanged.
- Every map cell is CSEP-scored (Evaluation-and-Tests) and the surface is published as a bounded probability, never an alarm — see Honest-Limits.
- Gerstenberger, M. C., Wiemer, S., Jones, L. M. & Reasenberg, P. A. (2005). Real-time forecasts of tomorrow's earthquakes in California. Nature 435, 328–331. doi:10.1038/nature03622
- Reasenberg, P. A. & Jones, L. M. (1989). Earthquake hazard after a mainshock in California. Science 243(4895), 1173–1176. doi:10.1126/science.243.4895.1173
- Page, M. T., van der Elst, N., Hardebeck, J., Felzer, K. & Michael, A. J. (2016). Three ingredients for improved global aftershock forecasts. Bulletin of the Seismological Society of America 106(5), 2290–2301. doi:10.1785/0120160073
- Ogata, Y. (1983). Estimation of the parameters in the modified Omori formula for aftershock frequencies by the maximum likelihood procedure. Journal of Physics of the Earth 31(2), 115–124. doi:10.4294/jpe1952.31.115
- Helmstetter, A., Kagan, Y. Y. & Jackson, D. D. (2007). High-resolution time-independent grid-based forecast for M ≥ 5 earthquakes in California. Seismological Research Letters 78(1), 78–86. doi:10.1785/gssrl.78.1.78
- Schorlemmer, D., Gerstenberger, M. C., Wiemer, S., Jackson, D. D. & Rhoades, D. A. (2007). Earthquake likelihood model testing. Seismological Research Letters 78(1), 17–29. doi:10.1785/gssrl.78.1.17
- Jordan, T. H., Chen, Y.-T., Gasparini, P., Madariaga, R., Main, I., Marzocchi, W., Papadopoulos, G., Sobolev, G., Yamaoka, K. & Zschau, J. (2011). Operational earthquake forecasting: state of knowledge and guidelines for utilization. Annals of Geophysics 54(4), 315–391. doi:10.4401/ag-5350
Related pages: Models-Classical · Reasenberg-Jones-Model · EEPAS-Model · Models-Employed · Methodology-History · Pipeline · Evaluation-and-Tests · Honest-Limits · Glossary.
⚠️ Disclaimer — read this. CAOS_SEISMIC produces probabilistic forecasts, not predictions. It is an independent research and education tool. It is NOT an official earthquake early-warning or civil-protection system, it does NOT predict when, where, or how large an earthquake will be, and it must NOT be used for life-safety, emergency, or evacuation decisions. Every number it publishes is a bounded, calibrated probability conditioned on the present state of seismicity — never an alarm, a countdown, or a "safe" state. A single outcome neither confirms nor refutes a probabilistic forecast.It complements, and does not replace or speak for, official agencies — always follow your national seismological and civil-protection authorities (e.g. USGS, INGV, CSN (Chile, SENAPRED for civil protection), GeoNet, JMA). The software is provided "as is", without warranty of any kind (MIT License); the authors accept no liability for its use. Data are courtesy of their providers (USGS/ANSS, ISC/ISC-GEM, Global CMT, EMSC, CSN, and others) under their respective licenses and attribution terms. See Honest-Limits for the full epistemic context.
CAOS_SEISMIC · seismic.fasl-work.com · source · MIT
Conditional probabilistic seismic forecasting — forecasts, never predictions.
Overview
Methodology & History
Classical models
- Models-Classical · index
- Gutenberg-Richter-Law
- Omori-Utsu-Law
- ETAS-Model
- Reasenberg-Jones-Model
- STEP-Model
- EEPAS-Model
- Smoothed-Seismicity
- Brownian-Passage-Time
- Rate-and-State-and-Coulomb
ML & analytical methods
- Models-ML · index
- Temporal-Point-Processes
- RMTPP
- Neural-Hawkes-Process
- Transformer-Hawkes-Process
- RECAST-and-FERN
- CNN-Spatial-Models
- Graph-and-Recurrent-Networks
- Detection-vs-Forecasting
Models employed
Data
Architecture
Evaluation
Progress
Reference