-
Notifications
You must be signed in to change notification settings - Fork 0
Glossary
A precise, sourced definitions reference for every domain term used across CAOS_SEISMIC. Terms are grouped by theme; within each group they read in a natural conceptual order. Equations are real LaTeX (GitHub renders MathJax); references are canonical, peer-reviewed, with DOIs. For the models that use these terms in context see Models-Classical, Models-ML and Models-Employed; for the tests see Evaluation-and-Tests.
Conventions used throughout: magnitudes are homogenized to moment magnitude
- Core framing
- Magnitude, completeness, and the size distribution
- Point processes and triggering
- Classical and operational models
- Machine-learning approaches
- Declustering and the dual catalog
- Evaluation: CSEP, tests, and scoring
- Calibration and uncertainty
- Tidal triggering and stress
- Geometry, regimes, and tiling
- Product and UI terms
- Data and catalogs
The field's organizing distinction (ICEF; Jordan et al., 2011). A prediction is a
deterministic statement that an event will or will not occur in a given region, time window
and magnitude range. A forecast assigns a probability strictly between 0 and 1 to such an
event. CAOS_SEISMIC is strictly a forecaster: every number it publishes is a probability in
The authoritative, deployed practice of issuing time-dependent, conditional earthquake probabilities as a scheduled service. OEF is what is achievable (as opposed to deterministic prediction). Real systems include the USGS Operational Aftershock Forecast and OEF-Italy (INGV). Jordan et al. (2011), doi:10.4401/ag-5350.
A probability of one or more qualifying events, conditioned on the present state of the system
(the recent catalog and covariates), scoped to a region × magnitude band × horizon. "Conditional"
means the number changes day to day as the observed history
The leading explanatory framework (not settled physics) for why deterministic prediction is effectively impossible: the crust sits near a critical state in which a small rupture may or may not cascade into a large one, governed by unmeasurably fine details. Presented as a leading hypothesis, alongside coexisting characteristic-earthquake / partial-predictability views. Bak, P., & Tang, C. (1989), Earthquakes as a self-organized critical phenomenon, JGR 94(B11), 15635–15637, doi:10.1029/JB094iB11p15635.
The empirical fact that short-term forecasting has a hard skill ceiling: the best operational models (ETAS family) beat a time-independent Poisson baseline by only a modest information gain, and mostly within aftershock-rich windows. Any claim beyond "modest, well-calibrated conditional probabilities" is overclaiming. Geller, R. J., Jackson, D. D., Kagan, Y. Y., & Mulargia, F. (1997), Earthquakes Cannot Be Predicted, Science 275(5306), 1616–1617, doi:10.1126/science.275.5306.1616.
A crucial honesty distinction. The relative gain over the long-term background during an active sequence can be large (one to three orders of magnitude); the absolute probability of a large event in the next day usually remains well under a few percent (< 1 %/day for great events). Every number is therefore shown next to its baseline so "elevated" and "still small" are visible at once.
The frequency–magnitude distribution of earthquakes is, to first order, a power law:
The slope
The lowest magnitude above which (essentially) all events in a catalog are reliably detected. A
model trained below
Two standard
A maximum-likelihood
Immediately after a large mainshock — exactly the highest-stakes, highest-traffic moment — the
catalog is grossly incomplete:
magType field. A wrong conversion shifts the entire GR
distribution and every rate forecast.
The upper magnitude bound that truncates the exceedance integral and sets the tail probability of the rare, high-impact events that dominate risk. It is specified per region from regional hazard models and carried as an explicit, documented assumption with sensitivity reported — never left undefined.
The single public number: the probability of at least one event above a target magnitude $M^$ in a
region over a horizon. For a non-homogeneous Poisson process with expected count $N_{\ge M^}$,
$$P(\ge 1\text{ event} \ge M^) = 1 - e^{-N_{\ge M^}}, \qquad
N_{\ge M^} = \iint \lambda(t,x,y\mid\mathcal H_t),\Phi(M^),dx,dy,dt, \quad
\Phi(M^) = 10^{-b(M^ - M_c)}.$$
The formula
A stochastic model for events (points) scattered in time (and space and magnitude). A marked spatio-temporal point process attaches a magnitude (the "mark") to each space–time point. All models in this project are special cases of a marked point process.
The instantaneous expected rate of events at time
The set of all events (times, locations, magnitudes) observed strictly before
A point process in which each event increases the intensity of future events — a stationary background plus the summed, decaying contribution of all past events. ETAS is the seismology-specific Hawkes process. Hawkes (1971); for ETAS see Ogata (1988).
The empirical power-law decay of aftershock rate after a mainshock:
The scaling of the number of direct offspring with mainshock magnitude,
The expected number of direct offspring (triggered events) per event in an ETAS / Hawkes process. It controls stability — see subcriticality.
ETAS stability requires two logically separate conditions: (1) finite branching — the
productivity × magnitude integral converges only if
The stationary, time-independent part of the intensity — where earthquakes occur independently of recent triggering. Estimated by smoothing a declustered catalog (see smoothed seismicity), it both feeds ETAS and serves as the floor for cold-start cells.
The de-facto operational baseline: a self-exciting Hawkes point process combining a stationary
background, Utsu productivity, the Omori–Utsu time kernel, and a spatial kernel into one conditional
intensity,
The inverse-power spatial decay used in space–time ETAS,
The most transparent operational aftershock model: a GR magnitude term times a modified-Omori time
decay,
The production reference for the output shape: it wraps Reasenberg–Jones clustering plus a background term into gridded shaking-probability maps — i.e. a "one-inference-per-short-interval" probabilistic regional map, exactly the form of this product. Gerstenberger, M. C., Wiemer, S., Jones, L. M., & Reasenberg, P. A. (2005), Nature 435, 328–331, doi:10.1038/nature03622.
The USGS scheduled cloud service that monitors ComCat and issues calibrated aftershock probabilities (running both Reasenberg–Jones and ETAS), with the first forecast roughly 30 minutes after a significant event. A real, honest operational analogue. Page et al. (2016), doi:10.1785/0120160073.
INGV's operational forecasting system, an ensemble of three distinct models — ETAS + ETES + STEP — running as a true prospective service for ~10 years. Its validation found it broadly reliable, with a documented underestimation during the 2016–2017 Central Italy (Amatrice–Norcia) sequence caused by post-mainshock catalog incompleteness. Spassiani, I., Falcone, G., Murru, M., & Marzocchi, W. (2023), GJI 234(3), 2501–2518.
A medium-term (months–years) precursory-scaling model: each event of precursor magnitude
A stationary, time-independent estimate of where earthquakes occur, obtained by smoothing a
declustered catalog with an adaptive kernel (the Helmstetter–Kagan–Jackson adaptive power-law
kernel, bandwidth set by the distance to the
A long-term, time-dependent recurrence model used where paleoseismic data constrain a fault's mean
recurrence interval. The inter-event-time density is inverse-Gaussian,
The change in stress driving a fault toward or away from failure from a nearby slip event:
Dieterich's constitutive theory predicting how the seismicity rate responds to a stress step,
deriving the Omori-like
The time a rupture takes to nucleate. The decisive argument for why daily tides barely trigger
ordinary earthquakes: if nucleation (
The unifying language: a model of an event stream defined by its conditional intensity
A TPP whose intensity (or its components) is parameterized by a neural network (RNN, LSTM, attention/ transformer), learning event dynamics from data. Foundational architectures (validated on non-seismic streams — their log-likelihood wins do not automatically transfer to seismicity) include RMTPP (Du et al., 2016, doi:10.1145/2939672.2939875), the Neural Hawkes Process (Mei & Eisner, 2017, arXiv:1612.09328), Self-Attentive Hawkes (Zhang et al., 2020, arXiv:1907.07561), and the Transformer Hawkes Process (Zuo et al., 2020, PMLR v119).
A design principle for the project's gated neural challenger: keep the additive background + summed-triggering skeleton of a Hawkes/ETAS process, but replace the fixed kernels with small MLPs / attention, and model magnitude explicitly. This anchors the network to known physics rather than learning triggering from scratch (the FERN spirit).
A GRU-based (recurrent) encoder–decoder neural TPP for earthquakes. It improves on temporal ETAS
only when the training catalog is large (
An ETAS-generalizing neural encoder (MLPs replace the fixed kernels). The FERN+ variant (which
ingests sub-$M_c$ events) reports a 4–12 % information-gain-per-earthquake improvement and learns
fault-aligned anisotropy ~1000× faster than ETAS. Crucially, the authors note it was not
CSEP-tested, provided no uncertainty quantification, and its test period ended before the
2011 Tohoku
The decisive 2026 benchmark: five modern neural point processes (NSTPP, DeepSTPP, AutoSTPP, DSTPP, SMASH) on California 1971–2021 with strict chronological splits and CSEP consistency tests. None outperformed ETAS — the gap is largest on the spatial test (best NPP ≈ 68.6 % vs ETAS ≈ 92.0 % pass rate), exactly where forecasting value lives. It also repaired a data-leakage flaw in earlier work (non-chronological splits + excluding the Tohoku sequence inflate metrics). The honest conclusion: no pure ML / NPP has robustly beaten ETAS in prospective CSEP as of 2026. Stockman, S., Lawson, D., & Werner, M. J. (2026), TMLR, arXiv:2410.08226.
The canonical cautionary tale: a deep net (~13,451 parameters, AUC 0.85 for aftershock spatial pattern) was matched by a 2-parameter logistic regression ("one neuron") on a single physical feature. Root causes — massive over-parameterization vs ~199 effective mainshocks, a per-cell "computer-vision" framing inflating the apparent sample size, and AUC being the wrong metric for rate forecasting. DeVries, P. M. R., et al. (2018), Nature 560, 632–634, doi:10.1038/s41586-018-0438-y; rebuttal Mignan, A., & Broccardo, M. (2019), Nature 575, E1–E3, doi:10.1038/s41586-019-1582-8.
The hard line between two distinct tasks. ML waveform models (PhaseNet, EQTransformer, PhaseNO,
SeisBench, the SeisLM foundation model) are mature/production for phase-picking, detection,
association and characterization — they build better, more complete catalogs (lower, more stable
A non-default model kept behind a feature flag. The neural challenger reaches the public map only if it beats ETAS in the project's own prospective CSEP harness (positive IGPE, T-test CI excluding zero) and passes calibration (a release blocker). Otherwise ETAS remains what ships.
Separating "independent" mainshocks from triggered foreshocks/aftershocks. Methods: Gardner–Knopoff space–time windowing (transparent cross-check) and Zaliapin–Ben-Zion nearest-neighbour (primary).
The most common pipeline mistake, made explicit: feed the declustered catalog only to the
stationary Poisson / smoothed-seismicity background
A windowing declustering method with magnitude-dependent space and time windows. OpenQuake hmtk
coefficients:
A declustering and feature method based on the Baiesi–Paczuski nearest-neighbour proximity
The community-endorsed framework (grown out of RELM, California) for prospective earthquake forecast testing. Using its standard tests — not bespoke metrics — is what makes the product defensible. Schorlemmer, D., et al. (2007), SRL 78(1), 17–29, doi:10.1785/gssrl.78.1.17.
The community Python toolkit implementing every CSEP test (catalog access, both forecast representations, the grid tests and catalog tests). Using it means reviewers can dispute the model, not the test code. Savran, W. H., et al. (2022), SRL 93(5), 2858–2870, doi:10.1785/0220220033; docs.cseptesting.org.
One forecast representation: a Poisson expected count per space–magnitude cell, evaluated with the Poisson consistency tests. Directly comparable to the published CSEP California daily benchmark.
The other representation: an ensemble of
Regional seismicity has variance
A test of whether one model is calibrated (its forecast is statistically consistent with what
happened). Necessary but not sufficient for skill. The CSEP consistency tests are N / M / S / L /
CL, all built on the Poisson joint log-likelihood
Tests whether the total forecast count matches the observed count, via Poisson quantile scores
Tests whether the forecast magnitude distribution (the GR shape) matches the observed magnitudes;
quantile
Tests whether the forecast spatial distribution matches where events actually occurred; quantile
The L-test scores the joint pseudo-likelihood (quantile
A test of whether model A is better than model B — the only thing that establishes skill. Skill is claimed only by winning a comparison test against a real baseline (smoothed-seismicity and ETAS) with a confidence interval excluding zero.
The comparison metric, in nats:
Natural-logarithm units of information — the CSEP convention for information gain (a gain in
The paired tests on per-earthquake information gain. The T-test is the paired Student-t,
The catalog-based analogue of the L-test statistic,
A scoring rule that is optimized (in expectation) only by the true probability — it cannot be gamed
by miscalibration. The project reports the logarithmic score
A proper score for the bounded binary exceedance output,
A proper score for a full predictive distribution,
An alarm-style / ROC view (a communication aid, never a primary skill metric): miss rate
The normalized area above the Molchan trajectory: 1 = perfect, 0.5 = random, 0 = perfectly unskilled. Zechar, J. D., & Jordan, T. H. (2010), PAGEOPH 167, 893–906.
ROC plots hit rate against false-alarm rate; AUC is the area under it. AUC is banned as a primary forecasting metric: it is invariant to monotone rescaling, hence blind to the calibration of the very probabilities a forecast publishes, and on a rare per-cell-per-day task it degenerates into a region classifier (the DeVries trap). Shown only as a communication aid.
The credibility ordering: true prospective (gold) > pseudo-prospective (the primary back-analysis mode) > retrospective out-of-sample (weak) > in-sample (fit diagnostic only). Mizrahi, L., et al. (2024), Reviews of Geophysics 62, doi:10.1029/2023RG000823.
Back-analysis that mimics a true prospective run via the forecast clock: at each daily issue time the
model is handed only the catalog slice
The strict driver enforcing pseudo-prospective discipline: features come only from
Any way information from the future, or a retroactively improved catalog, contaminates a forecast and
inflates its apparent skill. The five engineered-against modes: temporal leakage,
catalog-revision leakage,
The immutable, versioned record — for each daily issue — of the exact catalog state, the
The property that forecast probabilities match observed frequencies ("when we said 5 %, it happened ~5 % of the time"). Calibration is a release blocker — an uncalibrated probability does not ship. The public probability is recalibrated (isotonic / Platt) and validated per horizon.
A plot of forecast probability against observed frequency; the diagonal is perfect calibration. It is the single most credibility-building artifact the product ships, and is validated specifically in the cold-start / quiet regime, which dominates the diagram.
Post-hoc monotone recalibration maps (isotonic regression / logistic Platt scaling) that adjust raw model probabilities so the reliability diagram lands on the diagonal.
A calibration diagnostic: under a well-calibrated forecast, the forecast CDF evaluated at the observation is uniformly distributed. Deviations from uniformity reveal miscalibration.
Epistemic uncertainty is reducible (parameter and model uncertainty); aleatory uncertainty is the irreducible randomness of the process. The published bounds must be a real decomposition of both — not a cosmetic Poisson interval.
The published triad. The bounds are sourced from ETAS parameter uncertainty (MLE covariance /
bootstrap / INLAbru posterior), propagated
The dominant regime (most of any map's area-and-time is quiet). The conditional rate floors to the principled smoothed-seismicity background, never an arbitrary per-day constant; strength is borrowed spatially via hierarchical / empirical-Bayes pooling. Three honest, visually distinct UI states: low-but-poorly-constrained (wide bounds), genuinely quiescent (tight bounds near baseline), and no data / out-of-coverage (explicit mask). Blank must never read as "safe."
The small, regime-dependent modulation of earthquake rate by solid-Earth and ocean tides. Tidal stresses (~0.1–10 kPa) are ~$10^{-3}$–$10^{-4}$ of earthquake stress drops (~1–10 MPa), so tides can only advance/retard a rupture already near failure, never cause one. A real but small correlation (~0.5–1 % global rate excess; up to factor ~2–3 only for shallow ocean-loaded thrusts). Useless as a standalone predictor; encoded as a regularized covariate that may shrink to ~0. Métivier, L., et al. (2009), EPSL 278, 370–375, doi:10.1016/j.epsl.2008.12.024; Cochran, E. S., Vidale, J. E., & Tanaka, S. (2004), Science 306, 1164–1166, doi:10.1126/science.1103961.
The scalar resolved onto a fault from the tidal stress tensor,
The dominant periodic components of the tide: semidiurnal M2 (12.421 h) and S2 (12.000 h), diurnal O1 (25.819 h) and K1 (23.934 h), and the long-period fortnightly Mf (~13.66–14.77 d). The Mf band, where the period approaches nucleation timescales, is where clean correlation is most expected.
The crustal stress from the weight of ocean tides. At coastal/subduction margins OTL dominates the
body tide — skipping it is the single biggest tidal-modeling error. Computed with SPOTL
(nloadf/hartid) and a global ocean-tide model (TPXO/GOT/FES); the body tide via pygtide (ETERNA
PREDICT).
The standard test for tidal phase selectivity: each event is a unit vector at its tidal phase angle;
the length of the vector sum gives a p-value against the null of random occurrence ("Schuster walk").
Caveat: the p-value depends on
Slow-earthquake phenomena that respond to the same ~kPa tidal stresses far more strongly than ordinary earthquakes — the strongest, least-disputed tidal signal. Tremor rate is exponential in tidal shear stress; fortnightly (Mf) modulation of tremor acts as an in-situ stressmeter. Kept as a separate channel from the fast-earthquake model where tremor/SSE catalogs exist. van der Elst, N. J., et al. (2016), PNAS 113, 8601–8606, doi:10.1073/pnas.1524316113; Rubinstein, J. L., et al. (2008), Science 319, 186–189, doi:10.1126/science.1150558.
A classification of the stress/faulting environment that conditions the global model. The project uses five regimes — subduction interface, intraslab, crustal / strike-slip, intraplate, and ridge — each with its own ETAS priors anchored on the USGS OAF tectonic-regime study. Page, M. T., et al. (2016), BSSA 106(5), 2290–2301, doi:10.1785/0120160073.
The unit of the tiled global forecaster: the globe is partitioned into tiles, each fit on its halo
events (a buffer that makes triggering edge-correct) while owning only its interior cells (for
aggregation). Per-tile fitting avoids the global
In the global scope, any country/region is a view into one global forecast field — not a separately trained model. This is what makes cross-country bias comparison (high- vs low-seismicity) a first-class evaluation goal.
The USGS global subduction-zone geometry model (depth-to-slab, dip, strike, interface distance), a primary enricher for subduction regimes. Hayes, G. P., et al. (2018), Science 362, 58–61, doi:10.1126/science.aat4723.
For great subduction earthquakes the isotropic point-source kernel of generic ETAS breaks down; fault-aligned (anisotropic) and finite-rupture triggering is needed. A key reason a subduction build must not reuse generic California parameters.
The default Monitoring view: a continuous expected-count/rate surface on a perceptually-uniform sequential colormap (viridis/magma-class — not a red traffic-light ramp), with a calibrated numeric legend. The honest object is "where the conditional rate is elevated relative to its own long-term baseline," never "alerts."
The stationary background rate against which every forecast is shown, both as a ratio
(
The forecast window: 1 day / 2 days / 7 days, re-issued daily, always visible in the legend. A probability with no horizon is meaningless. Long horizons have near-zero gain outside active sequences.
A compact always-present badge driven by the CSEP tests. The green/amber/red triad is reserved exclusively for model quality (green = within CSEP consistency, amber = borderline, red = rejected/under-tested) — the only place red appears, describing model quality, never earthquake danger.
Honest degradation signals. The coverage mask hatches regions outside the validated footprint (blank never means safe). The staleness banner shows "generated {UTC} · next run {UTC}" — with one inference per day, the user must know the data's age; a failed run degrades visibly rather than silently serving a stale artifact.
The single small (few hundred KB – few MB), gzipped, committed JSON file the daily job produces and the static viewer consumes: per-cell rates for {1d,2d,7d} × {P10,median,P90}, the baseline, a CSEP test summary + reliability points, the coverage mask, and provenance. The raw ~6.48 M-cell global grid is never shipped to the browser.
The deploy pattern: the daily job commits the compact artifact (scoped git add results/ — never
git add -A) and pushes to the public repo; the static site auto-rebuilds. No processing backend on
the request path.
The international standard web-service API (event / station / dataselect) for seismological
data, exposed by USGS, ISC, EarthScope/IRIS, EMSC, and regional networks. Accessed uniformly via
ObsPy's Client. Has a 20,000-event-per-request cap (tile larger queries) and updatedafter for
incremental deltas.
The USGS real-time, no-auth, daily-current global catalog — the daily inference spine. The
magType field is first-class (mixing magnitude types distorts the GR tail). US Government work,
public domain (non-US networks keep their own attribution).
The Global Instrumental Earthquake Catalogue (1904–2021,
The catalog of centroid moment tensors for ~all
The source of short-horizon skill (low, stable
Static / slow-moving geophysical covariates that are upside, not foundation (the catalog dominates skill), each gated on measured incremental information gain over a catalog-only ETAS: Slab2 (subduction geometry) > GEM Active Faults + Bird PB2002 plate model > GNSS strain (Nevada Geodetic Lab MIDAS, feeding the background term) > focal-mechanism stress. InSAR and heat flow are deferred.
The versioned record stamping every pipeline stage (source catalog versions, query params,
retrieved-at timestamps, row counts, checksums, conversion coefficients,
See also: Methodology-History · Models-Classical · Models-ML · Models-Employed · Data-Sources · Technical-Architecture · Pipeline · Evaluation-and-Tests · Honest-Limits · Changelog-and-Progress.
⚠️ Disclaimer — read this. CAOS_SEISMIC produces probabilistic forecasts, not predictions. It is an independent research and education tool. It is NOT an official earthquake early-warning or civil-protection system, it does NOT predict when, where, or how large an earthquake will be, and it must NOT be used for life-safety, emergency, or evacuation decisions. Every number it publishes is a bounded, calibrated probability conditioned on the present state of seismicity — never an alarm, a countdown, or a "safe" state. A single outcome neither confirms nor refutes a probabilistic forecast.It complements, and does not replace or speak for, official agencies — always follow your national seismological and civil-protection authorities (e.g. USGS, INGV, CSN (Chile, SENAPRED for civil protection), GeoNet, JMA). The software is provided "as is", without warranty of any kind (MIT License); the authors accept no liability for its use. Data are courtesy of their providers (USGS/ANSS, ISC/ISC-GEM, Global CMT, EMSC, CSN, and others) under their respective licenses and attribution terms. See Honest-Limits for the full epistemic context.
CAOS_SEISMIC · seismic.fasl-work.com · source · MIT
Conditional probabilistic seismic forecasting — forecasts, never predictions.
Overview
Methodology & History
Classical models
- Models-Classical · index
- Gutenberg-Richter-Law
- Omori-Utsu-Law
- ETAS-Model
- Reasenberg-Jones-Model
- STEP-Model
- EEPAS-Model
- Smoothed-Seismicity
- Brownian-Passage-Time
- Rate-and-State-and-Coulomb
ML & analytical methods
- Models-ML · index
- Temporal-Point-Processes
- RMTPP
- Neural-Hawkes-Process
- Transformer-Hawkes-Process
- RECAST-and-FERN
- CNN-Spatial-Models
- Graph-and-Recurrent-Networks
- Detection-vs-Forecasting
Models employed
Data
Architecture
Evaluation
Progress
Reference