Question: why is RMS weighted by arrival weight? #29

luca-s · 2022-08-17T18:12:50Z

I stumbled across this part of the code and I realized that the event RMS is weighted by arrival weight. While the weighting scheme has its interesting value, especially when comparing different NonLinLoc solutions this weighted RMS works as a score, I find the weighting to be an issue when comparing solutions across several locators or across different velocity models. In those cases I would rather have the RMS computed in the "standard" way, without arrival weights. That would make the value meaningful for comparison.

For this reason I would like to understand better why the RMS is computed the way it is.

Thanks.

alomax · 2022-08-18T07:41:41Z

Good question. The basic answer is that an RMS that is not weighted by arrival weight (quality, pick uncertainty, travel-time error, ...) can be highly biased by bad or outlier data and so not very informative. A weighted RMS is used in Hypoinverse:

https://pubs.usgs.gov/of/1978/0694/report.pdf

However, there are complications in NLL depending on if the L2 GAU_ANALYTIC formulation of Tarantola and Valette 1982 http://www.ipgp.fr/~tarantola/Files/Professional/Papers_PDF/IP_QI_latex.pdf is used, in which case the weights are prior covariances on the picks and travel-time (Eq 10.9 in Tarantola and Valette 1982), and not sensitive to bad or outlier data.

Or a NLL EDT formulation, in which case the weights are the posterior contribution that the arrival makes to the EDT pdf stack. This is important because EDT intrinsically and efficiently down-weights outlier data so that their residuals can be very large, not including this posterior weight can give extremely large RMS in the presence of outlier data. But the EDT weights are a somewhat ad-hoc, hybrid of sums of probabilities, not simple covariances, so I doubt there is a clear or robust statistical basis for EDT weights and the resulting RMS. I tend to use the ellipsoid extent (len3 or se3, simple proxy for PDF extent) instead of RMS for filtering location results, along with sometimes number of readings, gap, or other prior measures.

In any case, the residuals are listed in the output, so an unweighted RMS can always be calculated.

Anthony

luca-s · 2022-08-18T08:13:44Z

Thank you very much for taking the time for answering, this is all good information.

FMassin · 2022-08-18T10:50:10Z

Interesting!

So, it is currently unfair to compare NLL RMS from other location methods such as those included in SeisComP?

LOCSAT uses unweighted rms
hypo71 uses unweighted rms
iloc has both (called uRMS and wRMS) but reports unweighted rms

luca-s · 2022-08-18T11:10:24Z

@FMassin As @alomax wrote "the residuals are listed in the output, so an unweighted RMS can always be calculated" so we should consider doing so in SED NLL plugin for SeisComP

alomax · 2022-08-18T13:15:12Z

The rms from the hypo71 output seems to be parsed here. But hypo71 does use an rms "corrected for average P & S residual"
https://pubs.usgs.gov/of/1972/0224/report.pdf

But, in general (i.e. always), I would suppose that statistics from two different procedures (or even the same procedure with very different input configurations) cannot be directly compared. Perhaps, for the case of hypocenter location, only the statistics between events with similar station distribution, proportion of P and S picks, etc, within a single location configuration (velocity model, ...) and location algorithm, can be directly compared.

FMassin · 2022-08-19T07:56:24Z

The rms from the hypo71 output seems to be parsed here.

I think this is for the SeisComP interface. The actual hypo71 code is in https://github.com/SeisComP/contrib-ipgp/tree/master/apps/3rd-party/Hypo71PC.

alomax · 2022-08-19T08:27:05Z

Yeah - I was a bit confused, as the link in the e-mail version of your comment pointed to the SCP interface...

In any case, what hypo71 is doing with weights and AVRPS is not immediately clear
(FORTRAN!), but there is some possible weight XWT.

FMassin · 2022-08-19T08:30:36Z

Me too ! It took me a while to get it and edited my comment sorry! I'm still confused about this FNO variable that seem to be incremented by 1 for each data point anyway...

luca-s closed this as completed Aug 18, 2022

luca-s mentioned this issue Aug 18, 2022

[nll] Unweighted RMS computation swiss-seismological-service/sed-SeisComP-contributions#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: why is RMS weighted by arrival weight? #29

Question: why is RMS weighted by arrival weight? #29

luca-s commented Aug 17, 2022

alomax commented Aug 18, 2022

luca-s commented Aug 18, 2022 •

edited

Loading

FMassin commented Aug 18, 2022 •

edited

Loading

luca-s commented Aug 18, 2022 •

edited

Loading

alomax commented Aug 18, 2022

FMassin commented Aug 19, 2022

alomax commented Aug 19, 2022

FMassin commented Aug 19, 2022 •

edited

Loading

Question: why is RMS weighted by arrival weight? #29

Question: why is RMS weighted by arrival weight? #29

Comments

luca-s commented Aug 17, 2022

alomax commented Aug 18, 2022

luca-s commented Aug 18, 2022 • edited Loading

FMassin commented Aug 18, 2022 • edited Loading

luca-s commented Aug 18, 2022 • edited Loading

alomax commented Aug 18, 2022

FMassin commented Aug 19, 2022

alomax commented Aug 19, 2022

FMassin commented Aug 19, 2022 • edited Loading

luca-s commented Aug 18, 2022 •

edited

Loading

FMassin commented Aug 18, 2022 •

edited

Loading

luca-s commented Aug 18, 2022 •

edited

Loading

FMassin commented Aug 19, 2022 •

edited

Loading