EWMA Comparisons #1

lifenjoiner · 2021-03-20T05:36:55Z

Here is an easy to use helper to give you a glance over different EWMA strategies:
EWMA_cmp.xlsx

Just refresh the random samples or change the N :)

The text was updated successfully, but these errors were encountered:

lifenjoiner · 2022-04-04T11:44:12Z

Explanation:
0. Image the moving window selecting the data samples.

Average in column I is the exact average of the selected samples. It is the ideal result, the base that others will compare to.
N in cell A2 is the moving window size.
Alpha in cell B2 is the solid decay, the W of EWMA.
n in column C is the index of samples.
V in column D is the random value as samples.
alpha in column E is the just-in-time decay, the W of EWMA.
EWMA Variant is the result of adding the the 1st sample by using Add.
EWMA Continuing is the result of adding the the 1st sample by using Set.
EWMA Warmup is the result of the VividCortex implement with WARMUP_SAMPLES = N - 1, and adding the the 1st sample by using Add.

Comparing to VividCortex implement:
EWMA Continuing should be the same, while they both add the the 1st sample by using Set.
EWMA Warmup should be the same, while they both use the moving window size equal to WARMUP_SAMPLES.

More discussions: DNSCrypt/dnscrypt-proxy#2079

lifenjoiner · 2022-04-06T01:55:51Z

2 strategies at the "warmup" stage related to how to deal with the outliers:

Add the 1st sample by using Add. The following samples have more (decreasing to stable) weight.
Add the 1st sample by using Set. The following samples have less weight.

Things you may need to consider:

Can you tell which one is outlier, the 1st, the 2nd, or the 3rd? If none, the nearer to exact average the better. If the 1st is, the average should be adjusted quicker/heavier. If one of the following is, the average should be adjusted slower/lighter.
What is the outlier ratio of a server? If it is high, the server is less reliable, adjusting quicker is better, the stable one will win at last. If all are low, they are reliable, the nearer to exact average the better.

Anyway, the above is a rough/lazy way. If you really care about outliers, you should deal with them in an earlier stage: the validate/cleanup stage, for all samples.

lifenjoiner pinned this issue Mar 20, 2021

lifenjoiner mentioned this issue Apr 5, 2022

regression: fix ewma warmup again DNSCrypt/dnscrypt-proxy#2079

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EWMA Comparisons #1

EWMA Comparisons #1

lifenjoiner commented Mar 20, 2021

lifenjoiner commented Apr 4, 2022 •

edited

lifenjoiner commented Apr 6, 2022

EWMA Comparisons #1

EWMA Comparisons #1

Comments

lifenjoiner commented Mar 20, 2021

lifenjoiner commented Apr 4, 2022 • edited

lifenjoiner commented Apr 6, 2022

lifenjoiner commented Apr 4, 2022 •

edited