-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Open
Labels
:Analytics/AggregationsAggregationsAggregations>enhancementTeam:AnalyticsMeta label for analytical engine team (ESQL/Aggs/Geo)Meta label for analytical engine team (ESQL/Aggs/Geo)
Description
ExtendedStats uses a naive algorithm for calculating variance which can lead to floating point errors when the difference between sumSq
and sum
is very small (see here and here). This won't be caught by our existing Kahan summation which is more concerned about accumulated errors when summing values.
There are alternative algos that behave better, which will be increasingly important as other tools build on variance (like t-test).
mysticaltechmysticaltech
Metadata
Metadata
Assignees
Labels
:Analytics/AggregationsAggregationsAggregations>enhancementTeam:AnalyticsMeta label for analytical engine team (ESQL/Aggs/Geo)Meta label for analytical engine team (ESQL/Aggs/Geo)