Optimise PromQL (#3966)

* Move range logic to 'eval' Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make aggregegate range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * PromQL is statically typed, so don't eval to find the type. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Extend rangewrapper to multiple exprs Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Start making function evaluation ranged Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make instant queries a special case of range queries Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Eliminate evalString Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Evaluate range vector functions one series at a time Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make unary operators range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make binops range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Pass time to range-aware functions. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make simple _over_time functions range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reduce allocs when working with matrix selectors Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Add basic benchmark for range evaluation Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reuse objects for function arguments Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Do dropmetricname and allocating output vector only once. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Add range-aware support for range vector functions with params Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Optimise holt_winters, cut cpu and allocs by ~25% Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make rate&friends range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make more functions range aware. Document calling convention. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make date functions range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make simple math functions range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Convert more functions to be range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make more functions range aware Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Specialcase timestamp() with vector selector arg for range awareness Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove transition code for functions Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove the rest of the engine transition code Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove more obselete code Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove the last uses of the eval* functions Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove engine finalizers to prevent corruption The finalizers set by matrixSelector were being called just before the value they were retruning to the pool was then being provided to the caller. Thus a concurrent query could corrupt the data that the user has just been returned. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Add new benchmark suite for range functinos Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Migrate existing benchmarks to new system Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Expand promql benchmarks Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Simply test by removing unused range code Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * When testing instant queries, check range queries too. To protect against subsequent steps in a range query being affected by the previous steps, add a test that evaluates an instant query that we know works again as a range query with the tiimestamp we care about not being the first step. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reuse ring for matrix iters. Put query results back in pool. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reuse buffer when iterating over matrix selectors Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Unary minus should remove metric name Cut down benchmarks for faster runs. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reduce repetition in benchmark test cases Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Work series by series when doing normal vectorSelectors Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Optimise benchmark setup, cuts time by 60% Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Have rangeWrapper use an evalNodeHelper to cache across steps Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Use evalNodeHelper with functions Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Cache dropMetricName within a node evaluation. This saves both the calculations and allocs done by dropMetricName across steps. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reuse input vectors in rangewrapper Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Reuse the point slices in the matrixes input/output by rangeWrapper Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make benchmark setup faster using AddFast Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Simplify benchmark code. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Add caching in VectorBinop Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Use xor to have one-level resultMetric hash key Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Add more benchmarks Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Call Query.Close in apiv1 This allows point slices allocated for the response data to be reused by later queries, saving allocations. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Optimise histogram_quantile It's now 5-10% faster with 97% less garbage generated for 1k steps Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make the input collection in rangeVector linear rather than quadratic Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Optimise label_replace, for 1k steps 15x fewer allocs and 3x faster Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Optimise label_join, 1.8x faster and 11x less memory for 1k steps Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Expand benchmarks, cleanup comments, simplify numSteps logic. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Address Fabian's comments Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Comments from Alin. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Address jrv's comments Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Remove dead code Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Address Simon's comments. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Rename populateIterators, pre-init some sizes Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Handle case where function has non-matrix args first Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Split rangeWrapper out to rangeEval function, improve comments Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Cleanup and make things more consistent Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Make EvalNodeHelper public Signed-off-by: Brian Brazil <brian.brazil@robustperception.io> * Fabian's comments. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>
prometheus · Jun 4, 2018 · dd6781a · dd6781a
1 parent 9dc763c
commit dd6781a
Show file tree

Hide file tree

Showing 12 changed files with 1,204 additions and 968 deletions.
diff --git a/promql/ast.go b/promql/ast.go
@@ -132,9 +132,8 @@ type MatrixSelector struct {
 	Offset        time.Duration
 	LabelMatchers []*labels.Matcher
 
-	// The series iterators are populated at query preparation time.
-	series    []storage.Series
-	iterators []*storage.BufferedSeriesIterator
+	// The series are populated at query preparation time.
+	series []storage.Series
 }
 
 // NumberLiteral represents a number.
@@ -166,9 +165,8 @@ type VectorSelector struct {
 	Offset        time.Duration
 	LabelMatchers []*labels.Matcher
 
-	// The series iterators are populated at query preparation time.
-	series    []storage.Series
-	iterators []*storage.BufferedSeriesIterator
+	// The series are populated at query preparation time.
+	series []storage.Series
 }
 
 func (e *AggregateExpr) Type() ValueType  { return ValueTypeVector }

diff --git a/promql/bench_test.go b/promql/bench_test.go
@@ -13,36 +13,183 @@
 
 package promql
 
-import "testing"
+import (
+	"context"
+	"fmt"
+	"strconv"
+	"strings"
+	"testing"
+	"time"
 
-// A Benchmark holds context for running a unit test as a benchmark.
-type Benchmark struct {
-	b         *testing.B
-	t         *Test
-	iterCount int
-}
+	"github.com/prometheus/prometheus/pkg/labels"
+	"github.com/prometheus/prometheus/util/testutil"
+)
+
+func BenchmarkRangeQuery(b *testing.B) {
+	storage := testutil.NewStorage(b)
+	defer storage.Close()
+	engine := NewEngine(nil, nil, 10, 100*time.Second)
 
-// NewBenchmark returns an initialized empty Benchmark.
-func NewBenchmark(b *testing.B, input string) *Benchmark {
-	t, err := NewTest(b, input)
-	if err != nil {
-		b.Fatalf("Unable to run benchmark: %s", err)
+	metrics := []labels.Labels{}
+	metrics = append(metrics, labels.FromStrings("__name__", "a_one"))
+	metrics = append(metrics, labels.FromStrings("__name__", "b_one"))
+	for j := 0; j < 10; j++ {
+		metrics = append(metrics, labels.FromStrings("__name__", "h_one", "le", strconv.Itoa(j)))
 	}
-	return &Benchmark{
-		b: b,
-		t: t,
+	metrics = append(metrics, labels.FromStrings("__name__", "h_one", "le", "+Inf"))
+
+	for i := 0; i < 10; i++ {
+		metrics = append(metrics, labels.FromStrings("__name__", "a_ten", "l", strconv.Itoa(i)))
+		metrics = append(metrics, labels.FromStrings("__name__", "b_ten", "l", strconv.Itoa(i)))
+		for j := 0; j < 10; j++ {
+			metrics = append(metrics, labels.FromStrings("__name__", "h_ten", "l", strconv.Itoa(i), "le", strconv.Itoa(j)))
+		}
+		metrics = append(metrics, labels.FromStrings("__name__", "h_ten", "l", strconv.Itoa(i), "le", "+Inf"))
 	}
-}
 
-// Run runs the benchmark.
-func (b *Benchmark) Run() {
-	defer b.t.Close()
-	b.b.ReportAllocs()
-	b.b.ResetTimer()
-	for i := 0; i < b.b.N; i++ {
-		if err := b.t.RunAsBenchmark(b); err != nil {
-			b.b.Error(err)
+	for i := 0; i < 100; i++ {
+		metrics = append(metrics, labels.FromStrings("__name__", "a_hundred", "l", strconv.Itoa(i)))
+		metrics = append(metrics, labels.FromStrings("__name__", "b_hundred", "l", strconv.Itoa(i)))
+		for j := 0; j < 10; j++ {
+			metrics = append(metrics, labels.FromStrings("__name__", "h_hundred", "l", strconv.Itoa(i), "le", strconv.Itoa(j)))
+		}
+		metrics = append(metrics, labels.FromStrings("__name__", "h_hundred", "l", strconv.Itoa(i), "le", "+Inf"))
+	}
+	refs := make([]uint64, len(metrics))
+
+	// A day of data plus 10k steps.
+	numIntervals := 8640 + 10000
+
+	for s := 0; s < numIntervals; s += 1 {
+		a, err := storage.Appender()
+		if err != nil {
+			b.Fatal(err)
+		}
+		ts := int64(s * 10000) // 10s interval.
+		for i, metric := range metrics {
+			err := a.AddFast(metric, refs[i], ts, float64(s))
+			if err != nil {
+				refs[i], _ = a.Add(metric, ts, float64(s))
+			}
+		}
+		if err := a.Commit(); err != nil {
+			b.Fatal(err)
 		}
-		b.iterCount++
+	}
+
+	type benchCase struct {
+		expr  string
+		steps int
+	}
+	cases := []benchCase{
+		// Simple rate.
+		{
+			expr: "rate(a_X[1m])",
+		},
+		{
+			expr:  "rate(a_X[1m])",
+			steps: 10000,
+		},
+		// Holt-Winters and long ranges.
+		{
+			expr: "holt_winters(a_X[1d], 0.3, 0.3)",
+		},
+		{
+			expr: "changes(a_X[1d])",
+		},
+		{
+			expr: "rate(a_X[1d])",
+		},
+		// Unary operators.
+		{
+			expr: "-a_X",
+		},
+		// Binary operators.
+		{
+			expr: "a_X - b_X",
+		},
+		{
+			expr:  "a_X - b_X",
+			steps: 10000,
+		},
+		{
+			expr: "a_X and b_X{l=~'.*[0-4]$'}",
+		},
+		{
+			expr: "a_X or b_X{l=~'.*[0-4]$'}",
+		},
+		{
+			expr: "a_X unless b_X{l=~'.*[0-4]$'}",
+		},
+		// Simple functions.
+		{
+			expr: "abs(a_X)",
+		},
+		{
+			expr: "label_replace(a_X, 'l2', '$1', 'l', '(.*)')",
+		},
+		{
+			expr: "label_join(a_X, 'l2', '-', 'l', 'l')",
+		},
+		// Combinations.
+		{
+			expr: "rate(a_X[1m]) + rate(b_X[1m])",
+		},
+		{
+			expr: "sum without (l)(rate(a_X[1m]))",
+		},
+		{
+			expr: "sum without (l)(rate(a_X[1m])) / sum without (l)(rate(b_X[1m]))",
+		},
+		{
+			expr: "histogram_quantile(0.9, rate(h_X[5m]))",
+		},
+	}
+
+	// X in an expr will be replaced by different metric sizes.
+	tmp := []benchCase{}
+	for _, c := range cases {
+		if !strings.Contains(c.expr, "X") {
+			tmp = append(tmp, c)
+		} else {
+			tmp = append(tmp, benchCase{expr: strings.Replace(c.expr, "X", "one", -1), steps: c.steps})
+			tmp = append(tmp, benchCase{expr: strings.Replace(c.expr, "X", "ten", -1), steps: c.steps})
+			tmp = append(tmp, benchCase{expr: strings.Replace(c.expr, "X", "hundred", -1), steps: c.steps})
+		}
+	}
+	cases = tmp
+
+	// No step will be replaced by cases with the standard step.
+	tmp = []benchCase{}
+	for _, c := range cases {
+		if c.steps != 0 {
+			tmp = append(tmp, c)
+		} else {
+			tmp = append(tmp, benchCase{expr: c.expr, steps: 1})
+			tmp = append(tmp, benchCase{expr: c.expr, steps: 10})
+			tmp = append(tmp, benchCase{expr: c.expr, steps: 100})
+			tmp = append(tmp, benchCase{expr: c.expr, steps: 1000})
+		}
+	}
+	cases = tmp
+	for _, c := range cases {
+		name := fmt.Sprintf("expr=%s,steps=%d", c.expr, c.steps)
+		b.Run(name, func(b *testing.B) {
+			b.ReportAllocs()
+			for i := 0; i < b.N; i++ {
+				qry, err := engine.NewRangeQuery(
+					storage, c.expr,
+					time.Unix(int64((numIntervals-c.steps)*10), 0),
+					time.Unix(int64(numIntervals*10), 0), time.Second*10)
+				if err != nil {
+					b.Fatal(err)
+				}
+				res := qry.Exec(context.Background())
+				if res.Err != nil {
+					b.Fatal(res.Err)
+				}
+				qry.Close()
+			}
+		})
 	}
 }