JSON Marshaling of large responses is excessively expensive #3601

jacksontj · 2017-12-19T18:11:57Z

What did you do?
Sent a query looking like metricname[3d] to prometheus.

What did you expect to see?
metricname has ~5k labelsets, and from my math I'm expecting to get a large data response (~2G)

What did you see instead? Under which circumstances?
Instead, I seemingly never get a response from prometheus. More interestingly I see a large increase in memory utilization to the point that prometheus stops scraping and eventually OOMs. From digging in more I found that the issue is all to do with how prometheus is marshaling out the response on the HTTP API. With my below test script, we generate 3d worth of data (at a 15s period) for 5k timeseries. We generate that data (~500ms and ~1.2G RAM) and then json marshal that data (~2m -- and consumes ~11G of RAM).

Issues

json.Marshal uses significantly more memory than the original (1.2G) and the output (2G) at ~11G
requests in json.Marshal aren't cancel-able so that if a request like this ever hits prometheus, it will either complete or cause prometheus to die
in addition to the large memory footprint, the marshaling (in the example below) takes ~2m on my desktop -- which is excessive, especially when you consider that the data generation took ~500ms.

Suggestions
Suggestion 1
In an effort to alleviate both problems I suggest the json marshaling is made to stream the data to the wire. There's no need to make a copy of it all in memory first, especially in the API case where we literally just write to the buffer. A terrible-hacky example would be something like:

		enc := json.NewEncoder(w)
		w.Write([]byte{'['})
		
		for i, item := range m {
    		if err := enc.Encode(item); err != nil {
    		    fmt.Println(err)
    		    return
    		}
    		if i < len(m)-1 {
    		    w.Write([]byte{','})
    		}
		}
		w.Write([]byte{']'})

In this example we would spin over every entry in the response and marshal that out. This means that each samplestream (in this example) would need to be in memory, but we'd then write it to the wire and no longer need it in memory. In addition this means that the request is "cancelable" at each encode step (if the client disconnects, then you get a stream closed error). Of course the "correct" implementation of this would require a bit of type switching

Suggestion 2
Change the marshaling of the various structs to be codegen'd. Most of them are partially there, but there are some minor improvements that can be made that would give you ~2x boost in performance (mostly copying less, and reflecting less).

For both of these I'd more than happy to implement it (its not that bad), but I wanted to get some feedback prior to implementation.

Repro Script

package main

import (
	"encoding/json"
	"fmt"
	"strconv"
	"time"

	"github.com/prometheus/common/model"
)

func generateData() model.Matrix {
	NUM_TIMESERIES := 5000
	NUM_DATAPOINTS := 17280

	// Create the top-level matrix
	m := make(model.Matrix, 0)

	for i := 0; i < NUM_TIMESERIES; i++ {
		lset := map[model.LabelName]model.LabelValue{
			model.MetricNameLabel: model.LabelValue("timeseries_" + strconv.Itoa(i)),
		}

		now := model.Now()

		values := make([]model.SamplePair, NUM_DATAPOINTS)

		for x := NUM_DATAPOINTS; x > 0; x-- {
			values[x-1] = model.SamplePair{
				Timestamp: now.Add(time.Second * -15 * time.Duration(x)), // Set the time back assuming a 15s interval
				Value:     model.SampleValue(float64(x)),
			}
		}

		ss := &model.SampleStream{
			Metric: model.Metric(lset),
			Values: values,
		}

		m = append(m, ss)
	}
	return m
}

func main() {
	start := time.Now()
	m := generateData()
	took := time.Now().Sub(start)

	fmt.Println("done generatingData took:", took)

	start = time.Now()
	json.Marshal(m)
	took = time.Now().Sub(start)
	fmt.Println("done marshaling took:", took)
}

The text was updated successfully, but these errors were encountered:

brian-brazil · 2017-12-19T19:16:37Z

See #3536, but in general if you request something that takes 1GB of memory you're likely to have problems.

brian-brazil · 2017-12-19T19:20:07Z

Also, we have protections against this type of large query so querying 3d of data at 15s resolution should immediately return an error. This is to encourage such requests to be broken up into smaller requests.

brancz · 2017-12-19T19:57:09Z

Sorry if this is too far out of context, but what do we generally think of optionally marshaling as protobuf instead of json? (something I recently had a discussion with @DirectXMan12 about)

jacksontj · 2017-12-19T19:58:42Z

Let me respond to the 2 separate responses. 1. Lots of data == bad The problem isn't that the data is so large (2G of data on a box with 256G ram is nothing) the problem is the size amplification (1-10G). This is soely due to the json marshaling doing it all before writing anything. This is an easy fix that just requires streaming the Marshald bytes to the wire. 2. Protections in place This is true for the query_range api, but if you send a request (like the issue states) of `somemetricname[3d]` there are no protections in place in the current implementation (if there was I wouldn't have seen the problem, nor filed the issue). The question is, if I submit a pr (1) speeding up marshaling and (2) moving api marshaling to streaming, will you accept it? On Dec 19, 2017 11:20 AM, "Brian Brazil" <notifications@github.com> wrote: Also, we have protections against this type of large query so querying 3d of data at 15s resolution should immediately return an error. This is to encourage such requests to be broken up into smaller requests. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#3601 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABi3nZ1EXu_Uwuxyfj6PG0-yyfd2rT1uks5tCAx1gaJpZM4RHWtC> .

jacksontj · 2017-12-19T19:59:38Z

IMO that's orthogonal and presumably you'd see the same issues (since it's a streaming problem, not the serialization itself).

…

On Dec 19, 2017 11:57 AM, "Frederic Branczyk" ***@***.***> wrote: Sorry if this is too far out of context, but what do we generally think of optionally marshaling as protobuf instead of json? (cc @DirectXMan12 <https://github.com/directxman12>) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#3601 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABi3nTKI6bZdVFCIVddgk8sbu3gb3v4oks5tCBUsgaJpZM4RHWtC> .

brian-brazil · 2017-12-19T21:05:20Z

Sorry if this is too far out of context, but what do we generally think of optionally marshaling as protobuf instead of json?

It doesn't make a difference, we still need to build up all the data in RAM before we can start writing it out.

jacksontj · 2017-12-21T18:31:33Z

I've spent some time working on this over the last day, and I've made significant progress. I have a PR open (prometheus/common#112) which speeds up marshaling using the current encoding/json marshaler by ~15-85% from the benchmarks.

Using this PR'd version of prometheus/common my repro script goes from a 3m runtime to 27s. If I change the script to use easyjson.MarshalToWriter the runtime drops even further, to ~16s.

smcquay · 2017-12-27T20:16:32Z

In discussing a related PR on #prometheus-dev on free node it came up that the other PR was a microbenchmark and missed some performance cases.

For completeness I replace encoding/json with json-iter and ran the numbers; the easyjson version is still fastest:

$ ./slowcli_easyjson
done generatingData took: 488.309301ms
done marshaling took: 15.330267979s

# with jsoniter
$ ./slowcli_jsoniter
done generatingData took: 494.28975ms
done marshaling took: 2m26.32011921s

# with encoding json
$ ./slowcli_encoding_json
done generatingData took: 478.830941ms
done marshaling took: 2m52.064084961s

brian-brazil · 2017-12-27T23:36:46Z

That is quite different from the benckmarks jsoniter have themselves, so I'm a bit suspicious. I'd really like to see an end-to-end test.

One way or the other this is all getting assembled in memory before being marshalled, so smaller queries are needed for this volume of data.

jacksontj · 2017-12-28T00:37:23Z

I'm not sure what you are suspicious of? Both of the benchmarks aren't black-boxed, they are open source-- meaning you can look at them and determine if there is something wrong.

If you don't want to bother looking into it, I've added end-to-end data (hitting the /query/ endpoint) on #3608 (comment)

brian-brazil · 2018-03-21T15:11:39Z

#3536 improved this. Your use case continues to not be sane.

lock · 2019-03-22T20:09:25Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

This was referenced Dec 21, 2017

Switch to json-iterator for v1 api. #3536

Merged

1.7 easyjson #3608

Closed

brancz mentioned this issue Jan 16, 2018

Streaming APIs #3690

Closed

brian-brazil closed this as completed Mar 21, 2018

jacksontj mentioned this issue Apr 20, 2018

Query Pushdown thanos-io/thanos#305

Open

bwplotka mentioned this issue Apr 23, 2018

sidecar: Allow Thanos backup when local compaction is enabled thanos-io/thanos#206

Open

This was referenced Apr 25, 2018

Comparing to remote_read jacksontj/promxy#22

Closed

Slow query response with multiple clients jacksontj/promxy#21

Closed

lock bot locked and limited conversation to collaborators Mar 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JSON Marshaling of large responses is excessively expensive #3601

JSON Marshaling of large responses is excessively expensive #3601

jacksontj commented Dec 19, 2017 •

edited

Loading

brian-brazil commented Dec 19, 2017

brian-brazil commented Dec 19, 2017

brancz commented Dec 19, 2017 •

edited

Loading

jacksontj commented Dec 19, 2017 via email

jacksontj commented Dec 19, 2017 via email

brian-brazil commented Dec 19, 2017

jacksontj commented Dec 21, 2017

smcquay commented Dec 27, 2017 •

edited

Loading

brian-brazil commented Dec 27, 2017

jacksontj commented Dec 28, 2017

brian-brazil commented Mar 21, 2018

lock bot commented Mar 22, 2019

JSON Marshaling of large responses is excessively expensive #3601

JSON Marshaling of large responses is excessively expensive #3601

Comments

jacksontj commented Dec 19, 2017 • edited Loading

brian-brazil commented Dec 19, 2017

brian-brazil commented Dec 19, 2017

brancz commented Dec 19, 2017 • edited Loading

jacksontj commented Dec 19, 2017 via email

jacksontj commented Dec 19, 2017 via email

brian-brazil commented Dec 19, 2017

jacksontj commented Dec 21, 2017

smcquay commented Dec 27, 2017 • edited Loading

brian-brazil commented Dec 27, 2017

jacksontj commented Dec 28, 2017

brian-brazil commented Mar 21, 2018

lock bot commented Mar 22, 2019

jacksontj commented Dec 19, 2017 •

edited

Loading

brancz commented Dec 19, 2017 •

edited

Loading

smcquay commented Dec 27, 2017 •

edited

Loading