InfluxDB: take advantage of influx chunked reponses #863

sanga · 2014-09-24T13:21:23Z

Fair warning up front. This is possibly not that trivial and I not even entirely sure as to the feasibility of doing this but anyway...

Influx supports chunked http responses, so that it will send data back in, well, chunks, as it calculates the data. According to the docs, it should send all data for the time period it has calculated and then move on to the next chunk of time from the requested time period. So I'm wondering, might it be possible to read the data in in chunks and paint the graph chunk by chunk (a very tertiary glance at flot docs would appear to suggest this is possible at least)?

The problem I'm trying to get around is basically this. I have some graphs that plot an awful lot of data. And they take a long time to paint. During that time we currently just get a spinner in the graph. Much nicer would be if the graph would gradually "fill in" i.e. paint backwards in time (like Splunk if you ever happen to have used that tool).

What do you think? Reasonable use of time/complexity to implement?

torkelo · 2014-09-24T13:46:47Z

have you considered using influxdb continuous queries to pre aggregate and speed up the queries? I havent used influxdb in a production setup yet, but with graphite (in production usage with hundreds of thousands of metrics) I have yet to find queries that take more than a second even for long time ranges and many metrics.

Also are you using columns to distinguish metrics or are you using a series per metric (with single a single value column). Because I think that using a where clause and using columns like "host" is very bad for performance when using influxdb.

As to streaming in the results, it would definitely be possible. But more would have to +1 vote it for me to spend time on it (might be 1-2 days work at least).

sahilthapar · 2014-09-25T06:35:22Z

I get this too ... Maybe I should look into continuous queries but I get this issue with single value columns.

sanga · 2014-11-11T08:05:06Z

Sorry I never replied to this earlier, apparently it was lost in a sea of open browser tabs.... Anyway, cont- queries would, I guess, work fine if I knew beforehand what I wanted to query. However most of the time I spend in grafana is exploring perf. problems, so I don't know what is interesting before I start exploring.

Having said that, a 90% solution to this is just to use a long enough group_by period, by which you can drastically reduce the amount of data send back from influx (cursory investigation suggests it's influx's speed is inversely proportional to the raw amount of data that it needs to pass back). The other benefit of doing this is that grafana needs to store less in memory so the ui remains snappy.

nbrownus · 2015-07-16T17:18:32Z

Support for this would clear up #2266 which the influxdb team has identified as the root issue influxdata/influxdb#3242

sknaumov · 2019-06-11T09:58:07Z

I have a related question - could Grafana be configured to retrieve data by chunks, using new request per chunk? The problem is that I have a lot of metrics in a blob in DB (write-optimized, storage size optimized, reads are infrequent and quite expensive) and a dashboard where for each metric Grafana creates an HTTP request for a big period of time (say, 1 day or 1 week). A natural thing would be to try to aggregate these requests to process all metrics simultaneously, as ultimately they point to the very same blob - but modern browsers allow only about 6 concurrent connections to the server by default. Caching blob decompression and parsing results would resolve the problem, but for long periods of data (say, 1 week) it is no longer an option, as data for the first 6 requests has to be processed first before follow-up requests for other metrics will be sent => I need to cache the whole 1 week of per-second data for all possible metrics. If Grafana would be able to perform data retrieval in configurable chunks (say, retrieve no more than 1 hour of data with 1 request), and for all metrics ask for the first hour first, then, when completed, ask for the second hour and so on... It would resolve the problem. Are there some configuration options like this?

ryantxu · 2019-07-10T18:24:35Z

With the new streaming infrastructure, this is now something we can consider (it is still a ways off!) but there is a path for it.

The things we need are:

backend_srv using fetch. See: Migrate backend_srv from angular to fetch #18049
most likely in influxdb-flux-datasource process each chunk and then stream results.

However, I a bit skeptical that the browser will be able to do anything useful if there is too much data.

vpapavas · 2019-11-07T00:57:58Z

Hi everyone! What is the status on supporting chunked responses? I am working on developing a plugin for a streaming data source whose responses are chunked.

torkelo · 2019-11-07T05:29:51Z

The datasource query function can return a rxjs Observable to stream results

gabor · 2021-07-16T08:56:04Z

@sknaumov i think the use-case you described in #863 (comment) is slightly different from what is requested in this issue, could you please open a separate feature-request for discussing it? thanks!

gabor · 2021-07-16T08:59:05Z

@sanga i'm trying to understand your use-case better:

is it about influxdb returning too much data? in this case, i'm worried that even if it gets solved, the browser will just stop working if it gets tens of thousands of data-points, in this case (as you also mentioned) longer group_by periods should solve the problem
is it about influxdb returning the results very slowly, where by using chunked responses we would see the start of the data faster in the graph?

gabor · 2021-07-16T09:05:58Z

NOTE: in general, i wonder how much is this use-case still supported in influxdb2. i did some tests with influxdb 2.0.7:

there is no mention of chunked responses in the influxql API in the docs ( https://docs.influxdata.com/influxdb/v2.0/api/v1-compatibility/#tag/Query , https://docs.influxdata.com/influxdb/v2.0/reference/api/influxdb-1x/query/ )
there is no mention of chunked responses in the flux API in the docs ( https://docs.influxdata.com/influxdb/v2.0/api/#operation/PostQuery )
it is documented in the docs for the 1.x version at https://docs.influxdata.com/influxdb/v1.8/tools/api/, where it says chunked=true|number
getting chunked responses does work in influxql mode in influxdb 2.0.7 when adding &chunked=true&chunk_size=10 to the end of the HTTP query.
- note that this is different from what the api-docs for influxdb 1.x say (chunk_size is not documented at all)

gabor · 2021-07-16T09:35:24Z

NOTE: for flux-mode, we are consuming the flux-response csv-row by csv-row ( https://github.com/grafana/grafana/blob/main/pkg/tsdb/influxdb/flux/executor.go#L78 ), so at least in theory there is a way to return non-full data to the browser. still, the question remains how useful this would be.

sanga · 2021-07-18T06:04:40Z

@gabor to be honest this ticket is so old that I can't anymore recall precisely what my problem was. I think is was a combination of both. Which is to say: Slowness caused by a large amount of data. I agree with your assertion that having a large amount of data will probably cause the browser to be unusably slow in any case. Given that and the fact that chunked queries don't exist anymore, I think this ticket can be closed

gabor · 2021-07-19T05:55:26Z

@sanga thanks for the info, closing it then.

torkelo added type/feature-request datasource/InfluxDB labels Sep 24, 2014

Andy2003 mentioned this issue Jan 28, 2017

Influxdb: Support InfluxDB 1.2+ Partial data responses #7380

Closed

marefr added the area/datasource label Mar 30, 2019

apurvam mentioned this issue Nov 8, 2019

refactor: remove logging for every pull query MINOR confluentinc/ksql#3736

Closed

2 tasks

vpapavas mentioned this issue Nov 11, 2019

Pull queries: Create new REST endpoint for pull queries confluentinc/ksql#3742

Closed

aocenas added the prio/low It's a good idea, but not scheduled for any release label Sep 2, 2020

daniellee changed the title ~~take advantage of influx chunked reponses~~ InfluxDB: take advantage of influx chunked reponses Nov 9, 2020

gabor added the needs investigation for unconfirmed bugs. use type/bug for confirmed bugs, even if they "need" more investigating label Jul 16, 2021

gabor self-assigned this Jul 16, 2021

gabor added needs more info Issue needs more information, like query results, dashboard or panel json, grafana version etc and removed needs investigation for unconfirmed bugs. use type/bug for confirmed bugs, even if they "need" more investigating labels Jul 16, 2021

gabor removed their assignment Jul 16, 2021

gabor closed this as completed Jul 19, 2021

This was referenced Dec 5, 2022

[Snyk] Security upgrade simple-git from 1.132.0 to 3.15.0 ekmixon/grafana#382

Open

[Snyk] Security upgrade simple-git from 1.132.0 to 3.15.0 kevinjm39/grafana#207

Open

This was referenced Dec 6, 2022

[Snyk] Security upgrade simple-git from 1.132.0 to 3.15.0 jinnu92/grafana#196

Open

[Snyk] Security upgrade simple-git from 2.48.0 to 3.15.0 turkdevops/grafana#740

Open

ekmixon mentioned this issue Jun 18, 2023

[Snyk] Security upgrade simple-git from 1.132.0 to 3.15.0 ekmixon/grafana#444

Open

ekmixon mentioned this issue Oct 27, 2023

[Snyk] Security upgrade simple-git from 1.132.0 to 3.16.0 ekmixon/grafana#495

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfluxDB: take advantage of influx chunked reponses #863

InfluxDB: take advantage of influx chunked reponses #863

sanga commented Sep 24, 2014

torkelo commented Sep 24, 2014

sahilthapar commented Sep 25, 2014

sanga commented Nov 11, 2014

nbrownus commented Jul 16, 2015

sknaumov commented Jun 11, 2019

ryantxu commented Jul 10, 2019

vpapavas commented Nov 7, 2019

torkelo commented Nov 7, 2019

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

sanga commented Jul 18, 2021

gabor commented Jul 19, 2021

InfluxDB: take advantage of influx chunked reponses #863

InfluxDB: take advantage of influx chunked reponses #863

Comments

sanga commented Sep 24, 2014

torkelo commented Sep 24, 2014

sahilthapar commented Sep 25, 2014

sanga commented Nov 11, 2014

nbrownus commented Jul 16, 2015

sknaumov commented Jun 11, 2019

ryantxu commented Jul 10, 2019

vpapavas commented Nov 7, 2019

torkelo commented Nov 7, 2019

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

gabor commented Jul 16, 2021

sanga commented Jul 18, 2021

gabor commented Jul 19, 2021