Postgres stacked bar graph panel with $__timeGroup has incorrect graph data #10073

rezolutiontech · 2017-12-04T19:27:57Z

Grafana version: 4.6.2
OS: Linux (Amazon Linux AMI)

I'm trying to replicate a graph I use from a influx datasource regularly using a postgres datasource which is a stacked bar graph with a 10m grouping.

When I view this graph however I get data outside the 10m grouping referenced in bars they shouldn't be in which results in floating values. I'll attach a screenshot in a follow up to this report.

Also if you hover over the graph data points will show values for all for metrics in each group.

Example of my postgres query/data is below:

SELECT (extract(epoch from "purchase_datetime")/extract(epoch from '10m'::interval))::int*extract(epoch from '10m'::interval) as time, company as metric, sum(paid_price) as value FROM purchases WHERE extract(epoch from purchase_datetime) BETWEEN 1512379200 AND 1512381417 GROUP BY 1,2 ORDER BY 1,2 ASC;
    time    | metric | value 
------------+--------+-------
 1512379200 | CO1    |   100
 1512379200 | CO2    |    20
 1512379800 | CO3    |    47
 1512380400 | CO3    |  62.5
 1512381000 | CO1    |    93
 1512381600 | CO3    |  30.5
 1512381600 | CO4    |    37
(4 rows)

My panel JSON is

{
  "id": 28,
  "title": "Panel Title",
  "span": 12,
  "type": "graph",
  "datasource": "Warehouse Reporting",
  "targets": [
    {
      "policy": "default",
      "dsType": "influxdb",
      "resultFormat": "time_series",
      "orderByTime": "ASC",
      "tags": [],
      "groupBy": [
        {
          "type": "time",
          "params": [
            "$__interval"
          ]
        },
        {
          "type": "fill",
          "params": [
            "null"
          ]
        }
      ],
      "select": [
        [
          {
            "type": "field",
            "params": [
              "value"
            ]
          },
          {
            "type": "mean",
            "params": []
          }
        ]
      ],
      "refId": "A",
      "format": "time_series",
      "alias": "",
      "rawSql": "SELECT\n  $__timeGroup(purchase_datetime,'10m') as time,\n  company as metric,\n  sum(paid_price) as value\nFROM\n  purchases\nWHERE\n  $__timeFilter(purchase_datetime)\nGROUP BY 1,2\nORDER BY 1,2 ASC\n"
    }
  ],
  "renderer": "flot",
  "yaxes": [
    {
      "label": null,
      "show": true,
      "logBase": 1,
      "min": null,
      "max": null,
      "format": "currencyUSD"
    },
    {
      "label": null,
      "show": true,
      "logBase": 1,
      "min": null,
      "max": null,
      "format": "short"
    }
  ],
  "xaxis": {
    "show": true,
    "mode": "time",
    "name": null,
    "values": [],
    "buckets": null
  },
  "lines": false,
  "fill": 1,
  "linewidth": 1,
  "dashes": false,
  "dashLength": 10,
  "spaceLength": 10,
  "points": false,
  "pointradius": 5,
  "bars": true,
  "stack": true,
  "percentage": false,
  "legend": {
    "show": true,
    "values": true,
    "min": false,
    "max": false,
    "current": false,
    "total": true,
    "avg": false,
    "hideEmpty": true,
    "hideZero": true,
    "alignAsTable": false
  },
  "nullPointMode": "null as zero",
  "steppedLine": false,
  "tooltip": {
    "value_type": "individual",
    "shared": true,
    "sort": 0
  },
  "timeFrom": null,
  "timeShift": null,
  "aliasColors": {},
  "seriesOverrides": [],
  "thresholds": []
}

The text was updated successfully, but these errors were encountered:

rezolutiontech · 2017-12-04T19:29:05Z

Example panel

svenklemm · 2017-12-04T20:53:10Z

The problem is you are not resetting the series. For every interval where a series has no data you need to insert a row for that series with NULL or 0.

Your query should produce something like the following query

SELECT * FROM (
SELECT 1512379200 as time, 'CO1' as metric, 100.0 UNION 
SELECT 1512379800 as time, 'CO1' as metric, NULL UNION 
SELECT 1512380400 as time, 'CO1' as metric, NULL UNION
SELECT 1512381000 as time, 'CO1' as metric, 93 UNION

SELECT 1512379200 as time, 'CO2' as metric, 20 UNION
SELECT 1512379800 as time, 'CO2' as metric, NULL UNION 
SELECT 1512380400 as time, 'CO2' as metric, NULL UNION
SELECT 1512381000 as time, 'CO2' as metric, NULL UNION

SELECT 1512379200 as time, 'CO3' as metric, NULL UNION 
SELECT 1512379800 as time, 'CO3' as metric, 47 UNION 
SELECT 1512380400 as time, 'CO3' as metric, 62.5 UNION
SELECT 1512381000 as time, 'CO3' as metric, 30.5 UNION

SELECT 1512379200 as time, 'CO4' as metric, NULL UNION 
SELECT 1512379800 as time, 'CO4' as metric, NULL UNION 
SELECT 1512380400 as time, 'CO4' as metric, NULL UNION
SELECT 1512381000 as time, 'CO4' as metric, 37
) as purchases ORDER BY 1,2

This is certainly not optimal and this will make the query you have to write more complicated cause you have to do multiple joins to get that result.

Its probably much better to generate those NULLs in the grafana backend so I'm considering implementing it there. I'm not yet sure though how to best control this behaviour.

rezolutiontech · 2017-12-04T21:03:42Z

Right. I thought it might be something like that. I was trying to do something using coalesce to make the equivalent happen but haven't stumbled on the right solution yet.

I agree that having grafana fill in the missing values would be ideal to keep the query complexity down.

The equivalent we use for influxdb with a min time interval options value of ">10m" looks like

SELECT sum("usdPrice") as price FROM "purchases" WHERE usdPrice > 0 AND $timeFilter GROUP BY company, time($interval) fill(0)
which is nice and simple but sadly storing a copy of the data from postgres in influx isn't an option for me currently.

Thanks for the quick response

svenklemm · 2017-12-04T21:45:35Z

This should produce the results you want/need:

SELECT 
  base.time,
  base.company as metric,
  paid_price 
FROM 
  (
    SELECT 
      time,
      company 
    FROM generate_series(($__unixEpochFrom()/600)::int*600,($__unixEpochTo()/600)::int*600,600) as times(time), 
    (SELECT distinct company from purchases) as companies
  ) as base 
  LEFT OUTER JOIN (
    SELECT
      $__timeGroup(purchase_datetime,'10m') as time,
      company,
      sum(paid_price) as paid_price
    FROM purchases
    WHERE
      $__timeFilter(purchase_datetime)
      GROUP BY 1,2
  ) as p ON (p.company=base.company and p.time = base.time) ORDER BY 1,2;

rezolutiontech · 2017-12-04T22:10:01Z

Wow! This was unexpected and awesome. Thank you! I completely forgot generate_series existed.

daniellee · 2017-12-05T09:29:46Z

Great work @svenklemm Is this something we want to add to the pg data source? Having a fill macro or option would be a good addition I think. The query above would be too complicated for some people. We could add a new macro if we to preserve backwards compatability.

svenklemm · 2017-12-05T10:29:18Z

I was thinking about either adding a new macro $__timeGroupFill(column,'5m',NULL) or adding a $__fill(NULL) macro which would be used in conjunction with the $__timeGroup macro. $__timeGroupFill seems like the cleaner way to implement this as you need the interval length for the filling but I'm open for other ideas. Not sure we need the fill value configurable but might aswell.

svenklemm · 2017-12-05T22:17:19Z

@daniellee Alternatively I could add an optional 3rd parameter to $__timeGroup() which would be the fill value. Any preferences?
I dont like the $__fill macro variant because it would just be a dangling macro with no real relation to any part of the sql query.

daniellee · 2017-12-06T12:26:43Z

@torkelo as you closed this - should I open a new issue for the fill macro functionality?

daniellee · 2017-12-06T12:29:24Z

Should probably add the fill function for MySQL at the same time. Ref #9487

torkelo · 2017-12-06T15:49:16Z

I closed it as it seemed the problem was more a usage issue and not a bug / feat req

daniellee added the datasource/Postgres label Dec 5, 2017

torkelo closed this as completed Dec 6, 2017

marefr added area/datasource and removed area/datasource labels Mar 30, 2019

wolph mentioned this issue May 1, 2020

Convert $__interval variable for Postgres to a valid SQL interval #24152

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Postgres stacked bar graph panel with $__timeGroup has incorrect graph data #10073

Postgres stacked bar graph panel with $__timeGroup has incorrect graph data #10073

rezolutiontech commented Dec 4, 2017

rezolutiontech commented Dec 4, 2017

svenklemm commented Dec 4, 2017 •

edited

Loading

rezolutiontech commented Dec 4, 2017 •

edited

Loading

svenklemm commented Dec 4, 2017 •

edited

Loading

rezolutiontech commented Dec 4, 2017

daniellee commented Dec 5, 2017

svenklemm commented Dec 5, 2017

svenklemm commented Dec 5, 2017

daniellee commented Dec 6, 2017

daniellee commented Dec 6, 2017

torkelo commented Dec 6, 2017

Postgres stacked bar graph panel with $__timeGroup has incorrect graph data #10073

Postgres stacked bar graph panel with $__timeGroup has incorrect graph data #10073

Comments

rezolutiontech commented Dec 4, 2017

rezolutiontech commented Dec 4, 2017

svenklemm commented Dec 4, 2017 • edited Loading

rezolutiontech commented Dec 4, 2017 • edited Loading

svenklemm commented Dec 4, 2017 • edited Loading

rezolutiontech commented Dec 4, 2017

daniellee commented Dec 5, 2017

svenklemm commented Dec 5, 2017

svenklemm commented Dec 5, 2017

daniellee commented Dec 6, 2017

daniellee commented Dec 6, 2017

torkelo commented Dec 6, 2017

svenklemm commented Dec 4, 2017 •

edited

Loading

rezolutiontech commented Dec 4, 2017 •

edited

Loading

svenklemm commented Dec 4, 2017 •

edited

Loading