Fix featureCount on postgres view when flag estimatedmetadata is set #32114

troopa81 · 2019-10-04T10:53:15Z

We can't estimate rows on postgres view.

Checklist

Commit messages are descriptive and explain the rationale for changes
Commits which fix bugs include Fixes #11111 at the bottom of the commit message
I have read the QGIS Coding Standards and this PR complies with them
New unit tests have been added for core changes
I have run the scripts/prepare-commit.sh script before each commit
I have evaluated whether it is appropriate for this PR to be backported, backport requests are left as label or comment

troopa81 · 2019-10-04T10:55:07Z

Not sure about the backport 3.4 though. It's quite an easy fix, so I would say yes

src/providers/postgres/qgspostgresprovider.cpp

nyalldawson · 2019-10-04T22:01:39Z

What's the consequence of the bug? Only fixes for crashes or data corruption should be backported now.

troopa81 · 2019-10-07T06:21:39Z

What's the consequence of the bug?

If you choose the option "Use estimated table metadata" in PostGIS connexion window and you want to add a view, the feature count will be always 0. It doesn't crash nor corrupt data. So, I remove the backport label.

jef-n · 2019-10-23T06:43:32Z

Doesn't this make loading expensive views painfully slow?

strk · 2019-10-23T08:36:44Z

It surely does. I suggest to use EXPLAIN and parse its output, for estimating number of rows contained in a query.

strk · 2019-10-23T08:41:52Z

EXPLAIN could actually be also used for queries, dropping another performance bottleneck when user asked for "estimated" metadata...

strk

I think EXPLAIN should be used for the estimation

src/providers/postgres/qgspostgresprovider.cpp

tests/src/python/test_provider_postgres.py

troopa81 · 2019-10-28T08:31:59Z

It surely does. I suggest to use EXPLAIN and parse its output, for estimating number of rows contained in a query.

Isn't it dangerous to rely on EXPLAIN output to estimate the number of rows? How can we be sure that the output is not gonna change in the future, that someday rows will be changed in number_of_rows ?

strk · 2019-10-28T10:17:39Z

Isn't it dangerous to rely on EXPLAIN output to estimate the number of rows? How can we be sure that the output is not gonna change in the future, that someday rows will be changed in number_of_rows ?

Explain has XML output support since 9.0 (https://www.postgresql.org/docs/9.0/sql-explain.html).
Shall format change in future we'll deal with it. I don't see it that dangerous.

troopa81 · 2019-10-31T10:15:31Z

Explain has XML output support since 9.0 (https://www.postgresql.org/docs/9.0/sql-explain.html).

Thanks for the tip, I was not aware about explain formats. I choose JSON over XML, because it seems to me easy to write 7352e3f .

jef-n · 2019-10-31T19:51:42Z

should be guarded by connectionRO()->pgVersion() >= 90200 as other occurrences of json. And the featureCount is unreliable with "estimated metadata" anyway and widely used to avoid expensive queries. I think we should just return -1 instead of 0, if there's no shortcut like EXPLAIN.

troopa81 · 2019-11-04T08:13:12Z

should be guarded by connectionRO()->pgVersion() >= 90200

OK, I'll add it (with 90000, this feature was already available in 9.0 )

I think we should just return -1 instead of 0, if there's no shortcut like EXPLAIN

And displaying -1 in the UI? or "Unknown"?

troopa81 · 2019-11-07T13:44:00Z

I let the feature count display as it is, so if the estimate doesn't work, it will display -1

troopa81 · 2019-11-14T14:07:38Z

@strk is it OK for requested changes?

strk

Thanks, looks fine here

haubourg · 2019-11-21T17:20:42Z

Anyone to merge this ?

haubourg · 2019-11-21T19:28:56Z

thanks @elpaso !

Gustry · 2020-11-11T11:49:49Z

@troopa81 I understand why you made this change for views, it was indeed correct.
But why you didn't keep the previous behavior for tables ?

The Plan Rows makes quite different results and is not following the VACUUM ANALYSE that people might run on their database.

Look this example:

EXPLAIN (FORMAT JSON) SELECT 1 FROM pgmetadata.contact
-- QUERY PLAN [{'Plan': {'Node Type': 'Seq Scan', 'Parallel Aware': False, 'Relation Name': 'contact', 'Alias': 'contact', 'Startup Cost': 0.0, 'Total Cost': 15.1, 'Plan Rows': 510, 'Plan Width': 4}}]
-- > 'Plan Rows': 510

VACUUM ANALYSE pgmetadata.contact;
SELECT reltuples as approximate_row_count  FROM pg_class WHERE oid = 'pgmetadata.contact'::regclass;
-- 0.0

SELECT COUNT(*) FROM pgmetadata.contact;
-- 0

EXPLAIN (FORMAT JSON) SELECT 1 FROM pgmetadata.contact;
-- Still 510

troopa81 · 2020-11-12T10:42:08Z

@Gustry But why you didn't keep the previous behavior for tables ?

Because it would cost an extra request to check if mQuery is a view or a table (calling relkind)

But, I reproduce your issue when you remove all your data from your table. I'll try to find some time to fix this.

Gustry · 2020-11-12T12:48:21Z

Because it would cost an extra request to check if mQuery is a view or a table

Hum, I though indeed we would have this info already in the context.

But, I reproduce your issue when you remove all your data from your table.

My tables are indeed neatly all empty for now.

But I tried with a simple line, it's still wrong.

SELECT COUNT(*) FROM pgmetadata.contact;
-- 1

EXPLAIN (FORMAT JSON) SELECT 1 FROM pgmetadata.contact;
-- Still 510

Should I create a ticket ?

troopa81 · 2020-11-12T15:58:54Z

Hum, I though indeed we would have this info already in the context.

It could be, I don't see any reason to retrieve this information several times

But I tried with a simple line, it's still wrong.

My guess here is that the Plan rows approach is more approximate than the reltuples one. I tried with a table containing 3+06 rows and removed all rows. The reltuples did say 0 while the Plan Rows says 300. Both have changed and both are kind of true for an estimation. But the reltuples is more true :)

Should I create a ticket ?

Yes, please

troopa81 added Bug Either a bug report, or a bug fix. Let's hope for the latter! backport release-3_4 labels Oct 4, 2019

mhugo reviewed Oct 4, 2019

View reviewed changes

src/providers/postgres/qgspostgresprovider.cpp Outdated Show resolved Hide resolved

ponceta reviewed Oct 4, 2019

View reviewed changes

src/providers/postgres/qgspostgresprovider.cpp Outdated Show resolved Hide resolved

troopa81 removed the backport release-3_4 label Oct 7, 2019

nyalldawson added the Frozen Feature freeze - Do not merge! label Oct 13, 2019

rldhont added the Needs Backporting label Oct 22, 2019

rldhont added this to the 3.12 milestone Oct 22, 2019

rldhont mentioned this pull request Oct 22, 2019

[Release-3_4]Fix featureCount on postgres view when flag estimatedmetadata is set #32344

Closed

6 tasks

strk requested changes Oct 23, 2019

View reviewed changes

src/providers/postgres/qgspostgresprovider.cpp Outdated Show resolved Hide resolved

tests/src/python/test_provider_postgres.py Outdated Show resolved Hide resolved

nyalldawson added backport release-3_10 and removed Frozen Feature freeze - Do not merge! labels Oct 25, 2019

troopa81 force-pushed the fix_feature_count_estimated_metadata branch from 2a47c63 to 7352e3f Compare October 31, 2019 10:12

troopa81 force-pushed the fix_feature_count_estimated_metadata branch from 7352e3f to f3705ab Compare October 31, 2019 15:39

troopa81 added 2 commits November 7, 2019 14:21

Fix featureCount on postgres view when flag estimatedmetadata is set

be4c4d3

Add pg version guard and test on estimated count for view

5f43b3f

troopa81 force-pushed the fix_feature_count_estimated_metadata branch from f3705ab to 5f43b3f Compare November 7, 2019 13:41

strk approved these changes Nov 14, 2019

View reviewed changes

elpaso merged commit 8913fb3 into qgis:master Nov 21, 2019

troopa81 mentioned this pull request Nov 22, 2019

Backport Fix featureCount on postgres view when estimatedmetada #33019

Merged

6 tasks

wonder-sk mentioned this pull request Jun 22, 2020

[postgres] Wrong feature counts when using estimated metadata #37342

Closed

troopa81 mentioned this pull request Jul 6, 2020

Manage Postgres parallel plans when estimating row count #37619

Merged

Gustry mentioned this pull request Nov 18, 2020

Estimated row count from PostgresSQL might quite wrong #40162

Closed

troopa81 mentioned this pull request Feb 2, 2021

Record count regression on views #41188

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix featureCount on postgres view when flag estimatedmetadata is set #32114

Fix featureCount on postgres view when flag estimatedmetadata is set #32114

troopa81 commented Oct 4, 2019

troopa81 commented Oct 4, 2019

nyalldawson commented Oct 4, 2019

troopa81 commented Oct 7, 2019 via email

jef-n commented Oct 23, 2019

strk commented Oct 23, 2019

strk commented Oct 23, 2019

strk left a comment

troopa81 commented Oct 28, 2019

strk commented Oct 28, 2019

troopa81 commented Oct 31, 2019

jef-n commented Oct 31, 2019

troopa81 commented Nov 4, 2019

troopa81 commented Nov 7, 2019

troopa81 commented Nov 14, 2019

strk left a comment

haubourg commented Nov 21, 2019

haubourg commented Nov 21, 2019

Gustry commented Nov 11, 2020

troopa81 commented Nov 12, 2020

Gustry commented Nov 12, 2020

troopa81 commented Nov 12, 2020

Fix featureCount on postgres view when flag estimatedmetadata is set #32114

Fix featureCount on postgres view when flag estimatedmetadata is set #32114

Conversation

troopa81 commented Oct 4, 2019

Checklist

troopa81 commented Oct 4, 2019

nyalldawson commented Oct 4, 2019

troopa81 commented Oct 7, 2019 via email

jef-n commented Oct 23, 2019

strk commented Oct 23, 2019

strk commented Oct 23, 2019

strk left a comment

Choose a reason for hiding this comment

troopa81 commented Oct 28, 2019

strk commented Oct 28, 2019

troopa81 commented Oct 31, 2019

jef-n commented Oct 31, 2019

troopa81 commented Nov 4, 2019

troopa81 commented Nov 7, 2019

troopa81 commented Nov 14, 2019

strk left a comment

Choose a reason for hiding this comment

haubourg commented Nov 21, 2019

haubourg commented Nov 21, 2019

Gustry commented Nov 11, 2020

troopa81 commented Nov 12, 2020

Gustry commented Nov 12, 2020

troopa81 commented Nov 12, 2020