Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3285

CharlesLgn · 2024-06-14T08:59:50Z

In Postgres, if we have a collumn stored as a "numeric", but store "double in it", the execution plan will not be optimized.

Driver Version

postgresql-42.7.3

Java Version

JDK 21

To Reproduce

Create a database with numeric column

create table INVOICE (
    id varchar(25) not null constraint pk_invoice primary key,
    amount numeric,
    contract_id varchar(25) not null
);

create table CONTRACT (
    id varchar(25) not null constraint pk_contract primary key,
    reference varchar(25)
);

use the request : select id from invoice i join contract c on c.id = i.contract_id where i.amount between ? and ?

When we use a PreparedStatement, the system should use an implicite cast as a numeric to calculate the execution plan, but it used ('the value')::double precision, so the execution plan could be false leading to performance issue (in my case, we have 8 million + invoice in the database, and the prepared request take 3 sec to execute)

The problème is generate by this change : 06abfb7

Now, the executed request for data -10 and 3.4 will be select id from invoice i join contract c on c.id = i.contract_id where i.amount between ('-10'::double precision) and ('3.4'::double precision).

As I understand, this modification is due to a security problème with thing like -? that could generate son SQL injection if the value is a negative number.

Correction proposition

Instead of using a cast, only put the data between parenthesis for number, so we have -? becomming -(-2.548) istead of -('-2.548'::double precision)

Fixes #3284

…ecution plan calculation

vlsi · 2024-06-14T12:17:16Z

When we use a PreparedStatement, the system should use an implicite cast as a numeric to calculate the execution plan

Please clarify why do you think so.
When you use PreparedStatement, the driver passes the value according to the set... method.
For instance, the execution plan might differ if you use setDouble(1, ...) vs setBigDecimal(1, ..).

I think the PR is invalid, and we should rather add a documentation sample that highlights PostgreSQL behavior.

CharlesLgn · 2024-06-14T15:06:10Z

Please clarify why do you think so.

before this two changes : 06abfb7 ; 93b0fcb

The behavior was different regarding number object value in PreparedStatement in 01-2024.

but, to go further, here is my request (simplified regarding some company privacy):

SELECT  Facture.ID, Facture.REFERENCE, Facture.MONTANTTTC, (
  SELECT Acteur.NAME || ' ' || CASE WHEN Acteur.LASTNAME IS NULL THEN '' ELSE Acteur.LASTNAME END
  FROM Acteur
  WHERE Contrat.Acteur_ID = Acteur.ID), Contrat.REFERENCE, Offre.LIBELLE
FROM Facture
  join Contrat on Facture.CONTRAT_ID = Contrat.ID
  join Offre on Contrat.OFFREPRODUIT_ID = Offre.ID
where Facture.MONTANTTTC between ? and ?

This request took less than 1ms in the last pgjdbc version. now it took more than 3 seconds.

In the object, montantTTC is a double, so we use st.setDouble(....
Now we should use st.setBigDecimal([...], BigDecimal.valueOf(...)). We must do the same things in our batchs, leading to billion conversion object,

Moreover, if I use a query tool like DBeaver/DataGrip/..., this conversion does not have this contrainst to cast the value.

here are the explained plan :

from the application using pgjdbc : PE_REQ_RECH_FACT_MONTANT_appli.txt
from dbeaver : PE_REQ_RECH_FACT_MONTANT_dbeaver.txt

davecramer · 2024-06-15T21:56:37Z

Are you requesting SimpleQuery mode when executing this ?

CharlesLgn · 2024-06-17T06:30:41Z

Are you requesting SimpleQuery mode when executing this ?

In fact I am not. so as I understand, and as @vlsi said in the issue post :

There's no way driver could do about it, and if it worked previously, it was a pure luck 🤷‍♂️

Sorry to have created a PR for no reason 😓

Do not cast a number value to in a prepared request to avoid wrong ex…

a9b4392

…ecution plan calculation

CharlesLgn changed the title ~~Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3284~~ Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" Jun 14, 2024

CharlesLgn closed this Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3285

Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3285

CharlesLgn commented Jun 14, 2024 •

edited

Loading

vlsi commented Jun 14, 2024

CharlesLgn commented Jun 14, 2024 •

edited

Loading

davecramer commented Jun 15, 2024

CharlesLgn commented Jun 17, 2024

Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3285

Postgres explain plan is not correctly optimezed if the data is a "double" but the collumn is a "numeric" #3285

Conversation

CharlesLgn commented Jun 14, 2024 • edited Loading

Driver Version

Java Version

To Reproduce

Correction proposition

vlsi commented Jun 14, 2024

CharlesLgn commented Jun 14, 2024 • edited Loading

davecramer commented Jun 15, 2024

CharlesLgn commented Jun 17, 2024

CharlesLgn commented Jun 14, 2024 •

edited

Loading

CharlesLgn commented Jun 14, 2024 •

edited

Loading