Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #77

kminder · 2018-06-26T13:05:58Z

Performance is rather poor for table output format when two conditions occur for the same result set.

The result set has a large number of columns.
The driver being used has a slow implementation of DatabaseMetaData.getPrimaryKeys.

For example testing has shown that for a query with ~100 columns using the HBase Phoenix thin driver the execution time can be cut from ~30 seconds to ~2 seconds by using CSV output format vs table output format. For example: select * from system.catalog;

This is due to how primary keys are detected. Currently the Rows implementation will make a metadata call for every column to determine it is a primary key for display purposes. I propose optimizing this such that a metadata call is only made for each unique table in the result set's columns.

The text was updated successfully, but these errors were encountered:

kminder · 2018-06-26T14:47:14Z

Proposed PR for this here: #78

julianhyde · 2018-09-04T05:53:35Z

Fixed in b14152a, PR #78.

kminder mentioned this issue Jun 26, 2018

SQLLINE-77: Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #78

Closed

julianhyde closed this as completed Sep 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #77

Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #77

kminder commented Jun 26, 2018

kminder commented Jun 26, 2018 •

edited

julianhyde commented Sep 4, 2018

Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #77

Poor performance with drivers having slow DatabaseMetaData.getPrimaryKeys impl #77

Comments

kminder commented Jun 26, 2018

kminder commented Jun 26, 2018 • edited

julianhyde commented Sep 4, 2018

kminder commented Jun 26, 2018 •

edited