Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release notes for 405 #15058

Closed
martint opened this issue Nov 16, 2022 · 58 comments · Fixed by #15139
Closed

Release notes for 405 #15058

martint opened this issue Nov 16, 2022 · 58 comments · Fixed by #15139
Milestone

Comments

@martint
Copy link
Member

martint commented Nov 16, 2022

No description provided.

@martint martint added this to the 404 milestone Nov 16, 2022
@martint martint pinned this issue Nov 16, 2022
@ebyhr
Copy link
Member

ebyhr commented Nov 16, 2022

# BigQuery
* Add support for retrying to create a BigQuery read session. ({issue}`15013`)

#15013

@ebyhr
Copy link
Member

ebyhr commented Nov 16, 2022

# General
* Improve performance of specific queries which compare table columns of type 
  `date` with `timestamp with time zone` literals. ({issue}`5798`)

#14945

@ebyhr
Copy link
Member

ebyhr commented Nov 17, 2022

# Google Sheets
* Rename config properties from `credentials-path` to `gsheets.credentials-path`, 
  from `metadata-sheet-id` to `gsheets.metadata-sheet-id`, 
  from `sheets-data-max-cache-size` to `gsheets.max-data-cache-size`, 
  from `sheets-data-expire-after-write` to `gsheets.data-cache-ttl`. ({issue}`15042`)

#15042

@hashhar
Copy link
Member

hashhar commented Nov 18, 2022

# PostgreSQL
* Prevent query failures when a `FULL JOIN` is pushed down to PostgreSQL connector. ({issue}`14841`)

#14841 (maybe #14929 is the better link)

@ebyhr
Copy link
Member

ebyhr commented Nov 19, 2022

# Delta Lake
* Improve performance and storage requirements when running the `vacuum` procedure on S3-compatible storage. ({issue}`15072`)

#15072

@raunaqmorarka
Copy link
Member

raunaqmorarka commented Nov 20, 2022

# Hive, Iceberg, Delta, Hudi
* Improve performance of reading Parquet files for boolean, tinyint, short, int, long, float, double and short decimal data types. The catalog configuration property `parquet.optimized-reader.enabled` can be set to `false` to disable the optimized implementation. ({issue}`14423`)

#14423 #15308

@sopel39
Copy link
Member

sopel39 commented Nov 21, 2022

General
* Improve performance and resource utilization of inserts into partitioned and bucketed tables. ({issue}`14718`)

#14718

@hashhar
Copy link
Member

hashhar commented Nov 21, 2022

# BigQuery
* Fixed an issue where queries would get executed against the project id specified in the credentials
  instead of the project id specified in `bigquery.project-id` catalog property. ({issue}`14083`)

#14083 #14961

@ebyhr
Copy link
Member

ebyhr commented Nov 22, 2022

# Hive
* Fix failure when access denied exceptions happened during listing tables or views in a Glue. ({issue}`14746`)

#14998

@huberty89
Copy link
Contributor

# Security
* Check access to catalog and schema during execution of USE statement. ({issue}`14208`)

#14209

@huberty89
Copy link
Contributor

# Security
* Filter out values from system.metadata tables to which user has no access ({issue}`14000`)

#14197

@raunaqmorarka
Copy link
Member

raunaqmorarka commented Nov 22, 2022

# Hive, Iceberg, Delta, Hudi
* Improve performance of reading parquet files for column of type UUID, time, decimal, varchar and char. ({issue}`14667`)

#14667 #15308

@raunaqmorarka
Copy link
Member

# Delta
* Improve CBO estimates when the nulls fraction statistic is not available for some columns. ({issue}`15132`)

#15132

@ebyhr
Copy link
Member

ebyhr commented Nov 24, 2022

# Delta Lake
* Allow registering existing table files in the metastore with the new
  [`register_table` procedure](delta-lake-register-table). ({issue}`13568`)
* Deprecate creating a new table with existing table content.
  This can be enabled using the `delta.legacy-create-table-with-existing-location.enabled`
  config property or `legacy_create_table_with_existing_location_enabled` session property. ({issue}`13568`)

#14779

@sopel39
Copy link
Member

sopel39 commented Nov 24, 2022

Hive:
* Improve performance for queries which write data into multiple existing partitions. ({issue}`15066`)

#15066

@ebyhr
Copy link
Member

ebyhr commented Nov 24, 2022

# Hive
* Store query ID when creating a new schema. ({issue}`15174`)
* Fix incorrect `schema already exists` error caused by a client timeout when
  creating a new schema. ({issue}`15174`)

#15177

@ebyhr
Copy link
Member

ebyhr commented Nov 24, 2022

# MongoDB
* Upgrade minimum required MongoDB version to 4.2. ({issue}`15062`)
* Add support for [fault-tolerant execution](/admin/fault-tolerant-execution). ({issue}`15062`)

#15062

@findepi
Copy link
Member

findepi commented Nov 25, 2022

Iceberg
* Fix `REFRESH MATERIALIZED VIEW` failure when the materialized view is based on non-Iceberg tables. ({issue}`13131`)

#13131 #15108

@findepi
Copy link
Member

findepi commented Nov 25, 2022

* Improve CBO estimates when the nulls fraction statistic is not available for some columns. ({issue}`15132`)

@raunaqmorarka let's phrase it in the view of end-user benefit.
e.g. is it improving query performance?

@raunaqmorarka
Copy link
Member

* Improve CBO estimates when the nulls fraction statistic is not available for some columns. ({issue}`15132`)

@raunaqmorarka let's phrase it in the view of end-user benefit.
e.g. is it improving query performance?

How about
"Improve query performance when nulls fraction statistic is not available for some columns" ?

@sopel39
Copy link
Member

sopel39 commented Nov 28, 2022

General:
* Fix incorrect query results where unpartitioned `WINDOW` operator is followed by partitioned
  and ordered `WINDOW` operators. {(issue)`15203`)

#15203

@ebyhr
Copy link
Member

ebyhr commented Nov 29, 2022

# MongoDB
* Delete a MongoDB field from collections when dropping a column. 
  Previously, the connector deleted only metadata. ({issue}`15226`)

#15227

@ebyhr
Copy link
Member

ebyhr commented Nov 30, 2022

# MongoDB
* Rename config properties from `mongodb.ssl.enabled` to `mongodb.tls.enabled`. ({issue}`15240`)
* Add support for setting file path and password for truststore and keystore. ({issue}`15240`)

#15240

@findepi
Copy link
Member

findepi commented Nov 30, 2022

General
* Provide an optional limit for `query_max_run_time` session property values.
  This can be set using `query.max-run-time.hard-limit` configuration property. ({issue}`15115`)

#15115

@findepi
Copy link
Member

findepi commented Nov 30, 2022

JDBC
* Return correct values in NULLABLE column of `DatabaseMetaData.getColumns` result. ({issue}`15214`)

#15214

@sopel39
Copy link
Member

sopel39 commented Nov 30, 2022

# Hive, Iceberg, Delta
* Improve performance when reading Parquet files. ({issue}`15257`)

#15257

@ebyhr
Copy link
Member

ebyhr commented Dec 5, 2022

# MongoDB
* Fix `NullPointerException` when a column name contains uppercase characters in `query` table function. ({issue}`15294`)

#15294

@ebyhr
Copy link
Member

ebyhr commented Dec 6, 2022

# General
* Fix incorrect results when adding or subtracting `interval` from `timestamp with time zone` type. ({issue}`15103`)

#15290

@ebyhr
Copy link
Member

ebyhr commented Dec 7, 2022

# Google Sheets
* Add support for setting read timeout using the `gsheets.read-timeout` config property. ({issue}`15322`)

#15322

@ebyhr
Copy link
Member

ebyhr commented Dec 8, 2022

# Delta Lake
* Add support for views. ({issue}`11609`)

#11763

@sopel39
Copy link
Member

sopel39 commented Dec 8, 2022

Hive:
* Improve performance for queries which write data into multiple partitions. ({issue}`15241`)

#15241

Merge with previous one with same name

@sopel39
Copy link
Member

sopel39 commented Dec 8, 2022

General:
* Print Trino version in EXPLAIN and EXPLAIN ANALYZE. ({issue}`15317`)

#15317

@findepi
Copy link
Member

findepi commented Dec 9, 2022

General
* Improve query planning performance of queries with many `GROUP BY` clauses. ({issue}`15292`)

#15292

@findepi
Copy link
Member

findepi commented Dec 9, 2022

Hive, Iceberg, Delta
* Improve `DROP TABLE` performance for tables stored on AWS S3. ({issue}`13974`)

#13974

Delta
* Fix `DROP TABLE` leaving files behind for managed tables stored on S3 and created by Databricks runtime. ({issue}`13017`)

#13017

@raunaqmorarka
Copy link
Member

# General
* Fix recording of `Projection CPU time` in EXPLAIN ANALYZE VERBOSE. ({issue}`15364`)

#15364

@ebyhr
Copy link
Member

ebyhr commented Dec 13, 2022

# Delta Lake
* Fix failure when the path contains special characters. ({issue}`15183`)

#15201

@ebyhr
Copy link
Member

ebyhr commented Dec 15, 2022

# General
* Allow configuring a refresh interval for DB resource group manager with the `resource-groups.refresh-interval` config property. ({issue}`14514`)

#15389

@sopel39
Copy link
Member

sopel39 commented Dec 15, 2022

General
* Improve query performance for large clusters and skewed queries. ({issue}`15369`)

#15369

@ebyhr
Copy link
Member

ebyhr commented Dec 16, 2022

# Iceberg
* Fix incorrect results when predicates over `row` column on Parquet files are pushed into the connector. ({issue}`15408`)

#15408

@sopel39
Copy link
Member

sopel39 commented Dec 16, 2022

General:
* Show task input/output size distribution in `EXPLAIN ANALYZE VERBOSE`. ({issue}`15286`)
* Display stage skewness warnings in `EXPLAIN ANALYZE`. ({issue}`15286`)

#15286

@raunaqmorarka
Copy link
Member

# Hive, Delta, Iceberg, Hudi
* Improve performance of reading parquet files for timestamp and timestamp with timezone data types. ({issue}`15204`)

#15204

@ebyhr
Copy link
Member

ebyhr commented Dec 19, 2022

# MongoDB
* Fix potential incorrect results when `objectid` function is used more than once. ({issue}`15426`)

#15426

@ebyhr
Copy link
Member

ebyhr commented Dec 20, 2022

# MongoDB
* Support case insensitive name matching in `query` table function. ({issue}`15329`)

#15329

@hashhar
Copy link
Member

hashhar commented Dec 21, 2022

# BigQuery
* Support using Arrow serialization when reading from BigQuery. This is an experimental feature and can be enabled by setting the catalog configuration property `bigquery.experimental.arrow-serialization.enabled` to `true`. ({issue}`14972`)

#14972

@sopel39
Copy link
Member

sopel39 commented Dec 21, 2022

General:
* Improve performance of selective queries, queries that read small number of
  columns and queries that proces tables with large Parquet row groups or Orc stripes.
  Config property `node-scheduler.max-pending-splits-per-task` got renamed to
  `node-scheduler.min-pending-splits-per-task` and new config property
  `node-scheduler.max-adjusted-pending-splits-per-task` is added to control
  behavior of new optimization. ({issue}`15168`)
  

#15168

@ebyhr
Copy link
Member

ebyhr commented Dec 21, 2022

# General
* Reset to the initial time zone of the client session when executing `SET TIME ZONE LOCAL`. ({issue}`15314`)

#15341

@ebyhr
Copy link
Member

ebyhr commented Dec 22, 2022

# Google Sheets
* Add support for base64 encoded credentials using the `gsheets.credentials-key` config property. ({issue}`15477`)

#15477

@ebyhr
Copy link
Member

ebyhr commented Dec 22, 2022

# Hive
* Fix failure when reading Hive views with `char` types. ({issue}`15470`)

#15470

@sopel39
Copy link
Member

sopel39 commented Dec 22, 2022

# Hive/Iceberg/Delta
* Improve resilience of insert queries. ({issue}`15476`)

#15476

@ebyhr
Copy link
Member

ebyhr commented Dec 23, 2022

# General
* Add support for `ALTER COLUMN ... SET DATA TYPE` statement. ({issue}`11608`)

#11608

@raunaqmorarka
Copy link
Member

# Hive, Delta, Hudi, Iceberg
* Avoid large memory allocations in parquet reader by limiting the maximum size of reads from file. This improves stability and reduces peak memory requirements. The catalog configuration property `parquet.max-buffer-size` can be used to change the maximum size of reads performed by the parquet reader from the default value of 8MB. ({issue}`15374`)
* Fix query failures when reading parquet files with large row groups. ({issue}`5729`)

#15374 #5729

@martint martint changed the title Release notes for 404 Release notes for 405 Dec 30, 2022
@martint martint unpinned this issue Dec 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging a pull request may close this issue.

8 participants