Don't skip some columns in `column_types` on Postgres #42933

ghiculescu · 2021-08-02T23:33:37Z

Fixes #41651, by partially reverting #39097. I just reverted the line that @terracatta highlighted as problematic.

I ran the same benchmark as #39097 and it seems like this change does not cause a perf regression.

require "bundler/inline"

gemfile(true) do
  source "https://rubygems.org"

  git_source(:github) { |repo| "https://github.com/#{repo}.git" }

  gem "rails", path: "/Users/alex/code/rails" # github: "rails/rails", branch: "main"
  gem "sqlite3"
  gem "benchmark-ips"
end

require "active_record"

ActiveRecord::Base.establish_connection(adapter: "sqlite3", database: ":memory:")

ActiveRecord::Schema.define do
  create_table :active_storage_blobs do |t|
    t.string   :key,          null: false
    t.string   :filename,     null: false
    t.string   :content_type
    t.text     :metadata
    t.string   :service_name, null: false
    t.bigint   :byte_size,    null: false
    t.string   :checksum,     null: false
    t.datetime :created_at,   null: false

    t.index [ :key ], unique: true
  end
end

class ActiveStorageBlob < ActiveRecord::Base
end

Benchmark.ips do |x|
  x.report("find_by") { ActiveStorageBlob.find_by(id: 1) }
end

This branch:

Warming up --------------------------------------
             find_by     1.940k i/100ms
Calculating -------------------------------------
             find_by     17.928k (± 4.8%) i/s -     91.180k in   5.098301s

Main:

Warming up --------------------------------------
             find_by     1.912k i/100ms
Calculating -------------------------------------
             find_by     17.961k (± 4.8%) i/s -     89.864k in   5.015252s

So it's basically the same. cc @terracatta @boblail

ghiculescu · 2021-08-03T19:42:08Z

activerecord/test/cases/calculations_test.rb

+    expected = if current_adapter?(:PostgreSQLAdapter)
+      # Postgres returns the same name for each column in the given query, so each column is named "coalesce"
+      # As a result Rails cannot accurately type cast each value.
+      # To work around this, you should use aliases in your select statement (see test_pluck_functions_with_alias).


This is a bit yucky, but it only happens in very specific cases: if you have select two (or more) columns that don't have an alias and use the same SQL function (eg COALESCE). This happens because of how Postgres returns column names, see #36186 (comment) for a summary.

As far as I can tell we have a few options:

The current state, which results in this bug: ActiveRecord::Result#column_types is often missing data #41651

This PR, which fixes ActiveRecord::Result#column_types is often missing data #41651 but introduces this inconsistency between databases.

Implement the refactor in Avoid coercing all SELECTs with the same column name to the same value #36186, which adds a fair bit of overhead to every query for a relatively niche issue.

In my opinion this PR is the least bad option.

Fixes rails#41651, by partially reverting rails#39097 I ran the same benchmark as rails#39097 and it seems like this change does not cause a perf regression. ```ruby require "bundler/inline" gemfile(true) do source "https://rubygems.org" git_source(:github) { |repo| "https://github.com/#{repo}.git" } gem "rails", path: "/Users/alex/code/rails" # github: "rails/rails", branch: "main" gem "sqlite3" gem "benchmark-ips" end require "active_record" ActiveRecord::Base.establish_connection(adapter: "sqlite3", database: ":memory:") ActiveRecord::Schema.define do create_table :active_storage_blobs do |t| t.string :key, null: false t.string :filename, null: false t.string :content_type t.text :metadata t.string :service_name, null: false t.bigint :byte_size, null: false t.string :checksum, null: false t.datetime :created_at, null: false t.index [ :key ], unique: true end end class ActiveStorageBlob < ActiveRecord::Base end Benchmark.ips do |x| x.report("find_by") { ActiveStorageBlob.find_by(id: 1) } end ``` This branch: ``` Warming up -------------------------------------- find_by 1.940k i/100ms Calculating ------------------------------------- find_by 17.928k (± 4.8%) i/s - 91.180k in 5.098301s ``` Main: ``` Warming up -------------------------------------- find_by 1.912k i/100ms Calculating ------------------------------------- find_by 17.961k (± 4.8%) i/s - 89.864k in 5.015252s ```

rails-bot · 2021-12-14T15:21:55Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.
Thank you for your contributions.

Don't skip some columns in `column_types` on Postgres

rails-bot bot added the activerecord label Aug 2, 2021

ghiculescu force-pushed the postgres-column-types branch from 1df9439 to a8e51c6 Compare August 2, 2021 23:38

ghiculescu requested a review from kamipo August 2, 2021 23:38

ghiculescu force-pushed the postgres-column-types branch from a8e51c6 to 2afda52 Compare August 3, 2021 19:37

ghiculescu commented Aug 3, 2021

View reviewed changes

ghiculescu force-pushed the postgres-column-types branch from 2afda52 to 05c7090 Compare September 3, 2021 19:48

ghiculescu force-pushed the postgres-column-types branch from 05c7090 to 587522e Compare September 15, 2021 14:52

rails-bot bot added the stale label Dec 14, 2021

rails-bot bot closed this Dec 21, 2021

rafaelfranca reopened this Sep 16, 2022

rails-bot bot removed the stale label Sep 16, 2022

rafaelfranca merged commit 26ceb7f into rails:main Sep 16, 2022

rafaelfranca added a commit that referenced this pull request Sep 16, 2022

Merge pull request #42933 from ghiculescu/postgres-column-types

81295f9

Don't skip some columns in `column_types` on Postgres

fatkodima mentioned this pull request Aug 2, 2023

Setting a postgres timestamptz attribute sets the value's timezone to UTC #48346

Closed

Flixt mentioned this pull request Aug 26, 2023

Fix column typecasting with duplicate column names for PostgreSQL #49043

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't skip some columns in `column_types` on Postgres #42933

Don't skip some columns in `column_types` on Postgres #42933

ghiculescu commented Aug 2, 2021 •

edited

ghiculescu Aug 3, 2021

rails-bot bot commented Dec 14, 2021

Don't skip some columns in column_types on Postgres #42933

Don't skip some columns in column_types on Postgres #42933

Conversation

ghiculescu commented Aug 2, 2021 • edited

ghiculescu Aug 3, 2021

Choose a reason for hiding this comment

rails-bot bot commented Dec 14, 2021

Don't skip some columns in `column_types` on Postgres #42933

Don't skip some columns in `column_types` on Postgres #42933

ghiculescu commented Aug 2, 2021 •

edited