Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads #34528

DmitryTsepelev · 2018-11-26T08:50:55Z

First of all, let me explain my use case: I have a heavy JSONB DB column which I want to exclude from most of the queries, that's why I've decided to use ignored_columns feature. This column is populated via DB trigger, and sometimes I do want to have it loaded, in such cases I explicitly load it (for instance, using .select("*")). Also, I have a custom ActiveRecord::Type to make interactions with the underlying JSON easier. The problem is that with this combination the value I get when I call that accessor is a value from the DB, custom type is completely ignored.

Here is a simplier example with a string instead of JSON:

class DeveloperName < ActiveRecord::Type::String
  def deserialize(value)
    "Developer: #{value}"
  end
end

class AttributedDeveloper < ActiveRecord::Base
  self.table_name = "developers"

  attribute :name, DeveloperName.new

  self.ignored_columns += ["name"]
end

developer = AttributedDeveloper.create
developer.update_column :name, "name"

loaded_developer = AttributedDeveloper.where(id: developer.id).select("*").first
puts loaded_developer.name # should be "Developer: name" but it's just "name"

Here is a reason why it happens:

ignored_columns removes the column from the column_hash
#select_all can fill column_types for the result set when DB provides the types (only PostgreSQL does it, so other tests for other DBs cannot reproduce the problem)
when #find_by_sql prepares the ActiveRecord::Result, it uses column_hash to remove all the types coming from the DB because we know how to handle them
Ignored columns are not in the column_hash so #find_by_sql defines the accessor method with the DB-provided type and custom attribute is ignored

My solution is to remove attributes_to_define_after_schema_loads keys from the column_types along with column_hash keys because having the custom attributes means that we do know how to deserialize this field.

rafaelfranca · 2018-11-26T17:22:45Z

What you are seeing is the expected behavior in my opinion. Because you ignored the column Rails should not know how to handle it coming from the database.

DmitryTsepelev · 2018-11-26T17:35:36Z

Thanks for the feedback!

This behaviour seemed confusing to me, because the doc says that "Ignored columns won't have attribute accessors defined", but since we do have the accessor (because of select("*")) I expect it to work :) Also, I believe that behavior should be consistent across the databases - my test passes for MySQL and SQLite

rafaelfranca · 2018-11-26T18:11:06Z

I agree the behavior should be consistent, so we need to understand why with no changes your test pass in MySQL and SQLite and not in PostgreSQL and make it work in PostgreSQL without adding attributes_to_define_after_schema_loads back to column_hash again before removing it from column_type.

rafaelfranca · 2018-11-26T18:24:01Z

activerecord/lib/active_record/querying.rb

@@ -40,7 +40,7 @@ module Querying
    def find_by_sql(sql, binds = [], preparable: nil, &block)
      result_set = connection.select_all(sanitize_sql(sql), "#{name} Load", binds, preparable: preparable)
      column_types = result_set.column_types.dup
-      columns_hash.each_key { |k| column_types.delete k }
+      columns_hash.merge(attributes_to_define_after_schema_loads).each_key { |k| column_types.delete k }


It seems that it is better to always use the connection.schema_cache.columns_hash here. attributes_to_define_after_schema_loads can contain attributes that are not in the database and we use this just to remove from column_types

DmitryTsepelev · 2018-11-26T21:00:01Z

I've traced the code to figure out why this issue does not affect other adapters. DatabaseStatements#exec_query returns ActiveRecord::Result, which accepts columns, rows and column_types to it's initializer. PostgreSQL DatabaseStatements provide all three arguments, while MySQL one - only two, so column_types are always empty in that case. Querying#find_by_sql calls result_set.column_types as a part of instantiation.

rafaelfranca · 2018-11-26T21:03:27Z

Cool! Thank you for investigating. What do you think on my suggestion of always using the column_hash returned by the schema cache on that line?

DmitryTsepelev · 2018-11-26T21:04:29Z

I've added it to my commit, waiting for the tests to pass :) Thanks for the suggestion

…ypes

DmitryTsepelev · 2018-11-27T14:18:12Z

connection.schema_cache.columns_hash works perfectly fine! I only had to add a small warmup setup to the instrumentation test (please let me know if there is a better way to do it) to make sure we have cache initialized before the first test - otherwise, the first event would be about the warmup instead of the desired one. Now all the specs are green

rafaelfranca · 2018-11-27T19:43:15Z

Great work on this

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads

rails-bot bot added the activerecord label Nov 26, 2018

rafaelfranca reviewed Nov 26, 2018

View reviewed changes

DmitryTsepelev force-pushed the fix-ignored-attributes branch from c67ee5d to 0fbdcad Compare November 26, 2018 20:52

DmitryTsepelev force-pushed the fix-ignored-attributes branch 7 times, most recently from f14b765 to 6e531b9 Compare November 27, 2018 12:55

Cached columns_hash fields should be excluded from ResultSet#column_t…

d34c1fc

…ypes

DmitryTsepelev force-pushed the fix-ignored-attributes branch from 6e531b9 to d34c1fc Compare November 27, 2018 13:30

rafaelfranca merged commit ddaca7c into rails:master Nov 27, 2018

rafaelfranca added a commit that referenced this pull request Nov 27, 2018

Merge pull request #34528 from DmitryTsepelev/fix-ignored-attributes

446bc02

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads #34528

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads #34528

DmitryTsepelev commented Nov 26, 2018 •

edited

rafaelfranca commented Nov 26, 2018

DmitryTsepelev commented Nov 26, 2018

rafaelfranca commented Nov 26, 2018 •

edited

rafaelfranca Nov 26, 2018

DmitryTsepelev commented Nov 26, 2018

rafaelfranca commented Nov 26, 2018

DmitryTsepelev commented Nov 26, 2018 •

edited

DmitryTsepelev commented Nov 27, 2018

rafaelfranca commented Nov 27, 2018

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads #34528

Additional types of ResultSet should not contain keys of #attributes_to_define_after_schema_loads #34528

Conversation

DmitryTsepelev commented Nov 26, 2018 • edited

rafaelfranca commented Nov 26, 2018

DmitryTsepelev commented Nov 26, 2018

rafaelfranca commented Nov 26, 2018 • edited

rafaelfranca Nov 26, 2018

Choose a reason for hiding this comment

DmitryTsepelev commented Nov 26, 2018

rafaelfranca commented Nov 26, 2018

DmitryTsepelev commented Nov 26, 2018 • edited

DmitryTsepelev commented Nov 27, 2018

rafaelfranca commented Nov 27, 2018

DmitryTsepelev commented Nov 26, 2018 •

edited

rafaelfranca commented Nov 26, 2018 •

edited

DmitryTsepelev commented Nov 26, 2018 •

edited