New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add #create_or_find_by to lean on unique constraints #31989
Conversation
This is really good. I have multiple use cases for this. 👍 |
def create_or_find_by!(attributes, &block) | ||
create!(attributes, &block) | ||
rescue ActiveRecord::RecordNotUnique | ||
find_by(attributes) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should raise in the fallback find
case? It'd be surprising to ever get a nil
result (for whatever reason, including a missing unique index) from these methods.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, good point. Let's change that to find_by!.
Better than nil.
Subscriber.create_or_find_by(nick: "bob", name: "the cat") | ||
end | ||
end | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is missing a test with create_with
(to change the behavior of the create
) and with where
(to change the behavior of find!
I'm not sure if those method should be used with create_or_find_by
but right now they can and I have no idea of the effect they would have, so better to test them to make sure it is what we expect.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't see how that differs? create_or_find_by
uses the same flow as find_or_create_by
, except instead of an ||
we're using a rescue
. But the flow is the same.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean, it's not that we can't test it, just that I don't think it teaches us anything interesting. The test would be identical to the one for test_find_or_create_by_with_create_with
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it is the same behavior we want so I guess it is fine being tested in that test
# matching record, which will then return nil, rather than a record will the given attributes. | ||
# * It relies on exception handling to handle control flow, which may be marginally slower. And | ||
# | ||
# This method will always returns a record if all given attributes are covered by unique constraints, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This isn't totally true: another client could update or delete the row between the rejected INSERT and the subsequent SELECT. This race condition is complementary to the one in #find_or_create_by
.
That caveat probably belongs in the "drawbacks" section, but even still I don't think we can make this claim here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. Will add that point. It's a much rarer race condition in many apps, I'd say.
# #create returns in such situation. | ||
def create_or_find_by(attributes, &block) | ||
create(attributes, &block) | ||
rescue ActiveRecord::RecordNotUnique |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs transaction(requires_new: true) do
around the create
to work in an ongoing surrounding transaction (on at least PostgreSQL)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this not necessary for find_or_create_by?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because it doesn't cause an SQL error and then attempt to recover. PostgreSQL remembers when an error has occurred inside a transaction, and disallows all further operations until that transaction has been rolled back.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we are willing to require PG 9.5 or later for this (when using the PG adapter), we could just stick ON CONFLICT DO NOTHING
on the end instead. This would let us wrap the whole thing in a transaction instead of just the create, which eliminates any possibility of race conditions.
def create_or_find_by!(attributes, &block) | ||
create!(attributes, &block) | ||
rescue ActiveRecord::RecordNotUnique | ||
find_by!(attributes) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to retry
here instead of raising if the find fails? Seems confusing for create_or_find_by!
to raise RecordNotFound, just because of a racing delete.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can't retry this failure as it should only occur when you use attributes where not all of them are covered by unique constraints. It's more like a INVALID QUERY kind of failure.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh right, I was assuming the delete race, rather than operator error. I guess we could retry once, so we're safe from a poorly-timed simple delete. That way we'd only be tripped up by a racing delete; insert; delete
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
True. I'm not sure whether having just a single retry is a good warranty, though. Since it's hard for us to tell the difference between legit race condition and invalid query.
TIL! Do you have an idea for a good test for that?
…On Tue, Feb 13, 2018 at 4:49 PM, Matthew Draper ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In activerecord/lib/active_record/relation.rb
<#31989 (comment)>:
> + #
+ # There are several drawbacks to #create_or_find_by, though:
+ #
+ # * The underlying table must have the relevant columns defined with unique constraints.
+ # * A unique constraint violation may be triggered by only one, or at least less than all,
+ # of the given attributes. This means that the subsequent #find_by may fail to find a
+ # matching record, which will then raise an `ActiveRecord::NotFound` exception,
+ # rather than a record will the given attributes.
+ # * It relies on exception handling to handle control flow, which may be marginally slower. And
+ #
+ # This method will always returns a record if all given attributes are covered by unique constraints,
+ # but if creation was attempted and failed due to validation errors it won't be persisted, you get what
+ # #create returns in such situation.
+ def create_or_find_by(attributes, &block)
+ create(attributes, &block)
+ rescue ActiveRecord::RecordNotUnique
Because it doesn't cause an SQL error and then attempt to recover.
PostgreSQL remembers when an error has occurred inside a transaction, and
disallows all further operations until that transaction has been rolled
back.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#31989 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AAAKtbTMH-JrUyizELV10shNLxOgesnAks5tUi2lgaJpZM4SElV8>
.
|
I think this should do it: def test_create_or_find_by_within_transaction
assert_nil Subscriber.find_by(nick: "bob")
subscriber = Subscriber.create!(nick: "bob")
Subscriber.transaction do
assert_equal subscriber, Subscriber.create_or_find_by(nick: "bob")
assert_not_equal subscriber, Subscriber.create_or_find_by(nick: "cat")
end
end |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Documentation edits.
# * The underlying table must have the relevant columns defined with unique constraints. | ||
# * A unique constraint violation may be triggered by only one, or at least less than all, | ||
# of the given attributes. This means that the subsequent #find_by may fail to find a | ||
# matching record, which will then raise an `ActiveRecord::NotFound` exception, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
`ActiveRecord::NotFound`
needs <tt>
for Rdoc code formatting, not backticks.
# * A unique constraint violation may be triggered by only one, or at least less than all, | ||
# of the given attributes. This means that the subsequent #find_by may fail to find a | ||
# matching record, which will then raise an `ActiveRecord::NotFound` exception, | ||
# rather than a record will the given attributes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will
=> with
# Attempts to create a record with the given attributes in a table that has a unique constraint | ||
# on one or several of its columns. If a row already exists with one or several of these | ||
# unique constraints, the exception such an insertion would normally raise is caught, | ||
# and the existing record with those attributes is sought found using #find_by. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is sought found using #find_by
— "sought" should be deleted.
# and the existing record with those attributes is sought found using #find_by. | ||
# | ||
# This is similar to #find_or_create_by, but avoids the problem of stale reads between the SELECT | ||
# and the INSERT, as that methods needs to first query the table, then attempt to insert a row |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
methods
=> method
Awesome! |
Good stuff. |
Happy to explore that as a PG 9.5 level-up somehow. But the whole feature shouldn’t depend on it imo.
… On Feb 21, 2018, at 12:47, Sean Griffin ***@***.***> wrote:
@sgrif commented on this pull request.
In activerecord/lib/active_record/relation.rb:
> + #
+ # There are several drawbacks to #create_or_find_by, though:
+ #
+ # * The underlying table must have the relevant columns defined with unique constraints.
+ # * A unique constraint violation may be triggered by only one, or at least less than all,
+ # of the given attributes. This means that the subsequent #find_by may fail to find a
+ # matching record, which will then raise an `ActiveRecord::NotFound` exception,
+ # rather than a record will the given attributes.
+ # * It relies on exception handling to handle control flow, which may be marginally slower. And
+ #
+ # This method will always returns a record if all given attributes are covered by unique constraints,
+ # but if creation was attempted and failed due to validation errors it won't be persisted, you get what
+ # #create returns in such situation.
+ def create_or_find_by(attributes, &block)
+ create(attributes, &block)
+ rescue ActiveRecord::RecordNotUnique
If we are willing to require PG 9.5 or later for this (when using the PG adapter), we could just stick ON CONFLICT DO NOTHING on the end instead. This would let us wrap the whole thing in a transaction instead of just the create, which eliminates any possibility of race conditions.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub, or mute the thread.
|
I want to use these awesome class methods on Rails 5.2, so much. |
I think we don't, "New features are only added to the master branch and will not be made available in point releases." by Maintenance Policy for Ruby on Rails. |
I see, thank you for your explanation. |
Hi everybody, As we start relying on the database constraints more, I'd suggest (and few people like the idea) to have the improved versions of I just opened the issue #34650 where we can discuss the topic. Any feedback is very appreciated! |
create_or_find_by will be implemented in Rails 6.0 (rails/rails#31989) However we've decided to use it before it comes out, and would love it to be in rubocop scope as well. I guess we're not the first ones to do it, so it could benefit to most.
### Description If `LegacyAppeal.find_or_create_by_vacols_id` is hit multiple times for an appeal that doesn't yet exist, it's possible for a race condition to arise where the appeal is saved by one process in between the time another process initializes and attempts to save it. I observed this happening when loading the case details page for appeals that exist in VACOLS but not Caseflow, when `find_or_create_by_vacols_id` is invoked from the appeals controller and the tasks controller in quick succession. When the second process attempts to save its version of the appeal, it fails with `PG::UniqueViolation` / `ActiveRecord::RecordNotUnique`. I changed the method to capture the `ActiveRecord::RecordNotUnique` and return the existing record via `find_by!` (which'll raise a `ActiveRecord::RecordNotFound` if it's unsuccessful), following [this pattern](rails/rails#31989), from the Rails 6 `create_or_find_by` method. It's not ideal to [use an exception as flow control](https://wiki.c2.com/?DontUseExceptionsForFlowControl), but I think it's okay in this case to let the database constraint determine the action.
Attempts to create a record with the given attributes in a table that has a unique constraint
on one or several of its columns. If a row already exists with one or several of these
unique constraints, the exception such an insertion would normally raise is caught,
and the existing record with those attributes is sought found using #find_by.
This is similar to #find_or_create_by, but avoids the problem of stale reads, as that methods needs
to first query the table, then attempt to insert a row if none is found. That leaves a timing gap
between the SELECT and the INSERT statements that can cause problems in high throughput applications.
There are several drawbacks to #create_or_find_by, though:
of the given attributes. This means that the subsequent #find_by may fail to find a
matching record, which will then return nil, rather than a record will the given attributes.
This method will always returns a record if all given attributes are covered by unique constraints,
but if creation was attempted and failed due to validation errors it won't be persisted, you get what
#create returns in such situation.