Need advice about race condition

fxn · September 1, 2006, 1:20pm

There’s a field in my User model that sometimes needs bulk updates:

User.update_all(“cluster = #{new_cluster}”, “cluster = #
{current_cluster}”)

Now, suppose user with id 37 is affected by that particular
update_all and updates his profile:

user 37 is read
update_all happens and changes 37’s cluster asynchronously
user 37 is saved

now user 37 belongs to the wrong cluster. Unlikely but possible. Is
there a robust way to write that?

– fxn

fxn · September 1, 2006, 1:50pm

Xavier N. wrote:

user 37 is saved

now user 37 belongs to the wrong cluster. Unlikely but possible. Is
there a robust way to write that?

You might want to read:

zsombor

–
Company - http://primalgrasp.com
Thoughts - http://deezsombor.blogspot.com

fxn · September 1, 2006, 2:00pm

Does user 37 have the ability to change his own cluster? If not, then
you’ll
only update the fields that your form submits… not everything.
(provided
you use update_attributes to save the results)

If he can update his cluster, then the only thing that comes to mind
is to
look at the modified date/time of the record you have out and compare it
to
what’s in the database.

Add a field to your table called updated_at

then in your User model, do a callback

before_update :check_if_stale

protected
def check_if_stale
user_updated_at = User.find(self.id).updated_at
if user_updated_at == self.updated_at
#record is ok to update
else
# your copy is stale… better do some corrective stuff…
could
return false which would prevent the record from being saved.
end
end

Not sure if that’s the best approach… someone else could have a
cleaner
way of doing that.

fxn · September 1, 2006, 4:58pm

On Sep 1, 2006, at 1:51 PM, Brian H. wrote:

Does user 37 have the ability to change his own cluster? If not,
then you’ll only update the fields that your form submits… not
everything. (provided you use update_attributes to save the results)

Yeah, I don’t want to force all the system to use update_attributes
though, sounds brittle. I prefer a solution that still lets people
use AR models normally (except for non-bang-saves, who can raise now
an exception as save! does).

– fxn

fxn · September 1, 2006, 11:11pm

On Sep 1, 2006, at 2:23 PM, Xavier N. wrote:

On Sep 1, 2006, at 1:41 PM, Dee Z. wrote:

You might want to read:
Optimistic concurrency control - Wikipedia

Oh yes, I had forgotten lock_version. Thank you!

Too quick, since that field is changed by an update_all lock_version
is untouched. I could do a select+assign in a before_save but there’s
still room for a race condition there, hmmmmm.

– fxn

fxn · September 1, 2006, 11:41pm

@Xavier:

Did you try my solution that used the updated_at column in the
before_update?

fxn · September 1, 2006, 2:36pm

On Sep 1, 2006, at 1:41 PM, Dee Z. wrote:

You might want to read:
Optimistic concurrency control - Wikipedia

Oh yes, I had forgotten lock_version. Thank you!

– fxn

fxn · September 2, 2006, 12:15am

On Sep 1, 2006, at 11:31 PM, Brian H. wrote:

@Xavier:

Did you try my solution that used the updated_at column in the
before_update?

That solution has another race condition, the fatal sequence within
before_update is:

updated_at is read and coincides, fine
update_all happens in a separate process
save --> wrong

which is the same sequence as in the more straightforward

cluster is read and coincides, fine
update_all happens in a separate process
save --> wrong

Perhaps that approach plus a mandatory transaction in all user
updates will be the only robust solution.

– fxn

fxn · September 2, 2006, 1:25am

On Sep 2, 2006, at 12:02 AM, Xavier N. wrote:

which is the same sequence as in the more straightforward

cluster is read and coincides, fine

update_all happens in a separate process

save → wrong

Perhaps that approach plus a mandatory transaction in all user
updates will be the only robust solution.

I was kind of assuming callbacks did not run in a transaction, but
they actually do according to

Peak Obsession
ClassMethods.html

so I think I am pretty close to getting this right, great!

– fxn

fxn · September 2, 2006, 1:41am

On 9/1/06, Xavier N. [email protected] wrote:

updates will be the only robust solution.

I was kind of assuming callbacks did not run in a transaction, but
they actually do according to

Peak Obsession
ClassMethods.html

so I think I am pretty close to getting this right, great!

Your original update_all is sufficient if you include
lock_version=lock_version+1. Then dirty readers will get a
StaleObjectError
if they save after the cluster change.

jeremy