[BACKPORT] Make reloading the owned partitions in map service context thread safe #11475

mmedenjak · 2017-09-28T10:44:14Z

Since the migration finalizations can be called concurrently, the
owned partitions might be reloaded concurrently. This means that the
set of owned partitions first might be set to a newer version and
then to an older version, leading to an incorrect set of owned
partitions.
This affects the query engine when it performs queries off the
partition thread as every member reports its own set of owned
partitions which is in this case incorrect. If the results from the
actual partition owner are received by the query engine later than
from the "lying" partition owner, they are discarded. This can cause
the query engine to return incorrect results until the partitions are
reloaded again on an another migration.
The fix reloads the partitions in a CAS loop ensuring that the newest
partition state will always be applied.

Also, added some type parameters and improved javadoc.

Backport of: #11471

Fixes :
#10107
#9870
#10776

Since the migration finalizations can be called concurrently, the owned partitions might be reloaded concurrently. This means that the set of owned partitions first might be set to a newer version and then to an older version, leading to an incorrect set of owned partitions. This affects the query engine when it performs queries off the partition thread as every member reports its own set of owned partitions which is in this case incorrect. If the results from the actual partition owner are received by the query engine later than from the "lying" partition owner, they are discarded. This can cause the query engine to return incorrect results until the partitions are reloaded again on an another migration. The fix reloads the partitions in a CAS loop ensuring that the newest partition state will always be applied. Also, added some type parameters and improved javadoc. Fixes : hazelcast#10107 hazelcast#9870 hazelcast#10776

mmedenjak · 2017-09-28T10:45:12Z

@mdogan @ahmetmircik I had to manually backport it since the classes changed a lot and the cherry pick failed with lots of conflicts. But it's mainly the same PR.

mmedenjak added Team: Core Type: Defect labels Sep 28, 2017

mmedenjak added this to the 3.8.7 milestone Sep 28, 2017

mmedenjak self-assigned this Sep 28, 2017

mmedenjak requested review from mdogan and ahmetmircik September 28, 2017 10:44

ahmetmircik approved these changes Sep 28, 2017

View reviewed changes

mdogan approved these changes Sep 28, 2017

View reviewed changes

mdogan added the Backport label Sep 28, 2017

mdogan merged commit 028d9a7 into hazelcast:maintenance-3.x Sep 28, 2017

mmedenjak deleted the QueryBounceTest-failure-backport branch March 5, 2018 14:21

mmedenjak added the Source: Internal PR or issue was opened by an employee label Apr 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BACKPORT] Make reloading the owned partitions in map service context thread safe #11475

[BACKPORT] Make reloading the owned partitions in map service context thread safe #11475

mmedenjak commented Sep 28, 2017

mmedenjak commented Sep 28, 2017

[BACKPORT] Make reloading the owned partitions in map service context thread safe #11475

[BACKPORT] Make reloading the owned partitions in map service context thread safe #11475

Conversation

mmedenjak commented Sep 28, 2017

mmedenjak commented Sep 28, 2017