My commits for ISPN-1106 that got in 5.0.x were not quite the same as in master, this pull request gets 5.0.x to the same state as master (re: ISPN-1106) #518

danberindei · 2011-09-07T13:59:42Z

https://issues.jboss.org/browse/ISPN-1106

5.0.x only

…y were waiting for the default cache to finish rehashing.

Fixed mismatchedTestNames.sh script.

* In LockManagerImpl log the other keys owned by the current transaction. * In DefaultCacheManager push the cache name to the NDC during cache startup. * Improved toString() for RehashControlCommand and DistributedExecuteCommand. * In InboundInvocationHandler log the cache name. * Log cache start/stop. * Log the read lock owners in JGroupsDistSync.

…nceTask can invalidate the keys after rehashing is done but before the cache listeners (e.g. KeyAffinityService) know it.

…o the TopologyChanged event.

…hash from finishing The generic scenario involves multiple caches. Say we have transactions Tx1 and Tx2 spanning caches C1 and C2. A new node joins the cluster, starting C1 and C2. With the following sequence of events rehashing will be blocked for lockAcquisitionTimeout. 1. Tx1 prepares on C1 locking K1 2. Tx2 wants to prepare on C2, Tx2 gets the tx lock 3. Tx2 now waits to lock K1 while holding the tx lock on C2 4. Rehash starts on C2 but it can't proceed because Tx2 has the tx lock 5. Tx1 now wants to prepare on C2, but can't acquire the tx lock I've implemented a crude "deadlock detection" scheme: a new tx will wait the full lockAcquisitionTimeout for the tx lock, but a tx that already has locks acquired will only wait 1/100 of that. So if there is a cycle it will break much quicker and allow rehashing to proceed. There is also a simpler variant where the transactions work with a single cache. In that case if the remote command can't acquire the tx lock with 0 timeout it knows that it has the tx lock on the origin node and it's in a deadlock situation.

This is no longer strictly necessary for ISPN-1106, as we are waiting with a shorter timeout on transactions with locks and so the rehash does not block for a very long period of time. It is recommended however to start all caches on application startup, and this method provides an easy way for users to start all their caches.

Dan Berindei added 8 commits September 7, 2011 16:45

ISPN-1123 - Some tests were creating a non-default cache but then the…

6c647f2

…y were waiting for the default cache to finish rehashing.

ISPN-1123 - Fixed some more mismatched test names

6fb29af

Fixed mismatchedTestNames.sh script.

ISPN-1106 - Fixed an error in the LockControlCommand constructor.

ec8e132

ISPN-1106 - Separated the rehash completion into two phases so Rebala…

739589b

…nceTask can invalidate the keys after rehashing is done but before the cache listeners (e.g. KeyAffinityService) know it.

ISPN-1106 - Moved the lock cleanup back from the DataRehashed event t…

c544e69

…o the TopologyChanged event.

maniksurtani closed this Sep 7, 2011

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

My commits for ISPN-1106 that got in 5.0.x were not quite the same as in master, this pull request gets 5.0.x to the same state as master (re: ISPN-1106) #518

My commits for ISPN-1106 that got in 5.0.x were not quite the same as in master, this pull request gets 5.0.x to the same state as master (re: ISPN-1106) #518

danberindei commented Sep 7, 2011

My commits for ISPN-1106 that got in 5.0.x were not quite the same as in master, this pull request gets 5.0.x to the same state as master (re: ISPN-1106) #518

My commits for ISPN-1106 that got in 5.0.x were not quite the same as in master, this pull request gets 5.0.x to the same state as master (re: ISPN-1106) #518

Conversation

danberindei commented Sep 7, 2011