Could not acquire a lock from redis error - on creating observe request. #832

madhushreegc · 2020-04-17T14:10:26Z

Hi ,

Leshan version M10.

I am triggering more than 20 observe request towards the same device from leshan server . Some time I see below error. If I change the lock key used now for add, addObservation and removeObservation method in RedisStoreRegistraion like LOCK:EP:deviceName:lwm2mpath (example : LOCK:EP:urn:imei:devideName:3:0:3) . Will I face any issues any where for any operations?

04:36:39,985 ERROR [StripedExchangeJob] Exception in striped thread: Could not acquire a lock from redis
java.lang.IllegalStateException: Could not acquire a lock from redis
        at org.eclipse.leshan.server.cluster.RedisLock.acquire(RedisLock.java:52) ~[leshan-server-cluster-1.0.0-M10.jar:?]
        at org.eclipse.leshan.server.cluster.RedisRegistrationStore.add(RedisRegistrationStore.java:551) ~[leshan-server-cluster-1.0.0-M10.jar:?]
        at org.eclipse.leshan.server.cluster.RedisRegistrationStore.putIfAbsent(RedisRegistrationStore.java:533) ~[leshan-server-cluster-1.0.0-M10.jar:?]
        at org.eclipse.californium.core.network.BaseMatcher.registerObserve(BaseMatcher.java:187) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.UdpMatcher.sendRequest(UdpMatcher.java:115) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.CoapEndpoint$OutboxImpl.sendRequest(CoapEndpoint.java:713) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.BaseCoapStack$StackBottomAdapter.sendRequest(BaseCoapStack.java:187) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.ReliabilityLayer.sendRequest(ReliabilityLayer.java:107) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.BlockwiseLayer.sendRequest(BlockwiseLayer.java:296) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.AbstractLayer.sendRequest(AbstractLayer.java:66) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.AbstractLayer.sendRequest(AbstractLayer.java:66) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.ExchangeCleanupLayer.sendRequest(ExchangeCleanupLayer.java:45) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.BaseCoapStack$StackTopAdapter.sendRequest(BaseCoapStack.java:142) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.stack.BaseCoapStack.sendRequest(BaseCoapStack.java:80) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.CoapEndpoint$4.runStriped(CoapEndpoint.java:600) ~[californium-core-2.0.0-M12.jar:?]
        at org.eclipse.californium.core.network.StripedExchangeJob.run(StripedExchangeJob.java:65) [californium-core-2.0.0-M12.jar:?]
        at eu.javaspecialists.tjsn.concurrency.stripedexecutor.StripedExecutorService$SerialJob.run(StripedExecutorService.java:548) [element-connector-2.0.0-M12.jar:?]
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [?:1.8.0_131]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_131]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) [?:1.8.0_131]
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) [?:1.8.0_131]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_131]

The text was updated successfully, but these errors were encountered:

sbernard31 · 2020-04-17T16:41:51Z

Will I face any issues any where for any operations?

Yes. We lock on Endpoint Name because we want atomic operation for a given endpoint name.

The question is why you get the could not be acquire ... We try to acquire the lock during 5s and redis access should be really fast...

madhushreegc · 2020-04-18T01:04:31Z

I am facing this issue very often when there are so many concurrent observe request towards device . Redis has huge number of keys .

Your advice is not to change the key for locking mechanism for observe request and to increase the duration to acquire the redis key lock correct?

yemkay · 2020-04-19T03:17:50Z

On a race condition, RedisLock could block a Jedis resource for a longer duration while it waits to acquire lock. Just wondering, if it makes sense to release the Jedis resource back to pool, when RedisLock.acquire goes to sleep for 10ms.

sbernard31 · 2020-04-20T08:49:19Z

Your advice is not to change the key for locking mechanism for observe request and to increase the duration to acquire the redis key lock correct?

Not really.
You face this after only 20 observe request on the same endpoint ?
I'm still surprising that the lock could not be acquired after 5s ... My guess would be that this should not happen and you should rather try to understand the issue more deeply maybe by monitoring your redis server or the Leshan code to find the time consuming part of code.

If I found time, I will try to reproduce this on my side.

On a race condition, RedisLock could block a Jedis resource for a longer duration while it waits to acquire lock. Just wondering, if it makes sense to release the Jedis resource back to pool, when RedisLock.acquire goes to sleep for 10ms.

Maybe it could 🤔
This would be useful if you face some new JedisExhaustedPoolException( "Could not get a resource since the pool is exhausted", ....) because of RedisLock.
But this means we need to rewrite the code.

Not directly linked but I think we should :

have WARN log if the lock expire before we try to unlock.
Make an interface about RedisLock to allow several implementation.
Timing of default implementation could be configurable.

I also ask myself if log is enough when we failed to acquired. 🤔

yemkay · 2020-04-22T07:05:22Z

Thanks Simon. We narrowed down the issue to another application which was running a "keys *" command on Redis, which eventually slowed down other Redis clients. We dont see a problem in Leshan per se, but this extension would help someone facing similar issue.

Make an interface about RedisLock to allow several implementation.

sbernard31 · 2020-04-22T07:52:35Z

Glad to see you finally find out the issue.
Using Keys * command in production is rarely a good idea 🙂

I will create an issue to remember to improve the RedisLock stuff.

Could we close this issue ?

sbernard31 · 2020-04-22T07:57:39Z

By the way, as you are using Leshan, could you eventually take time to give us some information about that in : #830 🙏

sbernard31 · 2020-04-22T08:51:23Z

I create an issue about improving RedisLock flexibility : #836
And fix a possible race condition : #837

sbernard31 · 2020-04-30T14:54:06Z

I hope I didn't afraid you with my "leshan user research" stuff 😅

I close the issue, feel free to reopen if needed.

yemkay · 2020-04-30T15:21:46Z

Not at all. I had some personal issues to attend to. I will add the info in some time..

…

On Thu, Apr 30, 2020, 8:24 PM Simon ***@***.***> wrote: I hope I didn't afraid you with my "leshan user research" stuff 😅 I close the issue, feel free to reopen if needed. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#832 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA3JTXGULTIAALKJMXTO6TRPGGJ7ANCNFSM4MKY4IKQ> .

sbernard31 added the question Any question about leshan label Apr 17, 2020

sbernard31 mentioned this issue Apr 22, 2020

Add more flexibility to RedisLock #836

Closed

sbernard31 mentioned this issue Apr 22, 2020

Fix race condition in RedisLock and add log to detect unintended expiration #837

Merged

sbernard31 closed this as completed Apr 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could not acquire a lock from redis error - on creating observe request. #832

Could not acquire a lock from redis error - on creating observe request. #832

madhushreegc commented Apr 17, 2020 •

edited by sbernard31

Loading

sbernard31 commented Apr 17, 2020

madhushreegc commented Apr 18, 2020

yemkay commented Apr 19, 2020

sbernard31 commented Apr 20, 2020

yemkay commented Apr 22, 2020 •

edited

Loading

sbernard31 commented Apr 22, 2020

sbernard31 commented Apr 22, 2020

sbernard31 commented Apr 22, 2020 •

edited

Loading

sbernard31 commented Apr 30, 2020

yemkay commented Apr 30, 2020 via email

Could not acquire a lock from redis error - on creating observe request. #832

Could not acquire a lock from redis error - on creating observe request. #832

Comments

madhushreegc commented Apr 17, 2020 • edited by sbernard31 Loading

sbernard31 commented Apr 17, 2020

madhushreegc commented Apr 18, 2020

yemkay commented Apr 19, 2020

sbernard31 commented Apr 20, 2020

yemkay commented Apr 22, 2020 • edited Loading

sbernard31 commented Apr 22, 2020

sbernard31 commented Apr 22, 2020

sbernard31 commented Apr 22, 2020 • edited Loading

sbernard31 commented Apr 30, 2020

yemkay commented Apr 30, 2020 via email

madhushreegc commented Apr 17, 2020 •

edited by sbernard31

Loading

yemkay commented Apr 22, 2020 •

edited

Loading

sbernard31 commented Apr 22, 2020 •

edited

Loading