Release BLOCKED triggers in releaseAcquiredTrigger #146

shelmling · 2017-05-22T15:24:09Z

No description provided.

shelmling · 2017-05-22T15:32:16Z

Dear Quartz Team,

Based on my findings from Issue 145, I'd like to propose the following change. I hope this is useful for you.

Thanks,
Sebastian

mstead · 2017-11-06T16:12:31Z

Any update on when this will get merged? We are currently getting hit by this issue and need it fixed ASAP.

pbuckley · 2017-12-01T18:54:52Z

👍 on fixing, I think we've been hit by this issue as well

IshwarKhandelwal · 2018-05-17T23:53:48Z

When this fix will be available in next release of quartz scheduler.

dersteve · 2018-06-26T09:52:55Z

Why don't we merge this fix in? We are seeing similar problems in our environment

fbokovikov · 2018-12-17T12:10:45Z

Let's merge this fix. @zemian @jhouserizer @chrisdennis

zemian · 2019-02-11T21:25:42Z

Hello folks, sorry it took so long to respond. I will take a look at this and will try to merge it in next day or so.

dersteve · 2019-02-27T11:55:50Z

@zemian Thanks for the merge! What are your plans for releasing this fix in any new versions?

zemian · 2019-02-27T13:19:53Z

Hi @dersteve , then next release should be 2.3.1. See https://github.com/quartz-scheduler/quartz/blob/quartz-2.3.x/docs/changelog.adoc

Don't have a date, but should be soon. I am trying to get it publish with help of Terracotta folks.

fbokovikov · 2019-03-06T08:56:01Z

@zemian Thanks for the merge! Can you specify the release date of new quartz version please? We really need this fix!

zemian · 2019-03-06T18:44:39Z

Hi @fbokovikov no release date yet :( Hopefully soon. At the meantime, you can simply do a local build from latest branch.

vincentjames501 · 2019-05-28T20:56:06Z

I think we've traced down an issue related to this commit/fix. When running in a clustered environment, with DisallowConcurrentExecution, and lots of triggers for that job, something appears to "hang" for several minutes doing nothing (all triggers are in a WAITING state and none are in COMPLETED/BLOCKED/ACQUIRED, the fire time is still valid and w/in our 30 min misfire range). I'm not sure why this would be as I don't know the quartz data model too well, however, if I comment out this line:

 getDelegate().updateTriggerStateFromOtherState(conn,
                    trigger.getKey(), STATE_WAITING, STATE_BLOCKED);

The issue goes away. Also, I merged these two into a single one locally and was also not able to reproduce the hang issue:

getDelegate().updateTriggerStateFromOtherState(conn,
                    trigger.getKey(), STATE_WAITING, STATE_ACQUIRED);	                    
getDelegate().updateTriggerStateFromOtherState(conn,
                    trigger.getKey(), STATE_WAITING, STATE_BLOCKED);

with

getDelegate().updateTriggerStateFromOtherStates(conn,
                    trigger.getKey(), STATE_WAITING, STATE_ACQUIRED, STATE_BLOCKED);

Can anyone hypothesize why this would be?
Is there something about this being done in two separate queries that could introduce race conditions?

Here is my theory:

Node A acquires a triggers (one trigger acquired, rest blocked)
Node A begins to release the acquired trigger by executing getDelegate().updateTriggerStateFromOtherState(conn, trigger.getKey(), STATE_WAITING, STATE_ACQUIRED); (now all triggers are in waiting)
Node B before Node A executes getDelegate().updateTriggerStateFromOtherState(conn, trigger.getKey(), STATE_WAITING, STATE_BLOCKED); acquires a trigger as everything to it is in the WAITING state now and Node B can acquire things (RACE CONDITION) (one trigger acquired, rest blocked).
Node A then executes getDelegate().updateTriggerStateFromOtherState(conn, trigger.getKey(), STATE_WAITING, STATE_BLOCKED); while Node B had one acquired and the rest blocked (now triggers are incorrectly set to a waiting state).
Node B then releases the acquired trigger and things are hosed (I'm not sure why this happens but I do think there is a race condition above?).

It would probably not be a bad idea to merge into a single query anyways for performance. CC @zemian @shelmling

* relates to quartz-scheduler/quartz#146 , quartz-scheduler/quartz#145 * relates to #741 #800

oridool · 2021-02-09T19:17:44Z

I sill have similar issue on v2.3.2, when using cluster more and enabling @DisallowConcurrentExecution.
I'm not sure the issue is fixed.
It happens only occasionally and not always.
I have a log just be fore my job execution ends, so I'm sure it is not working anymore. From application side everything seems to be normal. But trigger is still hangs in status BLOCKED.

Is there any workaround fix?

Thanks.

IovanAlexandru · 2021-05-06T14:45:19Z

I sill have similar issue on v2.3.2, when using cluster more and enabling @DisallowConcurrentExecution.
I'm not sure the issue is fixed.
It happens only occasionally and not always.
I have a log just be fore my job execution ends, so I'm sure it is not working anymore. From application side everything seems to be normal. But trigger is still hangs in status BLOCKED.

Is there any workaround fix?

Thanks.

@zemian @oridool Having the same issue on 2.3.2 as stated in #145 while using @DisallowConcurrentExecution annotation (145 PR specifies the problem is fixed via this PR 146 on version 2.3.2)

If we set up the following:

TRIGGER table: next_fire_time in the past, trigger_state to blocked
FIRED_TRIGGERS table: empty or (not containing the blocked trigger)
This will never make the job associated with this blocked trigger run again.
PR Release BLOCKED triggers in releaseAcquiredTrigger #146 (this one) mostly makes sure of proper clean-up but this is not ensured when instances die suddenly or in the middle of the release process of the BLOCKED jobs.
I think the problem should be addressed at the moment when Quartz polls the jobs from the DB. It currently polls only for WAITING state:
SELECT TRIGGER_NAME, TRIGGER_GROUP, NEXT_FIRE_TIME, PRIORITY FROM TRIGGERS WHERE SCHED_NAME = '?' AND TRIGGER_STATE = 'WAITING' AND NEXT_FIRE_TIME <= ? AND (MISFIRE_INSTR = -1 OR (MISFIRE_INSTR != -1 AND NEXT_FIRE_TIME >= ?)) ORDER BY NEXT_FIRE_TIME ASC, PRIORITY DESC

borisvaningelgom · 2021-12-02T16:56:21Z

@zemian We also face this issue in our production environment and have to manually correct the job trigger state in the database to solve it. Our job runs every 5 minutes.
We see no exception or errors in the logs. It just stops.
Even when the job DOES throw an exception, it doesn't necessarily get blocked.

Quartz version v2.3.2
Job is marked with @DisallowConcurrentExecution

koti-muppavarapu · 2022-06-08T12:48:57Z

@zemian We also face this issue in our production environment and have to manually correct the job trigger state in the database to solve it. Our job runs every 5 minutes. We see no exception or errors in the logs. It just stops. Even when the job DOES throw an exception, it doesn't necessarily get blocked.

Quartz version v2.3.2 Job is marked with @DisallowConcurrentExecution

We are facing exact same issue in our production. Did you find a solution or workaround for this issue? I am also using Quartz version 2.3.2 and my Job is marked with @DisallowConcurrentExecution as well.

borisvaningelgom · 2022-06-08T13:00:36Z

@koti-muppavarapu We solved it by properly configuring our quartz to run in clustered mode. It looks like if you don't do this, the @DisallowConcurrentExecution creates issues.

We has some missing properties that were the root cause.
Make sure "org.quartz.scheduler.isClustered" is set to true

koti-muppavarapu · 2022-06-08T13:13:22Z

Thanks for you reply @borisvaningelgom , I will try this property and see if I this will fix the issue. This is very random issue on which happen very randomly and rarely. Hopefully this will fix it.

vincentjames501 · 2022-06-09T01:14:00Z

@borisvaningelgom @koti-muppavarapu we’ve been running clustered mode since the beginning and that doesn’t solve it for us.

* applying commit quartznet/quartznet@05fd35c * related to quartz-scheduler/quartz#146 , quartz-scheduler/quartz#145 , #741, #800

Release BLOCKED triggers in releaseAcquiredTrigger

3f65b28

yagarwals mentioned this pull request Apr 27, 2018

Triggers are getting blocked permanently #145

Closed

zemian merged commit d8497ff into quartz-scheduler:master Feb 12, 2019

zemian added a commit to zemian/quartz that referenced this pull request Feb 12, 2019

Backport PR quartz-scheduler#146 and updated change log

9446f48

lahma added a commit to quartznet/quartznet that referenced this pull request Nov 7, 2019

release BLOCKED triggers in releaseAcquiredTrigger

05fd35c

* relates to quartz-scheduler/quartz#146 , quartz-scheduler/quartz#145 * relates to #741 #800

This was referenced Mar 16, 2021

Bump quartz from 2.2.2 to 2.3.2 13655041618/taotao-parent#1

Open

Bump quartz from 2.2.3 to 2.3.2 in /platform-schedule TianXiaoDer/kcools-platform#1

Open

SilviaDGregorio mentioned this pull request Nov 7, 2022

release BLOCKED triggers in releaseAcquiredTrigger Oriflame/cosmosdb-quartznet#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release BLOCKED triggers in releaseAcquiredTrigger #146

Release BLOCKED triggers in releaseAcquiredTrigger #146

shelmling commented May 22, 2017

shelmling commented May 22, 2017

mstead commented Nov 6, 2017

pbuckley commented Dec 1, 2017

IshwarKhandelwal commented May 17, 2018

dersteve commented Jun 26, 2018

fbokovikov commented Dec 17, 2018

zemian commented Feb 11, 2019

dersteve commented Feb 27, 2019

zemian commented Feb 27, 2019

fbokovikov commented Mar 6, 2019

zemian commented Mar 6, 2019

vincentjames501 commented May 28, 2019

oridool commented Feb 9, 2021

IovanAlexandru commented May 6, 2021

borisvaningelgom commented Dec 2, 2021

koti-muppavarapu commented Jun 8, 2022

borisvaningelgom commented Jun 8, 2022

koti-muppavarapu commented Jun 8, 2022

vincentjames501 commented Jun 9, 2022

Release BLOCKED triggers in releaseAcquiredTrigger #146

Release BLOCKED triggers in releaseAcquiredTrigger #146

Conversation

shelmling commented May 22, 2017

shelmling commented May 22, 2017

mstead commented Nov 6, 2017

pbuckley commented Dec 1, 2017

IshwarKhandelwal commented May 17, 2018

dersteve commented Jun 26, 2018

fbokovikov commented Dec 17, 2018

zemian commented Feb 11, 2019

dersteve commented Feb 27, 2019

zemian commented Feb 27, 2019

fbokovikov commented Mar 6, 2019

zemian commented Mar 6, 2019

vincentjames501 commented May 28, 2019

oridool commented Feb 9, 2021

IovanAlexandru commented May 6, 2021

borisvaningelgom commented Dec 2, 2021

koti-muppavarapu commented Jun 8, 2022

borisvaningelgom commented Jun 8, 2022

koti-muppavarapu commented Jun 8, 2022

vincentjames501 commented Jun 9, 2022