don't remove LOCK file when PGSQL stopped with quorum lost. #28

greenx · 2013-11-01T12:21:53Z

Hi!
I again found trouble. )
I doing experiment with quorum lost.
After quorum lost LOCK not deleted.
I see code.

    if  [ "$1" = "master" -a "$OCF_RESKEY_CRM_meta_notify_slave_uname" = " " ]; then
        ocf_log info "Removing $PGSQL_LOCK."
        rm -f $PGSQL_LOCK
    fi

Where defined $OCF_RESKEY_CRM_meta_notify_slave_uname ? what he mean?

greenx · 2013-11-01T12:57:27Z

One answer found http://clusterlabs.org/doc/en-US/Pacemaker/1.1/html-single/Pacemaker_Explained/#_multi_state_notifications

But the meaning is not quite clear.
Lost quorum - the resources will stopped.
I made a stand of four nodes.
If two node down - quorum lost and pacemaker apply policy 'stop'.
But there is one slave.

Or should we switched on and off in the correct order.

t-matsuo · 2013-11-02T03:01:58Z

If two node down - quorum lost and pacemaker apply policy 'stop'.
But there is one slave.

It may be a bug of pacemaker.

greenx · 2013-11-05T04:35:44Z

You mean to say, that after the loss of a quorum pacemaker should send a message to stop all the resources and reset this variable($OCF_RESKEY_CRM_meta_notify_slave_uname), because resource master-slave too must be stopped?

t-matsuo · 2013-11-05T12:41:41Z

You mean to say, that after the loss of a quorum pacemaker should send a message to stop all the resources
and reset this variable($OCF_RESKEY_CRM_meta_notify_slave_uname), because resource master-slave too
must be stopped?

Yes.

Wintermute3 · 2015-07-23T20:36:22Z

In the comparison above:

"$OCF_RESKEY_CRM_meta_notify_slave_uname" = " "

Why the space between the " " for the comparison target? Seems like an error to me. The OCF_RESKEY_CRM_meta_notify_slave_uname got trim'ed upon import, so blank strings would have become empty strings, no?

playmobil77d · 2015-11-12T09:29:35Z

Hi,
Is there a solution about this problem ?

greenx · 2015-11-12T12:09:32Z

I dont know. My project was closed. I almost was not interested in the topic.

furynick · 2019-01-15T12:54:32Z

I have the same problem with 2 nodes cluster.

The LOCK is never deleted as OCF_RESKEY_CRM_meta_notify_slave_uname always contains so the slave can't be restarted.

Perhaps this file can be deleted on slave successful stop after wal sync check

t-matsuo · 2019-01-20T14:04:05Z

Hi furynick

Sorry, I switched to another task 5 years ago.

This agent was merged into ClusterLabs repository and maintained at its community.
Could you open new topic at ClusterLabs resource-agent repository ?
Someone may respond.

furynick · 2019-01-20T17:16:57Z

Thanks for reply, I already found related issue at ClusterLabs#699

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

don't remove LOCK file when PGSQL stopped with quorum lost. #28

don't remove LOCK file when PGSQL stopped with quorum lost. #28

greenx commented Nov 1, 2013

greenx commented Nov 1, 2013

t-matsuo commented Nov 2, 2013

greenx commented Nov 5, 2013

t-matsuo commented Nov 5, 2013

Wintermute3 commented Jul 23, 2015

playmobil77d commented Nov 12, 2015

greenx commented Nov 12, 2015

furynick commented Jan 15, 2019

t-matsuo commented Jan 20, 2019

furynick commented Jan 20, 2019

don't remove LOCK file when PGSQL stopped with quorum lost. #28

don't remove LOCK file when PGSQL stopped with quorum lost. #28

Comments

greenx commented Nov 1, 2013

greenx commented Nov 1, 2013

t-matsuo commented Nov 2, 2013

greenx commented Nov 5, 2013

t-matsuo commented Nov 5, 2013

Wintermute3 commented Jul 23, 2015

playmobil77d commented Nov 12, 2015

greenx commented Nov 12, 2015

furynick commented Jan 15, 2019

t-matsuo commented Jan 20, 2019

furynick commented Jan 20, 2019