[WIP] Fix: crm_mon: try to connect CIB while pacemakerd shutting down #2342

wenningerk · 2021-04-12T13:12:03Z

actually while resources are evacuated from the node. But atm
there is no clean and easy way to tell when this is done or if
pacemakerd is just shutting down leftover daemons. So try to
connect anyway.

This showed up to be an issue - introduced by checking for
pacemakerd in full up & running state - when resources are
using crm_mon in their stop-operation.
Maybe don't merge this right away.
I've opened this PR with the simplest solution to just handle
the state where pacemakerd is sequentially shutting down
the subdaemons exactly like as if it was fully up and running.
So we can discuss here.
Alternative might be introduction of an additional state so
that when querying the state of pacemakerd the caller can
know that it is actually still trying to get down all resources
on the node before starting to shutdown all the subdaemons.

actually while resources are evacuated from the node. But atm there is no clean and easy way to tell when this is done and pacemakerd is just shutting down leftover daemons. So try to connect anyway.

HideoYamauchi · 2021-04-12T21:43:15Z

Hi Klaus,
Hi Ken,

Thank you for the fix.
I'll test this fix today just in case.

Many thanks,
Hideo Yamauchi.

wenningerk · 2021-04-13T05:19:05Z

I'll test this fix today just in case.

Thank you - cool if you can give it a try ...

HideoYamauchi · 2021-04-13T06:02:00Z

Hi Klaus,

Unfortunately, the problem still seems to remain.
I will check it a bit more.

I will also contact you with details of the issue.

Best Regards,
Hideo Yamauchi.

HideoYamauchi · 2021-04-13T06:47:48Z

Hi Klaus,

In RA of pgsql, the option of xml may be used to call crm_mon, so the following modifications are also required.

static int
pacemakerd_status(void){
(snip)
                             case pcmk_pacemakerd_state_running:
                                 rc = pcmk_rc_ok;
                                 break;
+                            case pcmk_pacemakerd_state_shutting_down:
+                                /* try our luck maybe CIB is still accessible */
+                                rc = pcmk_rc_ok;
+                                break;
                             default:
                                 break;
(snip)

Best Regards,
Hideo Yamauchi.

wenningerk · 2021-04-13T07:53:06Z

In RA of pgsql, the option of xml may be used to call crm_mon, so the following modifications are also required.

Of course - thanks for the pointer ...
Did you verify if it is working properly with the additional change?

HideoYamauchi · 2021-04-13T07:55:36Z

In RA of pgsql, the option of xml may be used to call crm_mon, so the following modifications are also required.

Of course - thanks for the pointer ...
Did you verify if it is working properly with the additional change?

I took a lot of time to build the test environment.
The test will be completed tomorrow.
Please wait for a while.

Best Regards,
Hideo Yamauchi.

wenningerk · 2021-04-13T08:04:30Z

Thanks Hideo. I know testing with timing-critical stuff is a pain.
Opened a new PR for it #2343

HideoYamauchi · 2021-04-13T22:31:47Z

Thanks Hideo. I know testing with timing-critical stuff is a pain.
Opened a new PR for it #2343

Hi Klaus,

Okay!
When my test is over, I will comment the results on a new PR.

Many thanks,
Hideo Yamauchi.

Fix: crm_mon: try to connect CIB while pacemakerd shutting down

49ebe4c

actually while resources are evacuated from the node. But atm there is no clean and easy way to tell when this is done and pacemakerd is just shutting down leftover daemons. So try to connect anyway.

kgaillot merged commit f2d8a3d into ClusterLabs:master Apr 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Fix: crm_mon: try to connect CIB while pacemakerd shutting down #2342

[WIP] Fix: crm_mon: try to connect CIB while pacemakerd shutting down #2342

wenningerk commented Apr 12, 2021 •

edited

HideoYamauchi commented Apr 12, 2021

wenningerk commented Apr 13, 2021

HideoYamauchi commented Apr 13, 2021 •

edited

HideoYamauchi commented Apr 13, 2021

wenningerk commented Apr 13, 2021 •

edited

HideoYamauchi commented Apr 13, 2021

wenningerk commented Apr 13, 2021 •

edited

HideoYamauchi commented Apr 13, 2021 •

edited

[WIP] Fix: crm_mon: try to connect CIB while pacemakerd shutting down #2342

[WIP] Fix: crm_mon: try to connect CIB while pacemakerd shutting down #2342

Conversation

wenningerk commented Apr 12, 2021 • edited

HideoYamauchi commented Apr 12, 2021

wenningerk commented Apr 13, 2021

HideoYamauchi commented Apr 13, 2021 • edited

HideoYamauchi commented Apr 13, 2021

wenningerk commented Apr 13, 2021 • edited

HideoYamauchi commented Apr 13, 2021

wenningerk commented Apr 13, 2021 • edited

HideoYamauchi commented Apr 13, 2021 • edited

wenningerk commented Apr 12, 2021 •

edited

HideoYamauchi commented Apr 13, 2021 •

edited

wenningerk commented Apr 13, 2021 •

edited

wenningerk commented Apr 13, 2021 •

edited

HideoYamauchi commented Apr 13, 2021 •

edited