Support OCF 1.1 reload-agent action #2349

kgaillot · 2021-04-15T21:47:25Z

Pacemaker previously used the "reload" action to reload agent parameters, but most agents used it to reload the service configuration. Pacemaker also misused the OCF 1.0 "unique" parameter attribute to indicate reloadability.

OCF 1.1 created the "reload-agent" action and "reloadable" parameter attribute for the Pacemaker usage.

Pacemaker now supports the OCF 1.1 usage. The old usage is now deprecated, but will be supported if the agent does not claim OCF 1.1 or later compliance and does not advertise the reload-agent action.

Special care must be taken so that rolling upgrades work (i.e. an older DC will still schedule reload commands), and that the controller chooses reload or reload-agent based on what the resource meta-data advertises, regardless of what command the DC schedules. (Technically that means we didn't have to change what the DC schedules, but I think having it schedule reload-agent will be less confusing.)

kgaillot · 2021-04-15T21:47:48Z

@wenningerk , can you review this when you get a chance? Thanks

daemons/controld/controld_metadata.c

daemons/controld/controld_execd.c

wenningerk · 2021-04-20T12:02:19Z

lgtm

With the release of the OCF 1.1 standard, this information may come in handy during troubleshooting. We warn if the RA does not advertise an OCF version, or if it advertises a major version higher than what Pacemaker supports, and otherwise log a debug message with the supported version.

... for new OCF 1.1 "reload-agent" action

... and formatting

This will make it easier to add new criteria.

... for better code isolation and future reuse

Pacemaker previously used the "reload" action to reload agent parameters, but most agents used it to reload the service configuration. Pacemaker also misused the OCF 1.0 "unique" parameter attribute to indicate reloadability. OCF 1.1 created the "reload-agent" action and "reloadable" parameter attribute for the Pacemaker usage. Pacemaker now supports the OCF 1.1 usage. The old usage is now deprecated, but will be supported if the agent does not claim OCF 1.1 or later compliance and does not advertise the reload-agent action.

... unless Pacemaker was built using --enable-compat-2.0, in which case it's really 1.0 with extensions.

In a rolling upgrade, the DC can't schedule reload-agent actions until all nodes have been upgraded.

…d-agent change

... to make future changes less error-prone

kgaillot · 2021-04-20T18:11:44Z

Rebased on current master and added 5 commits on top to address review

kgaillot · 2021-04-21T14:46:07Z

After further testing, I added 2 more commits on top -- one to fix a regression since 1.1.18 (!) and one to always advertise OCF 1.1 support to agents (since we do support 1.1 agents regardless of compatibility settings).

@wenningerk , can you review the top 7 commits? Thanks ...

wenningerk · 2021-04-21T15:00:11Z

@wenningerk , can you review the top 7 commits? Thanks ...

The top 4 - looked at just that far are fine for me.
Maybe be could use inotify on platforms that support it to monitor the agent-scripts - or add an external interface where a platform specific script (simple inotifywait-loop in the standard linux case) can take care of the platform specific handling.
I was already hesitant regarding the 1.1 advertisement and wanted to discuss it - but as you see it the same ...

c820651 (in 1.1.18) introduced a regression where cached meta-data was allowed to be used after a start action. Meta-data should always be refreshed from the agent after a start, in case the resource agent was updated.

Upon further consideration, Pacemaker should advertise OCF 1.1 support to agents even when built with --enable-compat-2.0, because Pacemaker supports the OCF 1.1 role names and reload-agent action in the agent either way, even if it's not using the new role names in log messages and output itself.

... to make its purpose clearer

kgaillot · 2021-04-21T17:25:42Z

OK, I still have a little testing to do, but hopefully this is the final form. The last 4 commits are different from last time.

wenningerk · 2021-04-22T08:54:22Z

OK, I still have a little testing to do, but hopefully this is the final form. The last 4 commits are different from last time.

changes looking good to me

wenningerk reviewed Apr 19, 2021

View reviewed changes

daemons/controld/controld_metadata.c Show resolved Hide resolved

wenningerk reviewed Apr 19, 2021

View reviewed changes

daemons/controld/controld_execd.c Show resolved Hide resolved

wenningerk reviewed Apr 19, 2021

View reviewed changes

daemons/controld/controld_execd.c Outdated Show resolved Hide resolved

wenningerk reviewed Apr 19, 2021

View reviewed changes

daemons/controld/controld_execd.c Outdated Show resolved Hide resolved

kgaillot added 19 commits April 20, 2021 13:11

API: libcrmcommon: add CRMD_ACTION_RELOAD_AGENT string constant

53f9f86

... for new OCF 1.1 "reload-agent" action

Doc: resources: improve Pacemaker Remote meta-data

34181b2

... and formatting

Feature: resources: support OCF 1.1 standard in ocf:pacemaker:remote

2287df2

Feature: resources: support OCF 1.1 standard in ocf:pacemaker:Dummy

7f40818

Refactor: controller: reorganize build_parameter_list()

0877832

This will make it easier to add new criteria.

Refactor: controller: enhance resource meta-data getter

56296db

... for better code isolation and future reuse

Feature: scheduler: support OCF 1.1 reload-agent action

34cbe57

Feature: libcrmservice: advertise OCF 1.1 support to resource agents

27fc165

... unless Pacemaker was built using --enable-compat-2.0, in which case it's really 1.0 with extensions.

Feature: libcrmcommon: bump feature set for reload-agent support

e4457d9

In a rolling upgrade, the DC can't schedule reload-agent actions until all nodes have been upgraded.

Test: cts-scheduler: update expected test results for reload vs reloa…

45c8d74

…d-agent change

Doc: Pacemaker Explained: update for OCF 1.1 reload-agent action

0e155ef

Refactor: controller: reorganize build_parameter_list() again

df4033a

... to make future changes less error-prone

API: libcrmcommon: add PCMK_OCF_MAJOR_VERSION string constant

389f816

API: libcrmcommon: add PCMK_OCF_MINOR_VERSION string constant

75818c3

API: libcrmcommon: add PCMK_OCF_VERSION string constant

029e290

Log: controller: info message when agent supports newer OCF 1.x standard

b168e00

Refactor: controller: streamline OCF 1.1 compliance check

7db36a4

kgaillot force-pushed the ocf11 branch from c10ecbe to 7db36a4 Compare April 20, 2021 18:11

Fix: controller: always refresh agent meta-data after start

4826a8d

c820651 (in 1.1.18) introduced a regression where cached meta-data was allowed to be used after a start action. Meta-data should always be refreshed from the agent after a start, in case the resource agent was updated.

kgaillot added 3 commits April 21, 2021 12:08

Refactor: controller: rename supports_reload to supports_legacy_reload

80277ee

... to make its purpose clearer

Low: controller: check for empty OCF version as well as NULL

6225f7b

kgaillot force-pushed the ocf11 branch from 045eef5 to 6225f7b Compare April 21, 2021 17:21

kgaillot merged commit d469c63 into ClusterLabs:master Apr 22, 2021

kgaillot deleted the ocf11 branch April 22, 2021 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support OCF 1.1 reload-agent action #2349

Support OCF 1.1 reload-agent action #2349

kgaillot commented Apr 15, 2021

kgaillot commented Apr 15, 2021

wenningerk commented Apr 20, 2021

kgaillot commented Apr 20, 2021 •

edited

kgaillot commented Apr 21, 2021

wenningerk commented Apr 21, 2021

kgaillot commented Apr 21, 2021

wenningerk commented Apr 22, 2021

Support OCF 1.1 reload-agent action #2349

Support OCF 1.1 reload-agent action #2349

Conversation

kgaillot commented Apr 15, 2021

kgaillot commented Apr 15, 2021

wenningerk commented Apr 20, 2021

kgaillot commented Apr 20, 2021 • edited

kgaillot commented Apr 21, 2021

wenningerk commented Apr 21, 2021

kgaillot commented Apr 21, 2021

wenningerk commented Apr 22, 2021

kgaillot commented Apr 20, 2021 •

edited