Modifying the options of an existing Gluster volume leads to remove-brick command #56781

omarkohl · 2019-05-22T11:41:30Z

SUMMARY

When trying to update the options of an existing Gluster volume, Ansible tries to remove bricks from that volume. This fails, but should not happen anyway! The problem does NOT exist with Ansible 2.7.10 (only 2.8).

ISSUE TYPE

Bug Report

COMPONENT NAME

glusterfs

ANSIBLE VERSION

ansible 2.8.0
  config file = /redacted/path/ansible.cfg
  configured module search path = [u'/home/someuser/.ansible/plugins/modules', u'/usr/share/ansible/plugins/modules']
  ansible python module location = /home/someuser/.local/lib/python2.7/site-packages/ansible
  executable location = /home/someuser/bin/ansible
  python version = 2.7.16 (default, Apr  6 2019, 01:42:57) [GCC 8.3.0]

CONFIGURATION

DEFAULT_JINJA2_EXTENSIONS(/redacted/path/ansible.cfg) = jinja2.ext.do

OS / ENVIRONMENT

STEPS TO REPRODUCE

  - name: "Set the retention-period to 120s"
    gluster_volume:
      state: present
      name: "volume1"
      options:
        { features.default-retention-period: "120" }

EXPECTED RESULTS

The option for the volume is updated and nothing else is done.

ACTUAL RESULTS

Ansible tries to remove bricks from the volume. This fails.

TASK [Set the retention-period to 120s] ****************************************
An exception occurred during task execution. To see the full traceback, use -vvv. The error was: None
fatal: [host-n1]: FAILED! => {"changed": false, "msg": "error running gluster (/usr/sbin/gluster --mode=script volume remove-brick volume1 host-n1:/gluster/brick1/glusterbrick host-n2:/gluster/brick1/glusterbrick host-n3:/gluster/brick1/glusterbrick host-n4:/gluster/brick1/glusterbrick start) command (rc=1): volume remove-brick start: failed: Removing bricks from replicate configuration is not allowed without reducing replica count explicitly.\n"}

The text was updated successfully, but these errors were encountered:

omarkohl · 2019-05-22T11:44:03Z

I think it likely that it is related to #38269

ansibot · 2019-05-22T11:50:12Z

cc @devyanikota
click here for bot help

devyanikota · 2019-05-22T18:22:13Z

Hi @omarkohl
It is expected that the existing bricks in the volume are added to the playbook as well.
Remove_brick operation is triggered in the absence of the 'bricks' field, thus telling ansible that the bricks can be removed. In simple words, the exact current state of the volume is to be mentioned in the playbook, which includes the bricks in the node.
According to your requirement, in order to change the desired 'options' field for the cluster, please do mention the existing bricks in the volume to avoid this issue.
Please let us know in case you need more help. Thanks.

omarkohl · 2019-05-23T10:25:20Z

OK I understand... I think this behaviour is not good:

This is a major and destructive change from version 2.7.10 to 2.8 which means that bricks could suddenly be removed from existing volumes just by performing a trivial operation such as I did
Requiring to specify all existing bricks is a lot of overhead. I don't have easy access to that list when I am running that playbook. Why is it necessary to do this? In general in Ansible it is not necessary to specify EVERY option to change something. Things that are not part of the task are not changed per default (at least that is my understanding)

I would suggest only adding and removing bricks when the list of bricks is specified and not if it is not mentioned in the playbook.

sac · 2019-07-02T13:22:08Z

@omarkohl you are right. If the existing bricks are large in number it becomes a lot of overhead. And more importantly, it is error prone.

…58635)

ansibot removed the needs_triage Needs a first human triage before being processed. label May 22, 2019

sac added a commit to sac/ansible that referenced this issue Jul 2, 2019

Call remove-brick only if no. of bricks > 0 (ansible#56781)

b8d7277

sac mentioned this issue Jul 2, 2019

Call remove-brick only if no. of bricks > 0 (#56781) #58635

Merged

ansibot closed this as completed in #58635 Jul 26, 2019

StephenSorriaux pushed a commit to StephenSorriaux/ansible that referenced this issue Jul 26, 2019

Call remove-brick only if no. of bricks > 0 (ansible#56781) (ansible#…

08b9e18

…58635)

ansible locked and limited conversation to collaborators Aug 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifying the options of an existing Gluster volume leads to remove-brick command #56781

Modifying the options of an existing Gluster volume leads to remove-brick command #56781

omarkohl commented May 22, 2019

omarkohl commented May 22, 2019

ansibot commented May 22, 2019

devyanikota commented May 22, 2019

omarkohl commented May 23, 2019

sac commented Jul 2, 2019

Modifying the options of an existing Gluster volume leads to remove-brick command #56781

Modifying the options of an existing Gluster volume leads to remove-brick command #56781

Comments

omarkohl commented May 22, 2019

SUMMARY

ISSUE TYPE

COMPONENT NAME

ANSIBLE VERSION

CONFIGURATION

OS / ENVIRONMENT

STEPS TO REPRODUCE

EXPECTED RESULTS

ACTUAL RESULTS

omarkohl commented May 22, 2019

ansibot commented May 22, 2019

devyanikota commented May 22, 2019

omarkohl commented May 23, 2019

sac commented Jul 2, 2019