Skip to content

Adding new node in slurm DeepOPS cluster failing  #1178

@karanveersingh5623

Description

@karanveersingh5623

This -l option is not working when adding a new node to Slurm Cluster
Below is the error trace

(env) (base) [root@adas-ml deepops]# ansible-playbook -l adas-ml-2 playbooks/slurm-cluster.yml -vvv

TASK [DeepOps.hosts : Install custom hosts file] ***************************************************************************************************************
task path: /root/deepops/roles/galaxy/DeepOps.hosts/tasks/main.yml:23
[WARNING]: conditional statements should not include jinja2 templating delimiters such as {{ }} or {% %}. Found: {{ slurm_configure_etc_hosts | default(true)
}}
<192.168.61.3> ESTABLISH SSH CONNECTION FOR USER: None
<192.168.61.3> SSH: EXEC ssh -o ControlMaster=auto -o ControlPersist=5m -o ConnectionAttempts=100 -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -o KbdInteractiveAuthentication=no -o PreferredAuthentications=gssapi-with-mic,gssapi-keyex,hostbased,publickey -o PasswordAuthentication=no -o ConnectTimeout=60 -o 'ControlPath=~/.ssh/ansible-%r@%h:%p' 192.168.61.3 '/bin/sh -c '"'"'( umask 77 && mkdir -p "` echo /tmp `"&& mkdir "` echo /tmp/ansible-tmp-1654670502.5229077-43700-14549810058325 `" && echo ansible-tmp-1654670502.5229077-43700-14549810058325="` echo /tmp/ansible-tmp-1654670502.5229077-43700-14549810058325 `" ) && sleep 0'"'"''
<192.168.61.3> (0, b'ansible-tmp-1654670502.5229077-43700-14549810058325=/tmp/ansible-tmp-1654670502.5229077-43700-14549810058325\n', b'')
<192.168.61.3> ESTABLISH SSH CONNECTION FOR USER: None
<192.168.61.3> SSH: EXEC ssh -o ControlMaster=auto -o ControlPersist=5m -o ConnectionAttempts=100 -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -o KbdInteractiveAuthentication=no -o PreferredAuthentications=gssapi-with-mic,gssapi-keyex,hostbased,publickey -o PasswordAuthentication=no -o ConnectTimeout=60 -o 'ControlPath=~/.ssh/ansible-%r@%h:%p' 192.168.61.3 '/bin/sh -c '"'"'rm -f -r /tmp/ansible-tmp-1654670502.5229077-43700-14549810058325/ > /dev/null 2>&1 && sleep 0'"'"''
<192.168.61.3> (0, b'', b'')
fatal: [adas-ml-2]: FAILED! => changed=false
  msg: 'AnsibleUndefinedVariable: ''dict object'' has no attribute ''adas-ml-2'''

PLAY RECAP *****************************************************************************************************************************************************
adas-ml-2                  : ok=19   changed=0    unreachable=0    failed=1    skipped=7    rescued=0    ignored=1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions