New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ceph-ansible: "Timeout when waiting for file /etc/ceph/ceph.client.admin.keyring" #1380
Comments
monitor kind of started but ceph needs the keys on monitors. There must be some simple issues. From the deplot node (called mgmt) I can ssh root@mon-node1 without password. Does that mean I did everything regarding to the key? [root@mon-node1 ceph]# ceph -s |
check group_vars/all.yml Monitor options set the monitor_interface or monitor_address |
I used eth0 for monitor interface |
Which version you select. Because the 10.2.5 the admin key build before ceph-mon start the version 11.xxx.xx, it will create by the ansbile-playbook, roles/ceph-mon/tasks/ceph_keys.yml |
Tried 9 and 10, not 11. I am not sure what it means "Because the 10.2.5 the admin key build before ceph-mon start the monitor service file like this...". Does it mean 10 and 9 won't work? I can try 11. |
I tried 11 and got the same results pretty much. Can't generate keyring at /etc/ceph for monitor nodes. TASK [ceph-mon : wait for ceph.client.admin.keyring exists] ******************** |
more output TASK [ceph-mon : start the monitor service] ************************************ TASK [ceph-mon : include] ****************************************************** TASK [ceph-mon : collect admin and bootstrap keys (for or after kraken release)] *** TASK [ceph-mon : wait for ceph.client.admin.keyring exists] ******************** RUNNING HANDLER [ceph.ceph-common : restart ceph mons] ************************* RUNNING HANDLER [ceph.ceph-common : restart ceph osds] ************************* RUNNING HANDLER [ceph.ceph-common : restart ceph mdss] ************************* RUNNING HANDLER [ceph.ceph-common : restart ceph rgws] ************************* RUNNING HANDLER [ceph.ceph-common : restart ceph nfss] ************************* PLAY RECAP ********************************************************************* |
@gator1 please check if your |
How do I check if ceph-mon is running?
I tried this on one monitor node and got this.
Where I get the system logs for it? Sorry I am a newbie.
[root@mon-node2 ~]# ceph -s
2017-04-04 09:51:13.217859 7f16a0b18700 -1 auth: unable to find a keyring
on
/etc/ceph/ceph.client.admin.keyring,/etc/ceph/ceph.keyring,/etc/ceph/keyring,/etc/ceph/keyring.bin:
(2) No such file or directory
…On Tue, Apr 4, 2017 at 3:22 AM, Sébastien Han ***@***.***> wrote:
@gator1 <https://github.com/gator1> please check if you ceph-mon are
running. Look at the system logs for ceph-mon as well.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1380 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ATBNFVy1HW2ZkVwYL3mMRQ7yrXLqlmyKks5rshoAgaJpZM4MgBm5>
.
|
on a monitor node: [root@mon-node2 ~]# ceph -s 2017-04-04 09:51:13.217859 7f16a0b18700 -1 auth: unable to find a keyring on /etc/ceph/ceph.client.admin.keyring,/etc/ceph/ceph.keyring,/etc/ceph/keyring,/etc/ceph/keyring.bin: (2) No such file or directory |
ceph-mon is running, does ceph-ansible create user ceph? [root@mon-node2 ceph]# ps -aux | grep ceph |
ceph-mon log from /var/log doesn't seem to have error other than 2017-04-04 01:46:46.474482 7f448d3497c0 0 set uid:gid to 167:167 (ceph:ceph) 2017-04-04 01:46:46.518318 7f448d3497c0 1 leveldb: Delete type=3 #4 2017-04-04 01:46:46.519499 7f448d3497c0 0 starting mon.mon-node2 rank 1 at 10.145.82.102:6789/0 mon_data /var/lib/ceph/mon/ceph-mon-node2 fsid ba09e207-d873-4bca-ad02-ee76d76e335d I attach the entire log here |
@gator1 try to run |
Usually this error occurs when you use wrong public address or wrong cluster address or maybe wrong monitor interface |
ceph-create-keys --cluster ceph --id mon-node1 and |
Here is the ceph.conf created. This is a network without dhcp, each node is assigned a static ip. The set up is on a physical server (36 core, 128g RAM, 20t total ssd) running ubuntu 16.04. Please do not change this file directly since it is managed by Ansible and will [global] [client.libvirt] [osd] |
you got it fixed, seems i have the same issue, Tried it first on 4 VM's worked perfect, now the same config on HW gives the same issue you have. |
I got it fixed by running with fqd for the mon and making sure the mon interface is the same one that maps to the fqdn name |
I dont have fqdn, there is no dns server on my network. I finally bypassed this by following this link. I did everything in this link about the system, not sure which one works, maybe SELinux? Now the ceph -s is health HEALTH_WARN. It seems to be another long battle to fix it. |
No activity. Closing this, feel free to re-open. |
I am having same issue. ceph-ansible deployment fails.
And I can't create keyrings:
ansible hosts file:
this is the ceph.conf file:
Any ideas? |
I fixed my issue by changing the number of monitors from 2 to 1. |
@Masber normally we advise going with odd numbers when it comes to monitors. So please stick with this recommendation. |
I got it fixed by running: systemctl disable firewalld sed -i 's/SELINUX=enforcing/SELINUX=disabled/g' /etc/selinux/config reboot |
@ist0ne that was the answer ! |
use this rule to allow ceph creating cluster at the same time running the firewall. These rules inclusive of OSD, MON, RADOSGW and MDS.
|
I ran into the same issue. Executing the firewall commands above fixed the issue. The selinux commands above that did not fix the issue. Try firewall first. Cheers! |
Hi,
I set up seven KVM centos CentOS-7-x86_64-Minimal-1611 VMs.
They use bridged network on the host with static ip address.
I set up a user cephuser in wheel group but ceph-ansible doesn't like that user.
Now instead I use root. I tried to set up three monitors (mon-node1--3) and three osds (osd-node1-3)
I use this all file to run ansible-playbook site.yml -u root with site.yml copied from sample unchanged.
I checked mon-node1 /etc/ceph and there is no keyring file.
Do I need certain utility for this to work? ssh-keygen works.
Please help.
all.txt
](url)
TASK [ceph-mon : wait for ceph.client.admin.keyring exists] ********************
fatal: [mon-node1]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for file /etc/ceph/ceph.client.admin.keyring"}
fatal: [mon-node3]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for file /etc/ceph/ceph.client.admin.keyring"}
fatal: [mon-node2]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for file /etc/ceph/ceph.client.admin.keyring"}
to retry, use: --limit @/root/ceph-ansible/site.retry
PLAY RECAP *********************************************************************
mon-node1 : ok=41 changed=1 unreachable=0 failed=1
mon-node2 : ok=39 changed=1 unreachable=0 failed=1
mon-node3 : ok=39 changed=1 unreachable=0 failed=1
osd-node1 : ok=2 changed=0 unreachable=0 failed=0
osd-node2 : ok=2 changed=0 unreachable=0 failed=0
osd-node3 : ok=2 changed=0 unreachable=0 failed=0
The text was updated successfully, but these errors were encountered: