-
Notifications
You must be signed in to change notification settings - Fork 494
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] 创建虚拟机一直显示启动中 #17213
Comments
重新创建了一台配置及密钥完全一致的虚拟机, 成功启动. |
当主机名为 |
启动失败日志: |
@saltfishh 这个报错是 qemu 启动虚机设置 网卡 ifname报错,可能是执行网卡 if-up 脚本失败了,详细日志可以看下 /opt/cloud/workspace/servers/logs/<server_id> . |
|
@saltfishh 能否执行一下 climc server-network-list --details --limit 0 --scope system,看下输出 |
|
@saltfishh 没有找到 LANVPCSub-13 这个 interface,是删除了么? |
|
这个情况最近愈发频繁了, 批量创建三台主机, 会有一台显示这个问题. 无法启动. {
"__reason__": "Async start server failed: uld not configure /dev/net/tun (LANVPCSub-13): Invalid argument\n",
"__stage__": "OnStartComplete",
"__status__": "error"
} |
当时创建了三台, kfc-cluster-#. |
@saltfishh 你现在有启动失败的环境吗,可以保留一下看一下 |
已解决. |
经查, 是有两台虚拟机已经在web界面删除了, 但是在宿主机上 kubectl logs -n onecloud -c host default-host-frl4l | grep -B1 'load server error'
---
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 068830db-318d-452c-8b7d-0bdab8081967
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 76f8f8e8-3dfe-46af-89c3-2228394c62fd
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 9a5f7d21-1eae-4909-8599-5b1ceab586b8
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest bc82500a-a045-4e1b-84b0-8dd64bb1b516
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f 这 4 台机器在之前升级云平台到 3.10 的时候, 出现了兼容性问题. cd /opt/cloud/workspace/servers/<server_uuid> && mv source-desc desc 重启下 agent kubectl rollout restart ds -n onecloud default-host |
done |
这个问题是3.9升级到3.10.1出现的兼容性问题,已经在3.10.2修复,解决方法如上,升级到高版本,找到不兼容的机器,覆盖 desc 文件后重启 host-agent |
问题描述/What happened:
使用批量创建, 创建了三台虚拟机, 两台OK, 其中一台虚拟机一直显示启动中.
环境/Environment:
cat /etc/os-release
):uname -a
):dmidecode | egrep -i 'manufacturer|product' |sort -u
)kubectl exec -n onecloud $(kubectl get pods -n onecloud | grep climc | awk '{print $1}') -- climc version-list
):以下是部分 host 日志(仅显示无法启动的虚拟机的 UUID):
The text was updated successfully, but these errors were encountered: