Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] 创建虚拟机一直显示启动中 #17213

Closed
saltfishh opened this issue May 31, 2023 · 16 comments
Closed

[BUG] 创建虚拟机一直显示启动中 #17213

saltfishh opened this issue May 31, 2023 · 16 comments
Labels
bug Something isn't working

Comments

@saltfishh
Copy link

问题描述/What happened:
使用批量创建, 创建了三台虚拟机, 两台OK, 其中一台虚拟机一直显示启动中.
环境/Environment:

  • OS (e.g. cat /etc/os-release):
    • centos7.9_2009
  • Kernel (e.g. uname -a):
  • Host: (e.g. dmidecode | egrep -i 'manufacturer|product' |sort -u)
    • dell R730XD
  • Service Version (e.g. kubectl exec -n onecloud $(kubectl get pods -n onecloud | grep climc | awk '{print $1}') -- climc version-list):

| ansible | release/3.10(84b6b97ba823050507)
Yunion API client version:
{
"major": "0",
"minor": "0",
"gitVersion": "v3.10.1",
"gitBranch": "release/3.10",
"gitCommit": "84b6b97ba8",
"gitTreeState": "clean",
"buildDate": "2023-05-05T06:55:34Z",
"goVersion": "go1.18.3",
"compiler": "gc",
"platform": "linux/amd64"
}

| cloudevent | release/3.10(84b6b97ba823050507) |
| cloudid | release/3.10(84b6b97ba823050507) |
|
|
|
|
|
|
|
| cloudmon | release/3.10(84b6b97ba823050507) |
| cloudproxy | release/3.10(84b6b97ba823050507) |
| compute_v2 | release/3.10(84b6b97ba823050507) |
| devtool | release/3.10(84b6b97ba823050507) |
| etcd | {"etcdserver":"3.4.6","etcdcluster":"3.4.0"} |
| identity | release/3.10(84b6b97ba823050507) |
| image | release/3.10(84b6b97ba823050507) |
| influxdb | 404 page not found |
| k8s | heads/v3.10.1-20230503.0(8962b3823050507) |
| log | release/3.10(84b6b97ba823050507) |
| monitor | release/3.10(84b6b97ba823050507) |
| notify | release/3.10(84b6b97ba823050507) |
| scheduledtask | release/3.10(84b6b97ba823050507) |
| scheduler | release/3.10(84b6b97ba823050507) |
| torrent-tracker | <title>Not Found</title> |
| webconsole | release/3.10(84b6b97ba823050507) |

以下是部分 host 日志(仅显示无法启动的虚拟机的 UUID):

[error 2023-05-31 07:06:34 guestman.(*SKVMGuestInstance).onMonitorTimeout(qemu-kvm.go:1330)] Monitor connect timeout, VM 75cae0b5-dfcb-41b1-857c-85da24dc5931 frozen: read tcp 127.0.0.1:55868->127.0.0.1:56408: read: connection reset by peer force restart!!!!
[error 2023-05-31 07:06:38 guestman.(*SKVMGuestInstance).StartMonitor(qemu-kvm.go:793)] Guest 75cae0b5-dfcb-41b1-857c-85da24dc5931 start monitor failed, can't get qmp monitor port or monitor path
[error 2023-05-31 07:06:38 guestman.(*SKVMGuestInstance).StartMonitor(qemu-kvm.go:793)] Guest 75cae0b5-dfcb-41b1-857c-85da24dc5931 start monitor failed, can't get qmp monitor port or monitor path
[info 2023-05-31 07:06:38 guestman.(*SKVMGuestInstance).asyncScriptStart(qemu-kvm.go:574)] VM started kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) ...
[info 2023-05-31 07:06:38 guestman.(*SKVMGuestInstance).asyncScriptStart(qemu-kvm.go:580)] Async start server kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) success!
[info 2023-05-31 07:06:38 monitor.(*QmpMonitor).read(qmp.go:227)] Scan over kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) ...
[info 2023-05-31 07:06:38 monitor.(*QmpMonitor).read(qmp.go:230)] QMP Disconnected kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931): read tcp 127.0.0.1:58226->127.0.0.1:56409: read: connection reset by peer
[error 2023-05-31 07:06:38 guestman.(*SKVMGuestInstance).onMonitorTimeout(qemu-kvm.go:1330)] Monitor connect timeout, VM 75cae0b5-dfcb-41b1-857c-85da24dc5931 frozen: read tcp 127.0.0.1:58226->127.0.0.1:56409: read: connection reset by peer force restart!!!!
[error 2023-05-31 07:06:42 guestman.(*SKVMGuestInstance).StartMonitor(qemu-kvm.go:793)] Guest 75cae0b5-dfcb-41b1-857c-85da24dc5931 start monitor failed, can't get qmp monitor port or monitor path
[error 2023-05-31 07:06:42 guestman.(*SKVMGuestInstance).StartMonitor(qemu-kvm.go:793)] Guest 75cae0b5-dfcb-41b1-857c-85da24dc5931 start monitor failed, can't get qmp monitor port or monitor path
[info 2023-05-31 07:06:42 guestman.(*SKVMGuestInstance).asyncScriptStart(qemu-kvm.go:574)] VM started kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) ...
[info 2023-05-31 07:06:42 guestman.(*SKVMGuestInstance).asyncScriptStart(qemu-kvm.go:580)] Async start server kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) success!
[info 2023-05-31 07:06:42 monitor.(*QmpMonitor).read(qmp.go:227)] Scan over kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931) ...
[info 2023-05-31 07:06:42 monitor.(*QmpMonitor).read(qmp.go:230)] QMP Disconnected kfc-cluster-1(75cae0b5-dfcb-41b1-857c-85da24dc5931): read tcp 127.0.0.1:34242->127.0.0.1:56410: read: connection reset by peer
[error 2023-05-31 07:06:42 guestman.(*SKVMGuestInstance).onMonitorTimeout(qemu-kvm.go:1330)] Monitor connect timeout, VM 75cae0b5-dfcb-41b1-857c-85da24dc5931 frozen: read tcp 127.0.0.1:34242->127.0.0.1:56410: read: connection reset by peer force restart!!!!

@saltfishh saltfishh added the bug Something isn't working label May 31, 2023
@saltfishh
Copy link
Author

重新创建了一台配置及密钥完全一致的虚拟机, 成功启动.
该问题可能无法复现

@saltfishh
Copy link
Author

当主机名为 kfc-cluster-1 时, 虚拟机会一直保持 启动中 的状态.
奇怪的是, 当前所有虚拟机名字都没有重复,包括回收站.
我换了个名字就能创建并开机成功, 只要是上面那个名字, 就无法启动.

@saltfishh
Copy link
Author

启动失败日志:
{
"reason": "Async start server failed: uld not configure /dev/net/tun (LANVPCSub-13): Invalid argument\n",
"stage": "OnStartComplete",
"status": "error"
}

@wanyaoqi
Copy link
Member

wanyaoqi commented May 31, 2023

启动失败日志: { "reason": "Async start server failed: uld not configure /dev/net/tun (LANVPCSub-13): Invalid argument\n", "stage": "OnStartComplete", "status": "error" }

@saltfishh 这个报错是 qemu 启动虚机设置 网卡 ifname报错,可能是执行网卡 if-up 脚本失败了,详细日志可以看下 /opt/cloud/workspace/servers/logs/<server_id> .

@saltfishh
Copy link
Author

启动失败日志: { "reason": "Async start server failed: uld not configure /dev/net/tun (LANVPCSub-13): Invalid argument\n", "stage": "OnStartComplete", "status": "error" }

@saltfishh 这个报错是 qemu 启动虚机设置 网卡 ifname报错,可能是执行网卡 if-up 脚本失败了,详细日志可以看下 /opt/cloud/workspace/servers/logs/ .

char device redirected to /dev/pts/14 (label charserial0)
qemu-system-x86_64: could not configure /dev/net/tun (LANVPCSub-13): Invalid argument
2023-05-31 07:09:10 Run command: ['/usr/local/qemu-4.2.0/bin/qemu-system-x86_64', '-enable-kvm', '-S', '-cpu', 'host,kvm=off,kvm_pv_eoi=on', '-chardev', 'socket,id=hmqmondev,port=56249,host=127.0.0.1,nodelay,server,nowait', '-mon', 'chardev=hmqmondev,id=hmqmon,mode=readline', '-chardev', 'socket,id=qmqmondev,port=56449,host=127.0.0.1,nodelay,server,nowait', '-mon', 'chardev=qmqmondev,id=qmqmon,mode=control', '-rtc', 'base=utc,clock=host,driftfix=none', '-nodefaults', '-no-user-config', '-global', 'kvm-pit.lost_tick_policy=discard', '-machine', 'pc,accel=kvm', '-k', 'en-us', '-smp', 'cpus=4,sockets=2,cores=120,maxcpus=240', '-name', "'kfc-cluster-1',debug-threads=on", '-uuid', '75cae0b5-dfcb-41b1-857c-85da24dc5931', '-m', '4096M,slots=4,maxmem=524288M', '-object', 'memory-backend-ram,id=mem,size=4096M', '-numa', 'node,memdev=mem', '-boot', 'cdn', '-device', 'VGA,id=video0,bus=pci.0,addr=0x02', '-vnc', ':349,password', '-object', 'iothread,id=iothread0', '-device', 'virtio-serial-pci,id=virtio-serial0,bus=pci.0,addr=0x03', '-device', 'virtio-scsi-pci,id=scsi,bus=pci.0,addr=0x04', '-drive', 'file=/data/storage/5e9ea4f8-ea14-4961-8ad2-1d767e8a1e69,if=none,id=drive_0,cache=writeback', '-device', 'scsi-hd,drive=drive_0,bus=scsi.0,id=drive_0', '-drive', 'file=/data/storage/a5e2089f-b72d-4381-8b22-bc320b9f6c15,if=none,id=drive_1,cache=writeback', '-device', 'scsi-hd,drive=drive_1,bus=scsi.0,id=drive_1', '-drive', 'id=ide0-cd0,if=none,media=cdrom', '-device', 'ide-cd,drive=ide0-cd0,bus=ide.1', '-netdev', 'type=tap,id=LANVPCSub-13,ifname=LANVPCSub-13,vhost=on,vhostforce=off,queues=2,script=/opt/cloud/workspace/servers/75cae0b5-dfcb-41b1-857c-85da24dc5931/if-up-brvpc-LANVPCSub-13.sh,downscript=/opt/cloud/workspace/servers/75cae0b5-dfcb-41b1-857c-85da24dc5931/if-down-brvpc-LANVPCSub-13.sh', '-device', 'virtio-net-pci,id=netdev-LANVPCSub-13,bus=pci.0,addr=0x05,netdev=LANVPCSub-13,mac=00:22:be:d1:5d:56,mq=on,vectors=5,speed=1000,host_mtu=1440', '-usb', '-device', 'usb-kbd', '-device', 'usb-tablet', '-device', 'qemu-xhci,id=usb,bus=pci.0,addr=0x06', '-pidfile', '/opt/cloud/workspace/servers/75cae0b5-dfcb-41b1-857c-85da24dc5931/pid', '-chardev', 'socket,id=qga0,server,nowait,path=/opt/cloud/workspace/servers/75cae0b5-dfcb-41b1-857c-85da24dc5931/qga.sock', '-device', 'virtserialport,bus=virtio-serial0.0,chardev=qga0,name=org.qemu.guest_agent.0', '-object', 'rng-random,id=rng0,filename=/dev/urandom', '-device', 'virtio-rng-pci,id=random0,bus=pci.0,addr=0x07,max-bytes=1024,period=1000,rng=rng0', '-chardev', 'pty,id=charserial0', '-device', 'isa-serial,chardev=charserial0,id=serial0', '-device', 'pvpanic,id=pvpanic,ioport=0x505']
char device redirected to /dev/pts/14 (label charserial0)
qemu-system-x86_64: could not configure /dev/net/tun (LANVPCSub-13): Invalid argument

LANVPCSub-13 我没有这个子网. 这个问题刚才能复现, 就是批量创建多个虚拟机, 后缀为 1 的会无法启动. 我创建了 3 次, 两次都这样.

@wanyaoqi
Copy link
Member

wanyaoqi commented May 31, 2023

@saltfishh 能否执行一下 climc server-network-list --details --limit 0 --scope system,看下输出

@saltfishh
Copy link
Author

@saltfishh 能否执行一下 climc server-network-list --details --limit 0 --scope system,看下输出

+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| Guest_ID | Guest | Network_ID | Network | Mac_addr | Mapped_Ip_Addr | IP_addr | Driver | BW_limit | Index | Virtual | Ifname |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| 401636b9-e8e8-4ea5-8731-093ae0010e97 | logcluster1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b9:81:1e:c9 | 100.64.126.228 | 10.11.1.157 | virtio | 1000 | 0 | false | LANVPCSub-29 |
| 1481391e-a7b3-4ffe-8507-248311f92144 | logcluster3 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:45:c3:69:2a | 100.64.126.238 | 10.11.1.156 | virtio | 1000 | 0 | false | LANVPCSub-28 |
| 42b734d8-186f-4c77-8b22-82249527857b | logcluster2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:30:63:ed:f0 | 100.64.126.244 | 10.11.1.155 | virtio | 1000 | 0 | false | LANVPCSub-27 |
| f3b80131-fcf0-4236-8b7d-5c73955fc9c9 | ncc-test-HA-2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:da:d5:0e:09 | 100.64.126.231 | 10.11.1.154 | virtio | 1000 | 0 | false | LANVPCSub-26 |
| bc5fd41c-7e96-4a57-8f19-9095568a696f | ncc-test-HA-1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:e1:2c:6b:70 | 100.64.126.237 | 10.11.1.150 | virtio | 1000 | 0 | false | LANVPCSub-22 |
| 34ac8df2-5663-4988-8506-8ae6d3862e39 | chenlun-work-win16 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:1e:29:63:fd | 100.64.126.234 | 10.11.1.151 | virtio | 0 | 0 | false | LANVPCSub-23 |
| ba48086e-db80-47ba-8c69-065c15a8e232 | ensp_public | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:56:db:1b:32 | 100.64.126.221 | 10.11.1.160 | virtio | 1000 | 0 | false | LANVPCSub-32 |
| c66a8f0f-c7d7-4c7e-8090-2e72c307def0 | mstsc-support-bak | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c0:b7:a0:68 | 100.64.126.250 | 10.11.1.135 | virtio | 1000 | 0 | false | LANVPCSub-7 |
| 0275c732-c015-4605-847c-210be0a0c482 | jumpserver_test_sxz | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b1:04:4f:76 | 100.64.126.218 | 10.11.1.167 | virtio | 1000 | 0 | false | LANVPCSub-39 |
| d664b9eb-4c18-4924-80fd-5406715130e5 | sxz_test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:73:2c:78:9d | 100.64.126.225 | 10.11.1.158 | virtio | 1000 | 0 | false | LANVPCSub-30 |
| 58cc90dc-7a79-485f-8f3c-20931aef9888 | zgw-test-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:29:a8:c3:30 | 100.64.126.213 | 10.11.1.172 | virtio | 1000 | 0 | false | LANVPCSub-44 |
| 81ea9a3f-6e02-496a-8e71-de109f9bb1d1 | zgw-host-windows | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:08:8e:d0:df | 100.64.126.214 | 10.11.1.171 | virtio | 1000 | 0 | false | LANVPCSub-43 |
| 892e9712-06d2-4bb8-8f0b-10de53cff4d1 | host-fan-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:70:a1:9a:25 | 100.64.126.223 | 10.11.1.162 | virtio | 1000 | 0 | false | LANVPCSub-34 |
| d7eaea5d-fcd3-4a08-877a-035d2d630902 | zgw-host-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:53:ba:8d:35 | 100.64.126.235 | 10.11.1.149 | virtio | 1000 | 0 | false | LANVPCSub-21 |
| bdb033fd-29fa-414d-8c7a-1a800ac9c612 | thx-linux-test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:8d:c0:d2:1f | 100.64.126.236 | 10.11.1.146 | virtio | 1000 | 0 | false | LANVPCSub-18 |
| 84bcb84d-d920-440c-897f-0c6fd3092371 | yangqi-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:a9:7f:aa:43 | 100.64.126.252 | 10.11.1.133 | virtio | 1000 | 0 | false | LANVPCSub-5 |
| d8adab62-0be9-4229-8f13-7efd8bab9fa7 | chenlun-kali-server | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c3:f8:d1:ce | 100.64.126.222 | 10.11.1.163 | virtio | 1000 | 0 | false | LANVPCSub-35 |
| 3539092c-d140-4f6e-82a6-a07bd596c25b | thx_linux_ansible | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:40:66:67:ac | 100.64.126.241 | 10.11.1.136 | virtio | 1000 | 0 | false | LANVPCSub-8 |
| 29a407a6-150b-442a-8074-9c2ba3952f99 | ncc-linux-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ce:73:3c:df | 100.64.126.224 | 10.11.1.161 | virtio | 1000 | 0 | false | LANVPCSub-33 |
| b03cecc0-ed37-4ab6-8166-991386744a35 | ssh-support-customer-jumpserver | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d1:dc:c0:26 | 100.64.126.251 | 10.11.1.134 | virtio | 1000 | 0 | false | LANVPCSub-6 |
| d3f3ed4d-07e9-45a9-852f-7828ab8f8b8f | yangqi-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:54:a0:88:94 | 100.64.126.240 | 10.11.1.144 | virtio | 1000 | 0 | false | LANVPCSub-16 |
| a7485391-39ac-4bb2-87e7-2e0fbdee1d9e | thx-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d3:11:ea:02 | 100.64.126.247 | 10.11.1.138 | virtio | 1000 | 0 | false | LANVPCSub-10 |
| 69145a18-4d26-4b86-8ef5-2ab5845dfda1 | fan-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ac:d4:45:4c | 100.64.126.242 | 10.11.1.140 | virtio | 1000 | 0 | false | LANVPCSub-12 |
| bcd42b9b-47a3-491a-822f-f84bf0c7441d | thx-win10 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b6:9a:a0:2f | 100.64.126.245 | 10.11.1.130 | virtio | 1000 | 0 | false | LANVPCSub-52n |
| acf64584-a6a3-4273-85b8-88e0f65f02f6 | ncc-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:fb:b2:6c:8d | 100.64.126.239 | 10.11.1.145 | virtio | 1000 | 0 | false | LANVPCSub-17 |
| bbb3e1b0-1906-475e-8220-ee77ddee2a2d | lan-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:5e:8f:a3:a3 | 100.64.126.255 | 10.11.1.148 | virtio | 1000 | 0 | false | LANVPCSub-2 |
| 93f22b6e-6e2c-429b-8a0d-34fe14a86d1b | yangqi_win | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:13:d8:67:00 | 100.64.126.232 | 10.11.1.153 | virtio | 1000 | 0 | false | LANVPCSub-25 |
| ed9ba29b-ac18-463d-8566-61248daf61b1 | sxz-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:6c:aa:5e | 100.64.126.246 | 10.11.1.139 | virtio | 1000 | 0 | false | LANVPCSub-11 |
| bc82500a-a045-4e1b-84b0-8dd64bb1b516 | DevDeptSelfUse | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:76:bf:44:4a | 100.64.126.248 | 10.11.1.137 | virtio | 1000 | 0 | false | LANVPCSub-9 |
| dcfd04dc-47fc-4c90-8961-fa107df4ff12 | mstsc_support | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:67:3c:c5:e1 | 100.64.126.253 | 10.11.1.132 | virtio | 1000 | 0 | false | LANVPCSub-4 |
| 9a5f7d21-1eae-4909-8599-5b1ceab586b8 | lan-db | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:88:a5:fa:88 | 100.64.126.243 | 10.11.1.142 | virtio | 1000 | 0 | false | LANVPCSub-14 |
| 068830db-318d-452c-8b7d-0bdab8081967 | VM-MgrNode | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:83:b0:ca | 100.64.126.254 | 10.11.1.131 | virtio | 0 | 0 | false | LANVPCSub-3 |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
*** Total: 32 Pages: 1 Limit: 2048 Offset: 0 Page: 1 ***

@wanyaoqi
Copy link
Member

@saltfishh 能否执行一下 climc server-network-list --details --limit 0 --scope system,看下输出

+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| Guest_ID | Guest | Network_ID | Network | Mac_addr | Mapped_Ip_Addr | IP_addr | Driver | BW_limit | Index | Virtual | Ifname |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| 401636b9-e8e8-4ea5-8731-093ae0010e97 | logcluster1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b9:81:1e:c9 | 100.64.126.228 | 10.11.1.157 | virtio | 1000 | 0 | false | LANVPCSub-29 |
| 1481391e-a7b3-4ffe-8507-248311f92144 | logcluster3 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:45:c3:69:2a | 100.64.126.238 | 10.11.1.156 | virtio | 1000 | 0 | false | LANVPCSub-28 |
| 42b734d8-186f-4c77-8b22-82249527857b | logcluster2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:30:63:ed:f0 | 100.64.126.244 | 10.11.1.155 | virtio | 1000 | 0 | false | LANVPCSub-27 |
| f3b80131-fcf0-4236-8b7d-5c73955fc9c9 | ncc-test-HA-2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:da:d5:0e:09 | 100.64.126.231 | 10.11.1.154 | virtio | 1000 | 0 | false | LANVPCSub-26 |
| bc5fd41c-7e96-4a57-8f19-9095568a696f | ncc-test-HA-1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:e1:2c:6b:70 | 100.64.126.237 | 10.11.1.150 | virtio | 1000 | 0 | false | LANVPCSub-22 |
| 34ac8df2-5663-4988-8506-8ae6d3862e39 | chenlun-work-win16 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:1e:29:63:fd | 100.64.126.234 | 10.11.1.151 | virtio | 0 | 0 | false | LANVPCSub-23 |
| ba48086e-db80-47ba-8c69-065c15a8e232 | ensp_public | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:56:db:1b:32 | 100.64.126.221 | 10.11.1.160 | virtio | 1000 | 0 | false | LANVPCSub-32 |
| c66a8f0f-c7d7-4c7e-8090-2e72c307def0 | mstsc-support-bak | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c0:b7:a0:68 | 100.64.126.250 | 10.11.1.135 | virtio | 1000 | 0 | false | LANVPCSub-7 |
| 0275c732-c015-4605-847c-210be0a0c482 | jumpserver_test_sxz | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b1:04:4f:76 | 100.64.126.218 | 10.11.1.167 | virtio | 1000 | 0 | false | LANVPCSub-39 |
| d664b9eb-4c18-4924-80fd-5406715130e5 | sxz_test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:73:2c:78:9d | 100.64.126.225 | 10.11.1.158 | virtio | 1000 | 0 | false | LANVPCSub-30 |
| 58cc90dc-7a79-485f-8f3c-20931aef9888 | zgw-test-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:29:a8:c3:30 | 100.64.126.213 | 10.11.1.172 | virtio | 1000 | 0 | false | LANVPCSub-44 |
| 81ea9a3f-6e02-496a-8e71-de109f9bb1d1 | zgw-host-windows | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:08:8e:d0:df | 100.64.126.214 | 10.11.1.171 | virtio | 1000 | 0 | false | LANVPCSub-43 |
| 892e9712-06d2-4bb8-8f0b-10de53cff4d1 | host-fan-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:70:a1:9a:25 | 100.64.126.223 | 10.11.1.162 | virtio | 1000 | 0 | false | LANVPCSub-34 |
| d7eaea5d-fcd3-4a08-877a-035d2d630902 | zgw-host-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:53:ba:8d:35 | 100.64.126.235 | 10.11.1.149 | virtio | 1000 | 0 | false | LANVPCSub-21 |
| bdb033fd-29fa-414d-8c7a-1a800ac9c612 | thx-linux-test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:8d:c0:d2:1f | 100.64.126.236 | 10.11.1.146 | virtio | 1000 | 0 | false | LANVPCSub-18 |
| 84bcb84d-d920-440c-897f-0c6fd3092371 | yangqi-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:a9:7f:aa:43 | 100.64.126.252 | 10.11.1.133 | virtio | 1000 | 0 | false | LANVPCSub-5 |
| d8adab62-0be9-4229-8f13-7efd8bab9fa7 | chenlun-kali-server | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c3:f8:d1:ce | 100.64.126.222 | 10.11.1.163 | virtio | 1000 | 0 | false | LANVPCSub-35 |
| 3539092c-d140-4f6e-82a6-a07bd596c25b | thx_linux_ansible | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:40:66:67:ac | 100.64.126.241 | 10.11.1.136 | virtio | 1000 | 0 | false | LANVPCSub-8 |
| 29a407a6-150b-442a-8074-9c2ba3952f99 | ncc-linux-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ce:73:3c:df | 100.64.126.224 | 10.11.1.161 | virtio | 1000 | 0 | false | LANVPCSub-33 |
| b03cecc0-ed37-4ab6-8166-991386744a35 | ssh-support-customer-jumpserver | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d1:dc:c0:26 | 100.64.126.251 | 10.11.1.134 | virtio | 1000 | 0 | false | LANVPCSub-6 |
| d3f3ed4d-07e9-45a9-852f-7828ab8f8b8f | yangqi-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:54:a0:88:94 | 100.64.126.240 | 10.11.1.144 | virtio | 1000 | 0 | false | LANVPCSub-16 |
| a7485391-39ac-4bb2-87e7-2e0fbdee1d9e | thx-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d3:11:ea:02 | 100.64.126.247 | 10.11.1.138 | virtio | 1000 | 0 | false | LANVPCSub-10 |
| 69145a18-4d26-4b86-8ef5-2ab5845dfda1 | fan-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ac:d4:45:4c | 100.64.126.242 | 10.11.1.140 | virtio | 1000 | 0 | false | LANVPCSub-12 |
| bcd42b9b-47a3-491a-822f-f84bf0c7441d | thx-win10 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b6:9a:a0:2f | 100.64.126.245 | 10.11.1.130 | virtio | 1000 | 0 | false | LANVPCSub-52n |
| acf64584-a6a3-4273-85b8-88e0f65f02f6 | ncc-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:fb:b2:6c:8d | 100.64.126.239 | 10.11.1.145 | virtio | 1000 | 0 | false | LANVPCSub-17 |
| bbb3e1b0-1906-475e-8220-ee77ddee2a2d | lan-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:5e:8f:a3:a3 | 100.64.126.255 | 10.11.1.148 | virtio | 1000 | 0 | false | LANVPCSub-2 |
| 93f22b6e-6e2c-429b-8a0d-34fe14a86d1b | yangqi_win | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:13:d8:67:00 | 100.64.126.232 | 10.11.1.153 | virtio | 1000 | 0 | false | LANVPCSub-25 |
| ed9ba29b-ac18-463d-8566-61248daf61b1 | sxz-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:6c:aa:5e | 100.64.126.246 | 10.11.1.139 | virtio | 1000 | 0 | false | LANVPCSub-11 |
| bc82500a-a045-4e1b-84b0-8dd64bb1b516 | DevDeptSelfUse | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:76:bf:44:4a | 100.64.126.248 | 10.11.1.137 | virtio | 1000 | 0 | false | LANVPCSub-9 |
| dcfd04dc-47fc-4c90-8961-fa107df4ff12 | mstsc_support | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:67:3c:c5:e1 | 100.64.126.253 | 10.11.1.132 | virtio | 1000 | 0 | false | LANVPCSub-4 |
| 9a5f7d21-1eae-4909-8599-5b1ceab586b8 | lan-db | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:88:a5:fa:88 | 100.64.126.243 | 10.11.1.142 | virtio | 1000 | 0 | false | LANVPCSub-14 |
| 068830db-318d-452c-8b7d-0bdab8081967 | VM-MgrNode | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:83:b0:ca | 100.64.126.254 | 10.11.1.131 | virtio | 0 | 0 | false | LANVPCSub-3 |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
*** Total: 32 Pages: 1 Limit: 2048 Offset: 0 Page: 1 ***

@saltfishh 没有找到 LANVPCSub-13 这个 interface,是删除了么?

@saltfishh
Copy link
Author

@saltfishh 能否执行一下 climc server-network-list --details --limit 0 --scope system,看下输出

+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| Guest_ID | Guest | Network_ID | Network | Mac_addr | Mapped_Ip_Addr | IP_addr | Driver | BW_limit | Index | Virtual | Ifname |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
| 401636b9-e8e8-4ea5-8731-093ae0010e97 | logcluster1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b9:81:1e:c9 | 100.64.126.228 | 10.11.1.157 | virtio | 1000 | 0 | false | LANVPCSub-29 |
| 1481391e-a7b3-4ffe-8507-248311f92144 | logcluster3 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:45:c3:69:2a | 100.64.126.238 | 10.11.1.156 | virtio | 1000 | 0 | false | LANVPCSub-28 |
| 42b734d8-186f-4c77-8b22-82249527857b | logcluster2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:30:63:ed:f0 | 100.64.126.244 | 10.11.1.155 | virtio | 1000 | 0 | false | LANVPCSub-27 |
| f3b80131-fcf0-4236-8b7d-5c73955fc9c9 | ncc-test-HA-2 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:da:d5:0e:09 | 100.64.126.231 | 10.11.1.154 | virtio | 1000 | 0 | false | LANVPCSub-26 |
| bc5fd41c-7e96-4a57-8f19-9095568a696f | ncc-test-HA-1 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:e1:2c:6b:70 | 100.64.126.237 | 10.11.1.150 | virtio | 1000 | 0 | false | LANVPCSub-22 |
| 34ac8df2-5663-4988-8506-8ae6d3862e39 | chenlun-work-win16 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:1e:29:63:fd | 100.64.126.234 | 10.11.1.151 | virtio | 0 | 0 | false | LANVPCSub-23 |
| ba48086e-db80-47ba-8c69-065c15a8e232 | ensp_public | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:56:db:1b:32 | 100.64.126.221 | 10.11.1.160 | virtio | 1000 | 0 | false | LANVPCSub-32 |
| c66a8f0f-c7d7-4c7e-8090-2e72c307def0 | mstsc-support-bak | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c0:b7:a0:68 | 100.64.126.250 | 10.11.1.135 | virtio | 1000 | 0 | false | LANVPCSub-7 |
| 0275c732-c015-4605-847c-210be0a0c482 | jumpserver_test_sxz | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b1:04:4f:76 | 100.64.126.218 | 10.11.1.167 | virtio | 1000 | 0 | false | LANVPCSub-39 |
| d664b9eb-4c18-4924-80fd-5406715130e5 | sxz_test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:73:2c:78:9d | 100.64.126.225 | 10.11.1.158 | virtio | 1000 | 0 | false | LANVPCSub-30 |
| 58cc90dc-7a79-485f-8f3c-20931aef9888 | zgw-test-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:29:a8:c3:30 | 100.64.126.213 | 10.11.1.172 | virtio | 1000 | 0 | false | LANVPCSub-44 |
| 81ea9a3f-6e02-496a-8e71-de109f9bb1d1 | zgw-host-windows | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:08:8e:d0:df | 100.64.126.214 | 10.11.1.171 | virtio | 1000 | 0 | false | LANVPCSub-43 |
| 892e9712-06d2-4bb8-8f0b-10de53cff4d1 | host-fan-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:70:a1:9a:25 | 100.64.126.223 | 10.11.1.162 | virtio | 1000 | 0 | false | LANVPCSub-34 |
| d7eaea5d-fcd3-4a08-877a-035d2d630902 | zgw-host-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:53:ba:8d:35 | 100.64.126.235 | 10.11.1.149 | virtio | 1000 | 0 | false | LANVPCSub-21 |
| bdb033fd-29fa-414d-8c7a-1a800ac9c612 | thx-linux-test | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:8d:c0:d2:1f | 100.64.126.236 | 10.11.1.146 | virtio | 1000 | 0 | false | LANVPCSub-18 |
| 84bcb84d-d920-440c-897f-0c6fd3092371 | yangqi-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:a9:7f:aa:43 | 100.64.126.252 | 10.11.1.133 | virtio | 1000 | 0 | false | LANVPCSub-5 |
| d8adab62-0be9-4229-8f13-7efd8bab9fa7 | chenlun-kali-server | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:c3:f8:d1:ce | 100.64.126.222 | 10.11.1.163 | virtio | 1000 | 0 | false | LANVPCSub-35 |
| 3539092c-d140-4f6e-82a6-a07bd596c25b | thx_linux_ansible | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:40:66:67:ac | 100.64.126.241 | 10.11.1.136 | virtio | 1000 | 0 | false | LANVPCSub-8 |
| 29a407a6-150b-442a-8074-9c2ba3952f99 | ncc-linux-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ce:73:3c:df | 100.64.126.224 | 10.11.1.161 | virtio | 1000 | 0 | false | LANVPCSub-33 |
| b03cecc0-ed37-4ab6-8166-991386744a35 | ssh-support-customer-jumpserver | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d1:dc:c0:26 | 100.64.126.251 | 10.11.1.134 | virtio | 1000 | 0 | false | LANVPCSub-6 |
| d3f3ed4d-07e9-45a9-852f-7828ab8f8b8f | yangqi-linux | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:54:a0:88:94 | 100.64.126.240 | 10.11.1.144 | virtio | 1000 | 0 | false | LANVPCSub-16 |
| a7485391-39ac-4bb2-87e7-2e0fbdee1d9e | thx-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:d3:11:ea:02 | 100.64.126.247 | 10.11.1.138 | virtio | 1000 | 0 | false | LANVPCSub-10 |
| 69145a18-4d26-4b86-8ef5-2ab5845dfda1 | fan-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:ac:d4:45:4c | 100.64.126.242 | 10.11.1.140 | virtio | 1000 | 0 | false | LANVPCSub-12 |
| bcd42b9b-47a3-491a-822f-f84bf0c7441d | thx-win10 | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:b6:9a:a0:2f | 100.64.126.245 | 10.11.1.130 | virtio | 1000 | 0 | false | LANVPCSub-52n |
| acf64584-a6a3-4273-85b8-88e0f65f02f6 | ncc-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:fb:b2:6c:8d | 100.64.126.239 | 10.11.1.145 | virtio | 1000 | 0 | false | LANVPCSub-17 |
| bbb3e1b0-1906-475e-8220-ee77ddee2a2d | lan-prometheus | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:5e:8f:a3:a3 | 100.64.126.255 | 10.11.1.148 | virtio | 1000 | 0 | false | LANVPCSub-2 |
| 93f22b6e-6e2c-429b-8a0d-34fe14a86d1b | yangqi_win | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:13:d8:67:00 | 100.64.126.232 | 10.11.1.153 | virtio | 1000 | 0 | false | LANVPCSub-25 |
| ed9ba29b-ac18-463d-8566-61248daf61b1 | sxz-host | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:6c:aa:5e | 100.64.126.246 | 10.11.1.139 | virtio | 1000 | 0 | false | LANVPCSub-11 |
| bc82500a-a045-4e1b-84b0-8dd64bb1b516 | DevDeptSelfUse | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:76:bf:44:4a | 100.64.126.248 | 10.11.1.137 | virtio | 1000 | 0 | false | LANVPCSub-9 |
| dcfd04dc-47fc-4c90-8961-fa107df4ff12 | mstsc_support | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:67:3c:c5:e1 | 100.64.126.253 | 10.11.1.132 | virtio | 1000 | 0 | false | LANVPCSub-4 |
| 9a5f7d21-1eae-4909-8599-5b1ceab586b8 | lan-db | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:88:a5:fa:88 | 100.64.126.243 | 10.11.1.142 | virtio | 1000 | 0 | false | LANVPCSub-14 |
| 068830db-318d-452c-8b7d-0bdab8081967 | VM-MgrNode | 249c0bc2-c1fd-45a4-8fc3-c8bb69af39ff | LANVPCSubnet | 00:22:58:83:b0:ca | 100.64.126.254 | 10.11.1.131 | virtio | 0 | 0 | false | LANVPCSub-3 |
+--------------------------------------+---------------------------------+--------------------------------------+--------------+-------------------+----------------+-------------+--------+----------+-------+---------+---------------+
*** Total: 32 Pages: 1 Limit: 2048 Offset: 0 Page: 1 ***

@saltfishh 没有找到 LANVPCSub-13 这个 interface,是删除了么?

没有动过, 这是我目前的 vpc subnet:
2023-06-01_15-10

@saltfishh
Copy link
Author

这个情况最近愈发频繁了, 批量创建三台主机, 会有一台显示这个问题. 无法启动.

{
    "__reason__": "Async start server failed: uld not configure /dev/net/tun (LANVPCSub-13): Invalid argument\n",
    "__stage__": "OnStartComplete",
    "__status__": "error"
}

@saltfishh
Copy link
Author

当时创建了三台, kfc-cluster-#.
kfc-cluster-2 无法启动.
把2删除后, 单独创建 kfc-cluster-2, 配置与批量创建时都一致.
就可以开机了...很离谱

@wanyaoqi
Copy link
Member

wanyaoqi commented Jun 5, 2023

@saltfishh 你现在有启动失败的环境吗,可以保留一下看一下

@saltfishh
Copy link
Author

已解决.

@saltfishh
Copy link
Author

经查, 是有两台虚拟机已经在web界面删除了, 但是在宿主机上 ps aux | grep qemu | grep <vm_name> 还能看到他.
当前版本 3.10.1.
升级到 3.10.2

kubectl logs -n onecloud -c host default-host-frl4l | grep -B1 'load server error'

---
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 068830db-318d-452c-8b7d-0bdab8081967
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 76f8f8e8-3dfe-46af-89c3-2228394c62fd
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 9a5f7d21-1eae-4909-8599-5b1ceab586b8
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest bc82500a-a045-4e1b-84b0-8dd64bb1b516
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f

这 4 台机器在之前升级云平台到 3.10 的时候, 出现了兼容性问题.

cd /opt/cloud/workspace/servers/<server_uuid> && mv source-desc desc

重启下 agent

 kubectl rollout restart ds -n onecloud default-host

@saltfishh saltfishh reopened this Jun 6, 2023
@saltfishh
Copy link
Author

done

@wanyaoqi
Copy link
Member

wanyaoqi commented Jun 6, 2023

经查, 是有两台虚拟机已经在web界面删除了, 但是在宿主机上 ps aux | grep qemu | grep <vm_name> 还能看到他. 当前版本 3.10.1. 升级到 3.10.2

kubectl logs -n onecloud -c host default-host-frl4l | grep -B1 'load server error'

---
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 068830db-318d-452c-8b7d-0bdab8081967
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 76f8f8e8-3dfe-46af-89c3-2228394c62fd
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest 9a5f7d21-1eae-4909-8599-5b1ceab586b8
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f
--
[info 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadExistingGuests(guestman.go:364)] Find existing guest bc82500a-a045-4e1b-84b0-8dd64bb1b516
[error 2023-06-06 08:57:36 guestman.(*SGuestManager).LoadServer(guestman.go:374)] On load server error: load desc ensure pci address: ensure disk 0 pci address: slot 00 out of range 01~1f

这 4 台机器在之前升级云平台到 3.10 的时候, 出现了兼容性问题.

cd /opt/cloud/workspace/servers/<server_uuid> && mv source-desc desc

重启下 agent

 kubectl rollout restart ds -n onecloud default-host

这个问题是3.9升级到3.10.1出现的兼容性问题,已经在3.10.2修复,解决方法如上,升级到高版本,找到不兼容的机器,覆盖 desc 文件后重启 host-agent

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants