Description
problem
So I have a problem adding a host which made me take a deep dive in the logs. I was horrified to see so many commands being issued by cloudstack-agent that are completely wrong and inappropriate for the system I'm running.
So on a Rocky linux 9.6, the agent tries to use iptables (the actual firewall is firewalld), looks for nfs.service (actual service is nfs-server.service), and does a bunch of stuff overriding system configuration as if I am on Ubuntu and not RhEl derivative.
Unsurprisingly, this doesn't work and any attempt to add the host fails with about few 1000s of lines of logs.
Using
cloudstack-common-4.20.1.0-1.noarch
cloudstack-agent-4.20.1.0-1.noarch
The agent ultimately get stuck in a loop - slice of the log below.
2025-06-13 14:22:08,741 INFO [utils.linux.KVMHostInfo] (Agent-Handler-1:[]) (logid:) Fetching host CPU arch
2025-06-13 14:22:08,746 INFO [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:[]) (logid:) Host uses control group [cgroup2fs].
2025-06-13 14:22:08,746 INFO [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:[]) (logid:) Calculating the max shares of the host.
2025-06-13 14:22:08,747 INFO [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:[]) (logid:) The max shares of the host is [14400].
2025-06-13 14:22:08,791 WARN [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:[]) (logid:) Execution of process [113893] for command [sudo grep InitiatorName= /etc/iscsi/initiatorname.iscsi ] failed.
2025-06-13 14:22:08,791 WARN [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:[]) (logid:) Process [113893] for command [sudo grep InitiatorName= /etc/iscsi/initiatorname.iscsi ] encountered the error: [grep: /etc/iscsi/initiatorname.iscsi: No such file or directory].
2025-06-13 14:22:08,802 WARN [utils.script.Script] (Agent-Handler-1:[]) (logid:) Execution of process [113899] for command [/bin/bash -c virt-v2v --version ] failed.
2025-06-13 14:22:08,802 WARN [utils.script.Script] (Agent-Handler-1:[]) (logid:) Process [113899] for command [/bin/bash -c virt-v2v --version ] encountered the error: [127].
2025-06-13 14:22:08,813 INFO [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:[]) (logid:) Attempting to create storage pool f142354b-9aee-486d-9be0-625921e37195 (Filesystem) in libvirt
2025-06-13 14:22:08,813 INFO [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:[]) (logid:) Found existing defined storage pool f142354b-9aee-486d-9be0-625921e37195, using it.
2025-06-13 14:22:08,813 INFO [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:[]) (logid:) Trying to fetch storage pool f142354b-9aee-486d-9be0-625921e37195 from libvirt
2025-06-13 14:22:09,848 INFO [cloud.agent.Agent] (Agent-Handler-2:[]) (logid:) Process agent startup answer, agent [id: 0, uuid: null, name: null] connected to the server
2025-06-13 14:22:09,848 ERROR [cloud.agent.Agent] (Agent-Handler-2:[]) (logid:) Error parsing task java.lang.NullPointerException
at java.base/java.util.concurrent.ConcurrentHashMap.putVal(ConcurrentHashMap.java:1011)
at java.base/java.util.concurrent.ConcurrentHashMap.put(ConcurrentHashMap.java:1006)
at java.base/java.util.Properties.put(Properties.java:1301)
at java.base/java.util.Properties.setProperty(Properties.java:229)
at com.cloud.agent.dao.impl.PropertiesStorage.persist(PropertiesStorage.java:55)
at com.cloud.agent.AgentShell.setPersistentProperty(AgentShell.java:215)
at com.cloud.agent.Agent.setUuid(Agent.java:423)
at com.cloud.agent.Agent.processStartupAnswer(Agent.java:688)
at com.cloud.agent.Agent.processResponse(Agent.java:932)
at com.cloud.agent.Agent$ServerHandler.doTask(Agent.java:1220)
at com.cloud.utils.nio.Task.call(Task.java:83)
at com.cloud.utils.nio.Task.call(Task.java:29)
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
at java.base/java.lang.Thread.run(Thread.java:840)
2025-06-13 14:22:09,936 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-5:[]) (logid:) Attempting to create storage pool 22594f15-9678-347e-9627-0f935db15e9f (NetworkFilesystem) in libvirt
2025-06-13 14:22:09,939 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-5:[]) (logid:) Found existing defined storage pool 22594f15-9678-347e-9627-0f935db15e9f, using it.
2025-06-13 14:22:09,947 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-5:[]) (logid:) Trying to fetch storage pool 22594f15-9678-347e-9627-0f935db15e9f from libvirt
2025-06-13 14:22:10,069 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-1:[]) (logid:) Attempting to create storage pool 1cfe58ff-e09e-3b31-ae0d-449aab610592 (NetworkFilesystem) in libvirt
2025-06-13 14:22:10,072 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-1:[]) (logid:) Found existing defined storage pool 1cfe58ff-e09e-3b31-ae0d-449aab610592, using it.
2025-06-13 14:22:10,089 INFO [kvm.storage.LibvirtStorageAdaptor] (AgentRequest-Handler-1:[]) (logid:) Trying to fetch storage pool 1cfe58ff-e09e-3b31-ae0d-449aab610592 from libvirt
versions
The versions of ACS, hypervisors, storage, network etc..
non-working host:
Rocky 9.6, 5.14.0-570.21.1.el9_6.x86_64
libvirt-10.10.0-7.3.el9_6.x86_64
qemu-kvm-9.1.0-15.el9_6.4.x86_64
cloudstack-common-4.20.1.0-1.noarch
cloudstack-agent-4.20.1.0-1.noarch
Also have working hosts which run :
Rocky 9.5, 5.14.0-503.35.1.el9_5.x86_64
cloudstack-common-4.20.0.0-1.x86_64
cloudstack-agent-4.20.0.0-1.x86_64
libvirt-10.5.0-7.5.el9_5.x86_64
qemu-kvm-9.0.0-10.el9_5.2.x86_64
The steps to reproduce the bug
- Clean install Rocky 9.6
- Follow instructions on https://docs.cloudstack.apache.org/en/latest/installguide/configuration.html#adding-a-host which is terribly outdated, so also you need to look at https://docs.cloudstack.apache.org/en/latest/quickinstallationguide/qig.html to get things going
- Try to add the host
...
What to do about it?
No response