-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
functional tests are failing on 'shell to vm' #399
Comments
I would say that this is exactly why we need to make some order between SSH and root passwords, between password-based and public-key based authentication. We need to see that such a VM:
Currently, there's quite a bit of a mixup. It's very similar to the situation with Hosted-Engine. |
We've already experienced this issue before: #318 |
On Thu, Dec 22, 2016 at 5:50 PM, Gal Ben Haim ***@***.***> wrote:
We've already experienced this issue before: #318
<#318>
The interface is shown in the vm, but for some reason ifup/ifdown can't
recognize it. I don't think that this issue relates to ssh but also don't
have a good explanation why it relates to the boot menu.
Reminds me of an infamous issue in OpenStack - the network (and
specifically, the DHCP services) are not fast enough to be up and running
before the VM is already requesting the network.
Y.
… —
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#399 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEcJ6cLM54qtXgQ8J6OCYiRdfSsvW8bKks5rKpwugaJpZM4LSy0t>
.
|
As I see it we have two options:
@mykaul @nvgoldin @ifireball @david-caro |
I've opened a bug on libvirt on the issue.
Y.
…On Dec 23, 2016 1:03 PM, "Gal Ben Haim" ***@***.***> wrote:
As I see it we have two options:
1. Enable the boot menu.
2. Search for another image to use in the functional tests.
@mykaul <https://github.com/mykaul> @nvgoldin
<https://github.com/nvgoldin> @ifireball <https://github.com/ifireball>
@david-caro <https://github.com/david-caro>
Thoughts ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#399 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEcJ6TQb2K9IZJs2PRK-KtrfRRytQPe0ks5rK6psgaJpZM4LSy0t>
.
|
I looked at dnsmasq logs:
and after the reboot:
So I'm not sure yet why it is complaining about duplicate definitions(and we try to add 4 hosts), and also why rebooting the CirrOS VM resolves the issue. Also nothing this doesn't always happen. |
We need to ensure Lago is not creating those definitions - that it is not racy. I think the networks are not prepared in parallel (but worth checking). |
@mykaul - will check. also need to check if the for reference another failure: http://jenkins.ovirt.org/job/lago_master_check-merged-fc25-x86_64/3/console |
On Sun, Dec 25, 2016 at 3:30 PM, Nadav Goldin ***@***.***> wrote:
@mykaul <https://github.com/mykaul> - will check. also need to check if
the networkCreateXML call to libvirt is blocking, in the sense that it
returns when completed.
perhaps we should ask isActive() on it?
for reference another failure: http://jenkins.ovirt.org/job/
lago_master_check-merged-fc25-x86_64/3/console
Notice that I've found #402 and
fixed it in #404 (I hope). Perhaps
it can explain some of the issues.
… —
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#399 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEcJ6SFFatca_b3pL-nJVQOSW9M9P7ATks5rLnASgaJpZM4LSy0t>
.
|
exactly what I'm checking now.. is #404 ok to merge? we can try merging and see how others go. |
On Sun, Dec 25, 2016 at 4:00 PM, Nadav Goldin ***@***.***> wrote:
perhaps we should ask isActive() on it?
exactly what I'm checking now..
is #404 <#404> ok to merge? we
can try merging and see how others go.
I sincerely hope so ;-)
… —
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#399 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEcJ6RJRSc_ym-ezFcXBexH-D5lVgpWiks5rLncPgaJpZM4LSy0t>
.
|
might have been resolved by #405, lets keep this ticket open for a while to see if it happens again in the following weeks. |
Unfortunately another failure: |
Moved the jobs back to VMs only for now. |
Finally got some true logs from the cirros vm on bootup:
For now I'm going to try add a work-around inside the image itself. |
lago functional tests fail when they run "too quickly", specifically this happens on bare-metals. So bringing them back to VMs for now. bug-url: lago-project/lago#399 Change-Id: I8c08145a4209e56e9e06b339fb853720314eb7b7 Signed-off-by: Nadav Goldin <ngoldin@redhat.com>
Did you open a bug on cirros?
…On Jan 5, 2017 12:30 PM, "Nadav Goldin" ***@***.***> wrote:
Finally got some true logs from the cirros vm on bootup:
Sending discover...
Sending discover...
Usage: /sbin/cirros-dhcpc <up|down>
No lease, failing
WARN: /etc/rc3.d/S40-network failed
For now I'm going to try add a work-around inside the image itself.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#399 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEcJ6VzEOU_op79MU1loYRCciKI3E3NFks5rPMYrgaJpZM4LSy0t>
.
|
Not sure if this is cirros bug, I need some more work to debug this on the libvirt/dnsmasq, I would suspect them for as I didn't see those dhcp request get to dnsmasq at all(only after restarting the network/dhcp). |
libvirt bug: https://bugzilla.redhat.com/show_bug.cgi?id=1411025 |
Safe to say this was fixed with the cirros workaround., haven't seen it for a while. |
The CirrOS VM doesn't get an IP, logging in with lago console and executing
/etc/init.d/S40network restart
resolves the issue. The only change done to automation was afec1ac, could it be thatdnsmasq
is not yet configured when we runlago start
(now with KVM enabled it is probably done faster than before)? @david-caro - any ideas?The text was updated successfully, but these errors were encountered: