Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLOUDSTACK-9707: While using hostid parameter, vm gets deployed on an… #1868

Merged
merged 1 commit into from
Jun 6, 2017

Conversation

anshul1886
Copy link

…other if the host

given is running out of capacity. If host id is specified the deployment should happen
on the given host and it should fail if the host is out of capacity. We are retrying
deployment on the entire zone without the given host id if we fail once. The retry,
which will retry on other hosts, should only be attempted if host id isn't given.

@cloudmonger
Copy link

ACS CI BVT Run

Sumarry:
Build Number 323
Hypervisor xenserver
NetworkType Advanced
Passed=104
Failed=0
Skipped=7

Link to logs Folder (search by build_no): https://www.dropbox.com/sh/yj3wnzbceo9uef2/AAB6u-Iap-xztdm6jHX9SjPja?dl=0

Failed tests:

Skipped tests:
test_01_test_vm_volume_snapshot
test_vm_nic_adapter_vmxnet3
test_static_role_account_acls
test_11_ss_nfs_version_on_ssvm
test_nested_virtualization_vmware
test_3d_gpu_support
test_deploy_vgpu_enabled_vm

Passed test suits:
test_deploy_vm_with_userdata.py
test_affinity_groups_projects.py
test_portable_publicip.py
test_over_provisioning.py
test_global_settings.py
test_scale_vm.py
test_service_offerings.py
test_routers_iptables_default_policy.py
test_loadbalance.py
test_routers.py
test_reset_vm_on_reboot.py
test_deploy_vms_with_varied_deploymentplanners.py
test_network.py
test_router_dns.py
test_non_contigiousvlan.py
test_login.py
test_deploy_vm_iso.py
test_list_ids_parameter.py
test_public_ip_range.py
test_multipleips_per_nic.py
test_regions.py
test_affinity_groups.py
test_network_acl.py
test_pvlan.py
test_volumes.py
test_nic.py
test_deploy_vm_root_resize.py
test_resource_detail.py
test_secondary_storage.py
test_vm_life_cycle.py
test_routers_network_ops.py
test_disk_offerings.py

@koushik-das
Copy link
Contributor

Code changes LGTM, also verified that deployment is not retried in case of failure if host is specified

@ustcweizhou
Copy link
Contributor

Do we need to add a global setting to determine whether deploy to other hosts if the specified host is out of capacity ?

@anshul1886
Copy link
Author

@ustcweizhou, In my opinion we should not do that as that way we are giving them option of silently deploying VM on some host which he might not be aware of. As after specifying host he is expecting that VM will be deployed on that host. If he is ok with deploying VM on different host then he can try deploying VM in next attempt without specifying hostid.

Copy link
Member

@sateesh-chodapuneedi sateesh-chodapuneedi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM code changes.

@ustcweizhou
Copy link
Contributor

@anshul1886
Considering about backwards compatibility. I would like to add the global setting, then users will have two options
(1) deploy vm fails if specified host does not have enough capacity. This is what this PR wants to do.
(2) deploy to other hosts if specified host does not have enough capacity. In this case, the specified host can be regarded as the perferred host. This is also what cloudstack works currently.

@anshul1886
Copy link
Author

@ustcweizhou, Actually I am restoring to old behaviour of 3.x release which got changed to current behaviour because of bug.

@anshul1886
Copy link
Author

@ustcweizhou, Is it looking good to you now?

@ustcweizhou
Copy link
Contributor

@anshul1886
Sorry I think it would be better to add a global setting.
This is campatible with 3.X, but not with 4.X which has more users.

@anshul1886
Copy link
Author

@ustcweizhou Added the global setting allow.deploy.vm.if.deploy.on.given.host.fails to control the behaviour.

@ustcweizhou
Copy link
Contributor

@anshul1886 good, I will test it.
Could you please change the default value to true ?

@anshul1886
Copy link
Author

@ustcweizhou I think default value should be false. As it tells the user to make informed decision instead of misleading according to original purpose of parameter. @sateesh-chodapuneedi @koushik-das What do you guys think?

@borisstoyanov
Copy link
Contributor

@blueorangutan package

@blueorangutan
Copy link

@borisstoyanov a Jenkins job has been kicked to build packages. I'll keep you posted as I make progress.

@blueorangutan
Copy link

Packaging result: ✔centos6 ✔centos7 ✔debian. JID-579

@borisstoyanov
Copy link
Contributor

@blueorangutan test

@blueorangutan
Copy link

@borisstoyanov a Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests

@blueorangutan
Copy link

Trillian test result (tid-942)
Environment: kvm-centos7 (x2), Advanced Networking with Mgmt server 7
Total time taken: 38934 seconds
Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr1868-t942-kvm-centos7.zip
Intermitten failure detected: /marvin/tests/smoke/test_network.py
Intermitten failure detected: /marvin/tests/smoke/test_privategw_acl.py
Intermitten failure detected: /marvin/tests/smoke/test_snapshots.py
Intermitten failure detected: /marvin/tests/smoke/test_templates.py
Intermitten failure detected: /marvin/tests/smoke/test_vpc_redundant.py
Test completed. 47 look ok, 2 have error(s)

Test Result Time (s) Test File
test_04_rvpc_privategw_static_routes Failure 351.00 test_privategw_acl.py
test_02_list_snapshots_with_removed_data_store Error 0.04 test_snapshots.py
test_01_vpc_site2site_vpn Success 165.58 test_vpc_vpn.py
test_01_vpc_remote_access_vpn Success 66.14 test_vpc_vpn.py
test_01_redundant_vpc_site2site_vpn Success 251.76 test_vpc_vpn.py
test_02_VPC_default_routes Success 285.21 test_vpc_router_nics.py
test_01_VPC_nics_after_destroy Success 520.60 test_vpc_router_nics.py
test_05_rvpc_multi_tiers Success 521.81 test_vpc_redundant.py
test_04_rvpc_network_garbage_collector_nics Success 1301.65 test_vpc_redundant.py
test_03_create_redundant_VPC_1tier_2VMs_2IPs_2PF_ACL_reboot_routers Success 569.66 test_vpc_redundant.py
test_02_redundant_VPC_default_routes Success 757.51 test_vpc_redundant.py
test_01_create_redundant_VPC_2tiers_4VMs_4IPs_4PF_ACL Success 1295.30 test_vpc_redundant.py
test_09_delete_detached_volume Success 157.72 test_volumes.py
test_08_resize_volume Success 156.45 test_volumes.py
test_07_resize_fail Success 161.44 test_volumes.py
test_06_download_detached_volume Success 156.31 test_volumes.py
test_05_detach_volume Success 150.77 test_volumes.py
test_04_delete_attached_volume Success 151.20 test_volumes.py
test_03_download_attached_volume Success 151.26 test_volumes.py
test_02_attach_volume Success 124.25 test_volumes.py
test_01_create_volume Success 715.42 test_volumes.py
test_03_delete_vm_snapshots Success 275.14 test_vm_snapshots.py
test_02_revert_vm_snapshots Success 100.79 test_vm_snapshots.py
test_01_create_vm_snapshots Success 159.74 test_vm_snapshots.py
test_deploy_vm_multiple Success 282.82 test_vm_life_cycle.py
test_deploy_vm Success 0.03 test_vm_life_cycle.py
test_advZoneVirtualRouter Success 0.02 test_vm_life_cycle.py
test_10_attachAndDetach_iso Success 26.74 test_vm_life_cycle.py
test_09_expunge_vm Success 125.21 test_vm_life_cycle.py
test_08_migrate_vm Success 31.03 test_vm_life_cycle.py
test_07_restore_vm Success 0.13 test_vm_life_cycle.py
test_06_destroy_vm Success 125.80 test_vm_life_cycle.py
test_03_reboot_vm Success 125.85 test_vm_life_cycle.py
test_02_start_vm Success 10.17 test_vm_life_cycle.py
test_01_stop_vm Success 40.32 test_vm_life_cycle.py
test_CreateTemplateWithDuplicateName Success 45.54 test_templates.py
test_08_list_system_templates Success 0.04 test_templates.py
test_07_list_public_templates Success 0.04 test_templates.py
test_05_template_permissions Success 0.06 test_templates.py
test_04_extract_template Success 5.15 test_templates.py
test_03_delete_template Success 5.11 test_templates.py
test_02_edit_template Success 90.14 test_templates.py
test_01_create_template Success 25.34 test_templates.py
test_10_destroy_cpvm Success 161.69 test_ssvm.py
test_09_destroy_ssvm Success 198.81 test_ssvm.py
test_08_reboot_cpvm Success 101.64 test_ssvm.py
test_07_reboot_ssvm Success 164.15 test_ssvm.py
test_06_stop_cpvm Success 131.89 test_ssvm.py
test_05_stop_ssvm Success 138.84 test_ssvm.py
test_04_cpvm_internals Success 1.31 test_ssvm.py
test_03_ssvm_internals Success 3.67 test_ssvm.py
test_02_list_cpvm_vm Success 0.13 test_ssvm.py
test_01_list_sec_storage_vm Success 0.13 test_ssvm.py
test_01_snapshot_root_disk Success 11.18 test_snapshots.py
test_04_change_offering_small Success 237.78 test_service_offerings.py
test_03_delete_service_offering Success 0.05 test_service_offerings.py
test_02_edit_service_offering Success 0.06 test_service_offerings.py
test_01_create_service_offering Success 0.11 test_service_offerings.py
test_02_sys_template_ready Success 0.13 test_secondary_storage.py
test_01_sys_vm_start Success 0.18 test_secondary_storage.py
test_09_reboot_router Success 35.31 test_routers.py
test_08_start_router Success 30.30 test_routers.py
test_07_stop_router Success 10.17 test_routers.py
test_06_router_advanced Success 0.06 test_routers.py
test_05_router_basic Success 0.04 test_routers.py
test_04_restart_network_wo_cleanup Success 5.70 test_routers.py
test_03_restart_network_cleanup Success 55.47 test_routers.py
test_02_router_internal_adv Success 1.10 test_routers.py
test_01_router_internal_basic Success 0.62 test_routers.py
test_router_dns_guestipquery Success 76.80 test_router_dns.py
test_router_dns_externalipquery Success 0.08 test_router_dns.py
test_router_dhcphosts Success 278.05 test_router_dhcphosts.py
test_router_dhcp_opts Success 21.77 test_router_dhcphosts.py
test_01_updatevolumedetail Success 0.07 test_resource_detail.py
test_01_reset_vm_on_reboot Success 146.02 test_reset_vm_on_reboot.py
test_createRegion Success 0.04 test_regions.py
test_create_pvlan_network Success 5.27 test_pvlan.py
test_dedicatePublicIpRange Success 0.45 test_public_ip_range.py
test_03_vpc_privategw_restart_vpc_cleanup Success 495.05 test_privategw_acl.py
test_02_vpc_privategw_static_routes Success 380.24 test_privategw_acl.py
test_01_vpc_privategw_acl Success 92.17 test_privategw_acl.py
test_01_primary_storage_nfs Success 35.92 test_primary_storage.py
test_createPortablePublicIPRange Success 15.21 test_portable_publicip.py
test_createPortablePublicIPAcquire Success 15.43 test_portable_publicip.py
test_isolate_network_password_server Success 90.49 test_password_server.py
test_UpdateStorageOverProvisioningFactor Success 0.16 test_over_provisioning.py
test_oobm_zchange_password Success 30.71 test_outofbandmanagement.py
test_oobm_multiple_mgmt_server_ownership Success 16.37 test_outofbandmanagement.py
test_oobm_issue_power_status Success 10.35 test_outofbandmanagement.py
test_oobm_issue_power_soft Success 15.54 test_outofbandmanagement.py
test_oobm_issue_power_reset Success 15.33 test_outofbandmanagement.py
test_oobm_issue_power_on Success 15.32 test_outofbandmanagement.py
test_oobm_issue_power_off Success 15.33 test_outofbandmanagement.py
test_oobm_issue_power_cycle Success 15.34 test_outofbandmanagement.py
test_oobm_enabledisable_across_clusterzones Success 92.93 test_outofbandmanagement.py
test_oobm_enable_feature_valid Success 5.23 test_outofbandmanagement.py
test_oobm_enable_feature_invalid Success 0.10 test_outofbandmanagement.py
test_oobm_disable_feature_valid Success 5.20 test_outofbandmanagement.py
test_oobm_disable_feature_invalid Success 0.11 test_outofbandmanagement.py
test_oobm_configure_invalid_driver Success 0.08 test_outofbandmanagement.py
test_oobm_configure_default_driver Success 0.08 test_outofbandmanagement.py
test_oobm_background_powerstate_sync Success 23.47 test_outofbandmanagement.py
test_extendPhysicalNetworkVlan Success 15.54 test_non_contigiousvlan.py
test_01_nic Success 429.23 test_nic.py
test_releaseIP Success 258.11 test_network.py
test_reboot_router Success 413.43 test_network.py
test_public_ip_user_account Success 10.25 test_network.py
test_public_ip_admin_account Success 40.35 test_network.py
test_network_rules_acquired_public_ip_3_Load_Balancer_Rule Success 66.96 test_network.py
test_network_rules_acquired_public_ip_2_nat_rule Success 61.95 test_network.py
test_network_rules_acquired_public_ip_1_static_nat_rule Success 163.82 test_network.py
test_delete_account Success 282.74 test_network.py
test_02_port_fwd_on_non_src_nat Success 55.66 test_network.py
test_01_port_fwd_on_src_nat Success 111.69 test_network.py
test_nic_secondaryip_add_remove Success 202.52 test_multipleips_per_nic.py
login_test_saml_user Success 19.28 test_login.py
test_assign_and_removal_lb Success 133.97 test_loadbalance.py
test_02_create_lb_rule_non_nat Success 187.43 test_loadbalance.py
test_01_create_lb_rule_src_nat Success 219.08 test_loadbalance.py
test_03_list_snapshots Success 0.07 test_list_ids_parameter.py
test_02_list_templates Success 0.04 test_list_ids_parameter.py
test_01_list_volumes Success 0.03 test_list_ids_parameter.py
test_07_list_default_iso Success 0.06 test_iso.py
test_05_iso_permissions Success 0.06 test_iso.py
test_04_extract_Iso Success 5.29 test_iso.py
test_03_delete_iso Success 95.21 test_iso.py
test_02_edit_iso Success 0.06 test_iso.py
test_01_create_iso Success 21.00 test_iso.py
test_04_rvpc_internallb_haproxy_stats_on_all_interfaces Success 208.59 test_internal_lb.py
test_03_vpc_internallb_haproxy_stats_on_all_interfaces Success 148.20 test_internal_lb.py
test_02_internallb_roundrobin_1RVPC_3VM_HTTP_port80 Success 495.86 test_internal_lb.py
test_01_internallb_roundrobin_1VPC_3VM_HTTP_port80 Success 431.76 test_internal_lb.py
test_dedicateGuestVlanRange Success 10.27 test_guest_vlan_range.py
test_UpdateConfigParamWithScope Success 0.14 test_global_settings.py
test_rolepermission_lifecycle_update Success 6.81 test_dynamicroles.py
test_rolepermission_lifecycle_list Success 6.06 test_dynamicroles.py
test_rolepermission_lifecycle_delete Success 5.86 test_dynamicroles.py
test_rolepermission_lifecycle_create Success 5.93 test_dynamicroles.py
test_rolepermission_lifecycle_concurrent_updates Success 6.00 test_dynamicroles.py
test_role_lifecycle_update_role_inuse Success 5.91 test_dynamicroles.py
test_role_lifecycle_update Success 10.97 test_dynamicroles.py
test_role_lifecycle_list Success 5.93 test_dynamicroles.py
test_role_lifecycle_delete Success 11.11 test_dynamicroles.py
test_role_lifecycle_create Success 5.90 test_dynamicroles.py
test_role_inuse_deletion Success 5.99 test_dynamicroles.py
test_role_account_acls_multiple_mgmt_servers Success 8.16 test_dynamicroles.py
test_role_account_acls Success 8.29 test_dynamicroles.py
test_default_role_deletion Success 6.00 test_dynamicroles.py
test_04_create_fat_type_disk_offering Success 0.08 test_disk_offerings.py
test_03_delete_disk_offering Success 0.04 test_disk_offerings.py
test_02_edit_disk_offering Success 0.06 test_disk_offerings.py
test_02_create_sparse_type_disk_offering Success 0.08 test_disk_offerings.py
test_01_create_disk_offering Success 0.24 test_disk_offerings.py
test_deployvm_userdispersing Success 20.60 test_deploy_vms_with_varied_deploymentplanners.py
test_deployvm_userconcentrated Success 106.26 test_deploy_vms_with_varied_deploymentplanners.py
test_deployvm_firstfit Success 105.94 test_deploy_vms_with_varied_deploymentplanners.py
test_deployvm_userdata_post Success 10.50 test_deploy_vm_with_userdata.py
test_deployvm_userdata Success 55.73 test_deploy_vm_with_userdata.py
test_02_deploy_vm_root_resize Success 5.98 test_deploy_vm_root_resize.py
test_01_deploy_vm_root_resize Success 5.99 test_deploy_vm_root_resize.py
test_00_deploy_vm_root_resize Success 247.79 test_deploy_vm_root_resize.py
test_deploy_vm_from_iso Success 217.83 test_deploy_vm_iso.py
test_DeployVmAntiAffinityGroup Success 75.99 test_affinity_groups.py
test_change_service_offering_for_vm_with_snapshots Skipped 0.00 test_vm_snapshots.py
test_01_test_vm_volume_snapshot Skipped 0.00 test_vm_snapshots.py
test_06_copy_template Skipped 0.00 test_templates.py
test_static_role_account_acls Skipped 0.02 test_staticroles.py
test_11_ss_nfs_version_on_ssvm Skipped 0.02 test_ssvm.py
test_01_scale_vm Skipped 0.00 test_scale_vm.py
test_01_primary_storage_iscsi Skipped 0.04 test_primary_storage.py
test_nested_virtualization_vmware Skipped 0.00 test_nested_virtualization.py
test_06_copy_iso Skipped 0.00 test_iso.py
test_deploy_vgpu_enabled_vm Skipped 0.03 test_deploy_vgpu_enabled_vm.py
test_3d_gpu_support Skipped 0.04 test_deploy_vgpu_enabled_vm.py

@sateesh-chodapuneedi
Copy link
Member

The point of specifying a host is in deploy request itself means inclination towards using that host for instance. Hence I think default value should be false.
While global setting makes sense, rather than a global setting, it would be better to add optional parameter in API itself which indicates whether deploy should strictly try deployment on specific host, if any, or look for other hosts as well. That way more granular control in case by case perspective.

@cloudmonger
Copy link

ACS CI BVT Run

Sumarry:
Build Number 448
Hypervisor xenserver
NetworkType Advanced
Passed=104
Failed=1
Skipped=7

Link to logs Folder (search by build_no): https://www.dropbox.com/sh/yj3wnzbceo9uef2/AAB6u-Iap-xztdm6jHX9SjPja?dl=0

Failed tests:

  • test_routers_network_ops.py

  • test_01_isolate_network_FW_PF_default_routes_egress_true Failed

Skipped tests:
test_01_test_vm_volume_snapshot
test_vm_nic_adapter_vmxnet3
test_static_role_account_acls
test_11_ss_nfs_version_on_ssvm
test_nested_virtualization_vmware
test_3d_gpu_support
test_deploy_vgpu_enabled_vm

Passed test suits:
test_deploy_vm_with_userdata.py
test_affinity_groups_projects.py
test_portable_publicip.py
test_over_provisioning.py
test_global_settings.py
test_scale_vm.py
test_service_offerings.py
test_routers_iptables_default_policy.py
test_loadbalance.py
test_routers.py
test_reset_vm_on_reboot.py
test_deploy_vms_with_varied_deploymentplanners.py
test_network.py
test_router_dns.py
test_non_contigiousvlan.py
test_login.py
test_deploy_vm_iso.py
test_list_ids_parameter.py
test_public_ip_range.py
test_multipleips_per_nic.py
test_regions.py
test_affinity_groups.py
test_network_acl.py
test_pvlan.py
test_volumes.py
test_nic.py
test_deploy_vm_root_resize.py
test_resource_detail.py
test_secondary_storage.py
test_vm_life_cycle.py
test_disk_offerings.py

Copy link
Contributor

@borisstoyanov borisstoyanov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM based on code review, manual verification and test results.

@ramkatru
Copy link

tag:mergeready

…other if the host

given is running out of capacity. If host id is specified the deployment should happen
on the given host and it should fail if the host is out of capacity. We are retrying
deployment on the entire zone without the given host id if we fail once. The retry,
which will retry on other hosts, should only be attempted if host id isn't given.

Also, introduces global setting
allow.deploy.vm.if.deploy.on.given.host.fails with which old behaviour
can be restored
@karuturi karuturi merged commit ac4a02f into apache:master Jun 6, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants