Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

create: When timing out, display last failing operators from CVO #1230

Merged

Conversation

cgwalters
Copy link
Member

My libvirt installs are breaking today because there are no workers,
and Prometheus wants one. This makes it a more obvious
what went wrong without having to dive into the debug log.

@openshift-ci-robot openshift-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Feb 11, 2019
@cgwalters
Copy link
Member Author

cgwalters commented Feb 11, 2019

Example output:

$ ~/src/github/cgwalters/homegit/bin/okdinst create osiris
Using libvirt NIC: br-osiris                                                                         
INFO Fetching OS image: redhat-coreos-maipo-47.312-qemu.qcow2.gz
INFO Consuming "Install Config" from target directory                                                                                                    
INFO Creating cluster...                                                                            
INFO Waiting up to 30m0s for the Kubernetes API...                                                                                                                   
INFO API v1.12.4+7ab8c74 up                              
INFO Waiting up to 30m0s for the bootstrap-complete event...                      
WARNING RetryWatcher - getting event failed! Re-creating the watcher. Last RV: 2951
INFO Destroying the bootstrap resources...                                                                              
INFO Waiting up to 30m0s for the cluster to initialize...                     
WARNING Cluster operator monitoring is reporting a failure: Failed to rollout the stack. Error: running task Updating Prometheus Operator failed: reconciling Prometheus Operator Deployment failed: updating deployment object failed: timed out waiting for the condition                                             
FATAL failed to initialize the cluster: timed out waiting for the condition

cmd/openshift-install/create.go Outdated Show resolved Hide resolved
cmd/openshift-install/create.go Outdated Show resolved Hide resolved
@cgwalters
Copy link
Member Author

Example output:

$ ~/src/github/cgwalters/homegit/bin/okdinst create osiris
Using libvirt NIC: br-osiris                                                                                                                                                                                      
INFO Fetching OS image: redhat-coreos-maipo-47.312-qemu.qcow2.gz                                    
INFO Consuming "Install Config" from target directory                                                                                                                                                             
INFO Creating cluster...                                 
INFO Waiting up to 30m0s for the Kubernetes API...                                                                                                                                                                
INFO API v1.12.4+897685a up                        
INFO Waiting up to 30m0s for the bootstrap-complete event...                                                                                                                                                      
WARNING RetryWatcher - getting event failed! Re-creating the watcher. Last RV: 2886
INFO Destroying the bootstrap resources...              
INFO Waiting up to 30m0s for the cluster to initialize...                                            
FATAL failed to initialize the cluster: Cluster operator monitoring is reporting a failure: Failed to rollout the stack. Error: running task Updating Prometheus Operator failed: reconciling Prometheus Operator D
eployment failed: updating deployment object failed: timed out waiting for the condition

My libvirt installs are breaking today because there are no workers,
and Prometheus wants one.  This makes it a more obvious
what went wrong without having to dive into the debug log.
@abhinavdahiya
Copy link
Contributor

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 12, 2019
@cgwalters
Copy link
Member Author

@abhinavdahiya Pending — Not mergeable. Needs approved label.

@abhinavdahiya
Copy link
Contributor

/approve

@abhinavdahiya
Copy link
Contributor

/approve

ci-robot has taken away my approval capabilities 😱😱

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: abhinavdahiya, cgwalters

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 12, 2019
@openshift-merge-robot openshift-merge-robot merged commit 6b4ba7f into openshift:master Feb 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants