New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
monitoring remove chattiness and race condition waits #1405
monitoring remove chattiness and race condition waits #1405
Conversation
/retest |
cmd/monitoring/monitoring.go
Outdated
hostnames = []string{} | ||
for iter, err := m.pipcli.ListVirtualMachineScaleSetPublicIPAddressesComplete(ctx, resourceGroup, "ss-master"); iter.NotDone(); err = iter.Next() { | ||
if err != nil { | ||
return nil, err | ||
time.Sleep(5 * time.Second) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we not lose IP addresses if we do this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might. But we will see less IP in the metrics.
If this returns err - we have a bigger problem. Not sure we want to try catching ARM errors here as it should be part of plugin code
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we're not aligned. these days the fake rp writes _data/containerservice.yaml before (and after) the cluster is created so you will end up here before the IPs exist. If you can, poll locally to be sure the cluster is created (can you read the provisioning state perhaps?) then fetch all the IPs. Adding time.Sleep(5 * time.Second)
without a loop basically achieves nothing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is issue with it now: #1416 Fixing in separate PR.
/retest |
e0eefe1
to
782bf12
Compare
/retest |
782bf12
to
095fcce
Compare
Codecov Report
@@ Coverage Diff @@
## master #1405 +/- ##
==========================================
- Coverage 45.03% 45.01% -0.03%
==========================================
Files 176 176
Lines 13042 13028 -14
==========================================
- Hits 5874 5865 -9
+ Misses 6818 6816 -2
+ Partials 350 347 -3 |
5a90c42
to
a3fd78a
Compare
/hold |
eff0180
to
86339a8
Compare
/retest |
@jim-minter If we could merge this, until we sort out https://github.com/openshift/openshift-azure/pull/1418/files . and when done we can move to monitor provisioning state as now it's a little bit broken. |
cmd/monitoring/monitoring.go
Outdated
time.Sleep(10 * time.Second) | ||
} else if iter.Value().IPAddress != nil { | ||
hostnames = append(hostnames, *iter.Value().IPAddress) | ||
break |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think this break should be here
cmd/monitoring/monitoring.go
Outdated
break | ||
} | ||
} | ||
if err == nil && len(hostnames) >= 4 { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"4" is very ugly
86339a8
to
70c4e8c
Compare
force pushed |
/hold cancel |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jim-minter, mjudeikis The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest Please review the full test history for this PR and help us cut down flakes. |
2 similar comments
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
fix: #1398
fix: #1399
remove chattiness and adds waits for race conditions