Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Merged by Bors] - improve platform stability #1497

Closed
wants to merge 119 commits into from

Conversation

sehz
Copy link
Contributor

@sehz sehz commented Aug 25, 2021

fixes #1490 and probably #1480 due to failure in health during installation.

  • Add minikube/k3 for K8 Test
  • Remove unneeded delay for tests
  • Improve K8 and Local installation
  • Simplify SPU health check to improve reliability
  • Improve SC's SPU controller reliability
  • Increase SPG's creation time to allow longer period for PVC provisioning
  • Error recovery for K8 Dispatcher in case of controller API issues
  • More instrumentation for tracing

@sehz sehz marked this pull request as draft August 25, 2021 00:07
@sehz sehz force-pushed the fix_health_check2 branch 3 times, most recently from 325bf89 to 5246216 Compare August 27, 2021 04:36
@sehz sehz linked an issue Aug 27, 2021 that may be closed by this pull request
@sehz sehz marked this pull request as ready for review August 29, 2021 04:12
@sehz
Copy link
Contributor Author

sehz commented Aug 29, 2021

bors r+

bors bot pushed a commit that referenced this pull request Aug 29, 2021
fixes #1490 and probably #1480 due to failure in health during installation.

* Add minikube/k3 for K8 Test
* Remove unneeded delay for tests
* Improve K8 and Local installation
* Simplify SPU health check to improve reliability
* Improve SC's SPU controller reliability
* Increase SPG's creation time to allow longer period for PVC provisioning
* Error recovery for K8 Dispatcher in case of controller API issues
* More instrumentation for tracing
@bors
Copy link

bors bot commented Aug 29, 2021

Pull request successfully merged into master.

Build succeeded:

@bors bors bot changed the title improve platform stability [Merged by Bors] - improve platform stability Aug 29, 2021
@bors bors bot closed this Aug 29, 2021
@sehz sehz deleted the fix_health_check2 branch September 24, 2021 04:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CI Stability: Zero copy status handling Ci stability: health check failure
1 participant