Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JUJU-592] LP1960235 worker does not stop #13717

Merged
merged 2 commits into from Feb 10, 2022

Conversation

hmlanigan
Copy link
Member

The unconverted-api-workers manifold was failing to quiesce for model migration or for agent upgrades. The #-container-worker was stopped, but not gone, causing the manifold to stay in "stopping".

Include the abort channel when calling StopAndRemoveWorker for #-container-worker, otherwise it is stopped, but never goes away correctly due to the differences in return errors with StopWorker. More investigation is needed around this. As well as finally writing manifolds to move the unconverted-api-workers into their appropriate locations.

QA steps

The following test is for migration, also a test for upgrade-juju should not require the machine agent to be bounced for success.

# Setup for model migration
$ juju bootstrap localhost destination
$ juju bootstrap localhost source
$ juju add-model moveme

# setup for the error scenario
$ juju deploy ubuntu
$ juju add-unit ubuntu --to lxd:0
# due to an outside issues, the container won't get to a full juju machine.
$ juju remove-unit ubuntu/1
$ juju remove-machine --force 0/lxd/1

# successful migrate
$ juju migrate moveme destination

Bug reference

https://bugs.launchpad.net/juju/+bug/1960235

Fixes LP1960235.  Otherwise the  #-container-watcher worker does not
respond appropriately when the unconverted-api-workers manifold is
stopping for migration or juju upgrades.
These run inside the unconverted-api-workers and their state changes are
not always obvious. Allow for more detail from the runner.
@hmlanigan hmlanigan added bug The PR addresses a bug 2.9 labels Feb 8, 2022
@hmlanigan
Copy link
Member Author

$$merge$$

@jujubot jujubot merged commit 2800d6e into juju:2.9 Feb 10, 2022
@hmlanigan hmlanigan deleted the lp1960235-worker-does-not-stop branch February 10, 2022 15:02
@wallyworld wallyworld mentioned this pull request Feb 17, 2022
jujubot added a commit that referenced this pull request Feb 21, 2022
#13757

Merge 2.9

#13693 [JUJU-416] Consistantly use juju/retry to handle retries 8 (worker/upgradesteps/*)
#13705 Ensure we check trace enabled before heavy operations
#13711 [JUJU-585] Increase SA secret bootstrap time out.
#13715 [JUJU-587] Tweak cpu power so test passes
#13719 [JUJU-575] Fix/nw deploy kubeflow
#13686 [JUJU-416] Consistantly use juju/retry to handle retries 6 (provider/gce/*)
#13716 [JUJU-586] Supply OpenStack FIPs to the instance-polle
#13718 Add Darwin support for uploading Microk8s images
#13722 Fixes Juju to handle microk8s not running
#13724 [JUJU-575] Reset in kubeflow dir;
#13723 [JUJU-256] Add microk8s tests for upgrade model actions
#13727 [JUJU-600] Add log diff debug message for assess_model_migration.py
#13717 [JUJU-592] LP1960235 worker does not stop
#13714 Fix multiwatcher data race
#13729 [JUJU-603] Remove client websocket buffer size stipulation
#13731 Added --no-progress argument to juju download.
#13733 Fix manual machine checks when migrating manual models
#13728 [JUJU-394] Improve enable-ha error message
#13721 Move lxd profile watcher from model cache to apiserver layer

Conflicts due to version bump and removal of actions UUID support.
```
# Conflicts:
# .github/workflows/smoke.yml
# apiserver/allfacades.go
# apiserver/facades/client/action/action_test.go
# apiserver/facades/client/client/client.go
# go.sum
# scripts/win-installer/setup.iss
# snap/snapcraft.yaml
# state/action_test.go
# state/operation_test.go
# version/version.go
# worker/uniter/runner/context/context_test.go
```

## QA steps

See PRs


[JUJU-416]: https://warthogs.atlassian.net/browse/JUJU-416?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-585]: https://warthogs.atlassian.net/browse/JUJU-585?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-587]: https://warthogs.atlassian.net/browse/JUJU-587?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-575]: https://warthogs.atlassian.net/browse/JUJU-575?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-416]: https://warthogs.atlassian.net/browse/JUJU-416?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-586]: https://warthogs.atlassian.net/browse/JUJU-586?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-575]: https://warthogs.atlassian.net/browse/JUJU-575?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-256]: https://warthogs.atlassian.net/browse/JUJU-256?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-600]: https://warthogs.atlassian.net/browse/JUJU-600?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-592]: https://warthogs.atlassian.net/browse/JUJU-592?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-603]: https://warthogs.atlassian.net/browse/JUJU-603?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
[JUJU-394]: https://warthogs.atlassian.net/browse/JUJU-394?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.9 bug The PR addresses a bug
Projects
None yet
3 participants