Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nodes rolling restart #95

Merged
merged 4 commits into from
Aug 31, 2023
Merged

Nodes rolling restart #95

merged 4 commits into from
Aug 31, 2023

Conversation

Nuttymoon
Copy link
Contributor

Linked issues

Changes

  • node role
    • Add the stop and restart playbooks
    • Add the health-checks playbook that can take 4 parameters and sets 3 vars:
      Parameters:
      - wait_bootstrapped: Wait for the node to be bootstrapped on all chains (default: false)
      - wait_healthy: Wait for the node to be healthy on all Subnets (default: false)
      - avalanchego_health_checks_retries: Number of retries for health checks (default: 60)
      - avalanchego_health_checks_delay: Delay between retries for health checks in seconds (default: 10)
      Outputs:
      - avalanchego_state: State of the avalanchego.service (running/stopped/failed/unknown)
      - avalanchego_is_bootstrapped: Dictionary of bootstrap status on each chain (true/false)
      - avalanchego_is_healthy: Dictionary of healthy status on each Subnet (true/false)
      
  • Add the rolling_restart playbook that makes use of health-checks with wait_bootstrapped and wait_healthy set to true

Additional comments

Note that it might not always make sense to wait for a node to be bootstrapped/healthy on all chains/Subnets.

@Nuttymoon Nuttymoon changed the title 78 rolling restart Nodes rolling restart Aug 28, 2023
@Nuttymoon Nuttymoon merged commit 7b7b871 into main Aug 31, 2023
@Nuttymoon Nuttymoon deleted the 78-rolling-restart branch August 31, 2023 09:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement rolling restart mechanism
2 participants