New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Bug 1834473: ovnkube: set NB/SB database inactivity probes to 60 seconds #631

Merged

openshift-merge-robot merged 1 commit into openshift:master from dcbw:db-inactivity-probe

May 11, 2020

Member

dcbw commented May 8, 2020 •

edited

Multiple northds run for HA in active/passive mode where the active
northd holds a lock. If that northd loses connectivity to the database
or is killed without releasing the lock, ovsdb-server will clear
the lock after twice the inactivity probe. But if that probe is set
to 0 (disabled) that will never happen, and a new northd will never
grab the lock and continue reconciling NB->SB.

Set the DB inactivity probe to something greater than 0 to ensure
that a northd will always eventually become active. The value of 60 was
chosen as a reasonable middle-ground between the lock being cleared
and another northd grabbing it (~120s) and the possibility that a loaded
ovsdb-server (many ovn-controller clients) would take more than 30-40
seconds to send/reply to all inactivity probes from clients.

Related: https://bugzilla.redhat.com/show_bug.cgi?id=1828989


          ovnkube: set NB/SB database inactivity probes to 60 seconds

d70c0c6

Multiple northds run for HA in active/passive mode where the active
northd holds a lock. If that northd loses connectivity to the database
or is killed without releasing the lock, ovsdb-server will clear
the lock after twice the inactivity probe. But if that probe is set
to 0 (disabled) that will never happen, and a new northd will never
grab the lock and continue reconciling NB->SB.

Set the DB inactivity probe to something greater than 0 to ensure
that a northd will always eventually become active.

openshift-ci-robot requested review from juanluisvaladas and squeed

May 8, 2020 17:58

openshift-ci-robot added the approved label

Contributor

danwinship commented May 8, 2020

/lgtm

openshift-ci-robot assigned danwinship

openshift-ci-robot added the lgtm label

Contributor

openshift-ci-robot commented May 8, 2020

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danwinship, dcbw

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [danwinship,dcbw]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Contributor

openshift-bot commented May 8, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

8 similar comments

Contributor

openshift-bot commented May 8, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 8, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 8, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 8, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 9, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 9, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 9, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

Contributor

openshift-bot commented May 9, 2020

/retest

Please review the full test history for this PR and help us cut down flakes.

dcbw changed the title ~~ovnkube: set NB/SB database inactivity probes to 60 seconds~~ Bug 1834473: ovnkube: set NB/SB database inactivity probes to 60 seconds

openshift-ci-robot added the bugzilla/severity-urgent label

Contributor

openshift-ci-robot commented May 11, 2020

@dcbw: This pull request references Bugzilla bug 1834473, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug

bug is open, matching expected state (open)
bug target release (4.5.0) matches configured target release for branch (4.5.0)
bug is in the state NEW, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

In response to this:

Bug 1834473: ovnkube: set NB/SB database inactivity probes to 60 seconds

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot added the bugzilla/valid-bug label

openshift-merge-robot merged commit e5a4fd3 into openshift:master

Contributor

openshift-ci-robot commented May 11, 2020

@dcbw: All pull requests linked via external trackers have merged: openshift/cluster-network-operator#631. Bugzilla bug 1834473 has been moved to the MODIFIED state.

In response to this:

Bug 1834473: ovnkube: set NB/SB database inactivity probes to 60 seconds

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot mentioned this pull request

Bug 1834473: ovnkube: really set NB/SB database inactivity probes to 60 seconds #643

Merged

juanluisvaladas mentioned this pull request

Bug 1834474: set NB/SB database inactivity probes to 60 seconds #646

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment