Skip to content

Commit

Permalink
bridge.c: prevent controller connects while flow-restore-wait
Browse files Browse the repository at this point in the history
When force-reload-kmod is used, it shows an error when reinstalling
tlvs during "Restoring saved flows" step:
OFPT_ERROR (xid=0x4): NXTTMFC_ALREADY_MAPPED

This is caused by a race condition between the restore script,
which calls ofctl, and the connected controllers both adding back
the same TLVs.

The restore script already sets flow-restore-wait to true while
doing flow restoration, and sets it back to false after it is
done, and this patch utilizes that fact to prevent the TLV race.
It does this by preventing vswitchd from connecting to
controllers in the controller table while it is in a
flow-restore-wait state.

With this patch, when bridge_configure_remotes() calls
bridge_get_controllers(), it first checks if flow-restore-wait
has been set, and if so, it ignores any controllers in the
controller database and sets n_controllers to 0.

This solution does preserve the management service controller
which is added via bridge_ofproto_controller_for_mgmt() after
checking whether we should call bridge_get_controllers()
(and thus n_controllers is properly set to 1, etc)

VMware-BZ: 2195377
Signed-off-by: Zak Whittington <zwhitt.vmware@gmail.com>
Signed-off-by: Ben Pfaff <blp@ovn.org>
  • Loading branch information
zwhittington authored and blp committed Oct 26, 2018
1 parent 555fe6c commit 7ed7342
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion vswitchd/bridge.c
Expand Up @@ -3608,7 +3608,8 @@ bridge_configure_remotes(struct bridge *br,
ofproto_set_extra_in_band_remotes(br->ofproto, managers, n_managers);
}

n_controllers = bridge_get_controllers(br, &controllers);
n_controllers = (ofproto_get_flow_restore_wait() ? 0
: bridge_get_controllers(br, &controllers));

ocs = xmalloc((n_controllers + 1) * sizeof *ocs);
n_ocs = 0;
Expand Down

0 comments on commit 7ed7342

Please sign in to comment.