This repository has been archived by the owner on Jun 20, 2024. It is now read-only.
Inconsistent IPAM entries with same version but with different owning peer #3632
Labels
Milestone
What you expected to happen?
When IPAM entries are updated locally and broadcast to the ring there can arise inconsistency in IPAM entries resulting in conflicts. #1962 broadly addresses this for all the cases.
This bug report is specific sequence resulting in IPAM entry ending in-consistent where both version are same with different owning peer
What happened?
Going to describe problem with the logs from the cluster where this is issue is seen.
its a 3 node cluster with
de:4f:4d:90:6b:e2
,42:6d:29:3d:b2:d4
and3e:06:93:0a:8f:fd
as peersde:4f:4d:90:6b:e2
has below view of token 10.34.0.042:6d:29:3d:b2:d4
has below view of token 10.34.0.0this results in both nodes not connecting to each other due to the conflict
de:4f:4d:90:6b:e2
fails to connect to42:6d:29:3d:b2:d4
similarly
42:6d:29:3d:b2:d4
fails to connect tode:4f:4d:90:6b:e2
Here is the snip of sequence of event that led to this consistent state
snip from the logs of
de:4f:4d:90:6b:e2
snip from the logs of
42:6d:29:3d:b2:d4
de:4f:4d:90:6b:e2
bumped version of 10.34.0.0 to 5DEBU: 2019/04/15 11:29:48.223696 [ring de:4f:4d:90:6b:e2]: Transfer token=10.34.0.0 from=3e:06:93:0a:8f:fd to=de:4f:4d:90:6b:e2 version=5
simultaneously
42:6d:29:3d:b2:d4
merged ring update from3e:06:93:0a:8f:fd
which bumbed version to 5:DEBU: 2019/04/15 11:29:41.882462 [ring 42:6d:29:3d:b2:d4]: Merge token=10.34.0.0 mine.Peer=3e:06:93:0a:8f:fd theirs.Peer=3e:06:93:0a:8f:fd mine.Version=4 theirs.Version=5
resulting in entried with same version but different owning peers.
Looking at the logs of the peer
3e:06:93:0a:8f:fd
why it increased version from 4 to 5 this is what is seen in the logsSo
DEBU: 2019/04/15 11:29:12.155554 [ring 3e:06:93:0a:8f:fd]: ReportFree token=10.34.0.0 peer=3e:06:93:0a:8f:fd version=5
bumped the version.Not sure why version should be bumped up when a token is reported to be free.
How to reproduce it?
Have a cluster with node A and node B, and keep adding node C and deleting it from the cluster. You will end up with IPAM inconsistencies once in a while.
The text was updated successfully, but these errors were encountered: