-
Notifications
You must be signed in to change notification settings - Fork 12.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nacos1.4.1选举异常 #5300
Comments
这里3台挂掉了一台,nacos2被选为主
此时raft协议已经无法工作了,因为还存活的机器数量少于一半
这是因为虽然集群失效了,SofajRaft还是会把上次的集群信息缓存下来 |
https://app.yinxiang.com/fx/60dcf61e-cf73-4343-875e-f4ff0885e7e5
但是后面我把stop掉的节点都启动后,集群还是脑裂状态,整个过程详细笔记可见:https://app.yinxiang.com/fx/60dcf61e-cf73-4343-875e-f4ff0885e7e5 同样的操作,在nacos2.0.0就没脑裂的现象。 |
有日志吗? 看一下alipay-jraft.log 和protocol-raft.log |
|
[root@nacos1 nacos]# cat logs/protocol-raft.log |
raftClusterInfo= 被修改成异常状态了。 这个问题在k8s环境中会出现,目前还在定位问题。 |
这个问题除了在k8s里会出现,在docker-composer中也会出现,我上面的笔记都是docker-composer操作的脑裂必现步骤:https://app.yinxiang.com/fx/60dcf61e-cf73-4343-875e-f4ff0885e7e5 |
@1019822077 您好,您的印象笔记复现记录似乎打不开了,可否维护一下? |
试试这个:https://app.yinxiang.com/fx/60dcf61e-cf73-4343-875e-f4ff0885e7e5 |
Can you use 1.4.2 retry? |
No response from author for a long time, community think the 1.4.2 has solve this problem. If new version has same problem. Please submit new issue let us know. |
首先安装1.4.1
export NACOS_VERSION=1.4.1
git clone --depth 1 https://github.com/nacos-group/nacos-docker.git
cd nacos-docker
docker-compose -f example/cluster-hostname.yaml up
之后集群中leader是nacos3,执行docker stop nacos3,剩下的2个进行选举,nacos2成功为leader.
此时再把nacos2 stop掉,再看nacos1的时候,leader还是保持nacos2。不会自动成leader.
脑裂整个过程笔记:
https://app.yinxiang.com/fx/60dcf61e-cf73-4343-875e-f4ff0885e7e5
https://www.yinxiang.com/everhub/note/60dcf61e-cf73-4343-875e-f4ff0885e7e5
以上过程我在nacos2.0.0上测试,没有问题,当有2个节点的时候,会重新选举,不会脑裂。3个都启动的时候,也不会脑裂。看来1.4.1的选举bug在nacos2.0.0上已解决了
The text was updated successfully, but these errors were encountered: