2

我们有一个大约 11 个节点的 Percona Xtradb 集群。其中一个节点在大约 2 天前崩溃了,但现在即使在捐助者指示 SST 过程已完成并且该节点现在已加入集群之后也无法再次启动。

当我检查无法启动的崩溃节点的日志时,我不断重复看到此错误(以小时为间隔):

[Warning] WSREP: Failed to report last committed [xxxxxx] -4 (Interrupted 
system call)

但在此消息在几个小时后在错误日志中弹出一次之前和之后,唯一记录的行是:

....
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping
[Warning] WSREP: Failed to report last committed [xxxxxx] -4 (Interrupted 
system call)
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping
2015-10-19 11:23:48 9091 [Note] WSREP: (f771e66c, 'tcp://0.0.0.0:4567') address 'tcp://192.168.2.100:4567' pointing to uuid f771e66c is blacklisted, skipping

……

什么可能导致这种情况发生?为什么这个节点不会重新启动?以及如何修复节点、启动它并让它再次加入集群?

4

0 回答 0