其中 interval="3" This is the frequency of read/write cycles, in seconds.读写quorum disk的频率 tko="13" This is the number of cycles a node must miss in order to be declared dead.连续失败多少次则判定此节点死掉
qdisk_timeout = interval x tko
④再来看看RHEL5里cman timeout是怎么去配置的, token This timeout specifies in milliseconds until a token loss is declared after not receiving a token. This is the time spent detecting a failure of a processor in the current configuration. Reforming a new configuration takes about 50 milliseconds in addition to this timeout. The default is 1000 milliseconds. 连续多长时间没有收到token就判定令牌丢失。默认1秒,其中有50ms是生成一个新的配置的时间。 retransmits_before_loss This value identifies how many token retransmits should be attempted before forming a new configuration. If this value is set, retransmit and hold will be automati- cally calculated from retransmits_before_loss and token. The default is 4 retransmissions. 连续丢失几次token,才会生成新的cluster配置(将丢失token的节点踢出集群)。默认4次。 token_retransmit This timeout specifies in milliseconds after how long before receiving a token the token is retransmitted. This will be automatically calculated if token is modi- fied. It is not recommended to alter this value without guidance from the openais community. The default is 238 milliseconds. 重发token的时间间隔,这个值是由上面的token和token_retransmit自动计算的。(1000-50)/4≈238ms
如果出现上面说的丢失心跳token的时候,日志会出现如下报错: openais[3345]: [TOTEM] The token was lost in the OPERATIONAL state.