cacti， SNMP timeout detected [500 ms],ignoring hos-xjc2694-ChinaUnix博客

Xiajc - 工作笔记xjc2694.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

xjc2694

博客访问： 3061516
博文数量： 535
博客积分： 15788
博客等级：上将
技术积分： 6507
用户组：普通用户
注册时间： 2007-03-07 09:11

文章分类

全部博文（535）

Puppet（6）
Solaris（1）
hadoop（15）
虚拟化（8）
C（1）
DB（44）
perl（35）
云计算（27）
系统监控（26）
Others（27）
WWW（100）
Mail（20）
Linux（213）
未分配的博文（12）

文章存档

2016年（1）

2015年（1）

2014年（10）

2013年（26）

2012年（43）

2011年（86）

2010年（76）

2009年（136）

2008年（97）

2007年（59）

我的朋友

相关博文

cacti， SNMP timeout detected [500 ms],ignoring hos

分类： LINUX

2009-07-08 09:18:33

昨天升级cacti和spine到0.8.7e

今天发现cacti监控的服务器有几台不出图了，使用snmp手动查询能够取到值。

检查cacti.log：

07/08/2009 09:15:03 AM - SPINE: Poller[0] Host[7] DS[67] WARNING: SNMP timeout detected [500 ms], ignoring host '10.249.86.146'

07/08/2009 09:15:03 AM - SPINE: Poller[0] Host[7] DS[68] WARNING: SNMP timeout detected [500 ms], ignoring host '10.249.86.146'

07/08/2009 09:15:03 AM - SPINE: Poller[0] Host[10] DS[93] WARNING: SNMP timeout detected [500 ms], ignoring host '10.255.147.80'

07/08/2009 09:15:03 AM - SPINE: Poller[0] Host[10] DS[94] WARNING: SNMP timeout detected [500 ms], ignoring host '10.255.147.80'

07/08/2009 09:15:03 AM - SPINE: Poller[0] Host[10] DS[95] WARNING: SNMP timeout detected [500 ms], ignoring host '10.255.147.80'

解决办法：

CACTID: Host[...] DS[....] WARNING: SNMP timeout detected [500 ms], ignoring host '........'

For "reasonable" timeouts, this may be related to a snmpbulkwalk issue. To change this, see Settings, Poller and lower the value for The Maximum SNMP OID's Per SNMP Get Request. Start at a value of 10 and increase it again, if the poller starts working. Some agent's don't have the horsepower to deliver that many OID's at a time. Therefore, we can reduce the number for those older/underpowered devices.

增加了The Maximum SNMP OID's Per SNMP Get Request 的值，默认为10，我增加到了30（可以适当调大），其中的一台服务器正常了，但是，还有其他的服务器仍然不出图。

对于仍然不出图的服务器，在console-management-devices里选择不出图的服务器，修改里面的Maximum OID's Per Get Request选项，增加到30（在此之前这里还是默认的10.）

最后，我更改了setting里的Maximum Threads per Process 和 Maximum Concurrent Poller Processes为5，默认为1.

使用0.8.7c没有问题，升级倒出问题了~~

另：在spine0.8.7e的changelog里发现：

bug: If host has MAX OID's set to 0, timeouts occur

补充：这两天发现，被监控的服务器重启或停止后，cacti的监控就会报这个错误，手动修改一下出问题的服务器的监控选项里的The Maximum SNMP OID's Per SNMP Get Request ，后问题就消失了，不论是改大或是改小，似乎是spine 0.8.7e的BUG，但又不想换回低版本的spine，忍着吧，希望下个版本可以解决这个问题。

阅读(5982) | 评论(0) | 转发(0) |

上一篇：EC2上，迁移美国的AMI到欧洲

下一篇：关于向sina发送邮件被拒（550）的问题

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6