Greetings,
I've got a Zenoss v2.5 server monitoring a large video encoding farm. Needless to say, these systems are under high bandwidth and CPU utilization the majority of the time.
What I'm running into is that, occasionally, these systems will fail to respond to a standard SNMP request, thereby throwing "SNMP agent down" errors in Zenoss, and generating lots of otherwise unnecessary alerts. Then, the next time the system is polled, it works, and a clear message is also sent (generating even more alerts).
Short of nice-ing the snmpd process down so that it doesn't get completely blocked by the video encoding, what would be the best way to handle this, either via configuring Zenoss, SNMP, or the servers themselves? I don't see an obvious solution to this puzzle..