[195330] in North American Network Operators' Group

home help back first fref pref prev next nref lref last post

Zabbix IT Services feature set

daemon@ATHENA.MIT.EDU (Graham Johnston)
Tue Jul 18 10:33:23 2017

X-Original-To: nanog@nanog.org
From: Graham Johnston <johnstong@westmancom.com>
To: "'nanog@nanog.org'" <nanog@nanog.org>
Date: Tue, 18 Jul 2017 14:33:19 +0000
Errors-To: nanog-bounces@nanog.org

Hi,

We have the Zabbix IT Services (running on Zabbix 3.2) configured for some =
test groups. =A0It usually returns good data but occasionally it seems that=
 one service group or trigger will get stuck in an alerting state and provi=
de an incorrect SLA. =A0This can occur if the trigger has changed to a prob=
lem state and then back to OK but the IT services doesn't reflect that chan=
ge. =A0It will occur where the top level group will show as having 100% pro=
blem time and the sub groups and items either have no problem time or such =
a small amount that it wouldn't indicate 100% problem time.

We have it built with some groups under root, some sub groups and items and=
 the items will have a trigger associated with those items. =A0We followed =
this article to the best of our knowledge: https://www.zabbix.com/documenta=
tion/3.2/manual/it_services
=A0
For Example:=A0
|Data Center=A0
|-Core1
|--Core1 - ICMP - Trigger
|-Core2
|--Core2 - ICMP - Trigger

Each subitem is a child of the item above it. =A0We haven't configured any =
dependencies to any other groups or items.
My question is, has anyone gotten the Zabbix IT Services to work correctly?=
 =A0Is there a trick to getting it to work, some configuration we are doing=
 incorrectly?

Thanks,
Graham



home help back first fref pref prev next nref lref last post