[194684] in North American Network Operators' Group

home help back first fref pref prev next nref lref last post

Re: Arista hardware health and environmental nagios plugin

daemon@ATHENA.MIT.EDU (Piotr Iwanejko)
Mon May 22 03:11:49 2017

X-Original-To: nanog@nanog.org
From: Piotr Iwanejko <piotr.iwanejko@gmail.com>
In-Reply-To: <CAEs2ZiKSSghaocpcRJ2CU9NOW543spfodzeC5VoRuoaNJhfZPQ@mail.gmail.com>
Date: Mon, 22 May 2017 09:10:36 +0200
To: bas <kilobit@gmail.com>
Cc: nanog <nanog@nanog.org>
Errors-To: nanog-bounces+nanog.discuss=bloom-picayune.mit.edu@nanog.org

Hello,

> Wiadomo=C5=9B=C4=87 napisana przez bas <kilobit@gmail.com> w dniu =
19.05.2017, o godz. 21:34:
>=20
> I had hoped not to have to re-invent the wheel.

Some custom scripts I use on 7050SX: =
https://github.com/piwanejko/Arista-monitoring-tools
Nagios checks:

CPU1 temperature			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006001'!'550'!'600'
CPU1 load				=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.1'!'70'!'90'
CPU2 temperature			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006002'!'550'!'600'
CPU2 load				=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.2'!'70'!'90'
CPU3 temperature			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006003'!'550'!'600'
CPU3 load				=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.3'!'70'!'90'
CPU4 temperature			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006004'!'550'!'600'
CPU4 load				=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.25.3.3.1.2.4'!'70'!'90'
Fan tray 1 status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100601111'!''!'1'=09
Fan tray 2 status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100602111'!''!'1'
Fan tray 3 status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100603111'!''!'1'
Fan tray 4 status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100604111'!''!'1'
Lower board temperature	=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006011'!'500'!'600'
PSU1 fan status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711211'!''!'1'
PSU1 in current status		=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711103'!''!'1'
PSU1 in voltage status		=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100711105'!''!'1'
PSU2 fan status			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721211'!''!'1'
PSU2 in current status		=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721103'!''!'1'
PSU2 in voltage status		=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.5.100721105'!''!'1'
SUP temperature			=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006005'!'550'!'600'
Upper board temperature	=
check_snmp_sw!2c!'public'!'.1.3.6.1.2.1.99.1.1.1.4.100006009'!'500'!'600'
Uptime					=
check_snmp_sw!'2c'!'public'!'.1.3.6.1.2.1.1.3.0'!'@60000:70000'!'60000:'


check_snmp_sw	->	check_snmp -H $HOSTADDRESS$ -P $ARG1$ -C $ARG2$ =
-o $ARG3$ -w $ARG4$ -c $ARG5$

I also made custom script to check discs and memory utilization, but =
it's too old and terribly written to be shared.

Best regards,=

home help back first fref pref prev next nref lref last post