[176259] in North American Network Operators' Group
Re: Incident notification
daemon@ATHENA.MIT.EDU (Peter Kristolaitis)
Fri Nov 21 11:12:35 2014
X-Original-To: nanog@nanog.org
Date: Fri, 21 Nov 2014 11:04:03 -0500
From: Peter Kristolaitis <alter3d@alter3d.ca>
To: nanog@nanog.org
In-Reply-To: <BFB0F9A20B1ADF418B7E13F088AB8577165FC20D@IS-EXCH-TEMP.office.is.nl>
Errors-To: nanog-bounces@nanog.org
We use OpsGenie for notifications (and on-call scheduling, etc). There
are other similar options such as PagerDuty, etc, as well.
Notifications can be submitted to the service in a variety of ways
(email, web API, etc), has a variety of integrations with other tools
(Nagios, Pingdom, etc) to aggregate all of your alerts, and there is a
callback mechanism where the user can trigger custom actions right from
the app (for example, I wrote an interface for it such that when we get
an alert, the on-call person can choose to restart the affected service
-- or even reboot the entire VM hosting it -- right from within the
OpsGenie app).
Each user can choose their method of contact (notification to the
smartphone app, SMS, phone call, email, whatever), and on-call schedules
(and exceptions) are easily managed.
It works for us... YMMV. ;)
- Peter
On 11/21/2014 10:52 AM, Thijs Stuurman wrote:
> Nanog list members,
>
> I was looking at some statistic and noticed we are sending out a massive amount of SMS messages from our monitoring systems.
> This left me wondering if there isn't a better (and cheaper) alternative to this, something just as reliant but IP based. We all have smartphones these days anyway.
>
> Therefore my question, what are you using to notify admins of incidents?
>
> Kind regards / Met vriendelijke groet,
>
> Thijs Stuurman
>
>
>
> [IS Logo]
>
>
> ________________________________
>
> IS Group
>
> Wielingenstraat 8
>
> T
>
> +31 (0)299 476 185
>
> info@is.nl<mailto:info@is.nl>
>
> 1441 ZR Purmerend
>
> F
>
> +31 (0)299 476 288
>
> www.is.nl<http://www.is.nl>
>
> ________________________________
>
> IS Group is ISO 9001:2008, ISO/IEC 27001:2005, ISO 20.000-1:2005, ISAE 3402 certified. De datacenters zijn PCI DSS en ISO 14001 compliant.
>
>