fosdem.org

Speakers
	Lindsay Holmwood
Schedule
Day	Saturday
Room	Janson
Start time	16:00
End time	16:45
Duration	00:45
Info
Event type	Podium
Track	Monitoring
Language	English
Media
Video (DIVX)

Starting the sysadmin tools renaissance: Flapjack + cucumber-nagios

Monitoring software is ripe for a renaissance. Now is the time to for building new tools and rethinking our problems. Leading the charge are two projects: cucumber-nagios, and Flapjack.

A systems administrator's role in today's technology landscape has never been so important. It's our responsibility to manage provisioning and maintenance of massive infrastructures, to anticipate ahead of time when capacity must be grown or shrunk, and increasingly, to make sure our applications scale.

While developer tools have improved tremendously, we sysadmins are still living in the dark ages, other than a few shining beacons of hope such as Puppet. We're still trying to make Nagios scale. We're still writing the same old monitoring checks. Getting statistics out of our applications is tedious and difficult, but increasingly important to scaling.

cucumber-nagios lets you describe how a website should work in natural language, and outputs whether it does in the Nagios plugin format. It includes a standard library of website interactions, so you don't have to rewrite the same Nagios checks over and over.

cucumber-nagios can also be used to check SSH logins, filesystem interactions, mail delivery, and Asterisk dialplans. By lowering the barrier of entry to writing fully featured checks, there's no reason not to start testing all of your infrastructure. But as you start adding more checks to your monitoring system you're going to notice slowdowns and reliability problems - enter Flapjack

Flapjack is a scalable and distributed monitoring system. It natively talks the Nagios plugin format (so you can use all your existing Nagios checks), and can easily be scaled from 1 server to 1000.

Flapjack breaks the monitoring lifecycle into several distinct chunks: workers that execute checks, notifiers that notify when checks fail, and an admin interface to manage checks and events.

By breaking the monitoring lifecycle up, it becomes incredibly easy to scale your monitoring system with your infrastructure. Need to monitor more servers? Just add another server to the pool of workers. Need to take down your workers for maintenance? Just spin up another pool, and turn off the old one.

Other events at the same time:

When	Event	Track	Where
15:15-16:15	Infrastructure round table	Distributions	H.1302
15:30-16:15	Introduction to Qt Designer	KDE	H.2214
15:30-16:30	HTML 5	Mozilla	H.1301
15:30-16:15	Distribution HR management	Distributions	H.1308
15:45-16:15	Lambda + JSR292	Free Java	AY
15:45-16:30	Add plugins to your GNOME apps	GNOME	H.1309
16:00-16:45	OSSEC	Security	Chavanne
16:00-17:30	LPI exam session 2	Certification	Guillissen
16:00-16:15	SIP Communicator: Skype-like conf calls with SIP Communicator	Lightning Talks	Ferrer
16:00-16:45	Show me YOUR code	OpenOffice.org	AW1.120
16:00-17:00	coreboot board porting	Coreboot	AW1.124
16:00-17:00	Drools	JBoss	AW1.105
16:00-16:30	Mirabeau: Creating Personal Media Networks	Jabber+XMPP	H.2213
16:00-16:30	The Ruby Smalltalkification	Ruby+Rails	AW1.126
16:00-17:00	Barebox	Embedded	Lameere
16:15-16:30	Kamailio (OpenSER) 3.0.0: redefinition of SIP server	Lightning Talks	Ferrer
16:15-17:00	Beyond UNIQUE: Exclusion constraints in PostgreSQL 9.0	PostgreSQL	AW1.121
16:15-16:45	bootchart2	Distributions	H.1308
16:30-16:45	asterisk: An introduction to Asterisk Development	Lightning Talks	Ferrer
16:30-17:00	Wizard4j	Free Java	AY
16:30-17:15	Gnome Development Tools	GNOME	H.1309
16:30-17:15	25 good practices in Ruby on Rails development	Ruby+Rails	AW1.126
16:30-17:00	You Got Your XMPP in My Website: Using Strophe.js for Fun and Profit	Jabber+XMPP	H.2213

fosdem.org

User login

Links:

Other events at the same time: