The up.time IT Systems Management Blog

Posts Tagged ‘sla’

SLA Tips & Tricks

Monday, January 10th, 2011

up.time SLA Monitoring Video

Welcome to the New Year everyone! I hope you had a a lot of delicious food and some good times over the holidays, or at the very least some good time off. I myself enjoyed it so much that my new year’s resolution is to hit the gym a lot more (and stop stuffing myself with chocolate as well).

With a new year comes great responsibility, and being in IT, we have to make sure we keep our users happy at all times. To keep the user’s expectations in check we have these fancy SLA documents and reports that are supposed to show how good of a job we’re doing to keep everything up and running all the time. Unfortunately there’s a few problems with this; it can take forever (from days to weeks) to create this report, fudge factors may exist in the data, have to get metrics from different systems/plaforms/layers/tiers/(insert_buzz_word_here), and did I mention this takes forever?

So here are a few tips & tricks to avoid falling into a coma from boredom while still gathering all these important metrics for this report.

1. Defining Metrics – Before even creating the SLA document/report, make sure we’ve defined which metrics indicate availability and performance of the service we’re providing. We also have to make sure we can actively monitor the metrics with our monitoring solution as well. Luckily up.time can monitor pretty much any kind of metric with thousands of default metrics collected out-of-the-box along with an extensive plug-in architecture to allow for a near unlimited number.

2. Baselining Current Service Level – Once we know what we need to look for we need to get a baseline for how we’re currently performing before we commit to providing a level of service that we cannot possible achieve. For this we create all the monitors in uptime to gather/threshold on the metrics so we get the stats.

3. Proactive SLA Management – Once we create the SLA in uptime with objectives (SLO’s) that defines the availability and performance, we get instant visibility on the SLA dashboard. We can also set it up to automatically alert us when a severity 1 (SEV1) issue occurs and starts to affect SLA performance. So not only do we get component-level alerting/self-healing, we also get the alerting and self-healing capabilities at the SLA-level as well.

4. Quick SLA Report – For those that are currently creating your SLAs manually, this will be a big one for you. How quickly can you get that SLA report compiled and completed? Well mine is sitting in my inbox since this morning already so that’s pretty sweet, especially since I can now choose when to tell my boss when I’m finished compiling the report.

If any of this sounds interesting to you we’re having an online webinar coming soon where you can come see for yourself. Also, for a quick overview of setting and managing your SLAs with up.time, check out my latest video – click here to watch.

Watching the Watcher… Easy SLA Management

Tuesday, June 2nd, 2009

Last week I was working with a prospect that used a large outsourcer for some of their companies Business Services. Every week they receive a report from this outsourcer showing where they stand in their SLA performance; however, the prospect was having a hard time understanding how they were actually coming up with the numbers.

This is where up.time 5 came into play. One of the powerful features of up.time is the ability to create Service Level Agreements (SLA’s) and Service Level Objectives (SLO’s). So, I assisted the prospect in setting up SLA/SLO’s against these outsourced Business Services/Servers for their own detailed up-to-the-minute stats, trending, and detailed reports. up.time 5 even allows you to back-date the SLA/SLO’s so you can see where you have been over the last X number of weeks/months/years. We also setup Alert Profiles that notify management and line of business owners when SLA’s reach warning and critical levels as defined by the company. The best part – all this was setup and running in less than 60 minutes over a remote session.

[/caption]

SLA Summary Report as generated in up.time 5

up.time 5 SLA Dashboard: Click to Enlarge

Proactive IT Systems Management isn’t that Hard….Really

Wednesday, April 22nd, 2009

A brief introduction:  My name is Dave Mitchell.  I’ve been in the business of delivering enterprise level IT service management for over 15 years.  I call uptime software my home.

I believe that far too many IT Managers are pulling the wool over their employers eyes. You know who you are! The one who runs that dark, mysterious department who’s systems administrators and help desk people run around wearing dirty polo shirts, with a half-empty Starbucks Tall Bold in one hand, and a constantly buzzing Blackberry in the other, screaming “MOVE!” as though the server room is some constantly burning pit of fire that must be put out yesterday. This persona is, in my opinion, a ruse.  A way to ward off prying employees, managers, otherwise evil spirits, that may get in the way of the Sysadmin’s leisure time.  I’m not saying that Mr. Admin isn’t doing his job.  I’m simply suggesting that he could be doing it more efficiently.

So, is this you?  Are you one of those guys who works for a company full of people who don’t understand your job just enough such that you can pull a fast one once in a while?  If so, there’s hope!  It’s not so bad running an open and accessible IT department.  Really.

Here’s a few questions to ask yourself, your department or your IT manager (depending on who you are);

1. Do you have a Service Level Agreement with your customers?  Is it published?  Can you prove you’re hitting it?  Having an SLA is like a contract between your department and your customers (the rest of the company).  It sets their expectations of your group, and allows you to function within them.

2. Is a monitoring system in place?  Does it do more than report up/down status?  Can other employees see system status at a glance?  The ability to report on system status in real time comes in handy when troubleshooting problem, sure. However by allowing other departments visibility into system status you give them the ability to check before opening a ticket. It also gives others a glimpse into that dark, mysterious world!

3. Do you meet with department heads on a regular basis?  I mean every department?  Not just the technical ones?  A brief meeting with Marketing, or Sales management can yield amazingly positive results and prevent potential problems due to lack of communication.

Now,  I’m sure you do all these things and more.  You are not that IT guy, or girl.  You’d be surprised how many of them are out there.  If you know someone at another company who deals with that kind of IT department, please pass this post along.  Tell them Mitchell says there’s hope!  IT isn’t a scary underworld, full of secret rituals and other sorts of geekery.  Sure, we’re proud geeks but that’s no reason (to paraphrase Mr. King) ‘we can’t all just get along’.