The up.time IT Systems Management Blog

Posts Tagged ‘capacity reporting’

Building a Better Capacity Planning Process

Monday, October 15th, 2012

I was recently re-reading a post I wrote back in May of this year entitled “Is your Capacity Planning Evolving to Meet Business Demand”  where I discussed how new technologies represent both challenges and opportunities for IT executives when it comes to capacity planning and the importance it plays in helping businesses grow:

“IT executives need new and more effective capacity planning processes in order to really take advantage of new technologies by optimizing the placement of applications according to criteria such as service level and cost. In addition, capacity planning software and tools can help teams be more effective.

One tactic you might consider as a start is to elevate your capacity planning team. Get it out of the “back room” of IT operations and make it a strategic function. Yes, remove it completely from IT operations and centralize it as a corporate IT function that reports directly to the CIO. This will send an important message to your organization and capacity management will begin to evolve and operate decentralized from technology support groups, such as network, server and storage.”

 

I present some ideas in this post including a high level model, roles and skills that will help you create a new and strategic capacity planning function within your IT organization.

First off, to be clear, the strategy of creating a strategic capacity planning function involves much more than just assigning the job to one or more technologists, giving them some great software (like up.time) which helps produce all kinds of automated reports that show CPU, storage and network capacity trends, and then holding weekly meetings to look at consumption charts. You need to have the team (or individual if you are starting with a team of one) focus on your business from “the big” picture” perspective.

Ideally your new and improved capacity planning process will look something like this:

 

 

This new capacity planning process (ideally a 3 person team or function) should consider assuming the following new roles and developing specific supportive skills including:

 

 

Building a new capacity planning process and supporting it with these roles and all the skill sets you need may not happen overnight. But the shift in the way your teams will start to think and plan for enterprise capacity will have profound and lasting benefits to your business. I hope these ideas help you in your quest of building a better and more strategic capacity planning function.

 

 

Never Run Out of Disk Space Again with Capacity Management

Tuesday, April 3rd, 2012

Get Better Capacity ManagementLast time, we showed how you can quickly determine if you need more hardware based on how much CPU is being used in your data center.  I can hear some of you screaming…”DISK!  What about disk??  How do I know when I will run out of disk??

 

And you know what, you’re absolutely right.  You should be thinking about your disks as part of your capacity planning initiatives.  If you are still unsure why you should care about capacity planning, take the flood in Thailand as an example.  Hard drives have shot up in price since the flood.  If you can foresee when you will run out of disk space (and how long you can live without it), maybe you can avoid paying an arm and a leg for additional capacity when you really don’t need it yet.  How?  Look no further than your trusty up.time Monitoring Station!

IT Capacity Growth Report

See File Capacity Growth Over Time!

 

By using up.time’s File System Capacity Growth Report, you can quickly see how your file systems are filling up.  This single pane of glass report takes the specified period and calculates how the file systems grow/shrink.  For example, if you selected to report on a one-month period, the delta shown will be the rate at which the file systems are growing/shrinking for the month.  Therefore, you can arm yourself with this information and be able to plan when you will need to buy more disks.

How long until YOUR VMware disk space is all gone?

 

 

If you are using VMware, you have additional tools in your arsenal to succeed at capacity management.  up.time can calculate approximately how long it will take to fill up your VMware datastores… automagically!  It tells you, in English, if your datastore will be full in 3 months, 1 week, or whatever the case may be, so you won’t be caught off guard by running out of disk space.  What makes this even cooler, is that we do it agentlessly!  Just add your vCenter or ESX servers to up.time (which takes about 5 seconds) and you will be able to see into your VMware future with the up.time VMware datastore crystal ball!

 

Free Capacity Management Webinar: 3 Simple Steps for Total Control of your IT Capacity.

Key to Capacity Planning is Knowledge

Tuesday, March 20th, 2012


Put your books way.  This is a capacity planning and capacity management quiz.  How many of these questions can you answer?

  1. How many resources are you using across your entire datacenter?
  2. What is using those resources?
  3. What is the utilization trending towards?
  4. Can you add more load to your servers?
  5. When will you run out of capacity?
  6. Will you gain much from virtualization?
  7. Do you know which servers you should virtualize?
  8. If you are already virtualizing, are you using your resources optimally?

 

Times up.  So how did you do? If you don’t have it memorized, I sure hope you have answers to the above at your fingertips, as those are some very typical questions that CIO’s would ask.  Most people view datacenters as cost centers, and for good reason.  Rising energy costs make headlines everyday and, given the slow recovery of the economy, we are constantly being asked to do more with less.  However, how can you do more with less if you’re not even sure of what you already have.

If you didn’t do so well on the test, I’ll let you cheat a little.  Download and install a trial of up.time.  For those of you who use or have heard of up.time, you might only know up.time as a monitoring tool.  However, there’s much more to the up.time solution than you may realize (and it’s already included in the tool): capacity management.  up.time has loads of features to help you perform capacity analysis.  There are many examples and success stories of how up.time can solve your capacity pains but let’s focus on one common scenario.

Example #1:

Your system administrators come to you saying they need to buy “X” number of servers because they are running out of CPU power.  You, as a diligent manager, know that the executives are never pleased with additional budget requests. So, before you put your neck on the line  for more money, you need to quickly verify what your admins said  is true. Fear not, my friend.  Simply pull up the up.time web UI and generate an “Enterprise CPU Utilization Report” for your entire datacenter in about 3 clicks.  Voila!

 

In seconds, you can easily see how much resources your servers are really using.

 

This is just one of the many examples of how you can use up.time for better, faster, and easier capacity planning.  We will be sharing more on how you can use up.time to address your capacity analysis needs over the next few weeks.  Stay tuned!

 

 

 

In fact, join us for a Free Webinar on Capacity Planning and Capacity Management:

- Patrick

 

The Do’s and Dont’s of Capacity Planning

Tuesday, November 9th, 2010

When it comes to capacity planning, it’s easy to be overwhelmed by the amount of raw data that you can potentially analyze in a data center environment. The question then becomes how you are going to take that data and make sense of it in a way that helps shape your decisions when it comes to adding or removing capacity.

When it comes to estimating capacity, you definitely want to get more depth then a simple metric like CPU utilization on a single server.

Here are some Do’s for capacity planning:

1) Ensure strong visibility across all  of your different platforms virtual, physical and cloud – Increasingly applications are being hosted across multiple infrastructure silos. Ensure that when you do your capacity planning, your toolset accounts for all the different platforms you are trying to monitor. The key here is, you want to as easily and rapidly as possible ensure that you can baseline and find trends across all of your infrastructure.
2) Get deeper visibility into the environment than just platform metrics - Platform metrics serve as a good first level of visibility into capacity, that is, obviously if a box is pegged at 100% CPU, this probably means it’s not doing very well from a capacity perspective. Conversely if the box is running only at 50% CPU or 50% memory, this doesn’t nessecarily gaurantee that the middleware or webserver applications running on that platform are not at full capacity or starving for some other kind of resources.  Ensure that you are also monitoring the application context sensitive capacity data. If your middleware is only capable of opening 500 database connections, you might want to know that you’re at 490, despite all other platform metrics being “OK”.
3) Schedule on-going reporting to your various capacity and performance teams – Capacity information is extremely useful, but shouldn’t be thought of as a “one off”. Capacity information should be made readily available to all teams, especially those doing physical consolidations, virtualization initiatives, or any team that is making any decisions on the acquisition of new hardware. Since decisions around these initiatives are being made all the time, scheduled reporting ensures that all stakeholders have the latest information available at all times. Not only that, these stakeholders will continue to be informed, even if you or parts of your team are on vacation.
4) Group your infrastructure in ways that reflect real business processes and application services – You want to be able to capacity plan against all kinds of permutations of servers, services and shared infrastructure stacks. Make sure that the groupings of servers you report on reflect real business services, applications and logical groupings (like application clusters). In this way, you should be able to rapidly report on aggregate capacity utilization and trends across these groupings without having to remake the wheel. Your tooling should enable you to report and access this data for subsets of infrastructure in a very low number of clicks, without doing SQL Queries, table joins or any advanced data warehousing.
5) Ensure visibility into time of day outages for capacity and relate them to SLAs – Sometimes the business linkage between availability, capacity and individual metrics streams can get really blurry in the “fog of war”. At the end of the day, you want to understand if capacity issues are driving outages and affecting SLA performance. You want to have the context over whether “time of day” performance is a driving force in this kind of capacity issue. For instance if 5000 VDI’s boot up in the morning and this saturates your connection broker, we might want to be able to correlate this time of day outage to the impact this is having on the end-user-experience for SLAs.

Here are important don’ts for capacity planning:

1) Ensure that you don’t have a virtual buffet of  different profiling/capacity tools – The last thing you want to do with capacity planning is try to do “screen level integration” of metrics or performance. That is, copying and pasting metrics from 10 different applications and trying to normalize them in a spreadsheet or data warehouse. This makes it impossible for you to aggregate the data quickly and cleanly. It calls into question the methedology, it wastes valuable time, and it produces results that are more likely than not going to be treated as unreliable by everyone involved.
2) Ensure that your capacity reporting isn’t merely a “static snapshot” in time – Often people use capacity planning reports that are static tables with a static count. For instance the number of virtual instances on Dec 10th. This kind of reporting doesn’t give you the type of insight required to understand the “capacity evolution” of the environment. You are much better off getting visibility into the number of virtual instances over time in a graph, or the virtual density over time so that you can see where things are headed and how workloads might be improperly stacked across the infrastructure stack.
3) Don’t hesitate to use your capacity planning initiative as a catalyst – In my experience, capacity planners have lots of data that would be very valuable to the operations/server teams that actually fix the servers.  Sometimes outages are related to a peak capacity issue. Traditionally reports and dashboards from capacity planning tools are much too cumbersome for members of the ops teams to use. If you have the right tools, the same data can be easily displayed and used by all. Having readily available capacity data correlated to outages is something that can be a catalyst for real wold discussions between 2 teams that sometimes find it hard to see “eye to eye”.
4) Don’t hide away your capacity data – Use dashboards and other reporting tools to make your capacity information available to everyone – your ops teams your management. This is essential data, and some teams may not be aware that they need it. How useful would it be for your ops and NOC teams to see an aggregate capacity dashboard up on a big screen? Wouldn’t it be great to see if there are obvious issues and spikes in capacity usage?
5) Don’t necessarily try to run before you can walkTime and time again, I see people attempt to approach their capacity planning initiatives from a perspective of “pie in the sky”. That is, that they hope that they can guestimate workloads, be able to do theoretical “what if” analysis, and immediately be able to re-order their entire data center in one shot to maximize efficiency by 400%. The truth is, that very few products that promise to do the above are easy to rollout, do not require massive amounts of integration, and will not cost you an arm and a leg. All of this, without even taking into consideration that “what if” analysis is of little value to operations or server engineering teams. As a first step towards the road to the panacea of capacity planning data analysis, you need to consider the cost, the rollout time, and the overall impact of any tool you introduce to do capacity planning. Definitely make sure you are walking (getting visibility), jogging (making this visibility and awareness clear across the organization), and then running (hitting higher order capacity planning ideals with solid data). Most importantly make sure you are choosing a tool set that will help you get on that on-ramp as quickly as possible.

Capacity Planning with up.time VideoMy colleague Joel recently recorded a capacity planning video including tips and best practices while demonstrating up.time’s capacity planning capabilities. You can find it on our YouTube channel at http://www.youtube.com/uptimesoftware or on our website – click here to view video.