The up.time IT Systems Management Blog

Posts Tagged ‘vSphere’

Auto-Discovery Improvements in up.time 7.1

Monday, October 1st, 2012

up.time 7.1 is introducing a few new changes to our existing automated discovery process that will allow you to scan and monitor your datacenter more efficiently than in previous versions. We have always strived to be one of the easiest monitoring tools on the market to set up, roll out and maintain. Our users have reported being able to roll out monitoring to as many as 400 servers a day using a combination of our agentless and agent based monitoring. With 7.1 we wanted to push this number even higher to save you even more time and effort. The result is a discovery process that is at minimum, 50% faster, and much easier to use with much less manual work and human error.

For those of you who haven’t tried up.time yet, we support the following auto-discovery methods

  • Network Scan: Provide a subnet range and up.time will scan for any servers, network devices, IPs, etc… that can be added.
  • VMware vCenter Server Inventory Synchronization via up.time vSync: up.time will tap into your vCenter environment and automatically add all attached elements in a matter of seconds. up.time will also keep this inventory in sync, so newly discovered VMs & vSphere servers will have monitoring applied automatically.
  • IBM pSeries Discovery: Scan your HMC or VIO servers to discover all attached frames and LPARs

Using these methods it is very easy to populate up.time with everything in your environment. We also offer utilities that will bulk import systems from a text file if you happen to have a running list of every server in your environment.

With up.time 7.1 we are introducing these changes to the auto-discovery process

  • Faster Network Discovery: In up.time 7, the discovery process on a subnet with 200 elements took on average 4m30s (270 seconds). In up.time 7.1, the same subnet now takes 1m45s (105 seconds). That’s a 62% performance improvement, discovering subnets will now take less than half the time.
  • Faster Bulk Addition of Elements: After the network has been discovered, it took on average about 20 seconds to add each discovered element. It was a manual process that could take quite a while in large environments. In up.time 7.1 any discovered element can now be bulk added using standard credentials, this completely removes the manual overhead time and reduces the average add time to about 3 seconds. After entering credentials once, it will take about 10 minutes for up.time to automatically add all 200 elements while you go refill your coffee.
  • Wizard based discovery: The process of setting up a discovery has been streamlined so that it’s easier to see where you are in the process, and understand how each discovery option works. We have also tied vSync discovery into the auto discovery process to highlight it’s discovery capabilities.

That’s all for now, more information will become available over the coming month as we approach the up.time 7.1 release. I will also be hosting a Sneak Peak Webinar to introduce up.time 7.1 to the world in late October, more details to follow.

Capacity Planning: Do you Know your Virtualized Environment?

Wednesday, May 9th, 2012
Remember, the key to capacity planning is knowledge. You cannot manage something you don’t know.  And to stay on that point, since VMware is probably the most popular virtualization technology available, I want to elaborate on VMware capacity planning.

 

up.time communicates with VMware’s vCenter to get all sorts of data and metrics.  For capacity management, we tie all that information neatly into easily digestible reports.

One of these reports is the vSphere Workload Report.  It enables you to visually see the resource usage at the data center, cluster and ESX server level as well as resource pools, vApps and the virtual machines.  With this knowledge in hand, you can easily determine if you are maxing out your VMware environment or if there’s room to better utilize your resources.

 

Quite a lot has been said about our capabilities on VMware but that is not the only virtualization technology we work with.  Another one that we have coverage for is IBM’s LPARs.  The LPAR Workload Report shows the CPU, Memory, Network I/O & Disk I/O for all the LPARs on a pSeries server. The graphs are stacked so not only can you see how the LPARs compare to each other, you also gain insight into the overall workload on your pSeries server.  This empowers you to accurately adjust the CPU entitlements of the LPARs and keep track of the overall workload over time!

Capacity planning might sound like a difficult task to tackle but through my last few posts, I hope I have shed some light on how up.time can assist in fulfilling your capacity management needs.  If you haven’t already done so, download a free trial of up.time and see what it can do for you!

- Patrick

up.time VMware vSphere 5 Support

Friday, April 27th, 2012

You asked for it, our upcoming release of up.time 7.0 will fully support VMware vSphere 5! uptime software inc. has been a long time VMware partner, launching our first VMware monitoring solution in the early days of ESX in 2006. Since then we have released several major evolutions of our virtualization monitoring platform and are continuing to build on our relationship by fully supporting the latest vSphere edition in up.time 7.0. up.time 6.0 already fully supports vSphere 4.x, with all of the powerful features you are used to continuing to be available:

Smart VMware Monitoring

  • vSync Dynamic Discovery: Keep your monitoring inventory in lock step with any changes to your vSphere environment. The second a new VM is spun up or hosts are moved around the datacenter, up.time’s inventory will update itself instantly so that you know you don’t have any monitoring blind spots in your environment. Monitor your whole vSphere environment agentlessly through your vCenter installation.
  • Power Awareness Intelligence: Power state awareness dashboards bring real time power status information into your global view. Take control of power state changes by alerting your administrators when critical systems are powered down. 
  • VM Sprawl Control: Automatic notifications of new VMs ensure they are compliant with your IT policy. Review weekly sprawl reports to target zombie or over allocated VMs  so you can free up precious disk & memory space to the VMs that really need it.

Deep VMware Capacity Management

  • Easy Capacity Bottleneck Troubleshooting: Instantly find your bottlenecks and drill into the root cause.
  • Global Capacity Reports: How much capacity do I have? How much am I using? How much am I wasting? up.time helps you answer all of these questions so that you can reclaim valuable resources and proactively avoid embarrassing capacity outages.

In addition to all of the other great benefits of up.time, our “single pane of glass” dashboards brings together your virtual, physical and cloud environments into one monitoring toolset.

If you haven’t had a chance to try up.time yet, you can download a free trial from our website and be up and running in 15 minutes.

Dave.

Stop the insanity

Monday, June 22nd, 2009

One of the definitions of insanity is doing the same thing over and over and expecting different results.  In this case, why does this affliction continue to haunt us in IT?  Given the significant advances in technology, specifically in virtualization, all of which are supposed to make our lives easier and more efficient; why haven’t we whipped the beast of IT complexity?  The majority of IT environments are still stuck in a world of break-fix, albeit, perhaps we’re just getting into a faster break-fix mode.

In one recent Gartner article, “Server Virtualization for x86: A Benefits Impact Assessment,” there is a rather telling statement:

“From Gartner surveys and client interactions, we know that, operationally, virtualization appears to be a “wash,” at best — and it actually creates additional costs (people, process development and tools) on a worst-case basis. “

So what are we doing wrong?  One reason is that in daily operations, there isn’t an easy way to prioritize incoming incidents or determine recurring problems.  I would categorize the recurring outages as “death by a thousand cuts.”  This is further exacerbated on teams with a number of sysadmins, where the same problem can be perceived as distinct problems to each sysadmin.  Resource inefficiency is created by having multiple sysadmins solve the same problem over-and-over.

Additionally, in VMware environments, the old traditional metrics of guest CPU, Memory, and I/O are not very useful anymore.  They aren’t good indicators of how guests ‘get along’ during regular compute workloads.  There are a whole new series of VMware specific metrics that are indicators of VM guest contention from a compute, bandwidth, and memory usage point of view.  System’s management tooling needs to understand these new factors to aid in managing a virtual infrastructure, something that traditional ‘Big 4′ tooling just can’t do.  Putting ill-matching VM guests onto the same physical infrastructure is simply asking for incidents and accumulated outages over time.

The right approach should be to stop banging your head against the wall, rather than simply taking two aspirin every day and dealing with the pain.  Instead of waiting for incidents to occur, a more proactive manner of avoiding them should be possible.   With VMware’s launch of vSphere in May a package called Orchestrator is also bundled (this is from their Dunes acquisition of a few years ago).  This is fantastic news for SMBs (and enterprises too) as it means that any installation of VMware vSphere will have runbook automation capabilities.  VMware’s Orchestrator is a very simple drag and drop interface to create (potentially complex) workflows to control your virtual infrastructure.  The latest release of up.time integrates tightly with Orchestrator to add application-level monitoring  and management capabilities and can trigger specific workflows when certain applications are about to exceed SLA objectives or will degrade unless corrective action is taken.   Through an Orchestrator plug-in the up.time API is also exposed, allowing bi-directional communication between Orchestrator and up.time (so you can dynamically add systems, or re-group them on the fly).

So rather than wait for an application to fail and trigger an incident, up.time can take corrective action in advance to complete avoid the incident.  This starts to help us snap out of the break-fix routine that we’re all stuck in.  Let’s take an example of incident avoidance and dynamic infrastructure:

Let’s say that you have an e-commerce application that requires that certain response time thresholds can’t be exceeded and the concurrent user sessions are also a factor.  With up.time, since it’s already a micro-framework that can monitor your entire infrastructure (applications, databases, platforms, networks, etc.), it can trigger actions based on identified thresholds.  If user sessions starts to peak or response time begins to drop, up.time can trigger an Orchestrator workflow to dynamically provision additional VM guests and bring them online into the e-commerce application.   Also, since up.time understands the application, as workload drops over time (e.g. the user peak has dissipated), workflows can then be triggered to de-provision the extra VM guests to avoid sprawl.

There are many more exciting things in this release, but we’ll cover those in another blog post.  I’m also going to cover the exciting capability of bridging private and public cloud with up.time.  What about dynamically provisioning compute capability in Terremark’s cloud or Amazon’s EC2 from the privacy of your own infrastructure and then having these instances monitored under the global purview of up.time?  We can do this, more info next blog post.

Do more with less. Virtualize and save – but plan carefully!

Wednesday, June 10th, 2009

Here’s some more work for you. Here’s some more responsibility. Here’s a shorter deadline. Now do it all with less money, less time, less resources, less, less, less!

It seems as though the more efficient we become, the more constrained we are. The current economic climate doesn’t help either.   Yes, this is the new norm.  So what can you do?

If you’re reading this you’ve probably invested time and money into a virtual infrastructure, or are considering it.  Great!  Virtualized computing environments squeeze every last drop of performance from hardware and, when properly budgeted, can save you thousands in the long run. But don’t expect a free lunch.

Physical to Virtual Consolidation

Consolidation of physical servers to virtual hosts allows you to break the 1 application to 1 server mold.  However the increased density in your server room might create hot spots, especially if you’ve decided on using a blade chassis.

That increased density means you’ll also be pushing your hardware harder. This will likely increase your power requirements, slightly.  Newer hardware is indeed more efficient, and technologies like VMware’s DRS Distributed Power Management allow you to move workloads around to less stressed hosts and power off unused resources. The net effect is a possible overall reduction in power usage, but peak times could actually require more.

An Up Front Expense?

Virtualization is a net new expense. Unless you are starting from scratch, you will need to invest in hardware, and software licensing.  I was recently asked to vet the cost of a 24 host, enterprise level virtual environment.  Assuming a requirement of 10 Terabytes of storage, and going with mid tier hardware I came up with an up-front ballpark cost of USD $225,000.  No small change.  Amortize your projected savings carefully. Is it worth the up-front investment? Luckily you can grow your virtual environment easily as required with little to no negative impact on the existing services.

Implement Standards

Virtualization has made provisioning services a snap. You’ve heard all the marketing buzz — reduced time to market, provision servers in seconds!, etc.  Suddenly that 10T of storage is GONE.  But how?

Sprawl. (Yes, up.time can help you with this!)

Back in the days before virtualization, if you needed more resources you had to justify the expense nine ways from Sunday.  When it finally arrived you’d spend a week staging it.  Then testing and finally implementing it, only to have it completely consumed a few months later!  When you planned your virtual infrastructure you WAY over provisioned it, didn’t you?  You thought ahead 3 years like you  did when you bought a single server for that one application.  However now you’re planning for possibly hundreds of workloads.  Need another machine?  No problem, just clone it and wait a few minutes.   Ever have cash burn a hole in your pocket?  Budgets prevent us from blowing that spare cash.  It’s exactly the same in a virtual environment, except the spare cash is extra CPU cycles, storage and memory.  From simply devising a set of rules for managing the virtual machine life cycle, or implementing tools to manage it, the only way you will realize long-term savings is to ensure you’re only using what you need.  Don’t run your VM environment like that TV salesperson’s famous oven — “Set it, and forget it!”.

If you keep these things in mind when building and managing your VMware vSphere environment, or any other virtual infrastructure, you will absolutely be able to do more, with less. Of course <shamesless plug>, up.time can solve you VMware monitoring needs with it’s deep VMware monitor and reports.

VMware vSphere – Are you ready? We are!

Wednesday, June 3rd, 2009

Unless you live under a rock, you know that VMware recently released vSphere 4.  The highly anticipated upgrade to its virtual infrastructure suite.  The number of feature upgrades and enhancements makes the new version somewhat hard to ignore.  But if you’re like me you tend to shy away from .0 releases.  I usually wait for the real world installations to sort out the bugs and let the developer issue a patch or point release. Let someone else be my guinea pig.  The last thing you want is for an upgrade to nuke your production system.

I am, however, happy to report that our experience with vSphere 4 has been relatively smooth so far.  While I’ve not taken the plunge and upgraded our production environment yet, our lab upgrade from 3.5 to the 4.0 beta, and subsequently the general release went off without a hitch.  This gives me the confidence to at least begin the planning stages of the production system upgrade.

Step one is to make sure our existing systems are at the latest version of Infrastructure 3.5 and fully patched. We start that in a a week or so and I’ll keep you all abreast of the progress.  One thing I don’t have to worry about as we ready our production environment for vSphere is that the up.time monitoring station is waiting for us on the other side.  It’s just waiting for me to play catch up!

So, have you upgraded to vSphere yet?  Tell us about your experience with the process and about vSphere in general. Or even better, if you are monitoring your vSphere infrastructure with up.time we’d love to hear about your experience. You can visit the up.time website for more on vSphere Monitoring or VMware monitors.

VMware Lifecycle Management Question…

Tuesday, May 5th, 2009

I was recently over at the “Offical VMware Virtualization Group” at Linkedin and a person was asking about VM Lifecycle Management and how they could prevent VMsprawl in their datacenter.

I have been working with VMware Virtualization for the past few years and the topic of VM Lifecycle Management has become much more prevalent over the last 6 months. One of the most confusing things is that VMware named their provisioning system “VMware Lifecycle Manager” when it is really all about getting VM’s into the infrastructure. It’s not so much about managing those VM’s, preventing VM sprawl, performance monitoring of ESX/individual VMs, visibility, control and audibility.

VMware did acquire a company called Dunes Technologies which played in the Lifecycle Management game. VMware is now releasing the Dunes product as VMware vCenter Orchestrator. It really concentrates on drag and drop automation and orchestration. This in combination with vSphere will help solve some of the lifecycle management issues; however, there is still the need for a 3rd party to get total control.

Check out the VMware Monitoring page at uptime software and you will see how up.time 5 can greatly assist in getting control back in your virtual infrastructure. Better virtual management and performance, deep workload profiling, VM density optimization reporting, tracking of vMotion, VMsprawl prevention and consolidation planning tools.  All in one easy to install application that not only will help with your virtual machines but your physical boxes as well.

vSphere Makes for Interesting Times

Friday, April 24th, 2009

With the announcement of vSphere on Tuesday by VMware and the associated products that are coming as part of the overall solution, it is a very exciting time for us IT folk.  There are several enabling technologies that are a part of their new ‘cloud operating system’ that will fundamentally change how we implement IT services within the datacenter, and bridge those services between the public and private cloud.  There are two technologies that are really interesting to me, the new version of Orchestrator and VMsafe.

With Orchestrator, truly automated application delivery based on SLA targets can be achieved, and not just for the components running on vmware, but across the service stack.  Orchestrator provides hundreds of out of the box workflows, with 3rd party vendors providing their own workflow packs for Orchestrator.  Creating fully automated stacks across multiple vendors is about to become much easier.  What used to take significant development effort can be accomplished with a drag and drop workflow builder.

The other technology that is very exciting is VMsafe.  Through VMsafe, software vendors will be able to provide technology solutions that are truly application aware.  Through the API they will be able to know exactly what the application workload is doing to the vSphere environment without having to instrument the guest.

Distributed Fault Tolerant workloads, and lifecycle/stage manager/orchestrator all come together to provide a complete solution that will allow commodity servers to provide the flexibility and availability that was once the realm of MVS, but without the expense.

Posted on behalf of Chris Knowles

 

VMware announces vSphere release…

Tuesday, April 21st, 2009

Today, VMware announced the release of vSphere. On the same day uptime announced support for vSphere in the current version of up.time. We have been working with VMware over the past few months to ensure we could support this release immediately. Watching the VMware vSphere webcast this afternoon, it was great to see all the new features (thin provisioning, Distributed Power Management, Fault Tolerance and the many enhancements created in partnership with Cisco) that are now available in this “Cloud Operating System”.

What’s even more exciting is the amount of new products coming out this year from VMware.  vCenter Chargeback, vCenter ConfigControl, vCenter CapacityIQ, vCenter AppSpeed and vCenter Orchestrator just to name a few. The one that I find really exciting is vCenter Orchestrator. Basically, it’s a GUI based workflow tool that interacts with the VMware API, CLI, SSH, 3rd party systems to name a few. For those of you who have been using VMware Lifecycle Manager… you have already seen bits of vCenter Orchestrator. With over 400 pre-built workflows there is a lot of potential automation that will be able to be put in place…stayed tuned.

uptime is committed to supporting the virtualization initiative, so I am truly excited for our coming software releases of up.time over the next 2 quarters. Just this past month, I have been working with customers to assist in implementing a real “Single Pane of Glass” management system for both physical and VIRTUAL servers. Beyond that, I’m busy helping to prevent VM sprawl and assisting in virtualization/consolidation planning using up.time 5.