Requires Free Membership to View
However, fundamental improvements in the realm of data protection still pose challenges in spite of the benefits of encapsulation and abstraction that VMware offers. Even with the advent of VMware virtualization, the backup guy is still the surliest of IT dudes. The most prevalent of these challenges is ensuring data consistency and addressing excessive consumption of VMware's underlying physical resources.
Because VMware can encapsulate physical servers into a handful of large hard disk image files -- virtual machine disk format (VMDK) files -- it's very tempting to think that backing up an entire server should be as easy as backing up these underlying VMDK files (and the handful of associated configuration files, of course).
But in most situations, this isn't the case. Unless the virtual machine (VM) is shut down, backing up a VM in its running state doesn't ensure that all in-flight activity is fully accounted for. In other words, this type of backup doesn't ensure that the data is consistent and, therefore, doesn't ensure that the restored VM contains enough accurate information to declare the restoration of the server as fully successful.
With respect to the challenge of excessive resource consumption, this is a side effect of virtualization. One of the key reasons to virtualize systems using VMware is to concentrate resource consumption onto fewer physical servers, thereby reducing the amount of idle cycles that most IT server infrastructures suffer from. However, in doing so, the unfortunate side effect is the inability to find enough resources to allow data backups to run unhindered.
This is also compounded by the fact that backups hit the most vulnerable points within VMware: Its narrow ability to handle excessive disk and network I/O. In fact, the decision to virtualize or not to virtualize a physical server most times is based on the intensity of disk and/or network I/O present on the physical server. Needless to say, a backup load is one of the worst loads for a VMware server to accommodate.
However, methods do exist to address these issues and provide benefits that, in some cases, are superior to standard physical server backup and recovery. But misunderstandings about each of these methods have ensued, as have the misconceptions around the implementations taken by many third-party backup/recovery products. Indeed, for many administrators, the most effective method to backup and recover VMware still escapes them; it's relegated to the realm of frustration and mystery.
Method 1: Local backup agent within each VM
How it works: This is a traditional approach to backups where the backup software agent is installed inside each VM exactly like would have been on a physical server. As outlined in the diagram below, data flows over the LAN to the backup/recovery infrastructure just as it would have if the agent was installed locally on a physical server.
Click here to see a diagram of a local backup agent installed within each VM.
The advantages of this method include:
The disadvantages of this method include:
Deployment tips
Running data backups simultaneously may work well for physical servers, which will likely have an abundance of idle resources, but for VMware virtual infrastructure, where idle resources are purposely consumed, there's a greater likelihood that multiple backup operations will choke the underlying physical server. Thus, after virtualization, backup schedules should be adjusted so that jobs are staggered through the course of your backup window to avoid excessive overlapping of jobs.
Only allow a single data stream per VM. Since the VM's VMDK files usually reside on a single VMFS volume, it's very easy to overwhelm this underlying file system with a multi-streamed job. So unless the VMDKs can separate themselves on to individual volumes, (RDMs, iSCSI LUNs or separate VMFS volumes) backups should run single-streamed rather than multi-streamed.
Method 2: Backup agent in ESX Service Console
How it works: This method involves installing a backup software agent right within the ESX Service Console and backing up each VM's underlying set of VMDK files as outlined in the diagram below. Because the Service Console is a Red Hat Linux OS, a Linux backup agent can be utilized for this.
Click here to see a diagram of a backup agent installed in the ESX Service Console.
The advantages of this method include:
The disadvantages of this method include:
Deployment tips
In order to ensure application consistency, the VMs should be shut down before the VMDKs are backed up:
You can utilize VCB utilities on ESX Service Console to obtain snapshots of a running VM:
vcbMounter utility:
vcbRestore utility:
If you decide to venture into scripting, you'll find that error-checking and correct back-out is the most difficult aspect of scripting, usually encompassing most of the code.
Method 3: VMware Consolidated Backup (VCB-Proxy)
How it works: This method involves a VMware-developed set of utilities collectively known as VMware Consolidated Backup. This method enables LAN-free backups of VMs from a centralized Windows 2003 proxy server connected to the same SAN volumes as the ESX Server. The data is then presented to the proxy for subsequent backup by a supported third-party backup application. This method is more complicated that the first two methods and involves the following components:
Backup proxy server:
VCB framework:
Backup Software Integration Module:
Click here to see a diagram of the VMware Consolidated Backup with proxy server.
VMware Consolidated Backup with the Proxy Server has the ability to conduct LAN-free file-level backups as well as LAN-free image level backups. However, these two capabilities utilize very different processes to achieve their ends.
VCB file-level backup/restore mounts the VMDK file onto the VCB Proxy Server and follows the subsequent steps:
- The backup job calls the VCB Framework to obtain a snapshot the VM and mounts the VM snapshot from the SAN to C:\mnt on the VCB Proxy Server.
- The directory/files are backed up (full, incremental or differential) using the backup application.
- The backup application calls the VCB Framework to unmount the VM snapshot and take the virtual machine out of snapshot mode.
- Individual files are restored to the original VM over the LAN via a backup agent that's installed within the VM.
Click here to see a diagram of a VCB-Proxy workflow for file-level backup and restore.
VCB image-level backup/restore exports the VMDK file onto the VCB Proxy Server and follows the subsequent steps:
- The backup job calls VCB Framework to obtain a snapshot of the VM and exports the VM snapshot from the SAN to C:\mnt on the VCB Proxy Server.
- The exported image files, including system files, are backed up using the backup application.
- The backup software then calls the VCB Framework to unmount the VM snapshot and take the VM out of snapshot mode.
- Restoration is accomplished by restoring the exported VM image using the backup application to a temporary area accessible to the VMware host such as a temporary location on the either the Proxy Server or the ESX Service Console.
- The VM image is then imported into the desired location on the ESX host.
Click here to see a diagram of a VCB-Proxy workflow for image-level backup and restore.
The advantages of this method include:
The disadvantages of this method include:
Deployment tips
About the author: Ashley D'Costa architects and designs advanced computer solutions and has technical experience with a broad spectrum of IT infrastructures.
This was first published in April 2008

Join the conversationComment
Share
Comments
Results
Contribute to the conversation