IT in Europe

Data dedupe: A natural fit with backup


Will cloud data protection replace tape?

Could the cloud replace tape? Learn how cloud data protection could be a good choice for your organization.

Recent surveys show that today the majority of organizations are relying on disk for data protection and backup, with the minority relying exclusively on tape. And now a new force is emerging: cloud data protection. But as is often the case with any new technology, there's a lot of vendor hype and misinformation about the cloud. The cloud can play a role in data protection, but as Whitehouse says in her column below, "there are slight nuances to its use in these situations."

In Lauren Whitehouse's latest Storage magazine column, learn how the same technologies that made disk so popular—such as data deduplication—are driving the adoption of cloud data protection, and how a cloud tier could be a good choice for some organizations.

Today, only 20% of ESG research respondents rely exclusively on tape (disk-to-tape or D2T) for their on-site backup process. For on-site backup, 80% of the companies surveyed by ESG have made the transition to a disk-assisted (disk-to-disk-to-tape or D2D2T) or disk-based (disk-to-disk or D2D) backup strategy. And among those using on-site disk, 60% eventually move data to tape for long-term retention or to facilitate disaster recovery (Source: ESG Research Report, "Data Protection Market Trends," April 2010).

Though off-site storage media today is dominated by tape, ESG expects this to change in the years ahead. By 2012, the average volume of data stored on off-site disk and third-party cloud storage is predicted to grow and become increasingly used as an alternative to off-site tape. The volume of data kept in third-party cloud storage is expected to increase nearly fourfold in the next two years, while off-site tape is expected to decrease by one-third by 2012 (Source: "Data Protection Market Trends," April 2010).

Never-ending data growth has driven the adoption of disk and data storage efficiency technologies, such as data deduplication, to execute larger and larger backup jobs within a limited window of time.

As IT organizations adopt disk for backup and decrease their reliance on tape media, interest in vaulting data electronically is on the rise. This is especially true for those employing deduplication to optimize network bandwidth and storage capacity. Maintaining data in its optimized state and electronically transferring it within and between sites creates new levels of efficiency. Provided adequate WAN bandwidth, transferring copies off-site is appealing for many companies. That's where the cloud data protection comes in.

The cloud tier

There's a lot of hype (and some confusion) about the cloud. It can play a big role in data protection strategies, but there are slight nuances to its use in these situations. In each case, a copy of data is stored in a cloud container and can be retrieved to facilitate recovery.

  • Backup Software-as-a-Service (SaaS) is when a service provider offers a web-based backup application and cloud-based storage for backup processes. Backup SaaS can leverage pure cloud resources or it can take a hybrid approach and combine some on-premises storage with storage in the cloud.
  • Cloud storage services provide a third-party cloud-based tier of storage for on-premises backup and archive solutions. On-premises, licensed backup products integrate with cloud storage providers' APIs to enable the seamless transfer of data into the cloud tier.
  • Cloud-based disaster recovery combines cloud computing and cloud storage services to enable failover to a cloud-based instance of an on-premises server.

Using the cloud as a tier

Organizations adopting a cloud tier see it as a tier of storage, unlimited in size, that doesn't have to be directly managed and may provide a more predictable cost model than acquiring, installing, configuring, maintaining and provisioning on-premises backup storage. A cloud tier is also a way to maintain remote copies of data for disaster recovery and may provide a better aggregation point for remote office and branch-office backup data than the corporate data center.

Organizations wary of leveraging a cloud tier often cite concerns about data security, access and control. Once corporate data leaves the physical security of a corporate location, protecting the privacy of the data becomes tantamount. IT administrators therefore need reassurance that accessibility to data stored in a cloud tier -- likely defined in a service-level agreement (SLAmeets the expectations of IT as well as those of their business constituents.

There's likely a big recovery time objective (RTO) improvement when recovering individual files or small data sets from the cloud because they can be directly recalled and streamed over the network. The alternative of recalling tape media, and recovering from tape, can take more time. Concerns about meeting RTOs when performing a "bulk" recovery of data are well founded. The time it would take to transfer data for a whole server or site jeopardizes any chance of meeting even the most lax RTOs.

Easing into cloud data protection

Taking advantage of on-demand cloud storage doesn't have to be complicated or disruptive. Many popular data backup applications provide integration with cloud storage services today, including CommVault Systems Inc., EMC Corp., Microsoft, Symantec Corp. and Zmanda. It's as easy as combining current on-premises backup software and bandwidth with a subscription to a supported cloud storage service, and modifying backup policies to direct a copy to a cloud target. But getting that first full backup copy to the cloud could be clumsy. For example, the size of the backup set may be too large to transfer over a low bandwidth connection in a reasonable amount of time. That's why most cloud storage service providers offer a "seeding" service that allows data to be uploaded into the cloud via portable media instead.

The easiest use case for cloud data protection is for remote office and branch-office data. The typical volume of data at remote and branch offices is well-suited to this approach; it's usually transported off site anyway and there's great potential to eliminate on-premises hardware.

About this author: Lauren Whitehouse is an analyst focusing on backup and recovery software and replication solutions at Enterprise Strategy Group, Milford, Mass.

This article was previously published in Storage magazine.

Article 4 of 4

Dig Deeper on Cloud backup

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.