Home > Data Backup News > The pros and cons of data deduplication technology for remote-office data backup
Data Backup News:
EMAIL THIS

The pros and cons of data deduplication technology for remote-office data backup

By Rick Cook
14 Sep 2009 | SearchDataBackup.com

News and trends in the storage industry
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google

Remote offices have always been the weak link in the data backup chain. Tape cartridges have to be changed according to a rotation schedule, sent off site for archival storage, and occasionally tested to make sure the information is being backed up properly. Because remote sites seldom have storage professionals on hand, these jobs fall to non-storage people as a secondary or tertiary job.

One solution is to do away with tape storage entirely and replace it with data deduplication software. After stripping the files down to unique, new or modified data, the files can be transmitted over the Internet to the central office where the IT staff can handle the backups.

"It is an appealing solution because remote tape is a nightmare," said Peter Eicher, product marketing manager for FalconStor Software Inc., a maker of deduplication software.

More on data deduplication technology and remote office backup
A tutorial on remote data backup technology

SMBs are still using tape storage for remote-office data backup

Where and how to use data deduplication technology in disk-based backup

Why global deduplication is important in choosing a dedupe product

Deduplication is automatic once it is set up. It doesn't require intervention at the remote office to make it work. The deduplication system accepts the data from the server, removes redundancies at the block or file level, depending on the system, and forwards it to the data center (or local backup disk) for storage.

If you're backing up to a central office over your network, you need to make sure that you have enough bandwidth to do the job. Although deduplication enormously reduces the amount of data, it still needs a healthy channel to handle the load. How big a load backup to a central office will put on the network varies with the amount of data you are transmitting, and has to be determined by looking at the actual size of the deduplicated files, but typically a DSL connection is the absolute minimum. But if you're backing up to a local disk, network bandwidth isn't a concern.

Software-based data deduplication

Deduplication can be done either by software (with CommVault, FalconStor, Symantec Corp., for example) or by hardware with an appliance such as those from Data Domain. The software can be built into a storage management or backup application (CommVault Simpana and Symantec Veritas NetBackup) or be a standalone product (FalconStor File-interface Deduplication System).

Generally, deduplication by software is cheaper, but it puts a bigger load on the server. Going through the data and combing out the duplicates takes a fair amount of computing power.

When is dedupe software not cheaper? When supporting it requires a major hardware upgrade on the server. Software is also more work for administrators since the software must be kept up to date and the changeover means modifying the backup software configuration, or in some cases, replacing the backup application completely if you choose to use an integrated product like CommVault Simpana or Veritas NetBackup.

Data deduplicating appliances

By contrast, deduplication appliances are typically plug and play. The appliance sits in the remote office and processes the data stream as it is sent from the server before sending it to disk or over the network to the central office.

According to Eicher, a typical deduplication reduces the data by 15:1 or 20:1, depending on the nature of the data and how you back it up. This makes it practical to transmit the data to be backed up over the network to the data center.

Once the data has been deduplicated, it can be encrypted before being transmitted. Eicher said that compressed or encrypted data doesn't deduplicate well because there's little redundancy left in it. The data should be deduplicated first and then compressed and/or encrypted.

Some remote offices don't bother to transfer the data to a central location. They dedupe it and store it locally in a disk-to-disk scheme. Eicher said that between 40% and 50% of remote sites don't bother to send the data off site. Here the advantage of deduplication is that you can store a lot more data in the same amount of disk space. This allows the remote office to keep more backups and hence a longer history. "Even with a small deduplication system, it's easy to keep weeks or even months of data," Eicher said.

About this author: Rick Cook specializes in writing about issues related to storage and storage management.

Tags: Data reduction and deduplicationRemote data protectionVIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google



RELATED CONTENT
Data reduction and deduplication
Texas Tech turns to data deduplication for data backup, disaster recovery
EMC gives Avamar desktop and laptop support
Data backup and recovery news briefs: Dynamic Solutions introduces data deduplication products
Data archiving reduces data backup workload prior to data deduplication
Arkeia takes aim at EMC Avamar with Kadena Systems data deduplication IP buy
Data backup and recovery news briefs: Druvaa Software updates flagship product, releases inSync v3.1
Data backup and recovery vendors dig into deduplication technology, aim for cloud backup
Data backup and recovery news briefs: Data Domain upgrades data deduplication appliances
Using data deduplication with backup applications: Source vs. target dedupe
Quantum launches midrange data deduplication backup appliances

Remote data protection
HP expands laptop and desktop data backup with Data Protector Notebook Extension
Data backup and recovery news briefs: Rackspace unveils cloud-based file storage apps
Terremark acquires managed data backup and recovery provider DS3 DataVaulting
Data backup and recovery news briefs: Dynamic Solutions introduces data deduplication products
Rackspace's Jungle Disk preps cloud data backup Server Edition
Data backup and recovery vendors dig into deduplication technology, aim for cloud backup
Double-Take replication software solves remote-office data backup headache for Lennox International
Data backup news briefs: ProStor Systems ships InfiniVault removable disk backup appliance for SMBs
i365 launches EVault Offsite Replication cloud data backup and disaster recovery service
Is cloud data backup service right for your organization?

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary



Data Backup Security: Tape Encryption & Backup Security
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2008 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts