Home > Data Backup Learning Guides > Data Backup > Data deduplication > Related information > How to recover deduplicated data faster
Learning Guides: Data Backup:
EMAIL THIS
 START   ARCHIVES   SECURITY   SOFTWARE   CDP   DEDUPLICATION   DISK BACKUP   REMOTE BACKUP   TAPE   FINAL EXAM   COPIES   
Data deduplication


Related information
<< PREVIOUS | NEXT >>: CAS and data deduplication: Partners in archiving
 TIPS & NEWSLETTERS TOPICS 

BACKUP AND RECOVERY

How to recover deduplicated data faster


George Crump
02.11.2008
Rating: --- (out of 5)


Data backup technical tips
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


Data Domain, along with others companies such as ExaGrid Systems Inc. and Quantum Corp., are riding the wave of rapid adoption of data deduplication systems as the solution to unmanageable data growth. However, there have been questions raised about how these solutions will affect recovery speed.

Unlike on traditional tape, deduplicated data isn't all stored sequentially and, in fact, it could be spread out all over the disk. So that begs the question: Are you going to see longer recovery windows as a result of using a data deduplication system?

The answer, unfortunately, is it depends. If you're using two-year-old tape technology for your backup and recovery efforts today, then it's safe to say that in almost every circumstance, you'll see an improvement in recovery performance. Even if your tape technology is fairly modern you should see improvements, especially when restoring individual files and folders.

Performance bottlenecks

If you have a device that can deliver data in a recovery mode (this would have to be high-speed disk with a well-tuned virtual tape library), your storage and network infrastructure must be able to deliver that data to its intended target at those speeds. However, this is usually not the case. There's almost always a performance bottleneck that must be overcome or lived with.

It is likely that the bottleneck is going to occur on the network. The deduplicated disk will outperform a Gigabit Ethernet network segment. Even with a recovery to locally attached disk, the write performance of that target disk is a major bottleneck. Disk writes are slower than disk reads, and those writes are going to be coupled with RAID parity recalculation.

Slow or inefficient clean-up routines can also slow recovery time. These routines limit fragmentation, clean up orphaned data segments and perform data integrity checks. All of this requires processing and disk I/O power and can slow down the recovery process. Clean-up process inefficiencies are caused by slow, poor programming of these routines. They also result in clean-up times that are so slow that the routine doesn't finish prior to the next inbound backup job or these processes are running while a recovery attempt is made.

Unlike a tape drive that sits idle until the next backup job, these cleanup routines are almost always working. The important consideration is that these routines typically are active at off-backup times, typically the middle of the business day, also the time when most recovery requests will be made.

Speeding up data recovery performance

Recovery performance can also be greatly impacted when the system has to handle concurrent operations, even outside of the clean-up process, for example, if you're trying to recover data while a backup is still going on. If tape is going to stay in your data protection plan, then one of the most important scenarios to test is recovery performance from the data deduplication system while you're doing a transfer to tape.

It is also important to test your recovery performance while the data deduplication is in a failed state. For example, a drive failure is simple to simulate -- pull a drive and let the rebuild process begin. While that's going on, start a large recovery and see how that impacts performance.

Not all systems suffer under these situations and it is possible to get recovery performance that's equal or close to equal the backup performance. As is always the case, you need to establish your goals for recovery. Then, make sure that the infrastructure and the applications are tuned to be able to handle that data transfer. This may involve upgrading, fine-tuning your network or changing the way your data is laid out on your storage device.

About the author: George Crump, founder of Storage Switzerland, is an independent storage analyst with over 25 years of experience in the storage industry.


Rate this Tip
To rate tips, you must be a member of SearchDataBackup.com.
Register now to start rating these tips. Log in if you are already a member.




Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


<< PREVIOUS | NEXT >>: CAS and data deduplication: Partners in archiving
VIEW ALL IN THIS CATEGORY

RELATED CONTENT
Backup and recovery
How to implement VMware Site Recovery Manager
Data backup strategies: Migrating from tape to disk
Restoring deduped data
Troubleshooting Microsoft Exchange data backup and restores
SharePoint data recovery solutions
Virtual machine backup with CA, HP and Syncsort
How to choose the right tape library
Data deletion or data destruction?
The advantages of tape for data backup
Is online data backup right for your business?

Data reduction and deduplication
Restoring deduped data
CommVault CEO says business good despite earnings slip
IBM quickly integrates FilesX's CDP into Tivoli Storage Manager
Quantum disk revenues double, tape sales decline
Record sales reported for data deduplication products
Quantum adds management app to its data dedupe platform
Remote-office backup and data deduplication
Demystifying VMware data protection: VMware data replication methods
HP prepares double dose of data deduplication
The pros and cons of globally deduplicating data backup appliances

Related information
How to estimate your data deduplication ratio
CAS and data deduplication: Partners in archiving
In-band vs. out-of-band deduplication
Compression, deduplication and encryption: What's the difference?

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts