Home > Data Backup Learning Guides > Data Backup > Data deduplication > Related information > How to estimate your data deduplication ratio
Learning Guides: Data Backup:
EMAIL THIS
 START   ARCHIVES   SECURITY   SOFTWARE   CDP   DEDUPLICATION   DISK BACKUP   REMOTE BACKUP   TAPE   FINAL EXAM   COPIES   
Data deduplication


Related information
<< PREVIOUS | NEXT >>: Compression, deduplication and encryption: What's...
 TIPS & NEWSLETTERS TOPICS 

BACKUP AND RECOVERY

How to estimate your data deduplication ratio


Jerome Wendt
06.26.2007
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn: This tip details the factors that influence the data deduplication ratio (the ratio of data before deduplication to the amount of data after deduplication), so you can estimate the deduplication ratio you can reasonably expect to achieve.

Every data deduplication vendor claims that their product offers a certain ratio of data reduction. However, the actual data deduplication ratio can vary according to many factors, some of which are within a user's control. Below are a few variables.

Redundant data

The more redundant data you have on your servers, the higher the data deduplication ratios you can expect to achieve. If you have primarily Windows servers with similar files and/or databases, you can reasonably expect to achieve higher ratios of data deduplication. If your servers run multiple operating systems and different files and databases, expect lower data deduplication ratios.

Data deduplication information
In-band vs. out-of-band deduplication

The skinny on data deduplication

Getting started with data deduplication

Compression, deduplication and encryption: What's the difference?

Deduplication extends to archives

Special Report: Data Deduplication
Rate of data change

Data deduplication ratios are related to the number of changes occurring to the data. Each percentage increase in data change drops the ratio; the commonly cited 20:1 ratio is based on average data change rates of approximately 5%.

Precompressed data

Data compression is a key component in every vendor's data-reduction algorithm. Vendors base their advertised data-reduction ratios on the premise that compression will reduce already deduplicated data by a factor of 2:1. In a case where data deduplication achieves 15 times, compression could take that ratio up as high as 30:1. However, users with large amounts of data stored in compressed formats such as jpeg, mpeg or zip, aren't likely to realize the extra bump compression provides.

Data retention period

The length of time data is retained affects the data-reduction rate. For example, to achieve a data-reduction ratio of 10 times to 30 times, you may need to retain and deduplicate a single data set over a period of 20 weeks. If you don't have the capacity to store data for that long, the data-reduction rate will be lower.

Frequency of full backups

Full backups give data deduplication software a more comprehensive and granular view into the backup. The more frequently full backups occur, the higher the level of data deduplication you'll achieve. Deduplicating backup software products have a slight edge over disk libraries because they run a full server scan every time they execute a server backup, even though they only back up changes to existing files or new files. In between full backups, disk libraries usually only receive the changes sent as part of the backup software's daily incrementals or differentials.

Check out the complete text of the Storage magazine article, Catching up with deduplication.

Jerome M. Wendt is a storage analyst specializing in open-systems storage and SANs.


Rate this Tip
To rate tips, you must be a member of SearchDataBackup.com.
Register now to start rating these tips. Log in if you are already a member.




Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


<< PREVIOUS | NEXT >>: Compression, deduplication and encryption: What's...
VIEW ALL IN THIS CATEGORY

RELATED CONTENT
Backup and recovery
Data backup options for remote sites
The differences between block-based and file-based data backup
How to implement VMware Site Recovery Manager
Data backup strategies: Migrating from tape to disk
Restoring deduped data
Troubleshooting Microsoft Exchange data backup and restores
SharePoint data recovery solutions
Virtual machine backup with CA, HP and Syncsort
How to choose the right tape library
Data deletion or data destruction?

Related information
How to recover deduplicated data faster
CAS and data deduplication: Partners in archiving
In-band vs. out-of-band deduplication
Compression, deduplication and encryption: What's the difference?

Data reduction and deduplication
Data backup options for remote sites
Restoring deduped data
CommVault CEO says business good despite earnings slip
IBM quickly integrates FilesX's CDP into Tivoli Storage Manager
Quantum disk revenues double, tape sales decline
Record sales reported for data deduplication products
Quantum adds management app to its data dedupe platform
Remote-office backup and data deduplication
Demystifying VMware data protection: VMware data replication methods
HP prepares double dose of data deduplication

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts