This article is part of an Essential Guide, our editor-selected collection of our best articles, videos and other content on this topic. Explore more in this guide:
3. - How deduplication is used today: Read more in this section
- Data backup process angst persists, but dedupe and cloud offer relief
- What types of data yield a high deduplication ratio?
- Software-based deduplication and cloud backup
- Windows Server 2012 deduplication vs. backup software dedupe
- Using deduplication when backing up VMs
- Education group goes all-dedupe for data center, remote data backups
- How centralized backup can ease remote site data protection challenges
- Using data dedupe technology as part of your disaster recovery strategy
- Finding data deduplication solutions for DR
- Deduplication and data lifecycle management
Explore other sections in this guide:
- 1. - Backup deduplication technology today
- 2. - How deduplication is performed
- 4. - Data deduplication challenges
What types of data are not well suited for deduplication? What types of data dedupe well?
There are certain file types and application data that inherently won't deduplicate very effectively. Certain applications, such as Lotus Notes, simply do not yield high deduplication ratios. Structured databases also often yield poor deduplication ratios. Certain rich media file types will actually result in deduplicated output that is the same size or even sometimes larger than the original. Beyond that, anything that has a high change rate will result in low deduplication ratios.
On the flip side, companies often see very high deduplication ratios from applications that have data with a low change rate as well as NAS shares, where there are often significant amounts of redundant data stored.
Virtual server environments often yield the best deduplication ratios. Because so much data between virtual machines (VMs) is actually redundant, many firms see extremely high reduction rates in data when deduplicating VM backups.