What types of data yield a high deduplication ratio?

Rachel Dines

What types of data are not well suited for deduplication? What types of data dedupe well?

There are certain file types and application data that inherently won't deduplicate very effectively. Certain applications, such as Lotus Notes, simply do not yield high deduplication ratios. Structured databases also often yield poor deduplication ratios. Certain rich media file types will actually result in deduplicated output that is the same size or even sometimes larger than the original. Beyond that, anything that has a high change rate will result in low deduplication ratios.

On the flip side, companies often see very high deduplication ratios from applications that have data with a low change rate as well as NAS shares, where there are often significant amounts of redundant data stored.

Virtual server environments often yield the best deduplication ratios. Because so much data between virtual machines (VMs) is actually redundant, many firms see extremely high reduction rates in data when deduplicating VM backups.

View the next item in this Essential Guide: Cloud-based backup and software deduplication explained or view the full guide: Complete guide to backup deduplication

There are Comments. Add yours.

 
TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to: