Ask the Expert

What open-source data deduplication software options are available?

What open-source data deduplication software options are available?

Requires Free Membership to View

There are currently a few open-source data deduplication software options available. BackupPC is an open-source backup software package, and it uses file-level deduplication. It uses a hashing algorithm to identify possibly identical files, and then does a binary compare to see if they're the same. If they are, it replaces one of them with a link to the other.

Oracle's Solaris ZFS is another popular open-source file system option, and they have recently announced support for data deduplication. According to their "Dedupe FAQ", ZFS eliminates duplicated blocks between files. The FAQ also says that the performance hit should be negligible as long as the deduplication tape is able to fit in RAM. However, it does not offer any guidance as to how big that table might grow.

For more information on data deduplication software:

This was first published in February 2010

There are Comments. Add yours.

TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to: