How do you determine which data to store on low-cost disk?

How do you determine which data to store on low-cost disk?

Do you have any method of determining what data should be stored on cheaper media like low-cost disk?

    Requires Free Membership to View

    When you register for SearchDataBackup.com, you’ll also receive targeted emails from my team of award-winning editorial writers. Because your job never seems to get any easier, it’s our goal to keep you up-to-date on the latest backup tips, trends and technologies that will help you get the job done.

    Rich Castagna, Editorial Director

    By submitting your registration information to SearchDataBackup.com you agree to receive email communications from TechTarget and TechTarget partners. We encourage you to read our Privacy Policy which contains important disclosures about how we collect and use your registration and other information. If you reside outside of the United States, by submitting this registration information you consent to having your personal data transferred to and processed in the United States. Your use of SearchDataBackup.com is governed by our Terms of Use. You may contact us at webmaster@TechTarget.com.

At a high level, a rule of thumb would be to keep high value data on your premium disk storage technology, then you could store less frequently accessed, perhaps aging data on lower-cost disk and finally, all remaining data that is kept for archive purposes on nearline, or even offline media such as tape, WORM, optical disk, etc. High value data in this context is, revenue generating or business-critical data.

However, this involves a manual data migration process that many software vendors have been trying to automate for quite some time. This is what has been driving the ILM efforts for some time but unfortunately, the options are still somewhat limited in terms of automation.

One of the obstacles in determining the value of data is that value is not easy to determine for certain types of data. For example, you can't always easily put a dollar value to medical records or government records. Sometimes, the value of data is not calculated in terms of lost revenue but rather, in terms of liability, loss of public confidence, or political embarrassment, which is much harder to quantify.

A simpler way to approach this may be to base your decision on access frequency and time. It begins to make sense to keep frequently accessed data on disk storage that can handle the performance and availability requirements associated with this type of data. Data that becomes less frequently accessed can then be migrated to lower-cost disk (which could, but not necessarily come with fewer costly performance and availability features). When storage decisions are made based on data access frequency, there are a few more options in terms of software products that can help identify the data to be migrated or even help automate the migration (i.e.: HSM solutions).

This was first published in August 2004