It's Time to Rethink Tape Storage - A Brief Overview
Keeping data “just in case” is no longer an option
The death of tape as a corporate storage medium is an exaggeration. The vast majority of organizations still use tape storage to back up data either onsite or off-site. As the volume of data continues to rise exponentially, off-site tape capacity looks to to increase significantly.
Historically, the low cost of tape storage has only encouraged organizations to keep and store all data “just in case.” This has served to unnecessarily bloat IT budgets and increase e-discovery risks and costs when an investigation, litigation or merger/ acquisition occur.
The main problem of a “save-everything” strategy
The problem is that many organizations, even household names, have lost track of the data that they have stored on tape. There are many fundamental questions that cannot be answered quickly or efficiently:
- What content are we currently storing?
- Is it located on-site, off-site or at a storage vendor - and if so, on which tape?
- What data is truly necessary for business continuity or legal purposes versus duplicate or irrelevant data that should be disposed of?
Hidden costs of accessing old data
Some information never expires and may already be in storage, such as proprietary drawings, prototypes or formulas. If this and other types of data must be retained for longer time periods, what is the plan to ensure it remains accessible as current technology becomes obsolete and the operational costs to maintain legacy systems purely for restoration purposes is no longer practical?
Plan your (data) retirement
If an organization doesn’t address these questions via a well-defined process, the default answer for both stored data and legacy systems automatically becomes “keep everything.” On a practical level, information about managing types of content is rarely tracked and communicated via an existing Service Level Agreement (SLA) and the unnecessary expense and risk multiplies. Lack of clarity about what is stored on tape is only one problem. Another important issue is that historical data can be difficult to access from tape storage. Organizations routinely back up and store information, thinking their processes are strong and the data is sound. However, a variety of issues including user error and hardware/software faults can hinder data retrieval, some of which are never discovered until the organization is in reactive crisis mode and scrambling for alternatives.
Get that file! Not as easy as you think
An organization has a duty to preserve relevant data upon reasonable notice of litigation. And many times, it’s only when a company is required to access the data that common problems are encountered, including:
- Backup software failure: Backup software is set up correctly and the process is kicked off. However, the actual backup data itself is never verified, so it doesn't register and the data cannot be located when it is required.
- Storage media failure: This is includes tape drive failure or corrupt or inaccessible tapes. Information written to tape can’t be read because of logical errors in the data. There is a significant difference between data from the last backup versus data from the point of failure.
- Human error: Errors such as accidentally re-initializing a tape or forgetting to enable the append option before starting a backup are common.
- Volume of data and "findability": Sheer volume of data and the ability to find specific content within the corporate memory is a common problem. How do you know if data is lost or missing? For example, when companies merge, the operational, accounting, and client data of both companies needs to continue to be available. The various backup landscapes have to be synced (e.g. proprietary backup systems in Windows environments).
- Aging systems and obsolescence: There is a need to maintain legacy data; converting old static systems to another format or newer technology. Auditors may also request submission of old data records, as in the case of one bank, the submission of 17,000 sets of entries from the 1980's. The tapes were available, but the software and drives were no longer serviceable.
- Disaster: Fire, water damage, mud, extraordinary cold, heat or other natural catastrophes are often the reasons for tapes becoming contaminated, damaged and no longer legible using the standard means.
- Forensically unsound methods: The data may be “readable” by a human, but moving data incorrectly can modify the file or system metadata relied upon for compliance, investigative, and e-discovery purposes. If this results in making the data difficult - if not impossible - to recover, it will not excuse a corporation from its duty to provide the information required. Some organizations are simply gambling that their legacy data will not be an issue and will be accessible and usable if and when they need it.
S.O.S. is at hand
Historically, it has been time consuming, technically difficult, and cost prohibitive to incorporate legacy data into an organization’s overall information life cycle management (ILM) plan. After relying on IT to restore the data, legal would work with IT to analyze the relevant data required to support an investigation or lawsuit. Due to budget and infrastructure limitations, restoring thousands or tens of thousands of tapes was not feasible.
The problem has been addressed by using technology to streamline the entire process. Rather than rely on a false sense of security, many corporations seek expert consultative assistance (It's key to ensure they have proven experience using forensically sound methods and deep expertise in legal, compliance, and IT issues) to help them manage stored data more efficiently and reduce the load on IT personnel and infrastructure.
7 techniques to get your tapes under control
- Data identification, mapping and collection: Locate, preserve and collect business critical and legally relevant data
- Migration: Safely migrate large amounts of data to alternative media or formats
- Media consolidation: Combine incremental or differential backups into one backup
- Media conversion: Seamlessly convert data from one format to another
- Tape cataloging: Catalog to save time, money and resources
- Tape duplication: Reproduce data with ease
- Tape ingestion: Prepare to ingest identified files or tapes into a legal hold or archiving system
The perfect storm of a highly regulated environment, rapidly growing amounts of data, and the need for greater responsiveness and transparency means that organizations need to address the issue of poorly managed tape storage sooner rather than later. It's not in anyone’s interest to leave the huge task of sorting and cataloging data stored on aging tapes to the 11th hour, or when regulators come knocking at the door.