The majority of avoidable data damage happens in the first sixty minutes, the so-called golden hour, after a failure due to human action. Here's a few examples:
- Swapping wrong disks in the RAID configuration.
- Raid corruption by replacing incorrect RAID controller.
- Forcing disks online.
- Initializing wrong disks.
- Failed rebuilds.
- Restores from incomplete backups on the same system.
But also performing a storage Vmotion when the storage system fails.
In all these cases it is essential that all activities are stopped and no further changes are written to the original configuration of the system and the individual storage devices itself. Since a RAID system is the base of every modern storage unit i.e. a server, Network Attached Storage (NAS) or any Storage Attached Network (SAN) the reason for data loss can be a logical issue – same as user interaction - or physical damage. On the first hand it is not always obvious what the underlaying cause of the issue might be.
Often you only have one chance to get the system back up and running and to recover stored data. It is essential to reach out to data recovery experts, Ontrack Engineers will help you to determine how to proceed and will inform you about chances, risks and possible next steps.
Ontrack, your RAID data recovery partner
Acknowledged global data recovery leader Ontrack uses advanced techniques to repair, recover and reconstruct RAID data that has become inaccessible.
Our engineers, backed by 35 years of data recovery research, experience and development, have the knowledge and tools to get your data back. You'll know exactly what data is recoverable before committing to the actual recovery. We perform RAID data recoveries on:
- Any RAID level
- Any RAID controller, RAID architecture including software-defined storage(SDS)
- Any hard drive type, make or model
What are symptoms of a broken RAID?
Lot of symptoms can hint there is a RAID array failure.
- Error messages
- Beeping sounds / alarms
- (red) light on the server
- Drives making noises
Less obvious symptoms:
- Data corruption
- Database corruption
- Unable to access files
- Logical drives disappearing
Less obvious symptoms:
- Lost connection to server, volume, share, LUN
- Volume down
- System speed
- No more access to system
Common causes of RAID data loss.
Each RAID data recovery is unique, but there are a few common causes for data loss.
RAID systems are built to withstand individual drive failure, but once a RAID is running in a degraded state, the workload of the remaining drives increases and so does the risk of additional drive failures. Additional drive failure can lead to failure of the RAID. We have the clean room environment and expertise required to recover data from failed drives, combined with the logical skills and tools to rebuild and repair file systems from failed or partially rebuilt RAID systems.
RAID systems can be affected by loss of power, power cycling and power surges causing loss of data. RAIDs running in a degraded state are especially vulnerable, as power issues can cause out-of-sync drives to be reincorporated into the array. In these cases, our teams have the tools and expertise to map the data and logically rebuild the RAID in order to extract the required data.
Fire, water, dirt and other contaminants from natural disasters can destroy a RAID floor in an instant. Rebuilding file systems from a RAID affected by a natural disaster takes specialized facilities, knowledge and tools to decontaminate the drives and logically rebuild the data. Ontrack has state-of-the-art tools and cleanroom environments to safely recover from physically damaged drives.
Whether by accident or completed with malicious intent, data loss from reformatting, reinstalling or volume overwrite can occur throughout any organization of any size. However, with the knowledge and expertise of Ontrack, even the most damaging human error can be conquered by expert data recovery engineers.
Whether conducted through computer information systems, computer networks, infrastructures, or personal computer devices, unwelcome attempts to steal, expose, alter and/or destroy sensitive information through data breaches and cyber attacks cause significant data loss. Ontrack has the tools and expertise to safely recover your data from a malicious cyber attack.
A form of malware, malicious software that encrypts a victim's files, Ransomware attacks prevent an individual from accessing their computer, or any data stored on it. This form of attack can result in the computer itself becoming locked, or the data on it to become stolen, deleted, or encrypted. Ontrack’s expert data recovery engineers can recover data from a ransomware attack.
What to do when experiencing RAID failure?
Before doing anything, answer the following questions:
- What specifically happened/ how did the event unfold?
- What data is lost?
- How critical/valuable is this data?
- Do I have a backup or other means of getting this data back?
- What are the financial implications of this data loss?
The answers to these questions will be crucial to determining the data recovery next steps.
Often you only have one chance to get the system back up and running and to recover stored data, so it is important to gather as much information as possible.
Call for assistance
A free expert consultation will cover...
- An assessment of the situation, the problem, and possible data loss.
- Gaining insight into the hardware, RAID configuration, virtualization software, and backup procedures.
- First aid, advice, and tips to prevent future data loss.
- Determining whether a Remote Data Recovery (RDR) can be performed.
- Remote Data Recovery (RDR) Services - In the event that only one hard drive of a RAID is faulty, RDR is the fastest, easiest, and most cost-effective solution. An RDR recovery takes place via an encrypted internet connection directly to your server.
- Tips and advice on how to shut down the server, pack up the hard drives and transport them safely to our clean rooms.
You can call us around the clock 24/7 for a free consultation. In the event of an emergency, we will take immediate action and try to resolve the problem straight away.
How fast do you need your RAID data back?
We understand the urgent need to recover data and offer service levels to meet your needs.
Standard5-15 business days
PriorityAverage of 3-7 business days
Emergency24/7 until completion
At each service level our engineers analyze your media to determine the condition of the data. As a result, you will be provided an option to receive an online report showing all recoverable files before you decide to proceed with the recovery. After your data is recovered, it's returned on the media of your choice (HDD, CD, DVD, tape, USB drive) or made available for encrypted download from a secure server.
RAID Server Data Recovery options
Whether your device has suffered physical damage or undergone a logical failure, we offer you a range of services to match your specific needs.
Our experts are on-hand and will respond to your issue immediately. From assisting with troubleshooting to onsite recovery, our wide range of expert recovery options will ensure you receive the very best solution to solve your RAID data loss problem.
The best recovery option for any type of RAID system, no matter what data loss situation has been experienced. This recovery service is provided in our state-of-the-art data recovery lab and ISO-5/Class 100 cleanroom environment to ensure the safety of your data.
Remote RAID Data Recovery
Your data is recovered remotely without your media ever leaving your premises. We connect to your system via the Internet to perform a live recovery. Available when the storage device or system is still operational.
If your RAID system cannot leave your premises, our engineers can bring their recovery expertise to you. This option is only available for emergency service. For on-site data recovery, the storage device or system must be operational.
Custom RAID Data Recovery
We offer custom-designed recovery solutions for proprietary RAID systems and/or highly complex enterprise-level systems. Our R&D team will work with your IT staff to create an emergency solution to retrieve your data.
Client Attended Recovery
A specialist version of our in-lab service for highly confidential and sensitive information. Your team of trusted staff can transport the device to our facilities and stay with it during the recovery process, ensuring the data remains guarded. This service is only offered with the appropriate confidentiality agreements and insurance certificates.
RAID recovery by Ontrack at all levels
Data loss can occur at different levels. Because of the vast amount of experience and patented technology gathered by our global team of data recovery experts, Ontrack is able to perform data recovery at all levels.
The first level of data loss begins with the storage medium. This can be a single hard drive or RAID storage with multiple drives (DAS, NAS, SAN). In a recovery for a loss at this level, Ontrack's engineers secure as much RAW data as possible. A copy is made of the working disks and in a clean room our engineers work on the electronics and mechanics to get any defective media operational again, so that the RAW data can be copied. With self-developed and patented tools, the RAID controller is imitated. This can be done even for the complex SAN systems that often lay their own block distribution over a combined RAID system, thus making the lost data visible again.
Most operating systems have a utility that automatically performs repairs. However, this can permanently damage the data. Always work with a copy. With virtual systems, there is additional complexity due to the combination of the virtual file system and the file systems running within the virtual machines.
The third level of data loss can occur within the file itself. Such as SQL, Oracle and exchange database files. These files can be quite complex. The internal structure can become corrupted to the point where the DBMS makes the database unavailable.
Our simple 4 step data recovery process
We ensure that our process is transparent, quick and safe. You'll be informed every step of the way for complete peace of mind.
Contact us 24/7 worldwide to obtain a free data recovery consultation and written price quote.
The entire evaluation process is transparent. Once we receive your device, our engineers recommend the best solution, send a fixed price quote and an overview of service levels and delivery schedules.
With your approval, we recover your data based on your chosen service level. Through our secure portal you can track the status of your recovery and view a list of recoverable files.
Once your data has been recovered, we’ll send it back to you on an encrypted external device via next day delivery free of charge.
Succesful RAID Recovery Case Studies
Ontrack uses advanced techniques to repair, recover and reconstruct server data that has become inaccessible.
Ontrack is highly proactive in keeping up with the incredible technological developments in data storage over the last 30 years. Today, we perform over 50,000 data recoveries per year around the world. RAID data recovery is our passion, and we try to involve our customers in it as much as possible. Our experience and expertise have created the world's best data recovery practices to give our valued customers a sense of confidence.
RAID Recovery Tips
- Never replace a failed hard drive with one that was previously part of a RAID system – ensure you completely erase the replacement drive before use.
- If a drive makes unusual mechanical noises, turn it off immediately and seek help.
- Obtain a bit-by-bit image of the hard disks before making any changes to hardware, software, or performing a RAID rebuild.
- Label the disks with their position in a RAID array.
- Do not run recovery programs on suspicious volumes. Volume repair tools, such as CHKDSK are designed to force file systems to a consistent state. This can be useful for a single disk system with simple errors. On a RAID system, inconsistency of a volume can usually be traced to problems with the RAID (i.e. out-of-sync disks, failed rebuild, etc). Using CHKDSK in such scenarios can result in harmful and irreversible changes to the file system.
- Do not run defragmentation programs on suspected failed disks.
- Do not run volume recovery programs in cases of a power failure on a RAID array, when the file system looks suspicious or is un-mountable, or the data is inaccessible after power is restored.
Recommended RAID Technology Partners
The world's leading technology companies trust Ontrack's expertise for RAID recovery and data recovery.
From LSI MegaRAID to Avago, Broadcom to Adaptec, we collaborate with the world’s leading RAID brands to guarantee the safe, efficient, and secure recovery of your important data.
Most common NAS vendors:
- NetGear ReadyNas
Most common server vendors:
- Super micro
Most common RAID controllers:
- LSI MegaRAID
- Dell PERC
- Intel RAID
- HP Smart Array
- IBM ServeRAID
RAID recovery for virtual environments
Because virtualized systems comprise a unique marriage of software and hardware components, the successful data recovery of inaccessible or lost data requires superior knowledge of hypervisors, host machines, storage devices, RAID arrays, file structures, applications, and more.
Ontrack's global team has years of experience and training, a host of proprietary tools, and specialized knowledge gained through unique working relationships with virtual machine software vendors and hardware OEMs that recommend our services. Ontrack virtual machine data recovery experts can help with following
- VMware ESX and vSAN (VMDK files),
- Microsoft Hyper-V (VHD, VHDX, and AVHDX files)
- Citrix Xen
- Oracle VM
We are the global industry leader in data recovery
Recommended by manufacturers
Recommended by manufacturers
Advanced information security and certifications
Unrivaled global expertise
Highly valued personal service
Toolset covering all layers
Largest team of specialists
Self-developed & patented tools
Custom build JIT recovery
Proven methods since 1985
50,000 customers per year
Start your RAID data recovery now with a free consultation.
Contact our team of experts for RAID data recovery.
Background information about RAID
What is RAID?
Redundant Array of Independent (originally Inexpensive) Disks (RAID) storage has revolutionized enterprise data storage technology, building in the peace of mind of redundancy (from RAID 1 & above) which can greatly minimize downtime suffered due to individual drive failures.
RAID System Overview
RAID is a term used for computer data storage schemes that spread and or replicate data among multiple hard disk drives. RAID was designed with two key goals: to increased data reliability and increased I/O (input/output) performance.
- mirroring, the copying of data to more than one disk
- striping, the splitting of data across more than one disk
- error correction, where redundant data is stored to allow problems to be detected and possibly fixed (known as fault tolerance)
RAID System History
RAID is the acronym for Redundant Array of Inexpensive Disks (Redundant Array of Independent Disks). The concept was born at the University of California, Berkeley, where David A. Patterson, Garth Gibson and Randy H. Katz were collaborating to produce operational prototypes of five levels of RAID storage systems. The result of their research has formed the basis of the complex RAID storage systems that exist today. Today IBM holds the intellectual property rights on RAID 5.
The different types of RAID
Ontrack offers data recovery services for all major RAID architectures. This includes, RAID levels 0, 00, 1, 10, 1E, 1E0, 2, 3, 4, 5, 50, 5E, 5EE, 6 and 60. We also work for a large number of proprietary RAID arrays.
The continuous development of our software tools ensures that we use the latest state-of-the-art and proprietary techniques to achieve the best possible data recovery. In addition, the research & development team assists our engineers in data recovery when they are confronted with unusual proprietary RAID arrays, through custom tools created especially for the occasion.
We are recommended by most RAID vendors such as HP, Compaq, Dell, Adaptec, IBM, Intel, Promise, LSI Logic, Mylex, Xiotech and Netsan. All have their own RAID configuration, a specific data block size, and a parity size and unique symmetry.
Our RAID recovery capabilities do not just stop at NTFS, our skills also extend to MAC, UNIX, FAT and VMware RAIDs.
RAID configuration, including the number of disks used, determines the type. As a reminder, the RAID that stands for Redundant Array of Independent Disks or Redundant Independent Disk Array is a storage solution that distributes data across multiple small disks that together form a single system. In addition to being less expensive, this device has a high level of performance and data security as RAID tolerates breakdowns better.
Today, there are nearly twenty types of RAID, if not more including configurations sometimes considered obsolete- among which is the RAID 2. Of all these RAID configurations, the most common are the RAID 0, RAID 1, RAID 10, RAID 2, RAID 3 and 4, RAID 5 or RAID 6.
RAID 0 is a system that uses only two disks and provides fast access to data. The RAID 1 also uses two disks and writes duplicate information. If one of the disks is damaged, you will find your data on the other. To benefit from the performance of RAID 0 and the security offered by RAID 1, RAID 5 was created. With a good distribution of data, RAID 5 combines speed and fault tolerance. The RAID 6 has the same advantages as RAID 5, but with the added bonus of better writing speed.
RAID 0 is the classic data stripping configuration, where data is written across all drives resulting in faster access. However, this performance carries a risk, if one or more disks cause a disaster in a RAID 0, then a serious loss of data can occur. The diagram below shows how the data is distributed across the matrix.
An example of a data recovery situation: a file was created that occupied data stripes 1 – 4, if drive 2 were to fail and the 2nd stripe lost, the file would most likely become corrupted. Another way to look at it would be if one drive fails, the largest possible good file would have to be smaller than the combined size of the remaining stripes.
This is the RAID level that sets up disk mirroring; the data on the primary disk is duplicated onto the other. There are no performance gains for this RAID, but if one drive fails, then you will have a backup on the second one.
|RAID 1E||In this layout, data striping is combined with mirroring, by mirroring each written stripe to one of the remaining disks in the array. Usable capacity of a RAID 1E array is 50% of the total capacity of all drives forming the array; if drives of different sizes are used, only the portions equal to the size of smallest member are utilized on each drive. One of the benefits of RAID 1E over usual RAID 1 mirrored pairs is that the performance of random read operations remains above the performance of a single drive even in a degraded array.|
|RAID 2||RAID 2 comprises of data striping at a bit level with a dedicated parity drive. This level uses hamming error detection codes and is intended for use on drives that do not have built-in error detection. For this reason, RAID 2 is not commonly
|RAID 3 and 4||RAID 3 and 4 both use striping with a dedicated parity drive, the difference between the two is that RAID 3 stripes at the byte level while RAID 4 stripes at the block level. RAID 3 is seldom used these days due to the poor performance of
byte level striping, RAID 4 is better with block level striping but still suffers slower write performance due to the parity having to be updated on every write.
RAID 5 is generally considered to be the best compromise between fault tolerance, speed and cost. The system divides the data in the same way as a RAID 0, but also distributes the parity data on all the hard disks that compose it. Each vendor has its own specific way of distributing parity information on disks, but it will almost always be one of these four ways: left asymmetric, left symmetric, right asymmetric and right symmetric. In the following diagram you can see how the location of the parity data is distributed over the disks.
The direction of parity is simple to identify, as you can see it "moves" both right and left. In asymmetric RAIDs the data strips ignore parity, they skip it until they reach the next available space. Symmetric RAIDs handle data strips in a slightly more complex way, once the data encounters a parity block, they move sideways and down to the next stripe set.
The E in RAID 5E stands for "extended" as it adds on or extends the capabilities of RAID 5. The extended spare drive is part of the overall RAID 5E and can be used for input/output operations. The addition of a hot spare drive within RAID 5E helps in distributing I/O load or operations, resulting in better performance than RAID 5. RAID 5EE is a type of nested RAID level that is similar to RAID level 5E but provides better spare drive features.
|RAID 5EE||RAID 5EE is a type of nested RAID level that is similar to RAID level 5E but provides better spare drive features. As with RAID level 5E, it further extends the capabilities of RAID level 5. The extended or additional spare drive is part
of the overall RAID 5EE and can be used for input/output operations.
The RAID 6 system is an extension of the RAID 5: it performs the same data distribution and adopts a similar division of parity, but generates an additional data block for each stripe. This way even if two disks were to fail simultaneously the RAID would not suffer data loss. In smaller RAIDs, the possibility of two hard drives failing simultaneously is reduced, but as the size of the RAID array increases, the chance of failures increases.
As for the performances, they are very similar to those of RAID 5: the writing speed is high, because the data and parity blocks can be written on all disks, but the read access is slow due to the delay generated by the jump of two parity series.
|RAID 0+1 and 1+0||
To gain performance and/or additional redundancy the standard RAID levels can be combined to create hybrid or nested RAID levels, RAID types that provide redundancy are typically combined with RAID 0 to boost performance.
As you can see from the diagrams below, these two levels of RAID are a combination of RAID 0 and RAID 1. The difference between the two is the actual position of the RAID array, shown by the diagrams where the bands are in bold.
RAID 01 is configured so that RAID 0 is a mirror copy.
The advantage is that when a drive fails in one of the level 0 arrays, the missing data can be transferred from the other array. However, adding an extra hard drive to one stripe requires you to add an additional hard drive to the other stripes to balance out storage among the arrays.
A disadvantage for this configuration is that is cannot recover from two simultaneous drive failures, unless the drives are from the same data stripe. In the diagram; if drives 1 and 5 failed the RAID could be rebuilt, but if 1 and 4 failed it would result in data loss.
RAID 10 is configured so that the RAID 0 is split across two RAID 1 arrays.
A big advantage to RAID 10 is all but one drive from each RAID 1 array could fail without any data loss. However, if the failed drive is not replaced, the single working drive in that array becomes a single point of failure for the entire system, if that last drive goes all data within the array is lost.
The RAID nesting technique can be used for other RAID levels as well, most commonly on RAID 5 but it can also be applied to other levels like 3 and 6, producing levels such as 50, 51, 60, 61, 30 and 03.
|RAID 50||RAID 50, also called RAID 5+0, combines the straight block-level striping of RAID 0 with the distributed parity of RAID 5. As a RAID 0 array striped across RAID 5 elements, minimal RAID 50 configuration requires six drives. On the right is
an example where three collections of 120 GB RAID 5s are striped together to make 720 GB of total storage space. One drive from each of the RAID 5 sets could fail without loss of data; for example, a RAID 50 configuration including
three RAID 5 sets can tolerate three maximum potential simultaneous drive failures (but only one per RAID 5 set). Because the reliability of the system depends on quick replacement of the bad drive so the array can rebuild, it is common
to include hot spares that can immediately start rebuilding the array upon failure.
|RAID 51||RAID 51 is implemented by mirroring or implementing RAID 1 on an entire RAID 5 array in addition to the parity information. It is generally created using software and hardware-based RAID techniques where RAID 1-based mirroring is implemented through an operating system on the hardware-based RAID 5 array. RAID 51 is specifically designed for enhanced backup availability and high fault tolerance capabilities. RAID 51 is considered a parity set of mirrored disks, hence RAID 5 is followed by RAID 1. It can remain operational or protect from data loss even after losing four of the six minimum configured disks.|
Commonly Used RAID Vocabulary
Add some more words based on 'parts of a RAID server' in the mindmap.
|RAID: RAID is a technology that supports the use of 2 or more hard drives in various configurations for the purposes of achieving greater performance, reliability and larger volume sizes through the use of consolidating disk resources and parity calculations.|
|Parity: A mathematical calculation which allows drives within a RAID array to fail without the loss of data. The simplest way to show this is the equation: A + B = C. You can remove anyone of the letters from above and work out its value from the 2 remaining. I.e. if B was removed so the equation looked like A + ? = C, then B's value can be worked out by moving the A, so B = C – A. This is obviously a simplistic way of describing it, to fully understand it in a RAID sense, knowledge of binary and the logical XOR expression is required.|
|Mirroring: The data from 1 or more hard drives is duplicated onto another physical disk(s).|
|Striping: The method that data and parity can be written across multiple disks. In the example below the data is written across the drives in a sequential order until the last drive, it then jumps back to the first and starts a 2nd stripe.|
|Block: A block is the logical space on each disk where the data is written, the amount of space is set by the RAID controller and most commonly would be 16KB to 256KB in size. The data will fill up the space until the limit is reached and then move onto the next drive, until the last drive when it will jump to the start of the next stripe.|
|Left / Right Symmetry: The symmetry in a RAID controls how the data and parity are distributed across the drives. There are four main styles of symmetry, which one is used depends on the RAID vender. Some companies also make proprietary styles depending on their business needs.|
|Hot Spare: There are a few different methods for dealing with drive failures within a RAID, one is the use of a Hot Spare. It is a spare disk which can be used in place of the failed one.|
|Degraded mode: This happens when a drive in the RAID becomes unreadable, the drive is then considered bad and is withdrawn from the RAID. The new data and parity are then written to the remaining drives within the RAID, if any data is requested from the failed drive it is worked out with the parity on the others. This degrades the performance of the RAID, hence degraded mode.|