Redundant Array of Independent Disks (RAID) is a term used to describe computer storage systems that spread or replicate data across multiple drives.
There are two main reasons for RAID storage to work in this way: it increases data reliability and improves I/O (input/output) performance. That said, RAID storage isn't a perfect technology. Data loss can still occur when using it.
In this article, we’ll dive into the workings of RAID and explore its benefits and pitfalls so you can make the right storage choice for your requirements.
A RAID combines physical disks into a single logical unit by using either special hardware or software. Hardware RAID solutions can come in a variety styles, from built onto the motherboard or add in cards, up to large enterprise NAS or SAN servers. With these setups the operating system (OS) is unaware of the technical workings or the RAID. Software solutions are typically implemented within the OS.
RAID is traditionally used on servers, but can be also used on workstations. The latter is especially true in storage-intensive computers such as those used for video and audio editing, where high storage capacities and data transfer speeds are required.
RAID storage is a complex area with plenty of technical terms to get your head around. We’ll explain the most common ones now as we’ll be using some of them later in this article.
Standard RAID configurations are known as levels. While there were originally five of these, there are now more as variations have evolved to include several nested and many non-standard (usually proprietary) levels.
As previously mentioned, mirroring involves the copying of data to more than one disk, striping occurs when data is split across more than one disk, and error correction occurs when redundant data is stored as a means of allowing problems to be detected and possibly fixed (known as fault tolerance). Depending on the system requirements, one or more of these techniques can be used within different RAID setups.
Different levels have their own types of redundancy, so a trade-off usually has to be made between fault tolerance and performance, depending on the application.
The basic RAID levels include:
There are many ways to get more out of your RAID system using non-standard configurations (up to RAID 51 and beyond), mainly as these can be used in tandem with other complex systems for significant efficiency and cost benefits.
As you may have gathered by now, RAID arrays are highly complex. This is often intensified within enterprise IT infrastructures, as RAID systems are used mostly for business critical applications; where availability and efficiency are crucial factors. What’s more, add-on technologies like virtualisation or database applications can spell disaster for a business if the system was to fail.
Modern RAID arrays can also use multiple file systems, like BTRFS or ZFS at the hardware level, with NTFS or HFS layered over the top for application support via virtualisation..
From a data recovery perspective, it would usually be necessary to not only reconstruct the RAID file system and bypass any physical failures, but to also assess any virtualised architecture that may exist. This can often make a recovery attempt extremely complex and time-consuming; however in many cases recoveries can be very successful.
RAID arrays are highly complex, and the challenges they present are intensified when they’re used for business-critical functions within enterprise IT infrastructures, as availability and efficiency are essential.
While useful in some scenarios, add-on technologies like virtualisation and database applications can cause further costly complications in a failing system.
From a data recovery perspective, it would usually be necessary to reconstruct the RAID file system, bypass any physical failures and assess any virtualised architecture. This can make for a complex and time-consuming process, but recovery is possible with the right expertise.
Sometimes a drive in the RAID can become unreadable; the drive is then considered bad and is withdrawn from the RAID. The new data and parity are then written to the remaining drives within the RAID, if any data is requested from the failed drive it is worked out with the parity on the others. This degrades the performance of the RAID.
Unfortunately, all drives can and will fail at some point.
If a failure occurs involving individual drives (assuming it is within a RAID 1 configuration or greater), the faulty drive can just be replaced with a new one and the data storage map can be rebuilt with zero data loss.
However, if a drive failure exceeds the redundancy capacity of the RAID, you should contact a professional RAID data recovery specialist to minimise the chances of complete data loss. It is imperative to ensure that your chosen provider has the tools and expertise to recover from any configuration or data loss situation. You should also assess whether they have direct partnerships with storage vendors and development capabilities for accommodating new or custom configurations.
We’re here to help with data recovery insight and expertise
Want to know more about how RAID works? Check out this Ontrack podcast from 2015 where we interviewed expert data recovery engineer Robin England on the dos and don’ts of using RAID systems. While the technology has evolved since then, there are still some key points worth considering in 2022.
If you’re already using RAID storage and you’ve recently experienced RAID data loss, we can help with that, too. Get in touch with our experts to start your recovery today