The I/O Blender Part 1: Ye Olde Storage I/O Path

May 23, 2012 By Stephen 2 Comments

Back in the day, when data was smaller and servers were machines, I/O followed a predictable pattern. Storage arrays could anticipate requests and RAID was beautiful. Then came virtualization, and with it an end to ye olde storage I/O path.

Ye Olde IO Path — In the good old days, I/O was predictable

Server = HBA = LUN

It was a simpler time back in the 1990’s. Each server had a SCSI host bus adapter (HBA) of its own. Maybe two, if failover was in order. This card transmitted block I/O requests from the operating system “over the wire” to a hard disk drive or storage array controller. And that wire was dedicated just for this purpose: Parallel SCSI or point-to-point Fibre Channel.

The storage array controller had a number of SCSI ports of its own; each was cabled to one of those server HBA’s. The storage array took requests from these “front-end” ports and translated them into internal requests. Usually this meant addressing a certain LUN carved from a single RAID set, though some smarter systems included a DRAM cache to accelerate performance.

The “back-end” of the storage array was a simple SCSI connection to a tray of hard disk drives. Most used parallel SCSI or copper FC, dual-ported and daisy chained from shelf to shelf. The RAID sets were statically mapped to 2, 5, or perhaps a few more disk drives. And that was that.

You should probably also read Storage Arrays Do A Few Things Very Well

Pre-Filling the Cache

The storage array “knew” that any I/O on the first port of controller A belonged to a unique server, and the same for every other port. This allowed the array controller to “learn” the I/O pattern of each port, and thus each server. Smart arrays would begin to predict the next read request to pre-fill the cache with likely data.

Even less-smart arrays got into the game. They could “read around” incoming I/O, and this worked fairly well for prefetch. This worked because the array also “knew” which data blocks belonged to a given host: A LUN was a complete and indivisible unit of storage and could be treated as such.

Copying and Moving Data

Since each LUN was a logical data set, arrays could copy and move data in a consistent manner. If the array copied an entire LUN as a single atomic operation, the data it contained would be consistent. This was the fundamental concept behind EMC Time Finder and many other “business continuance volume” (BCV) products.

In fact, in the 1990’s and early 2000’s, the main challenge in implementing BCV’s was creating “consistency groups” of multiple LUNs belonging to the same server or application. Once these groups were established, scripts could be used to pause an application while the storage array initiated data copies or replication.

Sharing and Not Sharing

Here, for no reason but nostalgia, I present a classic Gadzoox FCL1063TW FC-AL hub!

The advent of Fibre Channel meant that shared access to storage was finally possible. A Fibre Channel SAN allowed multiple servers to access the same front end ports and even the same LUN. But Fibre Channel’s use of World Wide Names meant that the storage array could still uniquely identify I/O and map it to a single server. Everything still worked in a SAN just as it had in a direct attached environment.

If a LUN was to be shared, the servers would use SCSI reservations to avoid conflicting writes and stale buffers. A golden age of SAN filesystems dawned around the year 2000, with Fibre Channel poised to be the high-end, high-performance storage interconnect of choice.

Not all operating systems played nicely in this environment, however. Microsoft Windows was notorious for “assuming” ownership of every LUN it could see. Even worse, Windows would write a disk signature on each, potentially corrupting data belonging to other operating systems. But even this was simple to address in classical Fibre Channel SANs using zoning or on the array with “LUN masking” technology.

Stephen’s Stance

This old-fashioned, predictable storage I/O path was deterministic and decipherable: The server, the switch, and the array all had enough information to do their jobs effectively and efficiently. But server virtualization changes everything, as we will see in the next entry in this series.

You might also want to read these other posts...

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

What You See and What You Get When You Follow Me

May 28, 2019

Social media ought to be social, not just a broadcast platform. That’s my feeling at least. It’s been a while since I’ve ranted about “write-only” social media accounts, so I thought I might as well do it again. And at the same time, I thought I would update you on my promise to the people who read, follow, and interact with me online.

Are You a Hypervisor Hugger or a Storage Stalwart?

November 14, 2011

The time has come to take sides on the core question of storage for virtual servers: Do you want storage intelligence to live in the hypervisor or the array? Most administrators are already lining up on one side or the other, unintentionally casting their vote while the rest flounder. But the storage industry must wake up and embrace the divide.

It’s Time To Speak Out Against Sexism In IT Recruiting

May 6, 2013

I have waged a long-standing battle against the sexist and offensive use of scantily-clad, non-technical “models” at technical trade shows. Sometimes known as “booth babes”, the use of women in this way harms our entire industry and makes companies look stupid, to boot. But when a recruiting firm uses such offensive tactics, it does even more harm, verging on illegal!

The Prime Directive of Storage: Do Not Lose Data

December 12, 2014

People call on storage devices and systems to do lots of things, from accelerating I/O to copying and sharing data. But at the heart of it all, storage arrays really have just one job: Do not lose data!

Microsoft’s Big Chance to Change

August 23, 2013

It takes a truly-remarkable leader to be willing to kill his old golden geese to make room for a new one; so far, only Apple and Amazon seem willing to forgo continuity in the name of profitable destruction. But new corporate leadership at Microsoft might un-stick the company and awaken the once-innovative Redmond powerhouse. The retirement of Steve Ballmer is welcome news.

A Watch Guy’s Review of the Apple Watch

April 27, 2015

Is the Apple Watch a personal communication revolution like the iPhone, a well-executed gadget like the Apple TV, or a total miss? Does it mark the end of the the world as we know it for watches? And what’s it like to use one? I’m a watch guy and a gadget guy, so perhaps my perspective will be of some value.

Storage Changes in VMware vSphere 5.1

September 4, 2012

As I have done since version 3.5, I’m charting the storage changes in VMware’s latest release of vSphere, 5.1. Unlike version 5, which included many new technical storage features, 5.1 mainly tweaks existing features and adds these new elements to the mix.

Why Big Disk Drives Require Data Integrity Checking

December 19, 2014

Hard disk drives keep getting bigger, meaning capacity just keeps getting cheaper. But storage capacity is like money: The more you have, the more you use. And this growth in capacity means that data is at risk from a very old nemesis: Unrecoverable Read Errors (URE).