Today’s Storage: Same As It Ever Was

July 7, 2015 by Stephen 1 Comment

"You may ask yourself/well how did I get here?" — “You may ask yourself/well how did I get here?”

Data storage has always been one of the most conservative areas of enterprise IT. There is little tolerance for risk, and rightly so: Storage is persistent, long-lived, and must be absolutely reliable. Lose a server or network switch and there is the potential for service disruption or transient data corruption, but lose a storage array (and thus the data on it) and there can be serious business consequences.

Perhaps the only area more conservative than storage is data protection (backup and archiving) and for much the same reasons. Data backups are the lifeline of modern businesses. Truly, every company is a digital company today, and without the digits there is no company!

Data storage and data protection also illustrate what my friend Dave McCrory would call data gravity: It is much more difficult to move large volumes of data than it is to move compute and network resources, so these tend physically to cluster wherever the storage is placed.

Today’s typical enterprise storage architecture looks remarkably like what I saw in the first years of my career: Specialized yet simplistic storage arrays at the center connected with proprietary protocols and networks to a host of compute resources. And these storage arrays tend to rely on the same basic methods to drive reliability: Data is duplicated between multiple hard disk drives in a fixed configuration behind twin controllers which determine where to place data and arbitrate access.

Storage arrays also offer data services of various sorts. The earliest of these features focused on shared access to data, including cloning and snapshots. Then we saw capacity optimization added, including deduplication, thin provisioning, and compression. With the advent of solid-state storage, much of the focus of mainstream storage development centered on performance through tiering and caching. Now, integration points like VAAI and VVOL have become increasingly important as servers are virtualized.

https://www.youtube.com/watch?v=I1wg1DNHbNU

But through it all, mainstream storage remains fairly rudimentary, with architectures and protocols that reflect the 1980’s rather than this decade. Most data is still stored on simple RAID sets and most access still uses the SCSI protocol, emulating a long-gone disk-centric system architecture. Even NAS, the other popular storage paradigm, is stuck with RAID and network protocols designed for Microsoft Windows and Sun UNIX networks in the 1990’s.

That’s pretty much where we are today, but things are finally changing. Over the next few weeks, I’ll be writing about the future of storage, transformed and re-thought. But I’ll also consider the bridge to this future, and ask if we’ll ever get there.

You might also want to read these other posts...

Comments

Steve C says

July 8, 2015 at 6:59 pm

Well said. I think it is important to distinguish between implementations (which can come and go, although best practices emerge and can stabilize implementation over a decade or more) and key architectural interfaces (over which layer upon layer of ecosystem are built, to the point where replacing the interface becomes hard if not economically infeasible).

SCSI as the interface between server (or disk array controller) and disk was actually a watershed change in interface. Responsibility shifted from the host operating system identifying a physical location on a physical disk (what cylinder does the head seek to, which platter (which head), and rotationally which sector) to the disk, which simply presented a single linear array of then-512-byte sectors. This abstraction for block storage has been not only used for physical disks for 30 years, but also for a wide range of logical disks.

More importantly, an enormous amount of software has been layered over this linear-array-of-sectors model over the decades. It is this layering which makes the SCSI abstraction as timeless as (say) TCP/IP in networking, or as its counterpart (but variable length) linear array of bytes, the “file”.

As interesting as new ways to access linear arrays of sectors (NVMe over fabric, anyone?) and linear arrays of bytes (objects, anyone?) are, they are simply new access methods or implementations for accessing the same age-old abstractions, and for the most part can be slipped in under the many layers of software written over the last 50 years to use those abstractions.

@FStevenChalmers (speaking for self, work at Hewlett Packard)

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

On the Death of Innovation, or “These Kids These Days!”

May 21, 2012

An obnoxious meme has returned to the fore lately, claiming that innovation is dead. The hippies did it, or maybe it was the Internet, or even a decline of America. But nothing could be further from the truth, and statements like this make me question the perspective of the speaker.

Hands-On Review: Unicomp Spacesaver M Keyboard for Mac

July 3, 2012

I would not hesitate to recommend the Unicomp Spacesaver M to Macintosh users used to an original IBM Model M, and I am admittedly a tough customer. I wish that Unicomp would update their website, packaging, logo, and keyboard graphics, but none of this really matters as your fingers press the keys. If any keyboard is worth $100, it is the Unicomp Spacesaver M!

Virtualized and Distributed Storage: This Time For Sure!

September 2, 2014

We were never able to achieve storage virtualization in mainstream enterprise IT because we lacked the ability to identify and move data non-disruptively. This has been solved by caching and distributed storage solutions, and it’s only a matter of time before the legacy need for centralized storage falls away.

Mac OS X Lion Adds CoreStorage, a Volume Manager (Finally!)

August 4, 2011

Mac OS X was majorly deficient in that it lacked a volume manager. This wouldn’t seem like a big deal to the average user, but held back the operating system in so many ways. A volume manager brings storage virtualization to an operating system, allowing storage capacity efficiently to be managed and manipulated. But all this has changed in Mac OS X 10.7 â€œLionâ€ with CoreStorage.

What is VMware VASA? Not Much (Yet)

November 11, 2011

VMware is adding storage integration features to their flagship vSphere server virtualization product line at a rapid pace. From backup to enterprise array offload, VMware is staking their claim. But information about one new storage feature in vSphere 5 has been scarce: The true nature of the Storage API for Storage Awareness (VASA) is only just beginning to be revealed.

Deduplication Coming to Primary Storage

September 16, 2008

Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time – reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.

A Fairy Tale of Two Storage Protocols

September 23, 2014

It’s clear how this fairy tale ends. So many companies are using “S3 plus” as their standard interface, and even inside their solutions, that it’s safe to say it’s won the cloud storage API battle. But S3 isn’t a finalized spec – the industry will extend and improve it over the coming years. Soon we’ll have a cloud storage standard based on S3, just like we have a LAN file services standard based on CIFS.

Sony NEX-5 Camera Review

September 15, 2010

The world of photography is like so many others: A vast gulf separates the amateurs and enthusiasts, from equipment to nomenclature to skills. I am decidedly in the amateur camp when it comes to photography, but I recently upgraded to a new compact interchangeable-lense camera, the Sony alpha NEX-5. It is an excellent match for my needs, allowing me to expand my skills and explore more advanced photographic techniques without sacrificing portability and ease of use.

You might also want to read these other posts...

Reader Interactions

Comments

Leave a Reply