The Storage Utilization Waterfall: Raw, Usable, and Used

October 1, 2008 By Stephen 3 Comments

Based on Floral Matryoshka by BrokenSphere/Wikimedia Commons

My February 2003 column for Storage magazine focused on the surprising difficulty of measuring storage utilization. I wrote:

“A true measurement of utilization would reflect every layer of usage metrics – from raw disk in a shared array to used storage within files. Raw storage for each new frame of reference is contained within the used storage measured above it, so low utilization is compounded as we move deeper into the stack.”

In that column, I suggested that utilization of any resource was based on just three metrics:

Raw
Usable
Used

But this is confounded by the frame of reference being measured. It’s trivially simple to determine the raw, usable, and used capacity for a storage array, server, or database. But what happens when one tries to measure storage utilization all the way through the stack?

When vendors take up this challenge, the discussion tends to get diverted into a cul-de-sac that presents their products most favorably, as was the case of Chuck Hollis’ comparison of his EMC CLARiiON to HP’s and NetApp’s storage products. Was Chuck wrong? Was HP right? Or was it NetApp that has the best utilization? One thing is certain, we’re getting nowhere if we can’t agree on some basic terminology.

Credit Storage Architect Chris Evans with seeing the problem for what it was. He noticed the matryoshka effect and put together a “waterfall” diagram, showing how low utilization is compounded as we move down the stack. He also notes that complexity rises as we move to the right, something I never called out.

We were both onto the same thing, though, and my study of storage utilization (published in the April 2003 issue) supported his suggestion that the raw to used ratio might be as little as 10:1 on average. At the time, I even put together a similar waterfall chart, but it was never published outside the company I worked for (that I know of).

So I fully and enthusiastically support Chris’ ideas on this topic! Let’s come up with some standard metrics for the various places that storage can be “raw, usable, and used”:

Disk drive units often have excess space (raw), and this is especially true of enterprise flash units
RAID sets definitely follow this pattern
Storage arrays themselves can have unused usable space (as noted by Marc Farley)
Storage virtualization can add another layer of utilization loss
On the host side, we must consider volume managers which can perform all the functions of an array
Filesystems also have raw, usable, and used space
As do applications that manage storage like databases
Add in capacity management technologies like compression and deduplication to really mess things up
Finally, server virtualization can sit above or below these server variables, and virtual machines themselves often have unused space.

Simply put, there are a lot of places for a few unused bytes to hide. Anyone want to bet that 10:1 is optimistic? And we’re only talking about capacity utilization – there are whole other worlds of power efficiency and performance to consider as well…

You might also want to read these other posts...

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

How Smart Is the Mondaine Helvetica Smart Watch?

December 30, 2015

I love watches and technology, so I was thrilled to hear about the creation of a “horological smart watch” base by the Swiss watchmaking industry. One of the first examples of this new breed is the just-released Mondaine Helvetica Smart. I purchased one of these watches, the limited-edition “1 of 1957” variety, and have had a chance to evaluate it both as a watch and a gadget.

FCoE vs. iSCSI – Making the Choice

May 20, 2011

iSCSI is an excellent choice in situations where Fibre Channel investment is nonexistent or badly in need of wholesale upgrade. FCoE, on the other hand, is likely to take over in high-end enterprise shops. It is relentlessly promoted by major vendors, and it seems that they will force the upgrade eventually.

What is VMware VASA? Not Much (Yet)

November 11, 2011

VMware is adding storage integration features to their flagship vSphere server virtualization product line at a rapid pace. From backup to enterprise array offload, VMware is staking their claim. But information about one new storage feature in vSphere 5 has been scarce: The true nature of the Storage API for Storage Awareness (VASA) is only just beginning to be revealed.

The Terrifying True Story Of Virtual Machine Mobility

December 22, 2011

Virtualization of server, network, and storage services illuminates the link between physical resources and functional applications. A running virtual machine can instantly move from one server, network adapter, HBA, or LUN to another. And when it happens, traditional components have no idea how to react.

What More Could Alan Turing Have Accomplished?

October 7, 2012

Many of you have probably heard the name of Alan Turing, but most of those probably don’t appreciate the extent of his contributions. To say that he invented the modern world is an overstatement, but he did dream up the computers we see around us today, and helped win World War II in the process. But the story of Alan Turing is as much about exclusion and defeat as it is of genius.

Microsoft: Kill the Craptops Before They Destroy Windows!

January 7, 2013

Release after release, Microsoft pushes Windows forward. Yet the operating system is continually undermined by the “value-focused” low-end machines pushed by the majority of OEMs. This race to the bottom has tarnished Windows for a decade and now threatens to derail Windows 8. Microsoft must do something to stop the crap before it’s too late!

From LAN Manager and SMB to CIFS: The Evolution of Prehistoric PC Network Protocols

March 22, 2012

Computers aren’t much good on their own. This simple fact was evident even at the dawn of the microcomputing age, and has never been more true today in the “post-PC” world. If the standard microcomputer is the “Wintel” box (Microsoft’s Windows, Intel’s CPUs, and all that implies) then the standard network services protocol is SMB. So let’s take a nice deep dive into SMB, past, present, and future!

The Prime Directive of Storage: Do Not Lose Data

December 12, 2014

People call on storage devices and systems to do lots of things, from accelerating I/O to copying and sharing data. But at the heart of it all, storage arrays really have just one job: Do not lose data!

You might also want to read these other posts...

Reader Interactions

Leave a Reply