Scaling Storage In Conventional Arrays

November 19, 2013 By Stephen 3 Comments

Clustering sounds great, but it’s awfully taxing to keep all the nodes consistent!

It is amazing that something as simple-sounding as making an array get bigger can be so complex, yet scaling storage is notoriously difficult. Our storage protocols just weren’t designed with scaling in mind, and they lack the flexibility needed to dynamically address multiple nodes. Data protection is extremely difficult and data movement is always time-consuming.

This is part of a series on “Scale-Out” Storage Field Day 4

Three Ways To Scale Storage

Traditionally, there have been three ways to scale storage, each with its own pros and cons:

Scale-up array: Add more storage behind a single controller “head”, which acts as the termination point for client I/O. Since anything behind the head is invisible to the client anyway, scaling up can be non-disruptive. But there are limits to the amount of data a single head can control, and performance can suffer as the front- and back-end controllers, CPU, and memory become saturated.
Scale-out cluster: As the array scales, the controller “head” becomes increasingly critical, leading most vendors to cluster arrays for high availability. Most use a “shared-everything” cluster design, with each controller able to “peek into” every other controller’s RAM and caches, ensuring data consistency. But it’s difficult to maintain this “mind meld” between clustered heads beyond a handful of members.
Scale-out gateway: Recently, many storage vendors have adopted a two-tier architecture with a true scale-out object store on the back end and one or more protocol gateways in the middle. The client talks to the gateway using a conventional protocol like iSCSI or NFS; the gateway handles data distribution; and the back end provides scale, data protection, and consistency.

All three of these traditional scale-out architectures have proven their worth in production over the past decade, and all are actively in development by “next-generation” storage vendors.

Four Examples From Storage Field Day 4

At Storage Field Day 4, the delegates heard about scale-out clusters from CloudByte, Overland Storage, and Nimble Storage, all of which can scale up as well as out.

There are many other scale-out cluster solutions, with market leaders like NetApp, EMC Isilon, and Dell’s Compellent and EqualLogic serving as familiar examples.

CloudByte

CloudByte’s scale-out architecture joins multiple nodes into a cluster, sharing storage in the cluster using ZFS. But CloudByte adds an additional layer of abstraction via what they call a tenant storage machine (TSM), which can be moved from cluster to cluster on demand. In this way, they transcend traditional clustering limitations, but data locality and client access is limited to a single cluster within the larger pool.

Overland Storage

Overland Storage scales out using a gateway driver running on their scale-up SnapScale storage nodes. Client I/O is wide striped across “peer sets” of disks located on nodes throughout the cluster. This novel approach allows the SnapScale cluster to grow while balancing storage across every node in the cluster, though it is not clear to me how they rebalance data as the cluster grows. Client I/O is balanced using round-robin DNS to distribute client connections, a simple but inflexible approach.

Nimble Storage

Nimble Storage also scales up and out by slicing all data and distributing it equally across the storage in a pool of nodes. Data is rebalanced in the background as the cluster grows, using what I would call a “lazy” algorithm to avoid performance impact. Currently, Nimble only supports four arrays in a single pool, but they promise that this number will grow over time. Since Nimble uses the iSCSI protocol exclusively, they rely on a host-side MPIO driver to allow parallel and highly-available client access across nodes.

Avere Systems and Cleversafe

Storage Field Day 4 delegates also learned about the scale-out gateway offering from Avere Systems and Cleversafe, who are working together to deliver such a solution. In the past, Storage Field Day 3 saw the launch of Exablox, which sells an integrated solution which includes both a scale-out object store and NAS gateway, but Avere and Cleversafe are focused solely on the gateway and object store, respectively. The Cleversafe distributed storage net (dsNet) platform is a massively-scalable object quite unlike any conventional scale-out storage array. Placing an Avere NAS gateway in front of the dsNet allows high performance NFS and SMB access to the scalable, distributed object store.

Note that Avere’s gateways can be used in front of any NFS or SMB storage system, so this concept isn’t limited to Cleversafe.

Coho Data

There was one more conventional-protocol scalable storage system at Storage Field Day 4: Coho Data. Coho pushes the scaling work into the network in a rather clever way. I’ll cover that in more detail in the future, but for now here’s their tech presentation so you can see how it works!

Stephen’s Stance

Scaling storage is hard. Really hard. Especially when you’re not able to change the client driver or protocol. So my hat is off to these companies and others who have come up with clever ways to maintain compatibility while scaling out beyond the bounds of a single storage array. Next time I’ll write about another approach: Scaling storage by changing the client or protocol!

You might also want to read these other posts...

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

Why I Am Biased Against FCoE

October 21, 2011

I am biased against FCoE because it’s too new to be blithely and broadly recommended for production enterprise use. That’s all. Yes, the standards are standardized and there are products extant. But that’s not enough for me.

Donate Your Swag to School Kids In Need

July 28, 2010

Trade shows are a veritable swag-fest, some with great loot and some with junk. I’ve been critical of the booth babe and chotchkie phenomenon, but my friend Kevin Houston has a better suggestion: Donate your (useful) swag like backpacks and pens to school kids in need.

Sony QX100 Lens Camera: Ruined by a Flaky iOS App

October 7, 2013

I was thrilled by the possibilities of adding a professional-quality camera sensor and lens to my iPhone, so I immediately pre-ordered Sony’s DSC-QX100 “lens camera”. It held so much promise, not just as a real innovation but also as a major productivity tool. That’s why I’m angry to write this, a scathing review of the horrid software that ruins the QX10 and QX100 experience. Do not buy this device.

It’s Time To Move Beyond Passwords (Especially On Web Sites)

January 8, 2016

Sure, single sign-on puts all your eggs in one basket. But this is vastly preferable to trusting that hundreds of third-party baskets are secure, especially when they prove on a weekly basis that they aren’t! It’s time to put distributed passwords behind us and switch to systems like SAML, both for businesses and consumers.

Fasting to Mitigate Jet Lag: Surprise! It Works!

February 11, 2013

I’m a frequent traveler, and thus a frequently suffer from moderate jet lag. It’s just so hard to adjust to a new time zone! But I recently stumbled on a simple method many claim helps your internal clock re-calibrate to travel. After trying it out on my trip to Australia last week, I’m convinced it can help!

Rocking Out With the Topping VX1 Desktop/Bookshelf Amplifier

October 6, 2015

A few months back, I asked folks on Twitter and LinkedIn for recommendations for a desktop amplifier for a pair of bookshelf speakers. I ended up with a Topping VX1, one of the many “Class-T” digital amps lauded by audiophiles for their excellent sound reproduction. Boy am I impressed! It’s rare that such an inexpensive gadget (around $100!) delivers so much performance!

We Live in the Future: Robotic Cat Litter Boxes!

May 8, 2010

This post is a bit of a break from my usual gadget-fest, but the object in question isn’t that far off: It requires electricity, costs more than average humans can justify, and simplifies a task we’ve all been doing fine up until now. That’s right: An overly-expensive electric cat litter box. Predictably, I love it.

Thinking About Storage In a New Way, From Cloud to Flash, with Dropbox and Fusion-io

July 23, 2013

I’ve been a storage revolutionary for quite a while, looking for new ways of data storage rather than technologies that perpetuate the same old approaches. That’s why I’m excited about the implications of two very different API access methods announced by Dropbox at DBX and by Fusion-io today at OSCON.