Virtualized and Distributed Storage: This Time For Sure!

September 2, 2014 by Stephen 3 Comments

I’ve posted before about the spot where most successful enterprise storage arrays sit: The “fat middle” between performance and capacity. But software is driving a wedge into the SAN, and I see radical architectural change on the horizon. There are so many storage virtualization technologies sneaking into server architecture that it’s impossible to think one won’t take hold. And virtualized storage doesn’t want a “fat middle” array!

The principles of data physics suggest that data should be located closer to compute

Here’s the Rack Endgame series:

The Fat Middle: Today’s Enterprise Storage Array

Virtualized and Distributed Storage: This Time For Sure!

The Rack Endgame: A New Storage Architecture For the Data Center

No Demand For Storage Virtualization

Storage virtualization has been around longer than me in this industry, but it has never taken hold. As far back as I can remember, companies have been developing and trying to sell a universal platform that would allow storage to be more dynamic, moving from place to place according to application needs. Companies have also been working on distributed storage platforms for decades, applying the principle of data gravity which says that storage should be close to compute.

Yet, as I discussed in my post, “The Fat Middle: Today’s Enterprise Storage Array“, today’s storage architecture is remarkably simple: Centralized storage systems form the core of the datacenter, a “Jack of all trades” approach with no consistent virtualization outside the array. Why has storage virtualization or distribution never taken hold?

Perhaps the main reason for today’s lack of storage virtualization is the heterogeneity of data center operating systems and applications. Simply put, there is no single platform that can be targeted by a storage virtualization layer, and there has been no demand for an in-network virtualization platform either. The safest approach has been to let the storage array itself provide what little virtualization there is, along with high availability and data services.

VMware Provides an Opening

But VMware challenges this paradigm. Today’s most widely-used “storage array” is actually not an array at all: It’s VMFS, the storage layer of VMware vSphere. It provides all of the functions of a traditional storage array within the hypervisor itself. Yet VMware is in the process of radically transforming this storage layer, and the implications are very exciting! VMware Virtual SAN (colloquially called “VSAN”) brings storage distribution and virtualization to mainstream data centers, reducing the pressure on centralized storage arrays to provide advanced data services.

With VSAN, a vSphere data center no longer needs centralized storage! VSAN supports all of the best features of vSphere (HA, DRS) without SAN or NAS and brings data distribution to the party as well, localizing data close to compute. It even supports automated tiering (caching really, but that’s a semantic battle for a different day). As demonstrated by EVO:Rail, VSAN gives customers a mainstream SAN-less alternative.

But it’s not just VMware that’s challenging the status quo. Microsoft is promoting an all-Windows Server data center. Nutanix has made waves with their converged data center solution, and it too boasts a “No SAN” approach. SimpliVity, Atlantis Computing, and Scale Computing are also charging in. Then there’s the dark horses of storage virtualization: Avere, PernixData, and Infinio are virtualizing storage today, though they’ve been pretty quiet about the implications of this. It’s a short jump from a caching solution to a real storage hypervisor!

Endgame: Distributed Storage

The goal of all this is accommodation of data physics, specifically the “gravity” of data proposed by Dave McCrory. Because data should not be centralized, storage shouldn’t follow a central/shared architecture either. Storage should be moved to the compute nodes to reduce latency and bring about real scalability of the entire system.

Architecturally, this is what Nutanix, VSAN, and the rest are all about. They move data closer to compute, eliminating the need for an artificial, external shared storage infrastructure yet preserving the enterprise-class data services we’ve all come to rely on. They identify and move data by placing storage intelligence closer to applications (on the same physical hardware if not in the same operating system).

The benefits are too compelling to ignore and have profound implications for the shape of enterprise storage in the future. If we live in a “no SAN” world, we no longer need to build “Jack of all trades” storage systems or rely on “do it all” interconnects like Fibre Channel. The distributed storage world wants specialist storage devices: Ultra-performance flash on PCIe or memory channel and ultra-scale storage on a slower and cheaper bus.

Stephen’s Stance

We were never able to achieve storage virtualization in mainstream enterprise IT because we lacked the ability to identify and move data non-disruptively. This has been solved by caching and distributed storage solutions, and it’s only a matter of time before the legacy need for centralized storage falls away. As the wise man said, maybe not today and maybe not tomorrow, but soon we’ll live in a different world.

Note: These are topics I discuss in my public speaking engagements, including my Truth in Storage seminar with Truth in IT. Check out the schedule for Truth in Storage and come listen to the whole story!

Disclaimer: I work with VMware, PernixData, Infinio, Avere, Nutanix, Atlantis Computing, Scale Computing, SimpliVity, and most other companies in enterprise storage with Foskett Services and Tech Field Day. I don’t think I’m biased, but you can draw your own conclusions.

You might also want to read these other posts...

Comments

Mark William Kulacz says

September 2, 2014 at 6:53 pm

“It provides all of the functions of a traditional storage array within the hypervisor itself.” Well, if you dont include data integrity protection, hardware/firmware management, background scans, …

That aside, consider that most of the time in hyperconverged solutions, the data that is being accessed by a VM isnt on the same physical server as the VM. Most IO will ultimately go over the interconnect. There is no relationship with which physical server a VM runs on, and the server that has all (or even some) of the data within the VSAN datastore. The SSD read cache is a L2 cache, and is on the disk group “under” the interconnect layer. Not that this is a bad thing – but technically speaking, once the cluster grows large enough the compute is still not on the same physical node as the storage. The only way to resolve that is to either fully mirror data objects (and not allow a mirrored copy to span more than one node), and then to limit the compute threads that access that data to running on those nodes.

I have nothing against hyperconvergence. Massive scale-out needs highly scalable ways to disperse computation across many nodes, even allowing for elasticity that stretches into the cloud. But the driver for hyperconvergence should be realistic about what it is and is not – and adopt the architecture when it makes sense.

Yes, I’m with NetApp, And my views and comments are my own and not represent those of my employer.
joshgant says

September 3, 2014 at 10:49 pm

I thought that what you are describing here is exactly how Nutanix works. It uses meta data around the I/O to determine if the read/write copy of the vmdk should reside on the server where that VM is running. Intelligently reducing the interconnect traffic.
joshgant says

September 3, 2014 at 10:59 pm

I just confirmed this. VSAN does read/write caching but Nutanix will move the writable copy to the local server automatically localizing all I/O for each VM.

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

Faster Ethernet Gets Weird

June 19, 2015

“One size fits all” doesn’t work for Ethernet, but this proliferation of speed options sounds like trouble without automatic capability negotiation. It’s nice to have options, but the IEEE must remain focused on interoperability and rein in the interests of the various companies proposing next-generation Ethernet technologies.

Review: 2013 Ford Flex

September 23, 2012

The 2013 Ford Flex exceeded my expectations in every driving and utility-related area, but the terrible MyFord Touch system really detracts from the vehicle. I would highly recommend buying a Ford Flex if you’ve got a family of 5 or 6, but try to get one with old fashioned buttons and dials!

Edward Snowden Is Right: We Must Protect The Internet

March 19, 2014

Edward Snowden, NSA whistleblower, appeared at TED2014. His video, embedded below, must be watched. You may think he’s a hero or you may think he’s a villain, but he’s unequivocally right about one thing: We must protect the integrity of the Internet and online communications or we risk disrupting the world economy and all of our lives.

EMC Redefine Possible (TL;DR Edition)

July 9, 2014

EMC made quite a few announcements today at their “Redefine Possible” event in London. There’s a lot of coverage out there already, so I decided to present a summary of the whole thing in “too long; didn’t read” (TL;DR) fashion.

Why I Am Biased Against FCoE

October 21, 2011

I am biased against FCoE because it’s too new to be blithely and broadly recommended for production enterprise use. That’s all. Yes, the standards are standardized and there are products extant. But that’s not enough for me.

ZFS Is the Best Filesystem (For Now…)

July 10, 2017

ZFS should have been great, but I kind of hate it: ZFS seems to be trapped in the past, before it was sidelined it as the cool storage project of choice; it’s inflexible; it lacks modern flash integration; and it’s not directly supported by most operating systems. But I put all my valuable data on ZFS because it simply offers the best level of data protection in a small office/home office (SOHO) environment. Here’s why.

It’s Time To Speak Out Against Sexism In IT Recruiting

May 6, 2013

I have waged a long-standing battle against the sexist and offensive use of scantily-clad, non-technical “models” at technical trade shows. Sometimes known as “booth babes”, the use of women in this way harms our entire industry and makes companies look stupid, to boot. But when a recruiting firm uses such offensive tactics, it does even more harm, verging on illegal!

Sony QX100 Lens Camera: Ruined by a Flaky iOS App

October 7, 2013

I was thrilled by the possibilities of adding a professional-quality camera sensor and lens to my iPhone, so I immediately pre-ordered Sony’s DSC-QX100 “lens camera”. It held so much promise, not just as a real innovation but also as a major productivity tool. That’s why I’m angry to write this, a scathing review of the horrid software that ruins the QX10 and QX100 experience. Do not buy this device.