• Skip to main content
  • Skip to primary sidebar
  • Home
  • About
    • Stephen Foskett
      • My Publications
        • Urban Forms in Suburbia: The Rise of the Edge City
      • Storage Magazine Columns
      • Whitepapers
      • Multimedia
      • Speaking Engagements
    • Services
    • Disclosures
  • Categories
    • Apple
    • Ask a Pack Rat
    • Computer History
    • Deals
    • Enterprise storage
    • Events
    • Personal
    • Photography
    • Terabyte home
    • Virtual Storage
  • Guides
    • The iPhone Exchange ActiveSync Guide
      • The iPhone Exchange ActiveSync Troubleshooting Guide
    • The iPad Exchange ActiveSync Guide
      • iPad Exchange ActiveSync Troubleshooting Guide
    • Toolbox
      • Power Over Ethernet Calculator
      • EMC Symmetrix WWN Calculator
      • EMC Symmetrix TimeFinder DOS Batch File
    • Linux Logical Volume Manager Walkthrough
  • Calendar

Stephen Foskett, Pack Rat

Understanding the accumulation of data

You are here: Home / Everything / Enterprise storage / Qumulo Launches To Manage Data At Scale

Qumulo Launches To Manage Data At Scale

March 16, 2015 By Stephen 1 Comment

“We’re getting the band back together!” That’s how the pitch must have gone for the founding team at Qumulo, who also happened to create Isilon, now a major product at EMC. But what do you do after creating the most successful scale-out storage platform in history? Add in some advanced data management services and make it a software-only platform!

What’s Qumulo?

CEO Peter Godman is joined by fellow Isilon alums, Aaron Passey (Chief Architect at Isilon and now Qumulo CTO), Neal Fachan (Distinguished Engineer at Isilon and Amazon AWS veteran, now VP of Engineering at Qumulo), and lots of others besides. There are probably some non-Isilon folks in the company, too, but I think it’s safe to say they have lots of talent. I spy Brett Goodwin, Chris Hoffman, and my old boss Peter Bell over there, too.

Qumulo is building a data-aware file system as well as a distributed storage layer
Qumulo is building a data-aware file system as well as a distributed storage layer

Like many of today’s storage startups, Qumulo is focusing on software. Although they are shipping branded appliances, it is pretty clear that the entire product focus is on the software layer. Well, layers: Qumulo de-couples their scale-out data layer from their filesystem, QSFS. We’ll get back to that later, but understand that, for now, this appliance pitch consists of these three components sold together.

Integrated Data Analytics

Let’s start with the most interesting part. Qumulo is delivering a scale-out storage solution with an integrated metadata engine. QSFS provides real-time analytics drawing on a database of file metadata that goes much further than traditional filesystems. It manages capacity and performance data by path, file, and client. And all of this is inside the filesystem itself, not an overlay or secondary application.

Qumulo allows much deeper understanding of performance and capacity. Check out this hotspot tree graph!
Qumulo allows much deeper understanding of performance and capacity. Check out this hotspot tree graph!

For example, QSFS has a fantastic hot spot graph allowing a systems administrator to instantly see which files are really getting pounded and which aren’t in use. It’s like storage resource management for performance, and I love it. It reminds me of my favorite capacity management tools (hi DaisyDisk!) but on a whole other dimension.

IntegratedAnalyticsView
Qumulo’s Integrated Analytics view lets you drill down into capacity or performance by client, directory tree, or file

 

The Qumulo differentiator is that these lovely graphical representations are not scan-based. Rather than just maintaining the directory tree, the Qumulo filesystem maintains all this metadata in near-real time. They wisely decided to allow the metadata database to lag behind real reads and writes rather than having it in the I/O path. It’s like the difference between synchronous and asynchronous replication and is a good trade off – filesystem performance beats instantaneous graphs any day.

QSFS sits on top of a distributed data store, but it could sit on just about anything
QSFS sits on top of a distributed data store, but it could sit on just about anything

The Qumulo Core Software Sandwich

Qumulo does not combine data protection with the filesystem. Rather, they have a “protected store” layer (pstore) that handles scale-out, data placement, and redundancy that sits completely separate from their filesystem. The filesystem itself (QSFS) treats this scale-out block storage layer as a single contiguous address space rather than integrating filesystem semantics with protection. This is totally different from most other scale-out storage systems (including Isilon) which integrate a filesystem with a scale-out object store.

The block store was built from the ground up to reflect the unique aspects of today’s hybrid storage hardware. Qumulo’s solution is “flash-first”, meaning that it leverages the plentiful random I/O capabilities of flash memory to deliver high performance and advanced metadata capabilities. This means that it requires flash, of course, but it still runs on just about any old hardware.

Typical of today’s scale-out systems, Qumulo requires 4 nodes at a minimum to guarantee availability. This might seem like a surprise to traditional storage people, but it’s really a best practice. Unlike Isilon, which relied on InfiniBand, Qumulo is all commodity hardware, using a 10 Gb Ethernet interconnect. I wouldn’t be surprised if RoCE was in there somewhere, though.

The Qumulo appliance is built of industry-standard hardware – 1U Intel servers filled with SSD and HDD. Each node runs Linux with the Qumulo block store and protected store layers. The UI runs on every appliance, so an administrator can connect to any of them and see the entire system.

The initial appliance, called Q0626, contains two 800 GB SSDs and four 6 TB helium HDDs. Each has two 10 GbE ports and I spotted an Arista 10 GbE switch in there, too.

The Future of Qumulo

Theoretically, the QSFS layer could run on top of other block data stores, from a single virtual machine (this is how Qumulo does evaluations and demos) to a real scale-out storage layer. In theory, Qumulo could layer QSFS on just about any storage platform, even if it’s not a scale-out storage system. This presents some really interesting ideas for future branching out.

Let’s imagine Qumulo Core running in a VM or public cloud. It could run in an AWS SSD instance, for example, or on VMware vSphere. I can imagine a hybrid cloud scenario like SimpliVity which leverages both on-premesis physical appliances, virtual machines, and cloud instances.

The Qumulo dashboard puts lots of data right in front of you
The Qumulo dashboard puts lots of data right in front of you, including the REST calls that make everything work

Another nice touch is the UI. The entire web UI is implemented using REST methods that can be exploited by developers or applications. This is even exposed right there in the UI so you can learn what does what just by using the system.

Stephen’s Stance

Qumulo has more than a dozen paying customers already. This is a new trend I’m seeing with storage companies – getting customers and putting systems in production before coming out of stealth. Combine this with the excellent team and funding and we have ourselves a real contender here.

But will the data management tricks be enough? Scale-out storage is hard to do, sure, but it’s becoming much more common. And quite a few companies are rumbling about their data management features. Some are even stepping into information management, which might prove a bridge too far for storage nerds to grok.

I suspect Qumulo will head in a very different direction than many other storage companies. Rather than building a scale-out storage solution, Qumulo Core could become a competitor for the likes of Maxta, Springpath, and VMware VSAN. After all, who wants to shift atoms these days?

You should also read:

  • Dave Henry: Qumulo Comes Out of Stealth Offering Data-Aware Scale-Out NAS
  • Martin Glassborow: Scale-Out of Two?

You might also want to read these other posts...

  • Electric Car Over the Internet: My Experience Buying…
  • GPS Time Rollover Failures Keep Happening (But…
  • What You See and What You Get When You Follow Me
  • Liberate Wi-Fi Smart Bulbs and Switches with Tasmota!
  • Introducing Rabbit: I Bought a Cloud!

Filed Under: Enterprise storage, Everything Tagged With: Aaron Passey, Arista, Brett Goodwin, Chris Hoffman, data management, Dave Henry, Isilon, Martin Glassborow, Maxta, Metadata, Neal Fachan, Peter Bell, Peter Godman, QSFS, Qumulo, Qumulo Core, scale-out, Springpath, SRM, storage management, vSAN

Primary Sidebar

The same thing can be identified by many different terms, and the same term may mean many different things.

Douglas John Foskett

Subscribe via Email

Subscribe via email and you will receive my latest blog posts in your inbox. No ads or spam, just the same great content you find on my site!
 New posts (daily)
 Where's Stephen? (weekly)

Download My Book


Download my free e-book:
Essential Enterprise Storage Concepts!

Recent Posts

Electric Car Over the Internet: My Experience Buying From Vroom

November 28, 2020

Powering Rabbits: The Mean Well LRS-350-12 Power Supply

October 18, 2020

Tortoise or Hare? Nvidia Jetson TK1

September 22, 2020

Running Rabbits: More About My Cloud NUCs

September 21, 2020

Introducing Rabbit: I Bought a Cloud!

September 10, 2020

Remove ROM To Use LSI SAS Cards in HPE Servers

August 23, 2020

Test Your Wi-Fi with iPerf for iOS

July 9, 2020

Liberate Wi-Fi Smart Bulbs and Switches with Tasmota!

May 29, 2020

What You See and What You Get When You Follow Me

May 28, 2019

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

April 6, 2019

Symbolic Links

    Featured Posts

    10 Mysteries The Lost Finale Definitively Settled

    May 25, 2010

    Why Big Disk Drives Require Data Integrity Checking

    December 19, 2014

    How to Get Me to Write about Your Company or Product

    March 15, 2012

    How To Sync Your iPad With Your Exchange Server

    April 3, 2010

    How Will Cisco Recover From The Consumer Strategy Blunder?

    January 2, 2013

    Deduplication Coming to Primary Storage

    September 16, 2008

    Making a Case For (and Against) Software-Defined Storage

    January 9, 2014

    Virtual Machine Mobility: Of What, and to Where and in What State?

    January 16, 2012

    Not All 802.11n Networks Are Alike

    July 2, 2011

    Nimble Storage Rolls Out an All-Flash Array

    February 24, 2016

    Copyright © 2021 · Log in