Zero Page Reclaim: Savior of Thin Provisioning?

January 4, 2011 By Stephen 4 Comments

In the previous post, I talked about how the Drobo uses metadata monitoring to solve the telephone game and make de-allocation possible. But that approach is challenging in complex enterprise environments. Instead, most enterprise arrays use a complex chain of semaphores to interpret signals from the connected hosts about the capacity that can be un-provisioned.

On the storage side, arrays can only use the information they have to de-allocate: The data that’s stored on them. They don’t know what application is using it, what file system it is. They don’t know anything at all.

But, somewhere along the line, someone had a big idea and said, “wait a second, what if we look for pages that are all zeros?” We’ll talk about pages a bit later, but for now, let’s talk about zeros. A zero is kind of a smoke signal coming up from over the hills that says, “there’s nothing valuable here.”

So the storage array watches for pages that are all zero and reclaims them. As protection against making a stupid mistake (what if you actually wanted to write all zeros?), anybody who asks for a page that has been reclaimed just gets all zeros back.

Most of the major vendors support this kind of zero page reclaim. This is good stuff. I don’t want to sound too critical of them because I appreciate them implementing at least this.

The problem is that there’s not a lot of ability to actually have those zeros be written. Almost no operating system writes zeros to deleted space. If they actually wrote pages of zeros, thin provisioning would work great.

So what do the storage vendors do? They come up with utilities that write zeros!

NetApp has SnapDrive, which zeros out empty space so that the Filer can go and recover that space. You run it whenever you want to run it. Eventually the storage array notices that you’ve zeroed out that space and it recovers it. Compellent and Symantec’s Veritas Storage Foundation have something like that, too. You can also force it using the SDelete command, and you can configure it using VMware ESX.

Zero page reclaim is pretty straightforward. It doesn’t take a lot of computing power – It’s not like you’re watching the file system for changes or anything. All you’re doing is occasionally going through and deleting pages full of zeros. So, you can post-process it, kind of like de-duplication.

There are quite a few issues with zero page reclaim, though:

Things aren’t writing zeros
Most of these implementations are page-based, which looks like a problem
Theoretically, this drives more IO through the system, not less

This last is the biggest problem, really. In most cases IO performance is a bigger issue than capacity in enterprise storage. If I could give you all the capacity you could possibly want or all the performance you could possibly want, most people would pick performance. It used to be capacity, but now it’s all about performance. If infrastructure folks could get one for free and had to pay for the other, they would definitely pay for performance.

And zero page reclaim, the way that it’s implemented with SDelete or with eagerzeroedthick, is driving tons of IO. Basically, a delete is the same as a write because you have to write all these zeros over the bus. But there’s a way around that, too. And that’s the topic for the next piece in this series.

You might also want to read these other posts...

Comments

Tom says

January 4, 2011 at 6:45 pm

Please comment where/how one can find how to use sdelete with ESX…and where/how/if one can do this de-allocation with an MSA 2012i G1?? Thank you, Tom
the storage anarchist says

January 5, 2011 at 12:50 pm

Interesting side note: 3PAR made a big deal about their custom ASIC that scans for zeros. On VMAX, we use the Tachyon chip instead of custom hardware to do line-speed zero detect.

How, you ask?

Well, we have the Tachyon create a T10-DIF for every received block, which is used to protect he integrity of the data all the way to the physical drive writes (and back). Checking for zeros thus requires only checking the DIF to see if it matches the known DIF af an all-zero block!

So, when it comes to zero page detect, VMAX don’t NEED no stinkin’ ASICs 🙂
Bill Plein says

January 5, 2011 at 8:57 pm

3PAR’s ASIC does much more than zero-detect, which is a feature that was un-used in the ASIC for years. By the way, isn’t the Tachyon FC chip considered an ASIC? So, you do need an ASIC.
sfoskett says

January 5, 2011 at 9:49 pm

That’s a really clever way to do it. I salute whoever had that bright idea!

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

From Kipling’s Dirigibles to the Jet Age

May 13, 2012

I don’t get much chance to read for pleasure, but two things I’ve been reading recently spurred my imagination. After reliving the advent of modern transportation in the solid non-fiction Jet Age by Sam Howe Verhovek, I stumbled upon two pieces of speculative fiction from an unlikely source that predated everything presented there.

The Rack Endgame: A New Storage Architecture For the Data Center

September 3, 2014

Top-of-rack flash and bottom-of-rack disk makes a ton of sense in a world of virtualized, distributed storage. It fits with enterprise paradigms yet delivers real architectural change that could “move the needle” in a way that no centralized shared storage system ever will. SAN and NAS aren’t going away immediately, but this new storage architecture will be an attractive next-generation direction!

Replacing Google Reader With Feedbin and Reeder

May 5, 2013

I am an avid Google Reader user, so I’m thoroughly annoyed by Google’s decision to kill it as of July 1. But there’s no stopping the tide, so I’ve made the move to Feedbin as a Reader replacement as of today. It’s a slick, snappy web application with a committed developer and, critically, support for Reeder, my favorite offline RSS reading application. Let’s hope this works!

Ten Terrible Apple Products

June 14, 2012

I’m often accused of being an Apple fanboy. While it’s true that I love my vast selection of fruity products from Cupertino, I’m not blind when the company makes mistakes. In fact, I think Apple’s mistakes are as enlightening as their successes: They reveal a company that is fallible, sometimes learning but often allowing the junk to rot far longer than other companies would.

We Live in the Future: Robotic Cat Litter Boxes!

May 8, 2010

This post is a bit of a break from my usual gadget-fest, but the object in question isn’t that far off: It requires electricity, costs more than average humans can justify, and simplifies a task we’ve all been doing fine up until now. That’s right: An overly-expensive electric cat litter box. Predictably, I love it.

Here’s Something Your Raspberry Pi Can’t Do: Gigabit Ethernet and SATA in the Olimex A20-OLinuXIno-LIME2

May 25, 2016

I’ve really enjoyed experimenting with the Raspberry Pi, and have even deployed a few as UNIX servers in my home and office network. The quad-core performance of the latest Pi models is awesome, but serious I/O limitations remain. With just one USB 2.0 interface shared for all network and storage operations, you aren’t going to […]

The Fat Middle: Today’s Enterprise Storage Array

August 31, 2014

Ask any project manager if it’s possible to deliver something that is fast, good, and cheap, and they’ll laugh. The phenomenon known as the Iron Triangle limits just about everything in the world from meeting all three conflicting requirements. Yet, for the last two decades, enterprise storage array vendors have been trying to deliver just this. How’s that working out?

Free as in Coffee – Thoughts on the State of OpenStack

May 2, 2016

Last week I headed to Austin, Texas to attend the semi-annual OpenStack Summit there. Along with the usual socializing, I was looking to understand the current state of the technology: What does OpenStack really mean these days, and where is it going? Let’s start with “free”. As “the Internet” is quick to point out, this critical word has multiple […]

You might also want to read these other posts...

Reader Interactions

Comments

Leave a Reply