What is WRITE_SAME? Green Eggs and Ham!

January 5, 2011 by Stephen 1 Comment

One of the sticky wickets that holds back thin provisioning is the need to communicate when capacity is no longer needed. Enterprise storage arrays can reclaim zeroed pages, but writing all those zeros can really fill up an I/O queue. This is where WRITE_SAME comes into the picture.

This is a really terrible name. It’s all-capital letters and has an underscore in the middle of it. We sound like engineers.

But WRITE_SAME is an interesting idea: Imagine you wanted to delete a terabyte of data using a storage system with zero page reclaim? You’d have to write a terabyte of zeroes. Well, that’s a lot of IO. You’re basically pouring zeroes across your PCI bus, HBA,network, and array.

Instead, imagine we could just say, “You know that page of zeroes that I just wrote? Can you please write that a million more times for me? Hey, thanks a lot.”

You could do it in one command. That’s what WRITE_SAME is. It’s a SCSI command that says, “That last thing that I just wrote, can you please write it again, and again, and again? Can you please write it a thousand times? Can you please write it over here, over there?” I sound like Dr. Seuss: You can write it in a car. You can write it at the bar. You can write it on a bike. You can write it with a pike.

This conserves IO, and is a really good thing. WRITE_SAME makes zero page reclaim that much more effective. Now if only we had a system that would actually use this command!

It’s popular with array vendors, because all they have to do is say, “Hey look, I already support zero page reclaim. It’s up to you guys up there in the stack to implement the rest of this problem. It’s not our problem. It’s your problem.”

As an aside, consider that, if you’re an array vendor, any problem that reduces the use of disk capacity is your problem. So, they may not all be that eager to have this work, I think, but I’m sure they’ll come around.

But imagine if you did this to an un-thin array. Imagine if the array didn’t support zero page reclaim on ingest and instead was post-processing. You could end up writing a terabyte of zeros on the back end of your storage system, or 10 terabytes or 100 terabytes of data, only to reclaim it later that day, or later in the week or later in the month. And what if your system didn’t support it at all? Suddenly, you’re flooded with IO requests on the storage-array side. So, basically, you’re conserving IO across the host and the network, but you’re potentially generating massive IO on the storage side – which is kind of a problem.

So, there are some issues here with this as well. But, we’re getting there.

You might also want to read these other posts...

Comments

the storage anarchist says

January 6, 2011 at 12:19 pm

The suspense is killing me!

WRITE_SAME itself is really NOT for Zero Page Reclaim…at least, it’s not an efficient approach since it can actually write the zeros on targets that aren’t “thin” (and “thin” is virtually transparent to hosts, file systems and applications). The real utility of WRITE_SAME is to reduce SAN traffic, something very helpful (for example) when VMware needs to re-initialize a VMDK for reuse (which is why it is now part of VAAI).

I know you’re probably setting up for the next page to discuss WRITE_SAME (UNMAP) and the new UNMAP commands – two capabilities that can be advertised by the targets so that the host software can specifically say “I don’t need these blocks anymore.”

Leaving your blog audience with the impression that WRITE_SAME(0x0000) is a useful approach to space reclamation is a bit misleading. Such cliffhangers work in live presentations where there are mere seconds until the next slide…here, not so much (IMHO).

GPS Time Rollover Failures Keep Happening (But They’re Almost Done)

This is week “1111111111” in the GPS system. Tomorrow morning it will roll over to week “0000000000”. How well will various systems handle this change? Not well, judging by what we’ve seen so far!

Ranting and Raving About the 2018 iPad Pro

I remain enthusiastic about the iPad Pro, despite getting a scratched screen and my concerns about durability. It’s a worthy successor to the original and offers enough improvements that I’d recommend the upgrade for just about anyone who uses their iPad for serious work. It’s still not yet a laptop replacement, but this is due more to a lack of desktop-class software for iOS than anything in Apple’s control.

Review: American Standard’s Champion 4 Toilet Flushes Almost Anything

July 31, 2012

Although most crappers are crap, plumbing companies like American Standard are hard at work innovating to eliminate the excrement. The current champion is fittingly called the Champion 4, and I have purchased, installed, and tested two examples of the type. The Champion is engineered to reduce the most common causes of clogs, and it has proved effective and “hands-off” in my real-world testing.

Hands-On Review: Unicomp Spacesaver M Keyboard for Mac

July 3, 2012

I would not hesitate to recommend the Unicomp Spacesaver M to Macintosh users used to an original IBM Model M, and I am admittedly a tough customer. I wish that Unicomp would update their website, packaging, logo, and keyboard graphics, but none of this really matters as your fingers press the keys. If any keyboard is worth $100, it is the Unicomp Spacesaver M!

What You See and What You Get When You Follow Me

May 28, 2019

Social media ought to be social, not just a broadcast platform. That’s my feeling at least. It’s been a while since I’ve ranted about “write-only” social media accounts, so I thought I might as well do it again. And at the same time, I thought I would update you on my promise to the people who read, follow, and interact with me online.

Generation 3 drobo: Fall In Love All Over Again

April 9, 2015

I remain a huge fan of drobo generally, and the third-generation drobo remains the best choice for home storage. It’s the perfect storage device for the long haul, and the performance improvements make it a no-brainer. Get one.

Mac OS X Lion Adds CoreStorage, a Volume Manager (Finally!)

August 4, 2011

Mac OS X was majorly deficient in that it lacked a volume manager. This wouldn’t seem like a big deal to the average user, but held back the operating system in so many ways. A volume manager brings storage virtualization to an operating system, allowing storage capacity efficiently to be managed and manipulated. But all this has changed in Mac OS X 10.7 â€œLionâ€ with CoreStorage.

The Myths of Standardization

December 15, 2011

I certainly benefit from standardization of the world around me, and I welcome interoperability and interchangeability as well as the price and product selection advantages. But I am not blithely focused on standardization above all else. I will happily use a proprietary solution if the alternative is inelegant, ineffective, or insufficient.

The Ideal pfSense Platform: Netgate RCC-VE 2440

September 21, 2015

After some frustration with stability and latency connecting my virtual pfSense router to my cable and DSL modems, I decided to switch to a physical box. I selected the Netgate RCC-VE 2440 as my hardware platform, since it’s the same box that pfSense themselves use as their OEM bundle. It also checks all the boxes with a dual-core Atom CPU, four Gigabit Ethernet ports, and low-power fanless design. Here’s my first impression and installation notes!

Follow the Yellow Brick Road to the Software-Defined Future

November 29, 2012

The Software-Defined Datacenter is a great concept, but it just won’t work. The big enterprise companies will never allow VMware (and daddy EMC) to commoditize them out of existence, so useful implementations will be rarer than ruby slippers. The best we can hope for is point enhancements to enable greater virtual machine mobility through SDN and improved storage integration.

You might also want to read these other posts...

Reader Interactions

Comments

Leave a Reply