<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	>

<channel>
	<title>Stephen Foskett, Pack Rat &#187; Data Domain Archives  &#8211; Stephen Foskett, Pack Rat</title>
	<atom:link href="http://blog.fosketts.net/tag/data-domain/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.fosketts.net</link>
	<description>Understanding the accumulation of data</description>
	<lastBuildDate>Fri, 10 Feb 2012 17:40:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>My 2009 IT Industry Predictions</title>
		<link>http://blog.fosketts.net/2009/12/24/2009-industry-predictions/</link>
		<comments>http://blog.fosketts.net/2009/12/24/2009-industry-predictions/#comments</comments>
		<pubDate>Thu, 24 Dec 2009 14:00:08 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[Computer History]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Everything]]></category>
		<category><![CDATA[Personal]]></category>
		<category><![CDATA[Terabyte home]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[.Mac]]></category>
		<category><![CDATA[Alan Atkinson]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Atmos]]></category>
		<category><![CDATA[Avatar]]></category>
		<category><![CDATA[Bing]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[cloud storage]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[Dave Donatelli]]></category>
		<category><![CDATA[Drobo]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[FAST]]></category>
		<category><![CDATA[FCoE]]></category>
		<category><![CDATA[GDrive]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Interop]]></category>
		<category><![CDATA[Iomega]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[NetApp]]></category>
		<category><![CDATA[Nirvanix]]></category>
		<category><![CDATA[OnTap]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[predictions]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[recession]]></category>
		<category><![CDATA[Snow Leopard]]></category>
		<category><![CDATA[SNW]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[Storage Decisions]]></category>
		<category><![CDATA[Sun]]></category>
		<category><![CDATA[switch]]></category>
		<category><![CDATA[Tech Field Day]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Windows 7]]></category>
		<category><![CDATA[Xiotech]]></category>
		<category><![CDATA[ZFS]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=2567</guid>
		<description><![CDATA[Predictions are perilous: Get it right and you look like a mere trend-watcher; get it wrong and you look like a fool. So I'm doing something different this year: I'm going to make predictions for 2009 now that it's over, and reflect on just how smart I am (not) to have made them.]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.fosketts.net/wp-content/uploads/2009/12/Lightbulb.jpg" ><img style=' display: block; margin-right: auto; margin-left: auto;'  class="aligncenter size-full wp-image-2569" title="Lightbulb" src="http://blog.fosketts.net/wp-content/uploads/2009/12/Lightbulb.jpg" alt="" width="425" height="425" /></a></p>
<p>It&#8217;s that time again, when everyone who thinks they&#8217;re a pundit (that would be everyone with a blog or Twitter account) has to make predictions for the coming year. But predictions are perilous: Get it right and you look like a mere trend-watcher; get it wrong and you look like a fool. It&#8217;s such a hassle! So I&#8217;m doing something different this year: <strong>I&#8217;m going to make predictions for 2009 now that it&#8217;s over</strong>, and reflect on just how smart I am (not) to have made them. Or something.<span id="more-2567"></span></p>
<h3>What I Would Have Gotten Right</h3>
<p>I definitely could have predicted a lot of what happened in 2009. I mean, <strong>these were slam dunks!</strong></p>
<ol>
<li><strong>Twitter rocks the world</strong> &#8211; I wasn&#8217;t early to Twitter, but I spent the early part of 2009 <a href="http://blog.fosketts.net/2008/12/05/storage-twitter/"  target="_blank">evangelizing</a> its benefits to companies and co-workers alike. Considering how common Twitter is today, it&#8217;s hard to believe how roundly criticized and misunderstood it was this time last year. Yet here we are, on the verge of 2010, and Twitter has seeped onto our business cards, presentation templates, and web sites. I might not have predicted how stable (!) Twitter got by the end of the year, though.</li>
<li><strong>Apple&#8217;s Macs and iPhones rule</strong> &#8211; I switched to <a href="http://blog.fosketts.net/series/iPhone/"  target="_blank">the iPhone</a> and <a href="http://blog.fosketts.net/series/MacBook-Pro/"  target="_blank">the Mac</a> in 2007 and 2008, respectively, but it looks like I wasn&#8217;t much of an iconoclast after all: By November, half of the <a href="http://gestaltit.com/field-day/"  target="_blank">Tech Field Day</a> delegates were using MacBooks, and the Windows and Blackberry holdouts have started vocally defending their operating system choice. Pretty much like Mac folks used to do way back in 2008.</li>
<li><strong>The recession is a serious pain</strong> &#8211; Companies put the brakes on spending and hiring, many even shifting both into reverse in 2009. This came as no surprise to humans capable of thought. The impact on enterprise IT companies was similarly predictable: Although most were able to survive, the impact of 2009 will continue to be felt for years. I might have predicted it would be worse, though I&#8217;m glad to say I would have been wrong.</li>
<li><strong>EMC, NetApp, HDS, HP, and IBM continue to quibble</strong> &#8211; Surprise: Big company bloggers spend way too much time criticizing the products and actions of each other and way to little time talking about the true value of their own products.</li>
</ol>
<p>Non-IT slam-dunk predictions: Obama was reviled by the right; the war in Afghanistan continues; people do stupid stuff in the name of reality shows.</p>
<h3>What I Probably Could Have Predicted</h3>
<p>Although some details would likely have been missed, <strong>I think I would have seen these coming<span style="font-weight: normal;">.</span></strong></p>
<ol>
<li><strong>Cloud compute and storage hits the enterprise</strong> &#8211; I was a believer in the cloud this time last year, and <a href="http://blog.fosketts.net/2009/04/02/changing-times-demand-focus/"  target="_blank">I bet my future on it</a> by taking a position at enterprise cloud storage provider, Nirvanix, in March. I would have predicted that enterprise buyers would be putting serious thought to buying cloud products, but the scope has surprised me. We&#8217;re talking enough petabytes that the non-cloud players felt compelled to strike back with the private cloud pitch. Awesome!</li>
<li><strong>Sun and Data Domain were acquired</strong> &#8211; My money would have been on Dell, IBM, or HP as buyers for this pair, but EMC wouldn&#8217;t have been outside my guesses. Still, Oracle buying Sun and vocally committing to keep it going, SPARC and all, would never have come to mind. But I wouldn&#8217;t have guessed against it either, so I&#8217;ll give myself a point here!</li>
<li><strong>Cisco and EMC buddy up</strong> &#8211; I&#8217;ve long thought an outright merger of these two was in the cards, but even the recession couldn&#8217;t make the financials work. A partnership would have been on the list, and <a href="http://thestoragearchitect.com/2009/11/03/enterprise-computing-vmware-cisco-and-emc-join-forces-to-create/"  target="_blank">Acadia</a> came as no surprise to anyone.</li>
<li><strong>Cloud outages and data loss</strong> &#8211; I definitely could have predicted that high-profile cloud services would fall over throughout the year, and that some would lose data. Not all are enterprise-grade, after all. But the outages at Google, Rackspace, and Amazon, and Microsoft&#8217;s Danger data loss, surprised me. Don&#8217;t those guys have their acts together?</li>
<li><strong>IT conferences falter</strong> &#8211; I spoke at Interop in 2009, but it lacked the 20,000-strong crowd it once had. Storage Decisions and Storage Networking World managed to fill their halls, but the old-school IT conference has lost its luster. Although VMworld remains strong, attendance was definitely off.</li>
<li><strong>FCoE and SSD are still starting</strong> &#8211; I&#8217;ve been lukewarm on <a href="http://blog.fosketts.net/tag/FCoE/"  target="_blank">Fibre Channel over Ethernet</a> and <a href="http://blog.fosketts.net/tag/ssd/"  target="_blank">Solid State Drives</a>, but I&#8217;m a bit surprised that storage vendors didn&#8217;t push them harder in 2009. I might have guessed there would have been more customer uptake to match the buzz.</li>
<li><strong>SMB storage is hot</strong> &#8211; There&#8217;s a hole in the storage market between $1,000 and $20,000, and companies like <a href="http://blog.fosketts.net/series/Drobo/"  target="_blank">Drobo</a> and <a href="http://blog.fosketts.net/series/Iomega/"  target="_blank">Iomega</a> are rushing in to fill it. Now that ESX has solid iSCSI support, I expect a world of innovation here. (Oops, that sounds kind of like a 2010 prediction!)</li>
</ol>
<p>Also in the predictable category: Goldman Sachs and Bank of America thrived while others fell; Ford is the strongest of the remaining US automakers; Boeing finally got the 787 off the ground.</p>
<h3>What I Never Would Have Guessed</h3>
<p>I&#8217;m not perfect, even in retrospect. Some of the Tech news from 2009 was just <strong>completely off the wall</strong>.</p>
<ol>
<li><strong>Microsoft Bing: This time for sure!</strong> &#8211; Seriously, Microsoft should stick to in-house thinking instead of trying to copy its rivals. Yet somehow, miraculously, Bing appeared and did not suck. In fact, I&#8217;m hearing regular (non-techie) folks around town talking about using the search engine. I&#8217;ve even used it! Could they actually have a winner?</li>
<li><strong>Windows 7 rocks</strong> &#8211; Really? Seriously? Could Microsoft have come up with a solid replacement for Windows XP?</li>
<li><strong>Ship it!</strong> &#8211; It&#8217;s not even 2010, and enterprise storage buyers can go out and purchase <a rel="nofollow" href="http://storagebod.typepad.com/storagebods_blog/2009/08/duke-nukem-forever-ontap-8.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+StoragebodsBlog+%28Storagebod%27s+Blog%29&amp;utm_content=Google+Reader"  target="_blank">NetApp&#8217;s OnTap 8</a>, <a href="http://gestaltit.com/all/tech/storage/bas/emcs-fast-1-action/"  target="_blank">EMC&#8217;s FAST</a>, <a href="http://gestaltit.com/featured/top/gestalt/emc-rules-atmos-compute/"  target="_blank">EMC Atmos Compute</a>, and unicorn tears. Well, maybe not unicorn tears.</li>
<li><strong>Still no GDrive</strong> &#8211; Seemingly every company has a cloud storage platform, from Amazon to Rackspace, Nirvanix to EMC, so why not Google? Could GDrive join Duke Nukem Forever as the most famous vaporware of the decade?</li>
<li><strong>The executive shuffle</strong> &#8211; <a href="http://gestaltit.com/featured/top/devang/dave-donatellis-move-emc-hp/"  target="_blank">Dave Donatelli</a> was supposed to lead EMC, not HP. <a href="http://gestaltit.com/featured/top/stephen/alan-atkinson-wysdm-emc-xiotech/"  target="_blank">Alan Atkinson</a> was supposed to launch another startup, not take over Xiotech. At least <a href="http://gestaltit.com/all/tech/storage/stephen/netapp-shows-ceo-succession-work/"  target="_blank">NetApp was gentle</a>.</li>
<li><strong>Mac OS X (still) lacks iSCSI and ZFS</strong> &#8211; Come on, Cupertino, what&#8217;s wrong with you guys? I&#8217;ve been hyping ZFS for years, and iSCSI is commonplace. Yet <a href="http://blog.fosketts.net/2009/06/09/snow-leopard-storage/"  target="_blank">Snow Leopard is stingy</a> with both. Makes me want to hiss like one of those blue folks in Avatar.</li>
<li><strong>Gestalt IT is a success</strong> &#8211; On a personal note, Gestalt IT didn&#8217;t even exist this time last year, and now we have <a href="http://gestaltit.com"  target="_blank">a successful IT infrastructure blog</a> and <a href="http://gestaltit.com/field-day/"  target="_blank">social media event</a>. Amazing!</li>
</ol>
<p>Other total shockers: Everyone loves Michael Jackson again; digital Beatles tunes are available everywhere but iTunes; Obama&#8217;s Nobel Peace Prize arrives 10 years early.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2009/04/23/enterprise-storage-strategies-blog/"  rel="bookmark" class="crp_title">Introducing the Enterprise Storage Strategies Blog</a></li><li><a href="http://blog.fosketts.net/2009/07/01/dustin-pedroia-common/"  rel="bookmark" class="crp_title">Dustin Pedroia And I Have Two Things In Common!</a></li><li><a href="http://blog.fosketts.net/2009/09/15/whats-cloud-storage-storage-decisions/"  rel="bookmark" class="crp_title">What&#8217;s All This About Cloud Storage? Ask Me At Storage Decisions</a></li><li><a href="http://blog.fosketts.net/2009/03/19/sun-cloud/"  rel="bookmark" class="crp_title">Sun Launches Their Own Cloud, But For Which Market?</a></li><li><a href="http://blog.fosketts.net/2009/04/23/cloud-slam-storage-panel/"  rel="bookmark" class="crp_title">Cloud Slam Storage Panel: This Will Be Interesting</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2009/12/24/2009-industry-predictions/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2009. |
<a href="http://blog.fosketts.net/2009/12/24/2009-industry-predictions/">My 2009 IT Industry Predictions</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/apple/" title="View all posts in Apple" rel="category tag">Apple</a>, <a href="http://blog.fosketts.net/category/everything/computerhistory/" title="View all posts in Computer History" rel="category tag">Computer History</a>, <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>, <a href="http://blog.fosketts.net/category/everything/" title="View all posts in Everything" rel="category tag">Everything</a>, <a href="http://blog.fosketts.net/category/everything/personal/" title="View all posts in Personal" rel="category tag">Personal</a>, <a href="http://blog.fosketts.net/category/everything/terabytehome/" title="View all posts in Terabyte home" rel="category tag">Terabyte home</a>, <a href="http://blog.fosketts.net/category/everything/virtualstorage/" title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2009/12/24/2009-industry-predictions/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Deduplication Coming to Primary Storage</title>
		<link>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/</link>
		<comments>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/#comments</comments>
		<pubDate>Tue, 16 Sep 2008 19:28:37 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Computer History]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Features]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[Atari]]></category>
		<category><![CDATA[Byte]]></category>
		<category><![CDATA[capacity optimization]]></category>
		<category><![CDATA[CAS]]></category>
		<category><![CDATA[Centera]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[data deduplication]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[DR-DOS]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[FilePool]]></category>
		<category><![CDATA[greenBytes]]></category>
		<category><![CDATA[Huffman coding]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[NetApp]]></category>
		<category><![CDATA[Riverbed]]></category>
		<category><![CDATA[single-instance storage]]></category>
		<category><![CDATA[Stacker]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[VTL]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=626</guid>
		<description><![CDATA[Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time - reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.]]></description>
			<content:encoded><![CDATA[<p style="padding-left: 30px;"><em>This is a follow-up to my story, <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">De-Duplication Goes Mainstream</a></em></p>
<p>Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time &#8211; reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.</p>
<h3>Stone Knives and Bear Skins</h3>
<p>We have all been here before, of course. Back at the dawn of the personal computer era, data compression was a hot topic of conversation. I recall being so impressed by an article in <a rel="nofollow" href="http://en.wikipedia.org/wiki/Byte_(magazine)"  target="_blank">Byte</a> (1986:5, p99) outlining <a rel="nofollow" href="http://en.wikipedia.org/wiki/Huffman_coding"  target="_blank">Huffman coding</a> that I tried cooking up an implementation in Atari BASIC. Lossless compression has a magical pull to the geek in many of us &#8211; redundant data just <em>wants</em> to be eliminated!</p>
<div id="attachment_630" class="wp-caption alignright" style="width: 254px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" ><img class="size-full wp-image-630 " title="Stacker" src="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" alt="Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0" width="244" height="254" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0</p></div>
<p>Companies soon applied <a href="http://www.zisman.ca/Articles/1993/DOS6.html"  target="_blank">compression to primary storage</a>, especially the limited storage in personal computers. <a rel="nofollow" href="http://en.wikipedia.org/wiki/Stac_Electronics#Microsoft_lawsuit"  target="_blank">Stacker</a> was a hit after 1990, until Microsoft built a workalike, called DoubleSpace, into DOS 6.0 in 1993, leading to a historical lawsuit. I personally used the ADDSTOR disk compression built into DR-DOS 6.0 to stretch two more years out of the 20 MB MFM hard drive in my AT&amp;T PC6300 at <a href="http://wpi.edu"  target="_blank">WPI</a>.</p>
<p>But something funny happened in the late 1990s: Compression began to lose its luster. Compressing data always takes quite a bit of CPU power, but this was offset somewhat by the truncated data transfers and more-efficient file system layout afforded in early PCs. But as disks got larger and faster, using precious CPU time to save space seemed less and less compelling. Today, although nearly every operating system includes built-in compression of files, folders, or perhaps disks, these features are rarely used. And compression was never popular in the performance-sensitive enterprise space.</p>
<h3><strong>Deduplication Has a Nice Ring</strong></h3>
<p>Although traditional fine-grained compression has not been very successful in the enterprise, its lanky cousin, single-instance storage, has long found niche jobs. Applications from databases to email systems to file servers have long had the ability to recognize to requests to store the exact same file or record, and to store just a single instance in this case. Even file systems have the ability to do single instance storage through the use of links, though this is initiated by the user rather than in an automated fashion.</p>
<p>In the late 1990s, FilePool began developing a <a rel="nofollow" href="http://en.wikipedia.org/wiki/Content-addressable_storage"  target="_blank">content-addressable storage</a> device, which was acquired by EMC in 2001. This device, later known as the Centera, was one of a number of storage platforms targeted at the archiving market introduced this decade. At the same time, <a rel="nofollow" href="http://en.wikipedia.org/wiki/Virtual_tape_library"  target="_blank">virtual tape libraries</a> made the jump from the mainframe to open systems. Both devices, being outside the critical path of performance but offering massive capacity, were well-suited to implement advanced <a rel="nofollow" href="http://en.wikipedia.org/wiki/Capacity_optimization"  target="_blank">capacity optimization</a> technologies that combined the concepts of compression with single-instance storage. Thus was created the modern world of data deduplication.</p>
<p>What we think of as deduplication is neither fish nor fowl: It assesses larger &#8220;chunks&#8221; of data than compression technologies, delivering greater capacity savings and potentially reducing performance impact, but is more flexible than single-instancing, recognizing the similarities within files or objects.</p>
<p>But it is still maddeningly difficult to scale deduplication while maintaining performance. Rather than fight to maintain reasonable write throughput, most deduplication products have switched to post-processing, deferring their work to quieter times.</p>
<h3><strong>It&#8217;s Not Just for Breakfast</strong></h3>
<p>Regardless of their methods or underlying technology, no deduplication vendor has stood up to support challenging low-latency or high-throughput production applications, however. <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">NetApp was the first to raise the issue of support for production applications</a>, but although they tout the technology for VMware, they haven&#8217;t exactly been shouting from the rooftops to get their A-SIS deduplication technology deployed in other high-I/O applications. And I haven&#8217;t seen Hifn&#8217;s card yet.</p>
<p>Yesterday, I mentioned that greenBytes was adding deduplication to their ZFS-based storage array for primary data. And now <a href="http://www.theregister.co.uk/2008/09/16/deduplicating_primary_storage/"  target="_blank">Riverbed has fired another shot</a> over the bow, repurposing their (deduplicating) WAN accelerator product for primary (file) storage. They might be able to pull it off, too, since they have a long list of customers who are already enjoying the technology in production. It&#8217;s not a stretch to suggest that Riverbed&#8217;s appliances can scale to handle production data loads. Although it&#8217;s file-only, I can imagine quite a few scenarios where this tech could really yield benefits. Could we come full-circle, with deduplication finally reaching the enterprise storage world?</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/09/25/deduplication-ready-prime-time/"  rel="bookmark" class="crp_title">Is Deduplication Ready for Prime Time?</a></li><li><a href="http://blog.fosketts.net/2011/09/22/data-reduction-condensed-version/"  rel="bookmark" class="crp_title">Data Reduction: the Condensed Version</a></li><li><a href="http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/"  rel="bookmark" class="crp_title">greenBytes Embraces and Extends ZFS</a></li><li><a href="http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/"  rel="bookmark" class="crp_title">Compression, Encryption, Deduplication, and Replication: Strange Bedfellows</a></li><li><a href="http://blog.fosketts.net/2011/05/27/storage-decisions-chicago/"  rel="bookmark" class="crp_title">Storage Decisions Chicago: All About Capacity Optimization</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2008. |
<a href="http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/">Deduplication Coming to Primary Storage</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/computerhistory/" title="View all posts in Computer History" rel="category tag">Computer History</a>, <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>, <a href="http://blog.fosketts.net/category/features/" title="View all posts in Features" rel="category tag">Features</a>, <a href="http://blog.fosketts.net/category/everything/virtualstorage/" title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>greenBytes Embraces and Extends ZFS</title>
		<link>http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/</link>
		<comments>http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/#comments</comments>
		<pubDate>Mon, 15 Sep 2008 15:13:29 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[CDP]]></category>
		<category><![CDATA[Copan]]></category>
		<category><![CDATA[data deduplication]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[greenBytes]]></category>
		<category><![CDATA[MAID]]></category>
		<category><![CDATA[snapshot]]></category>
		<category><![CDATA[spin-down]]></category>
		<category><![CDATA[Sun]]></category>
		<category><![CDATA[thin provisioning]]></category>
		<category><![CDATA[Thumper]]></category>
		<category><![CDATA[ZFS]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=622</guid>
		<description><![CDATA[I&#8217;ve long hollered that ZFS is a real storage revolution in the making, but recognized that it still had a way to go before replacing UFS, HFS+, and most volume managers. Well, a little Rhode Island company called greenBytes comes out of stealth today to announce that they&#8217;re doing just that &#8211; taking the solid [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve long hollered that <a href="http://blog.fosketts.net/2008/02/27/zfs-super-file-system/"  target="_self">ZFS is a real storage revolution in the making</a>, but recognized that it still had a way to go before replacing UFS, HFS+, and most volume managers. Well, a little Rhode Island company called <a href="http://www.green-bytes.com/"  target="_blank">greenBytes comes out of stealth today</a> to announce that they&#8217;re doing just that &#8211; taking the solid ZFS core and adding some serious enterprise storage features to it. And they&#8217;re rolling the lot into a multi-protocol storage array using commodity (<a href="http://www.sun.com/servers/x64/x4500/"  target="_blank">Sun Thumper</a>) hardware. These guys have cooked up a seriously interesting entrant in the storage market, though I can&#8217;t say much for the <a rel="nofollow" href="http://en.wikipedia.org/wiki/CamelCase"  target="_blank">decapitated camel-case spelling</a> of their (<a href="http://greenbytes.de/"  target="_blank">already in use</a>) name!</p>
<p><span id="more-622"></span><strong>Spun Down</strong></p>
<p>Although <a rel="nofollow" href="http://en.wikipedia.org/wiki/ZFS#Features"  target="_blank">ZFS&#8217; universal storage pool with non-RAID</a> is a great concept, it stands in the way of at least one (sometimes) desirable storage technique: disk spin-down. Put simply, since every disk contains metadata, all disks must always be spinning. This issue is by no means a ZFS-only problem, though &#8211; certain vendors tout the (laughable) greenness of their storage systems, while hoping that the average user won&#8217;t notice the truth: That a disk simply cannot spin down while any part of it is in use. This means that tacking spin-down onto a regular storage array is like painting it a different color: There is no benefit whatsoever to the average user. Sure, a few non-provisioned drives might spin down, but what are you doing buying a lot of non-provisioned drives anyway?</p>
<p>The solution has always been right in front of everyone: Develop <a href="http://blog.fosketts.net/2008/09/14/turning-the-page-on-raid/"  target="_self">a new type of non-RAID</a> with enough intelligence to allow drives to spin down when not used. This is what <a href="http://www.copansystems.com/index.php?"  target="_blank">COPAN Systems</a> did with their <a rel="nofollow" href="http://en.wikipedia.org/wiki/Massive_array_of_idle_disks"  target="_blank">MAID</a> technology: Invent an entirely new storage array, with integrated data protection and management techniques that allow <em>alive but not active</em> drives to spin down. Spin-down is not MAID any more than a bicycle is a Ducati.</p>
<p>Let&#8217;s make one thing clear: It&#8217;s <em>really hard</em> to reduce the power demands of storage devices. Disks guzzle watts like few other data center devices, and enterprise storage uses lots of disks. Lots of vendors are looking to hop onto the green storage bandwagon, and they all seem to realize that bringing some <a href="http://storageio.com/blog/?p=72"  target="_blank">intelligence to power management by enabling spin-down</a> is an open door. But it&#8217;s awfully hard to maintain performance and data protection when disks are spinning up and down all the time.</p>
<p>One element of the greenByte story is the way in which they have tweaked ZFS to allow disks to spin down. They limit the metadata updates to just a few disks, so the others can be idled when no access to them is made. The company suggests scheduling this for off hours to minimize latency as drives are brought back online, an approach that is less than optimal from an energy perspective but demonstrates that they understand just how difficult this problem is to crack. The core is there, however: They have integrated the data protection and storage management elements to enable spin-down to be practical.</p>
<p><strong>Compressed</strong></p>
<p>Another major storage industry theme of the last few years is deduplication of data. An advanced (or devolved, depending on your perspective) form of compression, deduplication allows a storage array to store duplicate data more efficiently, reducing the amount of capacity required for some applications. <a href="http://www.datadomain.com/"  target="_blank">Data Domain</a> is top-of-mind in this space, but just about everyone now offers some form of deduplication technology.</p>
<p>One major roadblock on the way to deduplication (or compression) nirvana is performance. Simply put, it&#8217;s <em>really really hard</em> to process data on the fly without affecting performance, especially as data scales up to the multi-terabyte range or as systems scale out to include multiple devices. One approach to tackling this issue is post-processing dedupe, which accepts incoming data in the normal way but goes back and processes it later to remove duplicates. This is the method <a href="http://netapp.com"  target="_blank">NetApp</a> uses, and they have leveraged it to become <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">the first vendor to support deduplication of production applications</a>.</p>
<p>Predictably, deduplication is another technology integrated into greenBytes&#8217; &#8220;ZFS+&#8221; technology. They claim that they can handle inline compression at wire speed, and also claim deduplication inline. It&#8217;s not yet clear exactly what the difference between compression and deduplication is to the company, or just what kind of performance their inline technology will yield, but it&#8217;s certainly nice to see this tech integrated with ZFS!</p>
<p><strong>Thin is In (the House!)</strong></p>
<p>greenBytes gets closer to enterprise storage bingo by adding <a href="http://blog.fosketts.net/2008/09/02/3pars-thin-un-provisioning-is-slightly-less-bad/"  target="_self">thin provisioning</a> to the mix. Actually, as the company&#8217;s CTO was quick to point out, they had to offer virtual or thin provisioning to enable the rest of the system to function. When your storage is sliced and diced by their Cypress array, the only way to present storage is with a wink and a promise of capacity to spare. Thankfully this is not the core of their pitch, however.</p>
<p>The company also promises snapshots and CDP replication, all leveraging ZFS at the core. All they need to add is tier-0 solid state storage to get five chips in a row without even <a rel="nofollow" href="http://en.wikipedia.org/wiki/Bingo_(U.S.)"  target="_blank">using the free space</a>! Although greenBytes is using Sun&#8217;s Thumper chassis currently for their Cypress array, their core technology is the ZFS+ software, and I expect we might see this mixed quite differently in the future. This is a software company, not an array vendor.</p>
<p>All considered, greenBytes has thoroughly broken the link between physical and logical storage, and I applaud them for it. This is exactly the kind of storage revolution the industry needs right now.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/09/25/deduplication-ready-prime-time/"  rel="bookmark" class="crp_title">Is Deduplication Ready for Prime Time?</a></li><li><a href="http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/"  rel="bookmark" class="crp_title">Deduplication Coming to Primary Storage</a></li><li><a href="http://blog.fosketts.net/2008/09/02/3pars-thin-un-provisioning/"  rel="bookmark" class="crp_title">3PAR&#8217;s Thin Un-Provisioning is Slightly Less Bad</a></li><li><a href="http://blog.fosketts.net/2008/09/14/turning-page-raid/"  rel="bookmark" class="crp_title">Turning the Page on RAID</a></li><li><a href="http://blog.fosketts.net/2011/04/30/storage-revolution/"  rel="bookmark" class="crp_title">We Need a Storage Revolution</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2008. |
<a href="http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/">greenBytes Embraces and Extends ZFS</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>, <a href="http://blog.fosketts.net/category/everything/virtualstorage/" title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

