<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	>

<channel>
	<title>Stephen Foskett, Pack Rat &#187; single-instance storage Archives  &#8211; Stephen Foskett, Pack Rat</title>
	<atom:link href="http://blog.fosketts.net/tag/single-instance-storage/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.fosketts.net</link>
	<description>Understanding the accumulation of data</description>
	<lastBuildDate>Fri, 10 Feb 2012 17:40:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>I Can Finally Talk About Windows Storage Server 2008!</title>
		<link>http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/</link>
		<comments>http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/#comments</comments>
		<pubDate>Tue, 05 May 2009 20:48:18 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[ActiveX]]></category>
		<category><![CDATA[clustering]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[dual-active]]></category>
		<category><![CDATA[Firefox]]></category>
		<category><![CDATA[Gestalt IT]]></category>
		<category><![CDATA[Internet Explorer]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[MVP]]></category>
		<category><![CDATA[NDA]]></category>
		<category><![CDATA[primary storage]]></category>
		<category><![CDATA[RDP]]></category>
		<category><![CDATA[remote administration]]></category>
		<category><![CDATA[single-instance storage]]></category>
		<category><![CDATA[SIS]]></category>
		<category><![CDATA[SMB]]></category>
		<category><![CDATA[SMB 2.0]]></category>
		<category><![CDATA[Storage Decisions]]></category>
		<category><![CDATA[TechNet]]></category>
		<category><![CDATA[VDS]]></category>
		<category><![CDATA[Windows Server 2008]]></category>
		<category><![CDATA[Windows Server 2008 R2]]></category>
		<category><![CDATA[Windows Server 2008 Service Pack 2]]></category>
		<category><![CDATA[Windows Storage Server 2008]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=1832</guid>
		<description><![CDATA[I don&#8217;t usually &#8220;do&#8221; NDAs. It&#8217;s just too hard to figure out what I&#8217;m allowed to say and what I should keep quiet. I prefer to get free and open information, but will settle for embargoed briefings if it means I can get some time to think before reporting. So my Microsoft connection is a [...]]]></description>
			<content:encoded><![CDATA[<p><strong>I don&#8217;t usually &#8220;do&#8221; NDAs</strong>. It&#8217;s just too hard to figure out what I&#8217;m allowed to say and what I should keep quiet. I prefer to get free and open information, but will settle for embargoed briefings if it means I can get some time to think before reporting. So my Microsoft connection is a major anomaly, and I&#8217;ve been sitting on my hands trying not to spill the beans&#8230;<span id="more-1832"></span></p>
<p>One of the great things about being a Microsoft MVP is the access I get to Microsoft software and staff. As I mentioned in my post about the <a href="http://blog.fosketts.net/2009/03/06/10-cool-storage-2009-microsoft-mvp-summit/" >10 cool storage features from the 2009 Microsoft MVP Summit</a>, I was able to preview a lot of what Microsoft is doing with their Server software and storage features. And the best part is that the Microsoft product teams are keenly interested in our feedback and suggestions. I&#8217;m told, for example, that the awesome iSCSI Quick Connect feature in the new Windows iSCSI initiator software was developed based on my feedback!</p>
<p>As I note on my <a href="http://gestaltit.com/tech/stephen/windows-storage-server-2008/"  target="_blank">Windows Storage Server 2008 preview</a> on Gestalt IT, Microsoft has always kept WSS close to the vest. It&#8217;s only available to OEMs, not retail customers, and has never even been shared with TechNet or MSDN subscribers in the past. So I was really pleased when <strong>Microsoft gave the File System Storage MVPs access to a beta version of WSS 2008</strong> so we could get a feel for all of the new features. I&#8217;ve also had some great conversations this week with the Microsoft product managers responsible for it.</p>
<p>What&#8217;s exciting about Windows Storage Server 2008?</p>
<ol>
<li>It includes all of the <a href="http://blog.fosketts.net/2008/07/31/windows-server-2008-changes-storage/"  target="_blank">storage enhancements in Windows Server 2008</a>, including <strong>SMB 2.0</strong> for much much faster file servicing over higher-latency links, SMfS, FSRM, enhanced VDS, and failover clustering.</li>
<li>WSS is the only way to get access to Microsoft&#8217;s <strong>iSCSI target software</strong>. It&#8217;s been improved in many ways from the prior releases, but its support for what Microsoft calls <strong>dual-active clustering</strong> is probably its most notable feature: You can&#8217;t share the same active LUN between cluster members, but each can have its own active LUNs and the can all fail over in the event that one member goes down.</li>
<li>The included <strong>single-instance storage (SIS)</strong> file-based deduplication has been much improved, scaling to 128 volumes per server and millions of files. It&#8217;s still not as effective capacity-wise as block-level deduplication (which I&#8217;d love to see, hint hint), but the performance is solid enough to use it for <strong>primary storage with production applications</strong>.</li>
<li>Probably the coolest feature exclusive to Windows Storage Server 2008 is its new <strong>browser-based remote administration capability</strong>. Just point your browser to the Storage Server machine (for example, &#8220;http://wss/desktop&#8221;) and you&#8217;ll get a full ActiveX version of RDP. Don&#8217;t use Internet Exploder? Firefox and Linux users will get a Java-based RDP instead! I will cover this feature more in the future, but let&#8217;s just say that <strong>every operating system should offer this</strong>!</li>
</ol>
<p>Want to try Windows Storage Server for yourself? Breaking from the past, Microsoft will soon (like next week!) allow TechNet subscribers to <strong>download the full install</strong>. OEMs have a <a href="http://microsoft.download-ss.com/default.asp"  target="_blank">sekrit back-door site</a> to try it out, too.</p>
<p>One more thing&#8230; <strong>This will be the last release of Windows Storage Server</strong>. There won&#8217;t even be a special Storage Server version of Server 2008 Service Pack 2! Starting now, Storage Server is just an optional feature of Windows Server. Purchasing and production use will still be limited to storage OEMs, but Microsoft has finally reconciled Storage Server with the rest of the Windows Server world. I imagine that most OEMs will release Service Pack 2 updates for their Storage Server customers shortly, and that future versions of the product will come closer to the base Server versions than WSS 2008. Although I can&#8217;t share what I know, I will say that <strong>Microsoft is continuing active development</strong> on their iSCSI target, single-instance storage, and other Storage Server features. I imagine that <a href="http://blog.fosketts.net/2008/08/19/windows-7-server-windows-server-2008-r2/"  target="_blank">Windows Server 2008 R2</a> will support storage systems in the very near future!</p>
<p>On a personal note, reading <a href="http://blogs.technet.com/storageserver/archive/2008/06/09/a-brief-history-of-windows-storage-server-releases.aspx" >A Brief History of Windows Storage Server Releases</a> from the <a href="http://blogs.technet.com/StorageServer/"  target="_blank">Microsoft Storage Server blog</a> reminded me of the original unveiling of Windows Storage Server at Storage Decisions Chicago in June, 2003. The company loaded us all on one of those lake cruise boats with some celebrity impersonators, chocolate &#8220;Oscar&#8221;-style statues, an open bar, and a band. Good times ensued!</p>
<blockquote><p>For more details, check out my Gestalt IT piece, <a href="http://gestaltit.com/tech/stephen/windows-storage-server-2008/"  target="_blank">Windows Storage Server-Based Systems Step Into 2008</a></p>
<p>Microsoft is detailing the new version of Windows Storage Server 2008 in a <a rel="nofollow" href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?EventID=1032410705"  target="_blank">webcast Thursday at 8 AM Pacific</a>. You should also check out the <a href="http://blogs.technet.com/StorageServer/"  target="_blank">Microsoft Storage Server blog</a>.</p>
</blockquote>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/08/19/windows-7-server-windows-server-2008-r2/"  rel="bookmark" class="crp_title">Windows 7 Server == Windows Server 2008 R2</a></li><li><a href="http://blog.fosketts.net/2009/05/27/windows-7-hands/"  rel="bookmark" class="crp_title">Windows 7 Is Here! In My Hands! But Why 8 DVDs?</a></li><li><a href="http://blog.fosketts.net/2009/02/26/microsoft-mvp-global-summit/"  rel="bookmark" class="crp_title">Attending Microsoft&#8217;s MVP Global Summit</a></li><li><a href="http://blog.fosketts.net/2009/07/01/dustin-pedroia-common/"  rel="bookmark" class="crp_title">Dustin Pedroia And I Have Two Things In Common!</a></li><li><a href="http://blog.fosketts.net/2008/07/31/windows-server-2008-changes-storage/"  rel="bookmark" class="crp_title">Windows Server 2008 Changes Storage</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2009. |
<a href="http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/">I Can Finally Talk About Windows Storage Server 2008!</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2009/05/05/windows-storage-server-2008/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Deduplication Coming to Primary Storage</title>
		<link>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/</link>
		<comments>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/#comments</comments>
		<pubDate>Tue, 16 Sep 2008 19:28:37 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Computer History]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Features]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[Atari]]></category>
		<category><![CDATA[Byte]]></category>
		<category><![CDATA[capacity optimization]]></category>
		<category><![CDATA[CAS]]></category>
		<category><![CDATA[Centera]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[data deduplication]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[DR-DOS]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[FilePool]]></category>
		<category><![CDATA[greenBytes]]></category>
		<category><![CDATA[Huffman coding]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[NetApp]]></category>
		<category><![CDATA[Riverbed]]></category>
		<category><![CDATA[single-instance storage]]></category>
		<category><![CDATA[Stacker]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[VTL]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=626</guid>
		<description><![CDATA[Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time - reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.]]></description>
			<content:encoded><![CDATA[<p style="padding-left: 30px;"><em>This is a follow-up to my story, <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">De-Duplication Goes Mainstream</a></em></p>
<p>Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time &#8211; reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.</p>
<h3>Stone Knives and Bear Skins</h3>
<p>We have all been here before, of course. Back at the dawn of the personal computer era, data compression was a hot topic of conversation. I recall being so impressed by an article in <a rel="nofollow" href="http://en.wikipedia.org/wiki/Byte_(magazine)"  target="_blank">Byte</a> (1986:5, p99) outlining <a rel="nofollow" href="http://en.wikipedia.org/wiki/Huffman_coding"  target="_blank">Huffman coding</a> that I tried cooking up an implementation in Atari BASIC. Lossless compression has a magical pull to the geek in many of us &#8211; redundant data just <em>wants</em> to be eliminated!</p>
<div id="attachment_630" class="wp-caption alignright" style="width: 254px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" ><img class="size-full wp-image-630 " title="Stacker" src="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" alt="Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0" width="244" height="254" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0</p></div>
<p>Companies soon applied <a href="http://www.zisman.ca/Articles/1993/DOS6.html"  target="_blank">compression to primary storage</a>, especially the limited storage in personal computers. <a rel="nofollow" href="http://en.wikipedia.org/wiki/Stac_Electronics#Microsoft_lawsuit"  target="_blank">Stacker</a> was a hit after 1990, until Microsoft built a workalike, called DoubleSpace, into DOS 6.0 in 1993, leading to a historical lawsuit. I personally used the ADDSTOR disk compression built into DR-DOS 6.0 to stretch two more years out of the 20 MB MFM hard drive in my AT&amp;T PC6300 at <a href="http://wpi.edu"  target="_blank">WPI</a>.</p>
<p>But something funny happened in the late 1990s: Compression began to lose its luster. Compressing data always takes quite a bit of CPU power, but this was offset somewhat by the truncated data transfers and more-efficient file system layout afforded in early PCs. But as disks got larger and faster, using precious CPU time to save space seemed less and less compelling. Today, although nearly every operating system includes built-in compression of files, folders, or perhaps disks, these features are rarely used. And compression was never popular in the performance-sensitive enterprise space.</p>
<h3><strong>Deduplication Has a Nice Ring</strong></h3>
<p>Although traditional fine-grained compression has not been very successful in the enterprise, its lanky cousin, single-instance storage, has long found niche jobs. Applications from databases to email systems to file servers have long had the ability to recognize to requests to store the exact same file or record, and to store just a single instance in this case. Even file systems have the ability to do single instance storage through the use of links, though this is initiated by the user rather than in an automated fashion.</p>
<p>In the late 1990s, FilePool began developing a <a rel="nofollow" href="http://en.wikipedia.org/wiki/Content-addressable_storage"  target="_blank">content-addressable storage</a> device, which was acquired by EMC in 2001. This device, later known as the Centera, was one of a number of storage platforms targeted at the archiving market introduced this decade. At the same time, <a rel="nofollow" href="http://en.wikipedia.org/wiki/Virtual_tape_library"  target="_blank">virtual tape libraries</a> made the jump from the mainframe to open systems. Both devices, being outside the critical path of performance but offering massive capacity, were well-suited to implement advanced <a rel="nofollow" href="http://en.wikipedia.org/wiki/Capacity_optimization"  target="_blank">capacity optimization</a> technologies that combined the concepts of compression with single-instance storage. Thus was created the modern world of data deduplication.</p>
<p>What we think of as deduplication is neither fish nor fowl: It assesses larger &#8220;chunks&#8221; of data than compression technologies, delivering greater capacity savings and potentially reducing performance impact, but is more flexible than single-instancing, recognizing the similarities within files or objects.</p>
<p>But it is still maddeningly difficult to scale deduplication while maintaining performance. Rather than fight to maintain reasonable write throughput, most deduplication products have switched to post-processing, deferring their work to quieter times.</p>
<h3><strong>It&#8217;s Not Just for Breakfast</strong></h3>
<p>Regardless of their methods or underlying technology, no deduplication vendor has stood up to support challenging low-latency or high-throughput production applications, however. <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">NetApp was the first to raise the issue of support for production applications</a>, but although they tout the technology for VMware, they haven&#8217;t exactly been shouting from the rooftops to get their A-SIS deduplication technology deployed in other high-I/O applications. And I haven&#8217;t seen Hifn&#8217;s card yet.</p>
<p>Yesterday, I mentioned that greenBytes was adding deduplication to their ZFS-based storage array for primary data. And now <a href="http://www.theregister.co.uk/2008/09/16/deduplicating_primary_storage/"  target="_blank">Riverbed has fired another shot</a> over the bow, repurposing their (deduplicating) WAN accelerator product for primary (file) storage. They might be able to pull it off, too, since they have a long list of customers who are already enjoying the technology in production. It&#8217;s not a stretch to suggest that Riverbed&#8217;s appliances can scale to handle production data loads. Although it&#8217;s file-only, I can imagine quite a few scenarios where this tech could really yield benefits. Could we come full-circle, with deduplication finally reaching the enterprise storage world?</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/09/25/deduplication-ready-prime-time/"  rel="bookmark" class="crp_title">Is Deduplication Ready for Prime Time?</a></li><li><a href="http://blog.fosketts.net/2011/09/22/data-reduction-condensed-version/"  rel="bookmark" class="crp_title">Data Reduction: the Condensed Version</a></li><li><a href="http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/"  rel="bookmark" class="crp_title">greenBytes Embraces and Extends ZFS</a></li><li><a href="http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/"  rel="bookmark" class="crp_title">Compression, Encryption, Deduplication, and Replication: Strange Bedfellows</a></li><li><a href="http://blog.fosketts.net/2011/05/27/storage-decisions-chicago/"  rel="bookmark" class="crp_title">Storage Decisions Chicago: All About Capacity Optimization</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2008. |
<a href="http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/">Deduplication Coming to Primary Storage</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/computerhistory/" title="View all posts in Computer History" rel="category tag">Computer History</a>, <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>, <a href="http://blog.fosketts.net/category/features/" title="View all posts in Features" rel="category tag">Features</a>, <a href="http://blog.fosketts.net/category/everything/virtualstorage/" title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

