<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:series="http://unfoldingneurons.com/"
	>

<channel>
	<title>Stephen Foskett, Pack Rat &#187; Atari Archives  &#8211; Stephen Foskett, Pack Rat</title>
	<atom:link href="http://blog.fosketts.net/tag/atari/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.fosketts.net</link>
	<description>Understanding the accumulation of data</description>
	<lastBuildDate>Fri, 10 Feb 2012 17:40:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
			<item>
		<title>Deduplication Coming to Primary Storage</title>
		<link>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/</link>
		<comments>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/#comments</comments>
		<pubDate>Tue, 16 Sep 2008 19:28:37 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Computer History]]></category>
		<category><![CDATA[Enterprise storage]]></category>
		<category><![CDATA[Features]]></category>
		<category><![CDATA[Virtual Storage]]></category>
		<category><![CDATA[Atari]]></category>
		<category><![CDATA[Byte]]></category>
		<category><![CDATA[capacity optimization]]></category>
		<category><![CDATA[CAS]]></category>
		<category><![CDATA[Centera]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[data deduplication]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[DR-DOS]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[FilePool]]></category>
		<category><![CDATA[greenBytes]]></category>
		<category><![CDATA[Huffman coding]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[NetApp]]></category>
		<category><![CDATA[Riverbed]]></category>
		<category><![CDATA[single-instance storage]]></category>
		<category><![CDATA[Stacker]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[VTL]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/?p=626</guid>
		<description><![CDATA[Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time - reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.]]></description>
			<content:encoded><![CDATA[<p style="padding-left: 30px;"><em>This is a follow-up to my story, <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">De-Duplication Goes Mainstream</a></em></p>
<p>Although deduplication of storage is nothing new, with Data Domain and other making hay with the technique for years, it has never been ready for prime time &#8211; reduction of active primary storage applications like email and databases. Instead, deduplication has been relegated to second- or third-tier status, deduplicating archives and backup data. But change is in the air, and deduplication vendors are starting to bustle towards the bright lights of primary storage.</p>
<h3>Stone Knives and Bear Skins</h3>
<p>We have all been here before, of course. Back at the dawn of the personal computer era, data compression was a hot topic of conversation. I recall being so impressed by an article in <a rel="nofollow" href="http://en.wikipedia.org/wiki/Byte_(magazine)"  target="_blank">Byte</a> (1986:5, p99) outlining <a rel="nofollow" href="http://en.wikipedia.org/wiki/Huffman_coding"  target="_blank">Huffman coding</a> that I tried cooking up an implementation in Atari BASIC. Lossless compression has a magical pull to the geek in many of us &#8211; redundant data just <em>wants</em> to be eliminated!</p>
<div id="attachment_630" class="wp-caption alignright" style="width: 254px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" ><img class="size-full wp-image-630 " title="Stacker" src="http://blog.fosketts.net/wp-content/uploads/2008/09/sc0003b3d4.png" alt="Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0" width="244" height="254" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Stacker dominated the disk compression world - until Microsoft introduced DOS 6.0</p></div>
<p>Companies soon applied <a href="http://www.zisman.ca/Articles/1993/DOS6.html"  target="_blank">compression to primary storage</a>, especially the limited storage in personal computers. <a rel="nofollow" href="http://en.wikipedia.org/wiki/Stac_Electronics#Microsoft_lawsuit"  target="_blank">Stacker</a> was a hit after 1990, until Microsoft built a workalike, called DoubleSpace, into DOS 6.0 in 1993, leading to a historical lawsuit. I personally used the ADDSTOR disk compression built into DR-DOS 6.0 to stretch two more years out of the 20 MB MFM hard drive in my AT&amp;T PC6300 at <a href="http://wpi.edu"  target="_blank">WPI</a>.</p>
<p>But something funny happened in the late 1990s: Compression began to lose its luster. Compressing data always takes quite a bit of CPU power, but this was offset somewhat by the truncated data transfers and more-efficient file system layout afforded in early PCs. But as disks got larger and faster, using precious CPU time to save space seemed less and less compelling. Today, although nearly every operating system includes built-in compression of files, folders, or perhaps disks, these features are rarely used. And compression was never popular in the performance-sensitive enterprise space.</p>
<h3><strong>Deduplication Has a Nice Ring</strong></h3>
<p>Although traditional fine-grained compression has not been very successful in the enterprise, its lanky cousin, single-instance storage, has long found niche jobs. Applications from databases to email systems to file servers have long had the ability to recognize to requests to store the exact same file or record, and to store just a single instance in this case. Even file systems have the ability to do single instance storage through the use of links, though this is initiated by the user rather than in an automated fashion.</p>
<p>In the late 1990s, FilePool began developing a <a rel="nofollow" href="http://en.wikipedia.org/wiki/Content-addressable_storage"  target="_blank">content-addressable storage</a> device, which was acquired by EMC in 2001. This device, later known as the Centera, was one of a number of storage platforms targeted at the archiving market introduced this decade. At the same time, <a rel="nofollow" href="http://en.wikipedia.org/wiki/Virtual_tape_library"  target="_blank">virtual tape libraries</a> made the jump from the mainframe to open systems. Both devices, being outside the critical path of performance but offering massive capacity, were well-suited to implement advanced <a rel="nofollow" href="http://en.wikipedia.org/wiki/Capacity_optimization"  target="_blank">capacity optimization</a> technologies that combined the concepts of compression with single-instance storage. Thus was created the modern world of data deduplication.</p>
<p>What we think of as deduplication is neither fish nor fowl: It assesses larger &#8220;chunks&#8221; of data than compression technologies, delivering greater capacity savings and potentially reducing performance impact, but is more flexible than single-instancing, recognizing the similarities within files or objects.</p>
<p>But it is still maddeningly difficult to scale deduplication while maintaining performance. Rather than fight to maintain reasonable write throughput, most deduplication products have switched to post-processing, deferring their work to quieter times.</p>
<h3><strong>It&#8217;s Not Just for Breakfast</strong></h3>
<p>Regardless of their methods or underlying technology, no deduplication vendor has stood up to support challenging low-latency or high-throughput production applications, however. <a href="http://blog.fosketts.net/2008/03/12/de-duplication-goes-mainstream/"  target="_self">NetApp was the first to raise the issue of support for production applications</a>, but although they tout the technology for VMware, they haven&#8217;t exactly been shouting from the rooftops to get their A-SIS deduplication technology deployed in other high-I/O applications. And I haven&#8217;t seen Hifn&#8217;s card yet.</p>
<p>Yesterday, I mentioned that greenBytes was adding deduplication to their ZFS-based storage array for primary data. And now <a href="http://www.theregister.co.uk/2008/09/16/deduplicating_primary_storage/"  target="_blank">Riverbed has fired another shot</a> over the bow, repurposing their (deduplicating) WAN accelerator product for primary (file) storage. They might be able to pull it off, too, since they have a long list of customers who are already enjoying the technology in production. It&#8217;s not a stretch to suggest that Riverbed&#8217;s appliances can scale to handle production data loads. Although it&#8217;s file-only, I can imagine quite a few scenarios where this tech could really yield benefits. Could we come full-circle, with deduplication finally reaching the enterprise storage world?</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/09/25/deduplication-ready-prime-time/"  rel="bookmark" class="crp_title">Is Deduplication Ready for Prime Time?</a></li><li><a href="http://blog.fosketts.net/2011/09/22/data-reduction-condensed-version/"  rel="bookmark" class="crp_title">Data Reduction: the Condensed Version</a></li><li><a href="http://blog.fosketts.net/2008/09/15/greenbytes-embraces-extends-zfs/"  rel="bookmark" class="crp_title">greenBytes Embraces and Extends ZFS</a></li><li><a href="http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/"  rel="bookmark" class="crp_title">Compression, Encryption, Deduplication, and Replication: Strange Bedfellows</a></li><li><a href="http://blog.fosketts.net/2011/05/27/storage-decisions-chicago/"  rel="bookmark" class="crp_title">Storage Decisions Chicago: All About Capacity Optimization</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2008. |
<a href="http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/">Deduplication Coming to Primary Storage</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/computerhistory/" title="View all posts in Computer History" rel="category tag">Computer History</a>, <a href="http://blog.fosketts.net/category/everything/enterprisestorage/" title="View all posts in Enterprise storage" rel="category tag">Enterprise storage</a>, <a href="http://blog.fosketts.net/category/features/" title="View all posts in Features" rel="category tag">Features</a>, <a href="http://blog.fosketts.net/category/everything/virtualstorage/" title="View all posts in Virtual Storage" rel="category tag">Virtual Storage</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2008/09/16/deduplication-primary-storage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>These computers are not junk!</title>
		<link>http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/</link>
		<comments>http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/#comments</comments>
		<pubDate>Fri, 22 Jun 2007 01:45:10 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[Computer History]]></category>
		<category><![CDATA[Everything]]></category>
		<category><![CDATA[Terabyte home]]></category>
		<category><![CDATA[3Com]]></category>
		<category><![CDATA[Atari]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[Linksys]]></category>
		<category><![CDATA[TiVo]]></category>

		<guid isPermaLink="false">http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/</guid>
		<description><![CDATA[Marc Farley challenged us storage geeks to wave our junk flags and reveal just how many computers we have. Well, I&#8217;m really not sure&#8230; Let&#8217;s see &#8211; in order of usefulness&#8230; Networked: Dell XPS M1210 laptop (killer!) with 120GB internal and 500 GB Western Digital My Book backup drive TiVo Series 3 with 250 GB [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.equallogic.com/blog/default.aspx?id=3212"  target="_blank">Marc Farley</a> challenged us storage geeks to wave our junk flags and reveal just how many computers we have.  Well, I&#8217;m really not sure&#8230;  Let&#8217;s see &#8211; in order of usefulness&#8230;</p>
<p>Networked:</p>
<ol>
<li><a rel="nofollow" href="http://www.dell.com/content/products/productdetails.aspx/xpsnb_m1210?c=us&amp;l=en&amp;s=dhs&amp;cs=19"  target="_blank">Dell XPS M1210</a> laptop (killer!) with 120GB internal and 500 GB <a href="http://www.wdc.com/en/products/products.asp?driveid=276&amp;language=en"  target="_blank">Western Digital My Book</a> backup drive</li>
<li><a href="http://dynamic.tivo.com/2.0.boxdetails.asp?box=series3HDDVR"  target="_blank">TiVo Series 3</a> with 250 GB (yeah yeah not yet upgraded)</li>
<li>TiVo Series 2 with 140 GB</li>
<li>Homebrew <a href="http://www.anandtech.com/cpuchipsets/showdoc.html?i=1622"  target="_blank">Celeron 4</a> desktop with 320 GB</li>
<li>Wife&#8217;s <a rel="nofollow" href="http://h10025.www1.hp.com/ewfrf/wc/genericDocument?docname=c00846711&amp;cc=us&amp;lc=en&amp;dlc=en&amp;dlc=en&amp;lang=en"  target="_blank">Compaq laptop</a> with 100 GB</li>
<li>Sa-weet <a rel="nofollow" href="http://www.google.com/url?sa=t&amp;ct=res&amp;cd=2&amp;url=http%3A%2F%2Fwww.hp.com%2Fhpinfo%2Fnewsroom%2Fpress_kits%2F2006%2Fipgconf%2Fds_c6180aio.pdf&amp;ei=uQl8Rs7WOomKgQKJlcGCAw&amp;usg=AFQjCNFumeKR7rVXNrtbU0-xWSJ6Nkq1Cw&amp;sig2=LOqtXGnrkaC_pDjQIPJozA"  target="_blank">HP Photosmart</a> all-in-one with no storage (but it&#8217;s networked!)</li>
<li><a rel="nofollow" href="http://en.wikipedia.org/wiki/WRT54G"  target="_blank">Linksys 54G</a> running <a href="http://www.polarcloud.com/tomato"  target="_blank">Tomato</a> (16 MB flasher!!)</li>
<li>2 <a rel="nofollow" href="http://en.wikipedia.org/wiki/3Com_Audrey"  target="_blank">3Com Audreys</a> (16 MB flash bay-bee!)</li>
<li>Old junker laptop with 10 GB</li>
<li><a rel="nofollow" href="http://en.wikipedia.org/wiki/Virgin_Webplayer"  target="_blank">Virgin Webplayer</a> Internet appliance with 64 MB disk-on-chip</li>
<li><a href="http://www.gweep.net/~sfoskett/linux/p3010.html"  target="_blank">Toshiba Portege</a> booting from a 4 GB CompactFlash disk</li>
<li>Homebrew <a rel="nofollow" href="http://en.wikipedia.org/wiki/AMD_K6"  target="_blank">AMD K6</a> system with 20 GB</li>
</ol>
<p>So my network has up to 11 devices on it&#8230;  Interesting!</p>
<p>Now for the rest of the machines:</p>
<ol>
<li>40 GB <a rel="nofollow" href="http://en.wikipedia.org/wiki/IPod"  target="_blank">iPod</a></li>
<li>Nomad Jukebox with 20 GB</li>
<li><a href="http://atari-ste.anvil-soft.com/html/factsmste.htm"  target="_blank">Atari MegaSTE</a> with 40 MB</li>
<li><a href="http://www.old-computers.com/museum/computer.asp?c=25"  target="_blank">Atari 1040STFM</a> with a floppy</li>
<li><a href="http://www.atarimuseum.com/computers/8BITS/XL/800xl/800xl.htm"  target="_blank">Atari 800XL</a> with a floppy</li>
<li>Oldest junkiest <a rel="nofollow" href="http://findarticles.com/p/articles/mi_m1563/is_n6_v9/ai_10737669"  target="_blank">Dell 382SX-20 laptop</a> with 20 MB</li>
<li><a href="http://www.old-computers.com/MUSEUM/computer.asp?c=531"  target="_blank">AT&amp;T PC6300</a> with 20 MB (my first PC)</li>
<li>Broken <a href="http://lowendmac.com/compact/se.shtml"  target="_blank">Mac SE</a></li>
</ol>
<p>There&#8217;s at least a terabyte and a half right there&#8230; Plus my collection of odd hard disks &#8211; just today I was marveling at the fact that I have 50 MB, 500 MB, 5 GB, 50 GB, and 500 GB hard disks!</p>
<p>Yeah, we&#8217;re nuts.</p>
<div id="crp_related"><h3>You might also want to read these other posts...</h3><ul><li><a href="http://blog.fosketts.net/2008/06/12/switch-or-how-the-mac-finally-won-me-over/"  rel="bookmark" class="crp_title">Switch! or How the Mac (Finally) Won Me Over</a></li><li><a href="http://blog.fosketts.net/2007/07/24/tivo-hd-arrives/"  rel="bookmark" class="crp_title">TiVo HD Arrives</a></li><li><a href="http://blog.fosketts.net/2007/07/20/where-is-linux-in-storage/"  rel="bookmark" class="crp_title">Where is Linux in Storage?</a></li><li><a href="http://blog.fosketts.net/2007/06/21/my-terabyte-house/"  rel="bookmark" class="crp_title">My terabyte house</a></li><li><a href="http://blog.fosketts.net/2007/06/20/what-good-is-the-tivo-series-3-aspect-button/"  rel="bookmark" class="crp_title">What good is the TiVo Series 3 Aspect button?</a></li></ul></div><script src="http://feeds.feedburner.com/~s/sfoskett?i=http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/" type="text/javascript" charset="utf-8"></script><hr />
<p><small>© sfoskett for <a href="http://blog.fosketts.net">Stephen Foskett, Pack Rat</a>, 2007. |
<a href="http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/">These computers are not junk!</a>
<br/>
This post was categorized as <a href="http://blog.fosketts.net/category/everything/computerhistory/" title="View all posts in Computer History" rel="category tag">Computer History</a>, <a href="http://blog.fosketts.net/category/everything/" title="View all posts in Everything" rel="category tag">Everything</a>, <a href="http://blog.fosketts.net/category/everything/terabytehome/" title="View all posts in Terabyte home" rel="category tag">Terabyte home</a>. Each of my categories has its own feed if you'd like to filter out or focus on posts like this.<br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://blog.fosketts.net/2007/06/21/these-computers-are-not-junk/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

