<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:series="http://unfoldingneurons.com/"
		>
<channel>
	<title>Comments on: Compression, Encryption, Deduplication, and Replication: Strange Bedfellows</title>
	<atom:link href="http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/</link>
	<description>Understanding the accumulation of data</description>
	<lastBuildDate>Fri, 19 Mar 2010 00:14:36 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
		<item>
		<title>By: Jered Floyd</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13988</link>
		<dc:creator>Jered Floyd</dc:creator>
		<pubDate>Fri, 27 Feb 2009 09:28:41 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13988</guid>
		<description>As David says, the right way to do this is deduplicate (or, at least, segment for deduplication), compress, encrypt, replicate (,wash, rinse, repeat).  --rsyncable totally works, but it&#039;s a bit of a hack... it&#039;s doing non-optimal segmentation for deduplication, and of course doesn&#039;t help unless you reset your encryption cipher on the same boundaries as well.  As you say, this makes the encryption somewhat less secure -- again, you ought to be doing your replication over an encrypted channel anyhow.&lt;br&gt;&lt;br&gt;At Permabit (&lt;a href=&quot;http://www.permabit.com&quot; rel=&quot;nofollow&quot;&gt;http://www.permabit.com&lt;/a&gt;), we incorporates all of these technologies in our Enterprise Archive product in this order for maximum benefit.   As data is being written an in-line process breaks files up in to variable-sized segments for optimal deduplication.  Then these segments are (optionally) compressed, (optionally) encrypted, deduplicated, and written to disk.  These compressed, encrypted chunks can be replicated, which is also done over an encrypted channel to eliminate traffic analysis.  This provides the best of all worlds.&lt;br&gt;&lt;br&gt;Regards,&lt;br&gt;  Jered Floyd&lt;br&gt;  CTO, Permabit</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->As David says, the right way to do this is deduplicate (or, at least, segment for deduplication), compress, encrypt, replicate (,wash, rinse, repeat).  &#8211;rsyncable totally works, but it&#39;s a bit of a hack&#8230; it&#39;s doing non-optimal segmentation for deduplication, and of course doesn&#39;t help unless you reset your encryption cipher on the same boundaries as well.  As you say, this makes the encryption somewhat less secure &#8212; again, you ought to be doing your replication over an encrypted channel anyhow.</p>
<p>At Permabit (<a href="http://www.permabit.com"  rel="nofollow">http://www.permabit.com</a>), we incorporates all of these technologies in our Enterprise Archive product in this order for maximum benefit.   As data is being written an in-line process breaks files up in to variable-sized segments for optimal deduplication.  Then these segments are (optionally) compressed, (optionally) encrypted, deduplicated, and written to disk.  These compressed, encrypted chunks can be replicated, which is also done over an encrypted channel to eliminate traffic analysis.  This provides the best of all worlds.</p>
<p>Regards,<br />  Jered Floyd<br />  CTO, Permabit<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jered Floyd</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13184</link>
		<dc:creator>Jered Floyd</dc:creator>
		<pubDate>Fri, 27 Feb 2009 04:28:41 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13184</guid>
		<description>As David says, the right way to do this is deduplicate (or, at least, segment for deduplication), compress, encrypt, replicate (,wash, rinse, repeat).  --rsyncable totally works, but it&#039;s a bit of a hack... it&#039;s doing non-optimal segmentation for deduplication, and of course doesn&#039;t help unless you reset your encryption cipher on the same boundaries as well.  As you say, this makes the encryption somewhat less secure -- again, you ought to be doing your replication over an encrypted channel anyhow.&lt;br&gt;&lt;br&gt;At Permabit (&lt;a href=&quot;http://www.permabit.com&quot; rel=&quot;nofollow&quot;&gt;http://www.permabit.com&lt;/a&gt;), we incorporates all of these technologies in our Enterprise Archive product in this order for maximum benefit.   As data is being written an in-line process breaks files up in to variable-sized segments for optimal deduplication.  Then these segments are (optionally) compressed, (optionally) encrypted, deduplicated, and written to disk.  These compressed, encrypted chunks can be replicated, which is also done over an encrypted channel to eliminate traffic analysis.  This provides the best of all worlds.&lt;br&gt;&lt;br&gt;Regards,&lt;br&gt;  Jered Floyd&lt;br&gt;  CTO, Permabit</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->As David says, the right way to do this is deduplicate (or, at least, segment for deduplication), compress, encrypt, replicate (,wash, rinse, repeat).  &#8211;rsyncable totally works, but it&#39;s a bit of a hack&#8230; it&#39;s doing non-optimal segmentation for deduplication, and of course doesn&#39;t help unless you reset your encryption cipher on the same boundaries as well.  As you say, this makes the encryption somewhat less secure &#8212; again, you ought to be doing your replication over an encrypted channel anyhow.</p>
<p>At Permabit (<a href="http://www.permabit.com"  rel="nofollow">http://www.permabit.com</a>), we incorporates all of these technologies in our Enterprise Archive product in this order for maximum benefit.   As data is being written an in-line process breaks files up in to variable-sized segments for optimal deduplication.  Then these segments are (optionally) compressed, (optionally) encrypted, deduplicated, and written to disk.  These compressed, encrypted chunks can be replicated, which is also done over an encrypted channel to eliminate traffic analysis.  This provides the best of all worlds.</p>
<p>Regards,<br />  Jered Floyd<br />  CTO, Permabit<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pete Steege</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13136</link>
		<dc:creator>Pete Steege</dc:creator>
		<pubDate>Tue, 10 Feb 2009 15:45:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13136</guid>
		<description>Hi Stephen,&lt;br&gt;Encryption seems to be evolving as a multi-level requirement.  Encryption of data at the end of the line with self-encrypting drives covers data at rest and avoids the dedupe issue. &lt;br&gt;&lt;br&gt;It&#039;s trickier, as you say, for encrpting data on the move.  When do you see a viable solution standardized?</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->Hi Stephen,<br />Encryption seems to be evolving as a multi-level requirement.  Encryption of data at the end of the line with self-encrypting drives covers data at rest and avoids the dedupe issue. </p>
<p>It&#39;s trickier, as you say, for encrpting data on the move.  When do you see a viable solution standardized?<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sfoskett</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13111</link>
		<dc:creator>sfoskett</dc:creator>
		<pubDate>Fri, 06 Feb 2009 09:06:37 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13111</guid>
		<description>That&#039;s the traditional way of doing it. But I&#039;m excited by the idea of doing deduplication AFTER compression and encryption using gzip-rsyncable and rsyncrypto!</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->That&#39;s the traditional way of doing it. But I&#39;m excited by the idea of doing deduplication AFTER compression and encryption using gzip-rsyncable and rsyncrypto!<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Slik</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13110</link>
		<dc:creator>David Slik</dc:creator>
		<pubDate>Fri, 06 Feb 2009 09:03:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13110</guid>
		<description>Compression, encryption, de-duplication, and replication can all coexist, you just need to do it in the right order.&lt;br&gt;&lt;br&gt;You first de-duplicate, then you compress, then you encrypt, and last of all, you replicate.&lt;br&gt;&lt;br&gt;And, of course, if you really care about your data, you take a has of it at the beginning so you can verify that after all those machinations, you&#039;re still getting your original data back at the end of the day.</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->Compression, encryption, de-duplication, and replication can all coexist, you just need to do it in the right order.</p>
<p>You first de-duplicate, then you compress, then you encrypt, and last of all, you replicate.</p>
<p>And, of course, if you really care about your data, you take a has of it at the beginning so you can verify that after all those machinations, you&#39;re still getting your original data back at the end of the day.<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sfoskett</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13105</link>
		<dc:creator>sfoskett</dc:creator>
		<pubDate>Fri, 06 Feb 2009 01:06:37 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13105</guid>
		<description>That&#039;s the traditional way of doing it. But I&#039;m excited by the idea of doing deduplication AFTER compression and encryption using gzip-rsyncable and rsyncrypto!</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->That&#39;s the traditional way of doing it. But I&#39;m excited by the idea of doing deduplication AFTER compression and encryption using gzip-rsyncable and rsyncrypto!<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Slik</title>
		<link>http://blog.fosketts.net/2009/02/05/compression-encryption-deduplication-replication/comment-page-1/#comment-13104</link>
		<dc:creator>David Slik</dc:creator>
		<pubDate>Fri, 06 Feb 2009 01:03:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.fosketts.net/?p=1396#comment-13104</guid>
		<description>Compression, encryption, de-duplication, and replication can all coexist, you just need to do it in the right order.&lt;br&gt;&lt;br&gt;You first de-duplicate, then you compress, then you encrypt, and last of all, you replicate.&lt;br&gt;&lt;br&gt;And, of course, if you really care about your data, you take a has of it at the beginning so you can verify that after all those machinations, you&#039;re still getting your original data back at the end of the day.</description>
		<content:encoded><![CDATA[<p><!-- google_ad_section_start -->Compression, encryption, de-duplication, and replication can all coexist, you just need to do it in the right order.</p>
<p>You first de-duplicate, then you compress, then you encrypt, and last of all, you replicate.</p>
<p>And, of course, if you really care about your data, you take a has of it at the beginning so you can verify that after all those machinations, you&#39;re still getting your original data back at the end of the day.<!-- google_ad_section_end --></p>
]]></content:encoded>
	</item>
</channel>
</rss>
