<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for All Things Hadoop</title>
	<atom:link href="http://allthingshadoop.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://allthingshadoop.com</link>
	<description>Scalable &#38; Distributed Computing for noobs, nerds and the elite Hadooper and Hadooperette.</description>
	<lastBuildDate>Sat, 21 May 2011 15:01:17 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>Comment on Faster Datanodes with less wait io using df instead of du by Edward Capriolo</title>
		<link>http://allthingshadoop.com/2011/05/20/faster-datanodes-with-less-wait-io-using-df-instead-of-du/#comment-151</link>
		<dc:creator><![CDATA[Edward Capriolo]]></dc:creator>
		<pubDate>Sat, 21 May 2011 15:01:17 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=446#comment-151</guid>
		<description><![CDATA[Good job! I mentioned this on the hadoop mailing list and someone did mention that you can tune your VFS cache so it caches more inodes and less data ( though I already have 7 GB vfs cache). It is nice to see that someone else noticed this is a problem.]]></description>
		<content:encoded><![CDATA[<p>Good job! I mentioned this on the hadoop mailing list and someone did mention that you can tune your VFS cache so it caches more inodes and less data ( though I already have 7 GB vfs cache). It is nice to see that someone else noticed this is a problem.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Cloudera, Yahoo and the Apache Hadoop Community Security Branch Release Update by Allen Wittenauer</title>
		<link>http://allthingshadoop.com/2011/05/05/cloudera-yahoo-and-the-apache-hadoop-community-security-branch-release-update/#comment-146</link>
		<dc:creator><![CDATA[Allen Wittenauer]]></dc:creator>
		<pubDate>Fri, 06 May 2011 04:36:24 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=423#comment-146</guid>
		<description><![CDATA[FWIW, LinkedIn runs base Apache 0.20.2 with 3-4 patches. These patches are fixes to the capacity scheduler and non-Linux portability (Mac OS X and Solaris). The &quot;huge team&quot; that builds and supports our production code base is me, either writing new code or grabbing patches from JIRA with some occasional help from the authors of those patches.  Now that Jakob is an employee, our next internal release might have two people that support it.  Two is not exactly huge. :D  (Most of the other work that LinkedIn does with Hadoop is on the periphery--Azkaban, Pig, etc.)

Anyway, it was important to me that LI run what Apache runs for a variety of reasons.  The big one being so that our options are open.  In retrospect, this was a great decision given the incompatibilities that are now flooding into the ecosystem between these forks and trunk.  It will be interesting to see how the various vendors force upgrades on users to deal with them.

(I&#039;m working on dropping the phrase &quot;Hadoop distribution&quot; from my vocabulary.  They are forks, no matter how much marketing wants to say otherwise).

The other thing to keep in mind is that if one has been paying attention, the tension is not new.  These battle lines were drawn a long, long time ago.]]></description>
		<content:encoded><![CDATA[<p>FWIW, LinkedIn runs base Apache 0.20.2 with 3-4 patches. These patches are fixes to the capacity scheduler and non-Linux portability (Mac OS X and Solaris). The &#8220;huge team&#8221; that builds and supports our production code base is me, either writing new code or grabbing patches from JIRA with some occasional help from the authors of those patches.  Now that Jakob is an employee, our next internal release might have two people that support it.  Two is not exactly huge. <img src='http://s0.wp.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />   (Most of the other work that LinkedIn does with Hadoop is on the periphery&#8211;Azkaban, Pig, etc.)</p>
<p>Anyway, it was important to me that LI run what Apache runs for a variety of reasons.  The big one being so that our options are open.  In retrospect, this was a great decision given the incompatibilities that are now flooding into the ecosystem between these forks and trunk.  It will be interesting to see how the various vendors force upgrades on users to deal with them.</p>
<p>(I&#8217;m working on dropping the phrase &#8220;Hadoop distribution&#8221; from my vocabulary.  They are forks, no matter how much marketing wants to say otherwise).</p>
<p>The other thing to keep in mind is that if one has been paying attention, the tension is not new.  These battle lines were drawn a long, long time ago.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Hadoop Streaming Made Simple using Joins and Keys with Python by [repost] Medialets Architecture &#8211; Defeating the Daunting Mobile Device Data Deluge » New IT Farmer</title>
		<link>http://allthingshadoop.com/2010/12/16/simple-hadoop-streaming-tutorial-using-joins-and-keys-with-python/#comment-141</link>
		<dc:creator><![CDATA[[repost] Medialets Architecture &#8211; Defeating the Daunting Mobile Device Data Deluge » New IT Farmer]]></dc:creator>
		<pubDate>Fri, 11 Mar 2011 13:48:21 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=355#comment-141</guid>
		<description><![CDATA[[...] Python &#8211; a lot of MapReduce is being moved into Python using Python streaming. [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Python &#8211; a lot of MapReduce is being moved into Python using Python streaming. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Hadoop Streaming Made Simple using Joins and Keys with Python by Quora</title>
		<link>http://allthingshadoop.com/2010/12/16/simple-hadoop-streaming-tutorial-using-joins-and-keys-with-python/#comment-130</link>
		<dc:creator><![CDATA[Quora]]></dc:creator>
		<pubDate>Thu, 06 Jan 2011 04:55:21 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=355#comment-130</guid>
		<description><![CDATA[&lt;strong&gt;What issues do developers generally face with Map-Reduce, the programming model, when they use Hadoop?...&lt;/strong&gt;

There are challenges specifically to MapReduce and then other challenges using Hadoop (which implements MapReduce) for developers, generally speaking. On the MapReduce side it is an entirely different thought process you need to have when attacking pro...]]></description>
		<content:encoded><![CDATA[<p><strong>What issues do developers generally face with Map-Reduce, the programming model, when they use Hadoop?&#8230;</strong></p>
<p>There are challenges specifically to MapReduce and then other challenges using Hadoop (which implements MapReduce) for developers, generally speaking. On the MapReduce side it is an entirely different thought process you need to have when attacking pro&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on NoSQL HBase and Hadoop with Todd Lipcon from Cloudera by NoSQL Daily &#8211; Thu Oct 14 &#8250; PHP App Engine</title>
		<link>http://allthingshadoop.com/2010/09/06/nosql-hbase-hadoop-todd-lipcon-cloudera/#comment-107</link>
		<dc:creator><![CDATA[NoSQL Daily &#8211; Thu Oct 14 &#8250; PHP App Engine]]></dc:creator>
		<pubDate>Thu, 14 Oct 2010 08:15:43 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=330#comment-107</guid>
		<description><![CDATA[[...] NoSQL HBase and Hadoop with Todd Lipcon from Cloudera &#171; All Things Hadoop [...]]]></description>
		<content:encoded><![CDATA[<p>[...] NoSQL HBase and Hadoop with Todd Lipcon from Cloudera &laquo; All Things Hadoop [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Running Hadoop MapReduce With Cassandra NoSQL by NoSQL Daily &#8211; Mon Sep 27 &#8250; PHP App Engine</title>
		<link>http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/#comment-104</link>
		<dc:creator><![CDATA[NoSQL Daily &#8211; Mon Sep 27 &#8250; PHP App Engine]]></dc:creator>
		<pubDate>Mon, 27 Sep 2010 08:15:50 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=185#comment-104</guid>
		<description><![CDATA[[...] Running Hadoop MapReduce With Cassandra NoSQL &#171; All Things Hadoop [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Running Hadoop MapReduce With Cassandra NoSQL &laquo; All Things Hadoop [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Podcast by Some podcasts &#171; Data story</title>
		<link>http://allthingshadoop.com/podcast/#comment-98</link>
		<dc:creator><![CDATA[Some podcasts &#171; Data story]]></dc:creator>
		<pubDate>Wed, 15 Sep 2010 11:35:41 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?page_id=144#comment-98</guid>
		<description><![CDATA[[...] Hadoop related podcasts: [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Hadoop related podcasts: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on NoSQL HBase and Hadoop with Todd Lipcon from Cloudera by NoSQL Daily &#8211; Tue Sep 7 &#8250; PHP App Engine</title>
		<link>http://allthingshadoop.com/2010/09/06/nosql-hbase-hadoop-todd-lipcon-cloudera/#comment-93</link>
		<dc:creator><![CDATA[NoSQL Daily &#8211; Tue Sep 7 &#8250; PHP App Engine]]></dc:creator>
		<pubDate>Tue, 07 Sep 2010 17:08:53 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=330#comment-93</guid>
		<description><![CDATA[[...] NoSQL HBase and Hadoop with Todd Lipcon from Cloudera &#171; All Thin&#8230; - &lt;4&gt; [...]]]></description>
		<content:encoded><![CDATA[<p>[...] NoSQL HBase and Hadoop with Todd Lipcon from Cloudera &laquo; All Thin&#8230; &#8211; &lt;4&gt; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Running Hadoop MapReduce With Cassandra NoSQL by Cassandra and Hadoop &#124; Mawazo</title>
		<link>http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/#comment-87</link>
		<dc:creator><![CDATA[Cassandra and Hadoop &#124; Mawazo]]></dc:creator>
		<pubDate>Mon, 02 Aug 2010 05:42:46 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=185#comment-87</guid>
		<description><![CDATA[[...] http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/ [...]]]></description>
		<content:encoded><![CDATA[<p>[...] <a href="http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/" rel="nofollow">http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Hadoop Cluster Setup, SSH Key Authentication by charmalloc</title>
		<link>http://allthingshadoop.com/2010/04/20/hadoop-cluster-setup-ssh-key-authentication/#comment-84</link>
		<dc:creator><![CDATA[charmalloc]]></dc:creator>
		<pubDate>Wed, 21 Jul 2010 16:30:43 +0000</pubDate>
		<guid isPermaLink="false">http://allthingshadoop.com/?p=133#comment-84</guid>
		<description><![CDATA[If you have not already posted to the user group you should http://hadoop.apache.org/mailing_lists.html]]></description>
		<content:encoded><![CDATA[<p>If you have not already posted to the user group you should <a href="http://hadoop.apache.org/mailing_lists.html" rel="nofollow">http://hadoop.apache.org/mailing_lists.html</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

