<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Sorting large files faster with a shell script</title>
	<atom:link href="http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/feed" rel="self" type="application/rss+xml" />
	<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html</link>
	<description>Learn the UNIX/Linux command line</description>
	<lastBuildDate>Mon, 06 Jun 2011 01:35:32 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Daniel</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-41294</link>
		<dc:creator>Daniel</dc:creator>
		<pubDate>Thu, 17 Jun 2010 20:03:19 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-41294</guid>
		<description>ops, that sort line should be

sort $file  (remove the -o $file)</description>
		<content:encoded><![CDATA[<p>ops, that sort line should be</p>
<p>sort $file  (remove the -o $file)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-41293</link>
		<dc:creator>Daniel</dc:creator>
		<pubDate>Thu, 17 Jun 2010 20:02:02 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-41293</guid>
		<description>This script runs into problems if the filesize is not large enough be split as requested.

I added the following lines to my copy 

if [ $size -lt 10485760 ]
then
   sort -o $file $file
   exit
fi

While the above script works great on large files, I just wanted to just take the easy way out with

for FILE in $FILES; do ; distsort.sh $FILE &gt; $FILE.new
done</description>
		<content:encoded><![CDATA[<p>This script runs into problems if the filesize is not large enough be split as requested.</p>
<p>I added the following lines to my copy </p>
<p>if [ $size -lt 10485760 ]<br />
then<br />
   sort -o $file $file<br />
   exit<br />
fi</p>
<p>While the above script works great on large files, I just wanted to just take the easy way out with</p>
<p>for FILE in $FILES; do ; distsort.sh $FILE &gt; $FILE.new<br />
done</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joel</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-30189</link>
		<dc:creator>Joel</dc:creator>
		<pubDate>Mon, 27 Apr 2009 23:57:28 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-30189</guid>
		<description>Thank you! I just applied this to ~80 million lines of data (700MB in bz2), sorted in 40 minutes on 16 machines.</description>
		<content:encoded><![CDATA[<p>Thank you! I just applied this to ~80 million lines of data (700MB in bz2), sorted in 40 minutes on 16 machines.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: admin</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-2366</link>
		<dc:creator>admin</dc:creator>
		<pubDate>Tue, 22 Jan 2008 02:56:29 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-2366</guid>
		<description>Good points guys. I think if you have multiple processors, the -C option makes sense.  However, since compression is also CPU intensive, tests would need to be done to see whether it makes more sense to use -C or to just fire up an additional local sort process.</description>
		<content:encoded><![CDATA[<p>Good points guys. I think if you have multiple processors, the -C option makes sense.  However, since compression is also CPU intensive, tests would need to be done to see whether it makes more sense to use -C or to just fire up an additional local sort process.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yves Junqueira</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-2359</link>
		<dc:creator>Yves Junqueira</dc:creator>
		<pubDate>Tue, 22 Jan 2008 01:59:09 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-2359</guid>
		<description>@Brandon,

Good point. Even if &#039;sort&#039; is CPU-bound and SSH would be competing for processor cycles with the local &#039;sort&#039; process, that may not be a problem in the end, since the local sort probably ends much faster than the others, so there is free local cpu power to spare.

And if you have SMP, ssh -C makes even more sense.</description>
		<content:encoded><![CDATA[<p>@Brandon,</p>
<p>Good point. Even if &#8216;sort&#8217; is CPU-bound and SSH would be competing for processor cycles with the local &#8216;sort&#8217; process, that may not be a problem in the end, since the local sort probably ends much faster than the others, so there is free local cpu power to spare.</p>
<p>And if you have SMP, ssh -C makes even more sense.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brandon</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-2167</link>
		<dc:creator>Brandon</dc:creator>
		<pubDate>Sat, 19 Jan 2008 14:43:29 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-2167</guid>
		<description>You may get even better performance if you transmit your data compressed (use the -C option to ssh).</description>
		<content:encoded><![CDATA[<p>You may get even better performance if you transmit your data compressed (use the -C option to ssh).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pages tagged "real"</title>
		<link>http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html/comment-page-1#comment-2146</link>
		<dc:creator>Pages tagged "real"</dc:creator>
		<pubDate>Sat, 19 Jan 2008 04:16:50 +0000</pubDate>
		<guid isPermaLink="false">http://bashcurescancer.com/sorting-large-files-faster-with-a-shell-script.html#comment-2146</guid>
		<description>[...] Sorting large files faster with a shell script. xmiszBubblez bookmarked on 01/18/08, saved by 1 others [...]</description>
		<content:encoded><![CDATA[<p>[...] Sorting large files faster with a shell script. xmiszBubblez bookmarked on 01/18/08, saved by 1 others [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.205 seconds -->

