<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Advances in Crawling the Web</title>
	<atom:link href="http://www.semclubhouse.com/advances-in-crawling-the-web/feed" rel="self" type="application/rss+xml" />
	<link>http://www.semclubhouse.com/advances-in-crawling-the-web/</link>
	<description></description>
	<pubDate>Sat, 11 Oct 2008 21:30:51 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Links of the Week March 7th, 2008 &#124; .eduGuru</title>
		<link>http://www.semclubhouse.com/advances-in-crawling-the-web/#comment-64</link>
		<dc:creator>Links of the Week March 7th, 2008 &#124; .eduGuru</dc:creator>
		<pubDate>Fri, 07 Mar 2008 16:38:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.semclubhouse.com/advances-in-crawling-the-web.html#comment-64</guid>
		<description>[...] Advances in Crawling the Web - There are 3 major parts to what a search engine does. What are some of the problems facing crawling programs? [...]</description>
		<content:encoded><![CDATA[<p>[...] Advances in Crawling the Web - There are 3 major parts to what a search engine does. What are some of the problems facing crawling programs? [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill</title>
		<link>http://www.semclubhouse.com/advances-in-crawling-the-web/#comment-63</link>
		<dc:creator>Bill</dc:creator>
		<pubDate>Tue, 04 Mar 2008 15:13:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.semclubhouse.com/advances-in-crawling-the-web.html#comment-63</guid>
		<description>You're welcome, James.

Funny, taking down your own site.  Guess it was a good thing that it wasn't someone elses. Thanks for sharing that story. I like the old usenet posts which talk about the early days of crawlers, and people trying to figure out why others were spending so much time grabbing pages from their sites.

There are a lot of elements to what makes a search engine work that go on behind the scenes, so it's great when someone provides as much depth and detail as the Texas A&#38;M researchers did in their paper.</description>
		<content:encoded><![CDATA[<p>You&#8217;re welcome, James.</p>
<p>Funny, taking down your own site.  Guess it was a good thing that it wasn&#8217;t someone elses. Thanks for sharing that story. I like the old usenet posts which talk about the early days of crawlers, and people trying to figure out why others were spending so much time grabbing pages from their sites.</p>
<p>There are a lot of elements to what makes a search engine work that go on behind the scenes, so it&#8217;s great when someone provides as much depth and detail as the Texas A&amp;M researchers did in their paper.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://www.semclubhouse.com/advances-in-crawling-the-web/#comment-62</link>
		<dc:creator>James</dc:creator>
		<pubDate>Mon, 03 Mar 2008 14:12:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.semclubhouse.com/advances-in-crawling-the-web.html#comment-62</guid>
		<description>I built myself a simple crawler and learnt most of this along the way... The problems you com up against are really interesting.
I like the politeness one....for my first attempt I took down my own site by using multi-curl....ha ha ahh the lessons we learn.

Nice article, thanks....</description>
		<content:encoded><![CDATA[<p>I built myself a simple crawler and learnt most of this along the way&#8230; The problems you com up against are really interesting.<br />
I like the politeness one&#8230;.for my first attempt I took down my own site by using multi-curl&#8230;.ha ha ahh the lessons we learn.</p>
<p>Nice article, thanks&#8230;.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
