<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Is Slurp Behaving Badly?</title>
	<atom:link href="http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/</link>
	<description>SEO and Marketing Blog</description>
	<lastBuildDate>Sat, 20 Mar 2010 13:43:49 -0700</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Keith</title>
		<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/comment-page-1/#comment-15854</link>
		<dc:creator>Keith</dc:creator>
		<pubDate>Thu, 25 Jan 2007 22:22:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.bruceclay.com/blog-test/2007/01/is-slurp-behaving-badly/#comment-15854</guid>
		<description>&lt;p&gt;We&#039;ve also noticed Inktomi behaving badly (ignoring robots.txt and MORE.)&lt;/p&gt;
&lt;p&gt;As of today, we&#039;ve banned Inktomi, Yahoo, Slurp, and all their IPs permanently from hundreds of servers and thousands of domains.&lt;/p&gt;
&lt;p&gt;I don&#039;t know what they&#039;re doing, or who is in control -- but they are acting like the WORST of what we see operating on the Internet.
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>We&#8217;ve also noticed Inktomi behaving badly (ignoring robots.txt and MORE.)</p>
<p>As of today, we&#8217;ve banned Inktomi, Yahoo, Slurp, and all their IPs permanently from hundreds of servers and thousands of domains.</p>
<p>I don&#8217;t know what they&#8217;re doing, or who is in control &#8212; but they are acting like the WORST of what we see operating on the Internet.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: matt</title>
		<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/comment-page-1/#comment-15853</link>
		<dc:creator>matt</dc:creator>
		<pubDate>Fri, 19 Jan 2007 08:22:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.bruceclay.com/blog-test/2007/01/is-slurp-behaving-badly/#comment-15853</guid>
		<description>don&#039;t know much about this, but in trying to figure out who/what inktomi is i found your post.
they pop up (from sunnyvale, ca) as a regular visitor to my and many other bloggers i&#039;ve spoken to. we guessed everything from gov&#039;t spying to simple marketing (which is what it&#039;s beginning to look like)
also noticed with yahoo past couple months that my sign-in page always flashes &amp; duplicates itself as i&#039;m signing in... an odd event that never happened before
</description>
		<content:encoded><![CDATA[<p>don&#8217;t know much about this, but in trying to figure out who/what inktomi is i found your post.<br />
they pop up (from sunnyvale, ca) as a regular visitor to my and many other bloggers i&#8217;ve spoken to. we guessed everything from gov&#8217;t spying to simple marketing (which is what it&#8217;s beginning to look like)<br />
also noticed with yahoo past couple months that my sign-in page always flashes &#038; duplicates itself as i&#8217;m signing in&#8230; an odd event that never happened before</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andy Redfern</title>
		<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/comment-page-1/#comment-15852</link>
		<dc:creator>Andy Redfern</dc:creator>
		<pubDate>Wed, 17 Jan 2007 22:23:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.bruceclay.com/blog-test/2007/01/is-slurp-behaving-badly/#comment-15852</guid>
		<description>&lt;p&gt;I have seen the similar strange behaviour with Yahoo! but also I have noticed that slurp is spidering a couple of my URL&#039;s a large number of times. &lt;/p&gt;
&lt;p&gt;Anyone else noticing this?
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I have seen the similar strange behaviour with Yahoo! but also I have noticed that slurp is spidering a couple of my URL&#8217;s a large number of times. </p>
<p>Anyone else noticing this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Derrick Wheeler</title>
		<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/comment-page-1/#comment-15851</link>
		<dc:creator>Derrick Wheeler</dc:creator>
		<pubDate>Wed, 17 Jan 2007 21:57:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.bruceclay.com/blog-test/2007/01/is-slurp-behaving-badly/#comment-15851</guid>
		<description>One more thing...
I&#039;ve seen instances of Google also &quot;partially indexing&quot; pages that are excluded using robots.txt files and using the text from an inbound link as the &quot;Title&quot;
</description>
		<content:encoded><![CDATA[<p>One more thing&#8230;<br />
I&#8217;ve seen instances of Google also &#8220;partially indexing&#8221; pages that are excluded using robots.txt files and using the text from an inbound link as the &#8220;Title&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Derrick Wheeler</title>
		<link>http://www.bruceclay.com/blog/2007/01/is-slurp-behaving-badly/comment-page-1/#comment-15850</link>
		<dc:creator>Derrick Wheeler</dc:creator>
		<pubDate>Wed, 17 Jan 2007 21:49:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.bruceclay.com/blog-test/2007/01/is-slurp-behaving-badly/#comment-15850</guid>
		<description>&lt;p&gt;It could also be a case of other websites linking to Will&#039;s URLs with extra query strings in an attempt to generate duplicate content indexed.  Yahoo! might only be crawling the URLs with the extra query strings because someone else is linking to them.  Yahoo discovers the URLs through links and then makes a request for the URL.  A question for Yahoo is... At what point does Yahoo cross reference the URLs it is requesting with the exclusions listed in the robots.txt files? On a related note, I have seen recent examples of URLs that are excluded using the robots.txt file being &quot;partially indexed.&quot;  Meaning Yahoo discovered the URL but didnt actually index any content from the URL.  In cases like this, I&#039;ve noticed that Yahoo uses the anchor text of an inbound link as the &quot;Title&quot; of the link in the search results.  Danny Sullivan discusses this on his Search Engine Land website.
&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>It could also be a case of other websites linking to Will&#8217;s URLs with extra query strings in an attempt to generate duplicate content indexed.  Yahoo! might only be crawling the URLs with the extra query strings because someone else is linking to them.  Yahoo discovers the URLs through links and then makes a request for the URL.  A question for Yahoo is&#8230; At what point does Yahoo cross reference the URLs it is requesting with the exclusions listed in the robots.txt files? On a related note, I have seen recent examples of URLs that are excluded using the robots.txt file being &#8220;partially indexed.&#8221;  Meaning Yahoo discovered the URL but didnt actually index any content from the URL.  In cases like this, I&#8217;ve noticed that Yahoo uses the anchor text of an inbound link as the &#8220;Title&#8221; of the link in the search results.  Danny Sullivan discusses this on his Search Engine Land website.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
