<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"><channel><title>Disqus - Latest Comments for WillMacc</title><link>http://disqus.com/people/76fd4434b76b132df33b17a23337bc41/</link><description></description><language>en</language><lastBuildDate>Tue, 22 May 2007 14:41:33 -0000</lastBuildDate><item><title>Re: Scraping Starts from the Very First Post</title><link>http://plagiarismtoday.disqus.com/scraping_starts_from_the_very_first_post/#comment-1346945</link><description>The moving of the blog and all that came after really wasn't to see who was scraping; but it did provide somewhat confirmation that probably a lot of those still visiting the site and the site feed was more than likely bots that are still habitually visiting the site/feed daily.&lt;br&gt;I understand that people don't regularly check RSS readers - as in Google Feed Fetcher - but I would dare to guess that's only a small percentage of the visits.&lt;br&gt;Moving the blog also provided me a chance to see who/what visits the site; as where with my &lt;a href="http://WordPress.com" rel="nofollow"&gt;WordPress.com&lt;/a&gt; blog, I could only see hits and not the actual visit information.&lt;br&gt;Since the move; probably 10 crawlers have been shotdown from scraping the content of the blog, BUT, scraping a blog like mine isn't that big of a deal since it's the information on the blog that's important. So, if my content gets scrapped and ends up up another blog - fine; the information is still valid and people still get to see who's doing what with what and whom.. :)&lt;br&gt;&lt;br&gt;Thanks,&lt;br&gt;WillMacc</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">WillMacc</dc:creator><pubDate>Tue, 22 May 2007 14:30:53 -0000</pubDate></item><item><title>Re: Scraping Starts from the Very First Post</title><link>http://plagiarismtoday.disqus.com/scraping_starts_from_the_very_first_post/#comment-1346942</link><description>Also... :)&lt;br&gt;A lot of "pseudo" feeders are attached and monitor other ping services.&lt;br&gt;I've seen countless visits from known crawlers with "bad intentions" hit the site as soon as a ping is sent out.&lt;br&gt;If you have a blog hosted on your own domain, you can issue a ping (to only one service - say; pingomatic) and then sit back and watch who starts hitting the site.&lt;br&gt;You'll see quickly a boat load of crawlers come and a lot of them will not appear as crawlers, but as regular user-agents. If you follow the trends of the crawlers/visitors after a ping, you'll probably start noticing some visitors will not pull any graphics on the blog; or only pull one hit as where most visitors will have line upon line of various content, items, and graphics that's embedded into the blog themes and within the articles.&lt;br&gt;Those that do that are Usually bots and not legit users, but having said that, you'll have to be careful and pick out the rss readers from the bots and crawlers.&lt;br&gt;&lt;br&gt;Thanks,&lt;br&gt;WillMacc</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">WillMacc</dc:creator><pubDate>Tue, 22 May 2007 14:41:33 -0000</pubDate></item></channel></rss>