Finding “SlurpConfirm404″ in Your Logs? Here’s Why.
by
Anne P. Mitchell, Esq. - 5 Comments, Last updated
06/21/2010
|
Previous Article « RIAA Says Copying Music to Computer for Personal Use OK, Washington Post and Blogs Have it Wrong
Read Next Article » RIAA President Cary Sherman Says Very Clearly “We Are Not Interested in Going After People for Personal Use Copying” - Listen Here
| Tweet |
|
If you are finding “SlurpConfirm404″, “SlurpConfirm404.htm”, “SlurpConfirm404.html” or “SlurpConfirm404.php” in your log files, and can’t figure out why, you’re not alone. Here’s what that SlurpConfirm404 is all about.
First, Yahoo Slurp is what Yahoo calls their Yahoo Web Crawler - their website indexing engine that crawls around the world wide web, indexing (cataloguing) all of the websites, and all of the web pages on those websites.
Now, many search engines and other web indexers, including Google and Yahoo, are interested in knowing what happens when someone comes to your website and tries to find a page on your site which doesn’t actually exist. What sort of error or message does your website return? Typically this error is error number 404 - page not found, and these search engines want to know how your site handles these. They want to see a proper “404 - not found” response for this, and want to make sure that your site is not returning such other response.
When the Yahoo Slurp web crawler wants to test your site to see what your site does with a query for a non-existant page (which should return some sort of “404 - page not found” error), it asks for a page on your site called “SlurpConfirm404″, on the assumption that you won’t have such a page, and so it will be a good test of what your site returns for such a request. In other words, it’s Yahoo’s Slurp’s way of confirming a 404 response - hence “SlurpConfirm404″.
Sharing is Caring! Please help us get the word out by sharing this article on Facebook, Twitter, or Google+!
| Tweet |
|
Previous Article « RIAA Says Copying Music to Computer for Personal Use OK, Washington Post and Blogs Have it Wrong
Read Next Article » RIAA President Cary Sherman Says Very Clearly “We Are Not Interested in Going After People for Personal Use Copying” - Listen Here
You May Also Like:
Why Use the + When Searching on the Internet?
What Do Women, Thongs, and Search Engines Have in Common?
Anatomy of Link Exchanges for Hire - 2008 Style
Yahoo Messenger Phishing Scam Takes Victims to Yahoo Photos Website
Microsoft Spying on Internet Explorer Users Google Searches in Order to Enhance Bing Search Engine
For additional similar stories check out our archives on Yahoo
Well, it does hundreds of hits with different non existent pages… Does it expect to have diffetent 404 page ? Doesn’t make much sense to me…
The most popular I see are:
/SlurpConfirm404/robocopwebring/NonFramesHome.htm
/SlurpConfirm404/baystars.htm
/SlurpConfirm404/animalprints/Vacation_Sick_Time/clpa.htm
/SlurpConfirm404/islam/circuses.htm
Islam circuses? Are you sure it is !Yahoo?
Comment by Common Interview — 6/21/2010 @ 12:16 am
I’ve just added an entry to my Fail2Ban filters to block any address that 404’s on SlurpConfirm404.
Yahoo’s web crawler ignores my robots.txt so I’m going to block that crap.
Comment by Dougie — 6/18/2010 @ 9:26 am
Big Thanks, I was very concerned. Slurp sounds more like a slug than a web crawler. So I should instead be pleased that Yahoo is having a good look around my website.
Regards The Pink Bin Lady
Comment by Susan Hofgartner — 6/17/2010 @ 11:11 am
Gah, so yahoo like doing what seems to be just another wave of spam traffic to my site..
The IP does belong to Yahoo (in my logs).
Thanks Yahoo, for screwing with my logs!
Comment by Are — 6/17/2010 @ 5:54 am
If this is true, why isn’t it just a single hit? Why do I have hundreds of attempts to obtain /slurpconfirm/randomwebpagehere.html?
Thanks for adding wasted effort to my server, yahoo.
Comment by joe — 10/30/2008 @ 7:04 am