Finding “SlurpConfirm404” in Your Logs? Here’s Why.

The Internet Patrol default featured image
Share the knowledge

If you are finding “SlurpConfirm404”, “SlurpConfirm404.htm”, “SlurpConfirm404.html” or “SlurpConfirm404.php” in your log files, and can’t figure out why, you’re not alone. Here’s what that SlurpConfirm404 is all about.

First, Yahoo Slurp is what Yahoo calls their Yahoo Web Crawler – their website indexing engine that crawls around the world wide web, indexing (cataloguing) all of the websites, and all of the web pages on those websites.

Now, many search engines and other web indexers, including Google and Yahoo, are interested in knowing what happens when someone comes to your website and tries to find a page on your site which doesn’t actually exist. What sort of error or message does your website return? Typically this error is error number 404 – page not found, and these search engines want to know how your site handles these. They want to see a proper “404 – not found” response for this, and want to make sure that your site is not returning such other response.

When the Yahoo Slurp web crawler wants to test your site to see what your site does with a query for a non-existant page (which should return some sort of “404 – page not found” error), it asks for a page on your site called “SlurpConfirm404”, on the assumption that you won’t have such a page, and so it will be a good test of what your site returns for such a request. In other words, it’s Yahoo’s Slurp’s way of confirming a 404 response – hence “SlurpConfirm404”.

Get New Internet Patrol Articles by Email!

The Internet Patrol is completely free, and reader-supported. Your tips via CashApp, Venmo, or Paypal are appreciated! Receipts will come from ISIPP.

CashApp us Square Cash app link

Venmo us Venmo link

Paypal us Paypal link

 


Share the knowledge

6 thoughts on “Finding “SlurpConfirm404” in Your Logs? Here’s Why.

  1. # Now in my .htaccess :
    SetEnvIf Request_URI “/SlurpConfirm404$” bad_bot

    Order Allow,Deny
    Allow from all
    Deny from env=bad_bot

    # Bye-bye Yahoo_Slurp

  2. Well, it does hundreds of hits with different non existent pages… Does it expect to have diffetent 404 page ? Doesn’t make much sense to me…

    The most popular I see are:

    /SlurpConfirm404/robocopwebring/NonFramesHome.htm
    /SlurpConfirm404/baystars.htm
    /SlurpConfirm404/animalprints/Vacation_Sick_Time/clpa.htm
    /SlurpConfirm404/islam/circuses.htm

    Islam circuses? Are you sure it is !Yahoo?

  3. I’ve just added an entry to my Fail2Ban filters to block any address that 404’s on SlurpConfirm404.

    Yahoo’s web crawler ignores my robots.txt so I’m going to block that crap.

  4. Big Thanks, I was very concerned. Slurp sounds more like a slug than a web crawler. So I should instead be pleased that Yahoo is having a good look around my website.
    Regards The Pink Bin Lady

  5. Gah, so yahoo like doing what seems to be just another wave of spam traffic to my site..

    The IP does belong to Yahoo (in my logs).

    Thanks Yahoo, for screwing with my logs!

  6. If this is true, why isn’t it just a single hit? Why do I have hundreds of attempts to obtain /slurpconfirm/randomwebpagehere.html?

    Thanks for adding wasted effort to my server, yahoo.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.