Google’s Custom Search Engine May Not Return Some Of Your Pages

I will start this post by asking a couple of questions:

  1. Are you using Google’s Custom Search Site or AdSense for Search for your web site?
  2. Are you sure it’s returning ALL of your pages or content related to a particular search?
  3. Are you absolutely sure?

Okay, that’s 3 questions actually, but nevermind the details. :)

Seriously, if you offer Google’s Custom Search Engine on your web site or web log, or if you are in the AdSense program and using the AdSense for Search feature, like so many bloggers do nowadays, you may want to reconsider, at least until they stop “hiding” or “ignoring” some of your content and pages.

Since 2005, I no longer offer Google’s Custom Search Engine on any of my web sites and I don’t intend to, anytime soon. Maybe a real example will help you understand how this could affect you and your web site:

If I search Google for [site:gidforums.com auto add slashes], i.e. restricting the search to only my web site, GIDForums, I expect it to return this page, but according to Google’s Custom Search Engine, that page doesn’t even exist! Look:

Returned Google SERP for [site:gidforums.com auto add slashes]

Notice that it’s not listed at all; not filtered, and not even stuck inside their infamous “supplemental results”.

But it’s an old page! In fact the page is very old — over 4 years old, actually — and Google indexed this page very soon after it was written, and even referred people to the page for at least a couple of years, or more. So why did they remove it? What happened?

If you look at the page closely, you’ll see that there’s absolutely nothing wrong with it, and although I am not proud of the content today, it’s still a web page that exists on my web site nevertheless, and “out there” for any decent search engine to find.

As you can imagine, I am now always asking myself, ‘How many other pages are “missing” like this?’

It’s one thing for Google to filter out pages when they serve their users on their regular search engine. That’s their ranking algorithm kicking in and it’s their prerogative what they believe to be relevant and what they want to return ultimately in their results pages, but it’s quite something else when they filter out your own pages when you are already restricting the search to just your site, and for them to then claim that it’s a site search engine!

I am fortunate because GIDForums uses a popular forum script that comes with a search engine of it’s own, and one that can manage user searches relatively well. So, if you are a blogger for example, and you are using WordPress to power your web site, please don’t replace the default WordPress site search form with Google’s - that would just be too stupid! :P

If you must use a third party site-search feature, I’d suggest using anything but Google, at least until they improve on the service.

Being curious, I decided to search other popular search engines like Yahoo and MSN. Here’s what Yahoo returns for my query. And what MSN returns. Both got it right, fortunately. :)

In case you don’t think this is such a huge problem for you, maybe you should think about this then: How many of your readers are leaving your site everyday convinced that a certain story, article or product, does not exist on your web site just because the Google Custom Search Site engine said so?

4 Comments »

Quite an interesting topic. I was not aware of this. Will have to check deeper into this now that you say it’s something likely to happen. In the meantime I have also tagged this story to Digg, just to see if there any other blogger’s out there who have encountered this same problem.

Thank you, for the info.

Comment by Nihal — June 9, 2007 @ 4:17 pm

Thanks for the info. When I tested it out I did get the results I thought I would but have changed to the WP Search widget anyhow. I tend to trust this type of information when you provide it.

I’m not thrilled with the WP Search widget just because you can’t configure it. It would be nice if it had options similar to the google one.

Comment by cableguy — June 9, 2007 @ 8:43 pm

I just need one mistake like that to convince me that it’s not suitable as a SITE search engine. It may not be a problem for you now — because you just started your site — but what if you have been blogging for the last 5 years, and today you can’t find something you wrote 4 years ago with the Custom Search Site engine that’s on your blog? At least you will know that you actually wrote it, and you will probably use whatever other methods available to you to look for it, but what about your readers? What will they think? We all know that there are some information sometimes that remain relevant for a long time.

I am still a Google fan, there is absolutely no doubt about it. :) But Google’s Custom Search Site feature is the one thing that they are not doing right, yet.

Nihal:
Thank you for tagging the story to Digg. I have never used Digg before, or any of those other social bookmarking sites, but I am sure that it is very helpful.

Comment by J de Silva — June 9, 2007 @ 8:54 pm

I have had the same concern since using the Google CSE on my web site. Once the search engine rank honeymoon was over I couldn’t find even 50% of my pages using Google. I am now looking into paying $50-$100 for a third party program. I simply don’t want a single page to be missed by a visitor to my site. Can’t think of a better way to kill my rank than that.

Jim

Comment by Jim Green — December 6, 2007 @ 3:04 am

Leave a comment

Theme designed by J de Silva exclusively for GIDBlog.com.