Google adds a third index for site search

Posted by Michael Martinez on May 12, 2008 in Intermediate SEO

Matt Cutts mentioned on his blog that Google Custom Search Engines now have a special site search index.

This seems like a major step in the right direction as site search has been a real issue for me. Site search is the third navigation tool that every Web site designer (and SEO) MUST take into consideration. You have your on-site navigation, your HTML sitemap, and site search. On larger sites, people are more likely to use site search than either on-site navigation or HTML sitemaps, so you need really good site search.

Most Web site operators don’t have the resources to install and maintain a fully indexing site search resource (although I personally feel that Google Search Appliance is an adequate solution for those people who can afford it). The rest of us have to rely upon third-party search solutions, and there are still a few alternatives to the major search engines. Unfortunately, most third-party solution providers whose tools I have tested entail strict limits (5,000 pages of free indexing being one of those limits).

I think moderately sized sites can benefit from the third-party site search tools but when you work with volume you need industrial-strength site search. It behooves a major search engine to provide that capacity as its own index benefits from crawling and indexing large sites.

I’m not saying that Ask, Google, Live, and Yahoo! should include every page on the Web in their Main Web search indexes — I’m just saying they should acknowledge that their users often use their systems as site search tools. So I don’t have a problem with Google Custom Search Engine using a third index just for site search. In fact, that might make their service more scalable for both them and me.

The less work I have to do as a Webmaster to make my site searchable while giving my visitors access to the content they want, the better off both my visitors and I tend to be. When I run out of time for updating indexes or site search maintenance functions, my site search offers stale content to my visitors. That’s not a good thing. I’ve watched my site search statistics shoot through the roof as people look for new content they know is there but it’s not appearing in site search.

Reducing the number of steps a Webmaster has to go through to get site content indexed is therefore a major plus on Google’s part. But having only just learned about this enhancement to the service I haven’t had time to test it. There is more to site search than just inclusion. You also need to be concerned with relevance. In fact, in site search relevance is even more important than in general Web search. In general Web search people usually have choices so if they don’t like the first result they can try another. In site search when relevance is low the second result tends to be just as bad as the first.

Even the Google Custom Search Engine has typically had severe relevance problems (it doesn’t offer nearly as much diversity as Web search does and that degrades the quality of the user search experience). I created one of the first SEO search engine Custom Search Engine tools soon after Google announced their CSE service. To this day its results tend to favor certain sites over others, diminishing its usefulness.

Site search needs to pay as much attention to clustering and diversity as main Web search. However, it also needs to be configurable. Where major search engines really let down Web site operators is in not allowing us to set priorities for site search. I feel that Google, Yahoo!, and Live all provide platforms (their Webmaster consoles) that can be expanded to provide for some site search optimization. Site search is not competitive — you cannot invade my site search and I cannot invade yours. It would be more helpful to everyone involved if the search engines relaxed some of their filters for site search.

I’m willing to take on some administrative responsibility in order to offer improved site search and network search. Network search, unlike site search, does have to allow for some competitiveness and it could be much more easily abused than site search. Nonetheless, I feel there are enough diverse networks out there to justify some development in network search. I’ve never found a third-party tool that even came close to meeting my needs for network search.

I don’t mind sharing the burden of deciding what should be indexed and how to manage in-site and in-network relevance. I’m not asking for a back door into Main Web Search. What I’m asking for is an improved user experience and more options for Webmasters to help leverage major search engines into their content. In fact, I would be happy to leverage new, previously unknown search engines into my content if they didn’t force me to manually reindex, limit my indexing, or require that I run their ads or show their results on their pages.

Give me a solution where we can both benefit, where my users can find what they want quickly, and where I can adapt my relevance settings to changes in consumer sentiment. That may be a lot to ask but it’s an important part of search technology that has been underdeveloped.

Google’s CSE site search index is, as I said above, a step in the right direction. But more steps are needed and there is a great opportunity here for someone to step forward and carve out a new search engine if the major search services don’t rise to the challenge and meet the need.

3 Comments on Google adds a third index for site search

By jchen on May 13, 2008 at 3:53 pm

I thought about providing site search service to address this concern of stale content. So, I started the sitesearch.JiansNet.com project. As a programmer, I made the search engine myself.

I do have couple of customers signed up with my site search service. However, what I find difficult to do is to market this thing. Most potential customers just want a free site search tool, but mine is a bit money per month. So, I pretty much gave up that idea sometime ago. I just can’t run a site search service for free.

So, I converted my website to a FAQ system, as I think that’s kind of fun to play with my search technology.

Looking at this article you posted, it seems there is still a need for site search service. But, I am still not sure whether economically it is feasible for any venture like mine to go down that path again.

I would love to hear back from you for any thoughts. As I love search engine technology, but got frustrated with Google and the big guys taking all out of this market, Even the site search, Google takes it using their CSE. But as far as I know, it is a “hack”. Since it is just off their main index, so, that’s why content could be stale, as they don’t crawl sites that often except the busy sites.

Anyway, let me know your thoughts if you would like, I welcome any thoughts on the site search stuff.

Thanks,

Jian

By Michael Martinez on May 14, 2008 at 8:07 am

Jian, there is a HUGE need for site search, although the SEO community does a very poor job of promoting the tools and solutions that are available for site search.

Although I cannot endorse your tool or give you a detailed marketing plan, you may be able to leverage some context-sensitive advertising into the tool to help defray your costs. Ultimately, you’ll need investors to help you set up the resources to provide site search for a large number of customers.

You cannot create a viable site search service through shared or virtual hosting services, if that is what you are doing. You would need a data center at some point (in the best of situations you would need many data centers to keep up with the demand).

Site search can potentially be larger than Web search because right now most people are using Web search for site search.

It just needs a business model that can overcome the challenges. There is a lot of room for growth in this area.

By jchen on May 14, 2008 at 11:16 pm

Thanks Michael for your comment and suggestion. That’s very insightful and helpful!

Based on your feedback and what I have experienced, I think I will just go with a FAQ system approach. That way, I can get revenue from content (e.g, ads). Like you said, site search is something that needs huge investment. Probably not doable without venture capital. However, I am against the VC approach, since most VCs are not in line with the founder’s vision and they tend to rush the service/product development.

Comment

Log in or Register to post a comment.

More

Read more posts by Michael Martinez

About the Author

Michael Martinez is the Director of Search Strategies for Visible Technologies, Inc. A former moderator at SEO forums such as JimWorld an Spider-food, Michael has been active in search engine optimization since 1998 and Web site design and promotion since 1996. Michael was a regular contributor to Suite101 (1998-2003) and SEOmoz (2006).

Brief hiatus Meta tags DO matter — get it right for a change