 |
The detailed information for this article was researched in December, 2003. While
we try to keep it up to date, product information changes rapidly. Please verify
specific product facts before relying on the information below. And if you see
something that we missed, or is inaccurate or out-of-date, please help us let
others know by emailing us at articles@alderconsulting.com.
Introduction
Let’s say you have a website with lot of detailed and valuable information spread across many pages. Your users want to find things on the site based on such a plethora of terms that it’s hard to keep track. Your budget doesn’t quite allow the latest million-dollar content management system, but you’d like to allow your users to search the information on your site. Is there hope?
There is. There are a number of hosted search tools that allow users to search your information on your site, with minimal effort from you. This article will provide an overview of the top free services and those under $250 a year, and help you think through which service is right for you.
Do You Really Need A Search?
Before you start looking for the right engine, it’s important to consider whether a search will actually help your users find things on your site. If your site is quite small, or has a smattering of information on various topics, a search may actually distract them from finding the information they need – or that you want them to find.
Many users will go directly to the search function, if it exists, without much consideration of the site navigation. If their search doesn’t return any results for their topic, they are likely to leave the site with the assumption that it doesn’t have what they need – without noticing a link for it that may be in the navigation or on the homepage. That being said, a search function can be very useful for sites with very task oriented users (continually coming to the site to find, say, Form 10C45Z, or guidelines for the Cuiverlicz protocol), comprehensive information on one or many topics, or listings of jobs, services, events, or the like.
About Searching with Hosted Tools
For an example of they type of site search we’re talking about, take a look at www.compumentor.org. Enter a search term (say, “networking”) in the search box on the upper left and click “Search”. You’re taken to a page that shows all the pages on the site that match your search. While the look of this new page is different from the rest of the site in this example, that’s doesn’t have to be true – many hosted tools will allow you to customize the search results page so that it fits right in. Compumentor is using Atomz, a free hosted search tool. While we have no special knowledge, this search feature is likely free to them and took no more than an hour or two to set up.
How do these tools work? The key factor that makes it an hosted tool is that the search engine itself doesn’t reside on your site or your server. Instead, the search engine might be in a different state (or country!). In order to be able to respond to search requests, the search engine relies on an “index” that it has created of your site in advance. This index contains, in essence, a record of every word on every page. The index is created with a program called a “spider”, which “crawls” your site over the internet – it starts at a URL you provide and clicks on every link it can find. This index is stored on the search service’s computers.
To allow your users to search, you add a small piece of HTML code to your website that produces a text box for the search terms and search button. It also contains a link back to the search engine. When your users enter a search term, it’s sent to the remote search engine. The engine searches the index and then shows the user a list of pages that match their terms. This list of results, called a “results page” is displayed from the service’s computers, but when the user clicks to view a page from the results, they are returned to your site - hopefully without realizing they left!
The hosted search tools don’t require any installation other than the addition of the piece of HTML code to your web page or pages (this code is generally provided by the service and can simply be copied and pasted). For this reason, they can be quickly setup by anyone with a basic knowledge of HTML.
Are These Search Tools Right for You?
Hosted search tools have a lot of advantages – primarily, they provide a reliable search for minimal effort and cost. But they’re not the best solution for every situation. There are two other types of search implementations that you may want to consider:
- Installed search engines. If you have access to install software on your server (i.e. your website is running on a physical computer in your building, or you’re paying a hosting company hundreds of dollars a month), you may want to consider installed search engine software. There are a number of open source engines that are free to those who know what to do with them. In general, they are considerably more complicated to set up than an HOSTED tool, but offer more flexibility in how the search functions and appears to the user.
- Database searches. If some of the information that appears on your website is stored in a database, a custom-built search of your database might be appropriate. This is useful when you want to search within a very limited and specific set of information – like last names or book titles. As opposed to finding every occurrence of these words in all pages, as an indexed search would, a database search can be designed to provide very specific results – like a member page or a book description. A database search will need to be coded specifically for your database by an experienced programmer.
Considering The Search Tools
There are a lot of hosted search tools out there. A number of considerations that can help you figure out which one is the right one for you.
- Page Limits. Most search tools price based on how many web pages are in your site. The more pages, the more it costs per year. This article focuses on tools for sites of about 1000 pages or less.
- Indexing Frequency. As discussed above, the search tools “index” your site at periodic intervals. The index that is created is matched up against search terms to determine what results to provide. If your site is rarely updated, then it doesn’t matter how frequently your site is indexed. If you have a lot of new information each week, however, your users won’t be able to find the new information unless the site is indexed weekly. Most tools allow anywhere from monthly to daily indexing. Some allow you to schedule periodic indexing, while others require you to manually kickoff the index process each time (by going to the search tool’s website).
- Reliable Company. As with anything else, it’s important to look at how well established the search tool is to ensure that it won’t be out of business tomorrow. Look at how long the company behind it has been in business, who else is using the tool, and how well known it is.
- Template Customization. As mentioned above, items that match a user’s search are displayed on a “search results” page. This page is physically located on the search tool’s server. However, many tools allow you to customize this page – some so extensively, using full HTML, that the results page can precisely match the look and the navigation of the rest of the site. Others are quite inflexible and require you to use particular formats or colors.
- Document Indexing. If your site includes a lot of resources that are posted as documents (such as PDF or Word documents), make sure your search tool can index these documents.
- Login. If your site is secured by a password, make sure the search tool can handle a username and password.
- Quality of Search. There are a number of different algorithms used by the tools to match a user’s search phrase to the pages that are returned. It’s difficult to talk about “quality” of these algorithms in the abstract, as it depends a lot on the site content and structure. The best way to judge is to set up the free trials on your site and see what you think of the results. The search tools vary considerably in their flexibility, however, with some providing notable benefits over others. Consider factors such as:
- Exclusion of Repeated text: If your site has a lot of text that is repeated on every page, such as extensive navigation or footers, this text can throw off the search results substantially. When someone searches on a word included in, say, a navigational drop-down, every page with this drop-down on it will be returned. Some tools have the ability to exclude this repeated text.
- Sophisticated Matching: There are a number of ways that search engines make it easier for users to find results. Some have spell-checks. Others support “stemming” (finding grammatical derivations of the search term – for instance, finding “walk” and “walking” if you enter “walked”), “stopwords” (removing common words like “and” or “the” from search terms – generally accepted to improve the quality of most searches), or even “synonyms” (finding conceptually related terms – for instance, finding “nonprofit” if you enter “not-for-profit” or “Africa” if you enter “Nigeria”).
- Ability to Affect Weighting: Some tools will allow you to give particular emphasis to particular page text like metadata descriptions (very useful if you want really high quality results and are willing to put a lot of time into descriptions) or to define “best bets” – pages that should always come up first for particular search terms.
- Quality of Result Display. The display of search results should be understandable and succinct, and, ideally match the look and feel of the rest of your site. This consists of three main factors:
- Results Listing. The listing for each page should allow the user to determine if it will meet their needs. Some engines will show the metadata description for each page (which is created in the HTML itself), which is a big advantage if you have created descriptions. Some show an excerpt, which show how the search terms appear on the page. Some engines show only the first hundred or so words on the page – which can be useless in determining if the page has the information for which you’re looking.
- Template Customization. As per the above.
- Tool Branding and Ads. Many tools require you to include a link to them, their logo, or even ads. While the branding like links and logos may detract a bit from a highly polished website, it likely won’t affect your user’s experience too much. Think carefully, however, whether subjecting your users to ads is worth any minimal advantages.
- Support. These search tools are generally easy to setup and maintain, and shouldn’t need a lot of support from the company. However, if you feel like you will need some help, make sure support is offered. The tools offer different levels of support ranging from no support at all to 24x7 phone and email support.
- Usage Reports. It’s good practice to periodically check to see what your users are searching for. Even the most minimal reporting tools offered show this information; more complex reporting tools also display how many results were returned for each search, top searches for which no results were found, and the like.
Comparing the Search Tools
Finally, to the part you have likely been waiting for: a comparison of the tools themselves. The descriptions below give an overview of the key strengths and features of each major competitive hosted (a list of those we didn’t find competitive follow). The stars on the descriptions below mark those we consider the best values.
For more information, see the detailed Search Tool Comparison spreadsheet. This spreadsheet rates each tool by the factors above, to allow easy comparison and sorting. We have also set up a Search Comparison Page, which allows you to run searches of the Alder Consulting site using the free trials for all these tools to allow you to compare.
- **Master.com (www.master.com/texis/master/app ) Despite the alarmingly unfriendly interface on their website, Master.com provides a solid free search tool contender which can handle very large sites (up to 5000 pages). Indexing can be scheduled weekly, and the template can be fully customized. The result descriptions show only the first text of the page (rather than excerpts or descriptions). No ads, but a large logo and byline (which can be removed for $40/ year).
- **SpiderLine (www.spiderline.com). SpiderLine has a number of packages, starting with a Bronze package at $50/ year for 500 pages and 6 indexes a month, which offer great value for the money. Their Silver package is $150/ year for 1000 pages and 33 indexes a month. Both packages are very flexible: full HTML customization, support for many document types, can login to a secure site, excludes repeated words, has many options to control search weightings, and includes extensive reports. They also include a powerful feature: the ability to define custom synonyms (to show results for “nonprofit” if youenter “not-for-profit”) for instance.
- **Google (www.google.com/services/websearch.html). The basic free Google search is just not very competitive with the other free services. Google will only index you when it indexes all the rest of the web – which results in 30- 45 days between indexes. If you have a small, low traffic site, it may never index all your pages at all. In addition, there is very little customization for the search results page, which includes logo and advertisements, and there is no reporting at all. If you have a large site that is extensively linked to by other sites, the Google engine may provide better quality results than other tools, however. Google has a separate University Search plan, which allows customization and reporting, which is available to “any accredited university or educational institution, or any organization at an accredited educational institution worldwide.” Update: As of September 2004, Google has opened up the University Search plan to all 501C3 nonprofits. While it could still be 30-45 days between indexes for a low traffic site, this new plan offers substantially more customization and reporting.
- FreeFind (www.freefind.com). Freefind provides both a free service, which supports about 2000 pages, and a paid service (1000 pages for $228/ year). Both plans allow daily indexing which supports PDF documents, ability to login to secure sites, fully HTML customizable search results template, and a nice description of result pages. The major downsides of the free service are the substantial ads and FreeFind branding on the search page, and back-of-pack (read: slow) service for support and indexing.
- PicoSearch (www.picosearch.com). Picosearch’s free plan doesn’t compare well to Atomz or Master.com (can’t schedule indexing, minimal customization, and lots of ads). Their professional plan could be useful for the right site, however. It supports 3000 pages for $249/ year, daily scheduled indexing, full HTML customization, support for many document types, can login to a secure site, excludes repeated words, and has a number of options to control search weightings. For some reason, however, the search assumes an “OR” " between multiple search terms instead of the standard "AND" (i.e. searching on “control panel” will find all pages with either the work “control” or the word “panel”)
- WhatUSeek (http://sitelevel.whatuseek.com). WhatUSeek has a free plan (1000 pages, manual indexing only, logo and ads). They also have plans that allow 250 pages for $54/ year, 500 pages for $108/ year, and 1000 pages for $216/year. All plans allow full HTML customization, document indexing, login, and control of search weightings including “best bets”. The paid plans allow scheduled indexing 3 times a week. None of the plans, however, support Boolean searches (OR, NOT, phrases, etc.).
- Atomz (www.atomz.com/search). Atomz provides the best of the free search tools for most sites. They provide weekly scheduled indexing, a high quality search, and a fully HTML customizable search results template for sites that are 500 pages or less. Your search result page must display a relatively sizable Atomz logo. It doesn’t index documents (PDFs, for instance), and cannot login to a secure site. This may not be the best choice if your site may exceed 500 pages at some point: as opposed to all the other free services, which provide a feasible upgrade to a paid service, Atomz’s (enterprise quality) paid services start at about $5000/ year. Update: As of September 2004, Atomz is no longer providing an ad-free search tool. Bummer. You either need to show Atomz’s ads, or purchase their paid service at about $5000/year.
Need More Options?
There are actually a lot more hosted search tools out there. Some were eliminated because they were quite uncompetitive in terms of features for the price; one (SpaceSurfer) because it has ads which actually float over your search results. If you’re curious, here’s some more:
For More Information
Search Tool Comparison Spreadsheet (View now)
A more detailed spreadsheet, also put together by Alder Consulting, which allows you to compare search tools by specific features.
Search Comparison Page (View now)
An Alder Consulting webpage which allows you to run searches against the Alder Consulting site using the free trial version of all of these search tools.
SearchTools.com (www.searchtools.com)
A great site with extensive lists of search tools (both hosted and others), reviews, and articles about search engine tools and concepts.
|
 |
|