| A web search engine is a software program designed to search for data on the Web. The search results are normally provided in the form of a list and are usually called hits. The data may comprise web pages, images, data and other types of files. Some search tools also collect information available in databanks or open directories. Unlike Internet directories that are maintained by human editors, search engines operate automatically or are a mix of algorithmic and human input.
Web search tools work by storing information about many web pages which they retrieve from the INTERNET. These pages are retrieved by An Internet crawler, or differently called a spider. It is an automated Web browser that follows every link it discovers. The content of each page is then analyzed to determine how to index it. Words, for example, are taken from titles, headings or special fields called meta tags. Data about web pages are saved and stored in an index catalogue for further use in queries. Some search engines, such as Google, save and store the whole or part of the source page (known as a cache) as well as data about web pages, whereas others, such as AltaVista, save and store every word of every page they find. This cached page always holds the initial search text, since it is the one that was actually indexed. Thus, it can be very useful because it comprises information that may no longer be available elsewhere.
When an Internet user types search terms in the search field, the software program checks its index and shows a list of the most suitable web pages according to its criteria, commonly with a short summary containing the document's title and at times excerpts from the text. Some search engines provide an advanced feature called proximity search that allows users to determine the length between search words.
The usefulness of a search engine rests on the relevance of the result set it gives back. Since there can be millions of web pages containing a certain word or phrase, some pages may be more relevant and popular than others. Most search engines use methods to grade the results to show the "best" results first.
The way a search engine shows web pages is search engine-specific. The methods also change with time, since the use of Internet services alters and new techniques emerge. |