Edited/Distributed by HURINet - The Human Rights Information Network
---------------------------------------------------------------------
## author : gagliajn@NETCOM.COM
## date : 07.07.99
---------------------------------------------------------------------
July 7, 1999
By The Associated Press
Internet search engines are not keeping pace with
the explosive growth of the Web.
A study found that search engines -- which enable
a computer user to find information by typing in a
word or combination of words -- cover a
diminishing fraction of Web pages and take a long
time to list new sites.
The most comprehensive engine, Northern Light,
covers only about one-sixth of the Internet pages
that search engines can reach, the study found.
That is down from one-third for the best engine a
year and a half ago.
Northern Light is closely followed by Snap and
Altavista. Hotbot, which led with 34 percent
coverage in the previous study, was down to 11
percent.
The study also found that it takes more than six
months on average for a new Web page to make it
into a search engine's listings.
The study of 11 search engines was conducted by
computer scientists Steve Lawrence and C. Lee
Giles at the NEC Research Institute in Princeton,
N.J. It was published in Thursday's issue of the
journal Nature.
Lawrence and Giles estimated that as of February,
the searchable Web consisted of 800 million pages
containing more than 6 trillion characters. Their
December 1997 survey put the number of pages at
about 320 million. By comparison, the 532 miles of
shelves in the Library of Congress contain an
estimated 20 trillion characters.
Search engines use computers called ``spiders''
that continuously surf the Web. They save each
page they visit, then follow the links on the page
to find other pages. When a user types in a word,
the engine looks in its index to see which pages
contain it. A page that's not listed in the index
will not be found.
The spiders are more likely to find pages that
have more links going to them from other pages.
Lawrence said that may make it hard for new sites
to make it into search engine listings.
Lawrence also said search engines may be lagging
because their databases become more expensive as
they grow, without necessarily creating more
advertising profits.
Marc Krellenstein, chief technical officer at
Northern Light, said he believes the fraction
covered by the search engines is actually larger,
because he considers the study's estimate of the
size of the Web a bit high.
Krellenstein also questioned whether search
engines need to concern themselves with every new
site. He said they focus on good sites that are of
more interest to users.
``The fact that the Web is growing so fast doesn't
mean the quality part is growing as fast,'' he
said.
=====================================================================
(2)
Internet Search Engine Comparison
By The Associated Press
Internet search engines and the percentage of the searchable
Web that they cover:
Northern Light: 16.0
Snap: 15.5
Altavista: 15.5
HotBot: 11.3
Microsoft: 8.5
Infoseek: 8.0
Google: 7.8
Yahoo: 7.4
Excite: 5.6
Lycos: 2.5
Euroseek: 2.2
Source: Computer scientists Steve Lawrence and C. Lee Giles
at NEC Research Institute in Princeton, N.J.
====================================================================
(3) Some Tips for Using Search Engines
By The Associated Press
Computer scientist Steve Lawrence recommends Web
surfers looking for something well known, like
weather forecasts, use engines like Google or
DirectHit that rank sites according to their
popularity.
``These tend to be good for finding and ranking
highly information which is well known and has
been around for a while,'' he said.
If you are looking for something very specific or
obscure -- say, Victorian rocking chairs -- he
recommends searching engines like Altavista or
Hotbot and making the query as specific as
possible.
----------------------------------
Send mail for the 'huridocs-tech' list to 'huridocs-tech@hrea.org'.
Mail administrative requests to 'majordomo@hrea.org'.
For additional assistance, send mail to: 'owner-huridocs-tech@hrea.org'.
Archives of previous messages posted to the list can be found at:
http://www.human-rights.net/huridocs-tech.
[Reply to this message] [Start a new topic] [Date Index] [Thread Index] [Author Index] [Subject Index] [List Home Page] [HREA Home Page]