The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

WWW::Search::WebCrawler - class for searching WebCrawler

SYNOPSIS

  use WWW::Search;
  my $oSearch = new WWW::Search('WebCrawler');
  my $sQuery = WWW::Search::escape_query("+sushi restaurant +Columbus Ohio");
  $oSearch->native_query($sQuery);
  while (my $oResult = $oSearch->next_result())
    print $oResult->url, "\n";

DESCRIPTION

This class is a WebCrawler specialization of WWW::Search. It handles making and interpreting WebCrawler searches http://www.WebCrawler.com.

This class exports no public interface; all interaction should be done through WWW::Search objects.

SEE ALSO

To make new back-ends, see WWW::Search.

HOW DOES IT WORK?

native_setup_search is called (from WWW::Search::setup_search) before we do anything. It initializes our private variables (which all begin with underscore) and sets up a URL to the first results page in {_next_url}.

native_retrieve_some is called (from WWW::Search::retrieve_some) whenever more hits are needed. It calls WWW::Search::http_request to fetch the page specified by {_next_url}. It then parses this page, appending any search hits it finds to {cache}. If it finds a ``next'' button in the text, it sets {_next_url} to point to the page for the next set of results, otherwise it sets it to undef to indicate we''re done.

BUGS

Please tell the author if you find any!

TESTING

This module adheres to the WWW::Search test suite mechanism. See $TEST_CASES below.

AUTHOR

As of 1998-03-16, WWW::Search::WebCrawler is maintained by Martin Thurn (MartinThurn@iname.com).

WWW::Search::WebCrawler was originally written by Martin Thurn based on WWW::Search::HotBot.

LEGALESE

THIS SOFTWARE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.

VERSION HISTORY

If it's not listed here, then it wasn't a meaningful or released version.

2.02, 1999-10-05

now uses hash_to_cgi_string()

2.01, 1999-07-13

1.13, 1999-03-29

Remove extraneous HTML from description (thanks to Jim Smyser jsmyser@bigfoot.com)

1.11, 1998-10-09

Now uses split_lines function

1.9

1998-08-20: New format of www.webcrawler.com output.

1.7

\n changed to \012 for MacPerl compatibility

1.5

1998-05-29: New format of www.webcrawler.com output.

1.3

First publicly-released version.