潘蓝兰

NAME

Novel::Robot::Parser - get novel / bbs content from website

小说站点解析引擎

INIT

site

support download novel/thread from website 支持小说/论坛站点下载

%SITE_DOM_NAME in Parser.pm 站点列表

support txt file 支持处理txt

txt : 指定解析txt文件

support raw file 支持处理序列化数据

raw : 指定解析经MessagePack压缩的二进制文件

new

init funtion, need set sitename,or url 初始化解析模块,需指定站点名称或网址

   #site name : 直接指定站点
   my $parser = Novel::Robot::Parser->new( site => 'jjwxc' );
    
   #url : 通过url自动检测站点
   my $url = 'http://www.jjwxc.net/onebook.php?novelid=2456';
   my $parser = Novel::Robot::Parser->new( site => $url );

NOVEL FUNCTION

get_item_ref

get novel / forum thread data hash ref

获取小说/贴子内容,返回一个hash引用

   my $r = $parser->get_item_ref($url, %opt);

get_item_info

get novel index info / forum thread first page info

获取小说/贴子首页信息

   my $r = $parser->get_item_info($url);

BOARD FUNCTION

writer -> multi books, forum -> multi threads

get_board_ref

get writer / board info, 获取版块信息

   my $r = $parser->get_board_ref($url, %opt);

QUERY FUNCTION

get_query_ref

query info, 获取查询结果

    my $query_type = '作者';
    my $query_keyword = '顾漫';

    my ($info, $items_ref) = $parser->get_query_ref( $query_keyword, 
        query_type => $query_type );