Package | Description |
---|---|
com.yishuifengxiao.common.crawler | |
com.yishuifengxiao.common.crawler.macther |
Modifier and Type | Method and Description |
---|---|
MatcherRule |
CrawlerBuilder.contentPageRule()
获取内容页地址规则
多个规则之间用半角逗号隔开 |
Modifier and Type | Method and Description |
---|---|
Set<MatcherRule> |
CrawlerBuilder.linkRules()
获取链接提取规则
|
Modifier and Type | Method and Description |
---|---|
CrawlerBuilder |
CrawlerBuilder.addLinkRule(MatcherRule linkRule)
增加链接提取规则
|
CrawlerBuilder |
CrawlerBuilder.contentPageRule(MatcherRule contentPageRule)
设置内容页地址规则
多个规则之间用半角逗号隔开 |
Modifier and Type | Method and Description |
---|---|
CrawlerBuilder |
CrawlerBuilder.addLinkRules(Set<MatcherRule> linkRules)
增加链接提取规则
|
CrawlerBuilder |
CrawlerBuilder.setLinkRules(Set<MatcherRule> linkRules)
清空原始链接提取规则后设置链接提取规则
|
Modifier and Type | Method and Description |
---|---|
PathMatcher |
MatcherFactory.getMatcher(MatcherRule matcherRule)
根据链接匹配规则生成对应的匹配器
|
Copyright © 2020 Pivotal Software, Inc.. All rights reserved.