Modifier and Type | Method and Description |
---|---|
ContentRule |
CrawlerBuilder.content()
获取内容页地址规则
|
Modifier and Type | Method and Description |
---|---|
CrawlerBuilder |
CrawlerBuilder.content(ContentRule content)
设置内容解析规则
|
static SimulatorData |
Crawler.testMatcher(String url,
SiteRule siteRule,
ContentRule content)
内容匹配测试
|
static SimulatorData |
Crawler.testMatcher(String url,
SiteRule siteRule,
ContentRule content,
Downloader downloader)
内容匹配测试
|
Modifier and Type | Method and Description |
---|---|
void |
ContentExtract.extract(ContentRule contentRule,
List<ExtractRule> rules,
Page page)
从网页内容里解析出所有符合要求的数据
|
void |
ContentExtractDecorator.extract(ContentRule contentRule,
List<ExtractRule> rules,
Page page) |
Modifier and Type | Method and Description |
---|---|
void |
SimpleContentExtract.extract(ContentRule contentRule,
List<ExtractRule> rules,
Page page) |
Modifier and Type | Method and Description |
---|---|
SimulatorData |
Simulator.match(String url,
SiteRule siteRule,
ContentRule content,
Downloader downloader)
测试内容匹配
|
SimulatorData |
SimpleSimulator.match(String url,
SiteRule siteRule,
ContentRule content,
Downloader downloader) |
Copyright © 2020 Pivotal Software, Inc.. All rights reserved.