public abstract class AbstractTextAnalyzer extends AbstractAnalyzer
Copyright (c) 2020 xsx All Rights Reserved. x-easypdf-pdfbox is licensed under Mulan PSL v2. You can use this software according to the terms and conditions of the Mulan PSL v2. You may obtain a copy of Mulan PSL v2 at: http://license.coscl.org.cn/MulanPSL2 THIS SOFTWARE IS PROVIDED ON AN "AS IS" BASIS, WITHOUT WARRANTIES OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO NON-INFRINGEMENT, MERCHANTABILITY OR FIT FOR A PARTICULAR PURPOSE. See the Mulan PSL v2 for more details.
Modifier and Type | Class and Description |
---|---|
protected static class |
AbstractTextAnalyzer.DefaultTextStripper
文本剥离器
|
Modifier and Type | Field and Description |
---|---|
protected Set<TextInfo> |
infoSet
文本信息列表
|
document, log
Constructor and Description |
---|
AbstractTextAnalyzer(Document document)
有参构造
|
Modifier and Type | Method and Description |
---|---|
abstract int |
getCharacterCount(int pageIndex)
获取字符数
|
abstract void |
processText(int pageIndex)
处理文本
|
getDocument
public AbstractTextAnalyzer(Document document)
document
- 文档Copyright © 2024. All rights reserved.