Class LargeBlockSameTagLevelToContentFilter
java.lang.Object
com.kohlschutter.boilerpipe.filters.heuristics.LargeBlockSameTagLevelToContentFilter
- All Implemented Interfaces:
BoilerpipeFilter
Marks all blocks as content that:
- are on the same tag-level as very likely main content (usually the level of the largest block)
- have a significant number of words, currently: at least 100
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionbooleanprocess(TextDocument doc) Processes the given documentdoc.
-
Field Details
-
INSTANCE
-
-
Constructor Details
-
LargeBlockSameTagLevelToContentFilter
private LargeBlockSameTagLevelToContentFilter()
-
-
Method Details
-
process
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Specified by:
processin interfaceBoilerpipeFilter- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-