Class IgnoreBlocksAfterContentFilter

java.lang.Object
com.kohlschutter.boilerpipe.filters.english.HeuristicFilterBase
com.kohlschutter.boilerpipe.filters.english.IgnoreBlocksAfterContentFilter
All Implemented Interfaces:
BoilerpipeFilter

public final class IgnoreBlocksAfterContentFilter extends HeuristicFilterBase implements BoilerpipeFilter
Marks all blocks as "non-content" that occur after blocks that have been marked DefaultLabels.INDICATES_END_OF_TEXT. These marks are ignored unless a minimum number of words in content blocks occur before this mark (default: 60). This can be used in conjunction with an upstream TerminatingBlocksFinder.
See Also: