Class MinFulltextWordsFilter
- java.lang.Object
-
- com.kohlschutter.boilerpipe.filters.english.HeuristicFilterBase
-
- com.kohlschutter.boilerpipe.filters.english.MinFulltextWordsFilter
-
- All Implemented Interfaces:
BoilerpipeFilter
public final class MinFulltextWordsFilter extends HeuristicFilterBase implements BoilerpipeFilter
Keeps only those content blocks which contain at least k full-text words (measured byHeuristicFilterBase.getNumFullTextWords(TextBlock)). k is 30 by default.
-
-
Field Summary
Fields Modifier and Type Field Description static MinFulltextWordsFilterDEFAULT_INSTANCEprivate intminWords
-
Constructor Summary
Constructors Constructor Description MinFulltextWordsFilter(int minWords)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static MinFulltextWordsFiltergetDefaultInstance()booleanprocess(TextDocument doc)Processes the given documentdoc.-
Methods inherited from class com.kohlschutter.boilerpipe.filters.english.HeuristicFilterBase
getNumFullTextWords, getNumFullTextWords
-
-
-
-
Field Detail
-
DEFAULT_INSTANCE
public static final MinFulltextWordsFilter DEFAULT_INSTANCE
-
minWords
private final int minWords
-
-
Method Detail
-
getDefaultInstance
public static MinFulltextWordsFilter getDefaultInstance()
-
process
public boolean process(TextDocument doc) throws BoilerpipeProcessingException
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Specified by:
processin interfaceBoilerpipeFilter- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-
-