Class KeepEverythingWithMinKWordsExtractor
java.lang.Object
com.kohlschutter.boilerpipe.extractors.ExtractorBase
com.kohlschutter.boilerpipe.extractors.KeepEverythingWithMinKWordsExtractor
- All Implemented Interfaces:
BoilerpipeExtractor, BoilerpipeFilter
A full-text extractor which extracts the largest text component of a page. For news articles, it
may perform better than the
DefaultExtractor, but usually worse than
ArticleExtractor.-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionbooleanprocess(TextDocument doc) Processes the given documentdoc.
-
Field Details
-
filter
-
-
Constructor Details
-
KeepEverythingWithMinKWordsExtractor
public KeepEverythingWithMinKWordsExtractor(int kMin)
-
-
Method Details
-
process
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-