Class LargestContentExtractor
java.lang.Object
com.kohlschutter.boilerpipe.extractors.ExtractorBase
com.kohlschutter.boilerpipe.extractors.LargestContentExtractor
- All Implemented Interfaces:
BoilerpipeExtractor,BoilerpipeFilter
A full-text extractor which extracts the largest text component of a page. For news articles, it
may perform better than the
DefaultExtractor, but usually worse than
ArticleExtractor.-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic LargestContentExtractorReturns the singleton instance forLargestContentExtractor.booleanprocess(TextDocument doc) Processes the given documentdoc.
-
Field Details
-
INSTANCE
-
-
Constructor Details
-
LargestContentExtractor
private LargestContentExtractor()
-
-
Method Details
-
getInstance
Returns the singleton instance forLargestContentExtractor. -
process
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-