Interface BoilerpipeFilter
- All Known Subinterfaces:
BoilerpipeExtractor
- All Known Implementing Classes:
AddPrecedingLabelsFilter, ArticleExtractor, ArticleMetadataFilter, ArticleSentencesExtractor, BlockProximityFusion, BoilerplateBlockFilter, CanolaExtractor, ContentFusion, DefaultExtractor, DensityRulesClassifier, DocumentTitleMatchClassifier, ExpandTitleToContentFilter, ExtractorBase, IgnoreBlocksAfterContentFilter, IgnoreBlocksAfterContentFromEndFilter, InvertedFilter, KeepEverythingExtractor, KeepEverythingWithMinKWordsExtractor, KeepLargestBlockFilter, KeepLargestFulltextBlockFilter, LabelFusion, LabelToBoilerplateFilter, LabelToContentFilter, LargeBlockSameTagLevelToContentFilter, LargestContentExtractor, ListAtEndFilter, MarkEverythingBoilerplateFilter, MarkEverythingContentFilter, MinClauseWordsFilter, MinFulltextWordsFilter, MinWordsFilter, NumWordsRulesClassifier, NumWordsRulesExtractor, PrintDebugFilter, SimpleBlockFusionProcessor, SplitParagraphBlocksFilter, SurroundingToContentFilter, TerminatingBlocksFinder, TrailingHeadlineToBoilerplateFilter
public interface BoilerpipeFilter
A generic
BoilerpipeFilter. Takes a TextDocument and processes it somehow.-
Method Summary
Modifier and TypeMethodDescriptionbooleanprocess(TextDocument doc) Processes the given documentdoc.
-
Method Details
-
process
Processes the given documentdoc.- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-