Interface BoilerpipeFilter

All Known Subinterfaces:
BoilerpipeExtractor
All Known Implementing Classes:
AddPrecedingLabelsFilter, ArticleExtractor, ArticleMetadataFilter, ArticleSentencesExtractor, BlockProximityFusion, BoilerplateBlockFilter, CanolaExtractor, ContentFusion, DefaultExtractor, DensityRulesClassifier, DocumentTitleMatchClassifier, ExpandTitleToContentFilter, ExtractorBase, IgnoreBlocksAfterContentFilter, IgnoreBlocksAfterContentFromEndFilter, InvertedFilter, KeepEverythingExtractor, KeepEverythingWithMinKWordsExtractor, KeepLargestBlockFilter, KeepLargestFulltextBlockFilter, LabelFusion, LabelToBoilerplateFilter, LabelToContentFilter, LargeBlockSameTagLevelToContentFilter, LargestContentExtractor, ListAtEndFilter, MarkEverythingBoilerplateFilter, MarkEverythingContentFilter, MinClauseWordsFilter, MinFulltextWordsFilter, MinWordsFilter, NumWordsRulesClassifier, NumWordsRulesExtractor, PrintDebugFilter, SimpleBlockFusionProcessor, SplitParagraphBlocksFilter, SurroundingToContentFilter, TerminatingBlocksFinder, TrailingHeadlineToBoilerplateFilter

public interface BoilerpipeFilter
A generic BoilerpipeFilter. Takes a TextDocument and processes it somehow.
  • Method Summary

    Modifier and Type
    Method
    Description
    boolean
    Processes the given document doc.