Package com.kohlschutter.boilerpipe
Interface BoilerpipeFilter
- All Known Subinterfaces:
BoilerpipeExtractor
- All Known Implementing Classes:
AddPrecedingLabelsFilter,ArticleExtractor,ArticleMetadataFilter,ArticleSentencesExtractor,BlockProximityFusion,BoilerplateBlockFilter,CanolaExtractor,ContentFusion,DefaultExtractor,DensityRulesClassifier,DocumentTitleMatchClassifier,ExpandTitleToContentFilter,ExtractorBase,IgnoreBlocksAfterContentFilter,IgnoreBlocksAfterContentFromEndFilter,InvertedFilter,KeepEverythingExtractor,KeepEverythingWithMinKWordsExtractor,KeepLargestBlockFilter,KeepLargestFulltextBlockFilter,LabelFusion,LabelToBoilerplateFilter,LabelToContentFilter,LargeBlockSameTagLevelToContentFilter,LargestContentExtractor,ListAtEndFilter,MarkEverythingBoilerplateFilter,MarkEverythingContentFilter,MinClauseWordsFilter,MinFulltextWordsFilter,MinWordsFilter,NumWordsRulesClassifier,NumWordsRulesExtractor,PrintDebugFilter,SimpleBlockFusionProcessor,SplitParagraphBlocksFilter,SurroundingToContentFilter,TerminatingBlocksFinder,TrailingHeadlineToBoilerplateFilter
public interface BoilerpipeFilter
A generic
BoilerpipeFilter. Takes a TextDocument and processes it somehow.-
Method Summary
Modifier and TypeMethodDescriptionbooleanprocess(TextDocument doc) Processes the given documentdoc.
-
Method Details
-
process
Processes the given documentdoc.- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-