Class LargeBlockSameTagLevelToContentFilter

  • All Implemented Interfaces:
    BoilerpipeFilter

    public final class LargeBlockSameTagLevelToContentFilter
    extends java.lang.Object
    implements BoilerpipeFilter
    Marks all blocks as content that:
    1. are on the same tag-level as very likely main content (usually the level of the largest block)
    2. have a significant number of words, currently: at least 100