Class CommonTagActions

java.lang.Object
com.kohlschutter.boilerpipe.sax.CommonTagActions

public abstract class CommonTagActions extends Object
Defines an action that is to be performed whenever a particular tag occurs during HTML parsing.
  • Field Details

    • TA_IGNORABLE_ELEMENT

      public static final TagAction TA_IGNORABLE_ELEMENT
      Marks this tag as "ignorable", i.e. all its inner content is silently skipped.
    • TA_ANCHOR_TEXT

      public static final TagAction TA_ANCHOR_TEXT
      Marks this tag as "anchor" (this should usually only be set for the <A> tag). Anchor tags may not be nested. There is a bug in certain versions of NekoHTML which still allows nested tags. If boilerpipe encounters such nestings, a SAXException is thrown.
    • TA_BODY

      public static final TagAction TA_BODY
      Marks this tag the body element (this should usually only be set for the <BODY> tag).
    • TA_INLINE_WHITESPACE

      public static final TagAction TA_INLINE_WHITESPACE
      Marks this tag a simple "inline" element, which generates whitespace, but no new block.
    • TA_INLINE

      @Deprecated public static final TagAction TA_INLINE
      Deprecated.
    • TA_INLINE_NO_WHITESPACE

      public static final TagAction TA_INLINE_NO_WHITESPACE
      Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block.
    • PAT_FONT_SIZE

      private static final Pattern PAT_FONT_SIZE
    • TA_BLOCK_LEVEL

      public static final TagAction TA_BLOCK_LEVEL
      Explicitly marks this tag a simple "block-level" element, which always generates whitespace
    • TA_FONT

      public static final TagAction TA_FONT
      Special TagAction for the <FONT> tag, which keeps track of the absolute and relative font size.
  • Constructor Details

    • CommonTagActions

      private CommonTagActions()