Package com.kohlschutter.boilerpipe.sax
Class CommonTagActions
java.lang.Object
com.kohlschutter.boilerpipe.sax.CommonTagActions
Defines an action that is to be performed whenever a particular tag occurs during HTML parsing.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic final classCommonTagActionsfor block-level elements, which triggers someLabelActionon the generatedTextBlock.static final classstatic final class -
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final Patternstatic final TagActionMarks this tag as "anchor" (this should usually only be set for the<A>tag).static final TagActionExplicitly marks this tag a simple "block-level" element, which always generates whitespacestatic final TagActionMarks this tag the body element (this should usually only be set for the<BODY>tag).static final TagActionSpecial TagAction for the<FONT>tag, which keeps track of the absolute and relative font size.static final TagActionMarks this tag as "ignorable", i.e.static final TagActionDeprecated.static final TagActionMarks this tag a simple "inline" element, which neither generates whitespace, nor a new block.static final TagActionMarks this tag a simple "inline" element, which generates whitespace, but no new block. -
Constructor Summary
Constructors -
Method Summary
-
Field Details
-
TA_IGNORABLE_ELEMENT
Marks this tag as "ignorable", i.e. all its inner content is silently skipped. -
TA_ANCHOR_TEXT
Marks this tag as "anchor" (this should usually only be set for the<A>tag). Anchor tags may not be nested. There is a bug in certain versions of NekoHTML which still allows nested tags. If boilerpipe encounters such nestings, a SAXException is thrown. -
TA_BODY
Marks this tag the body element (this should usually only be set for the<BODY>tag). -
TA_INLINE_WHITESPACE
Marks this tag a simple "inline" element, which generates whitespace, but no new block. -
TA_INLINE
Deprecated.UseTA_INLINE_WHITESPACEinstead -
TA_INLINE_NO_WHITESPACE
Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block. -
PAT_FONT_SIZE
-
TA_BLOCK_LEVEL
Explicitly marks this tag a simple "block-level" element, which always generates whitespace -
TA_FONT
Special TagAction for the<FONT>tag, which keeps track of the absolute and relative font size.
-
-
Constructor Details
-
CommonTagActions
private CommonTagActions()
-
TA_INLINE_WHITESPACEinstead