HTML and XML Parsers.
Tidy's HTML parser corrects many conditions and enforces certain user preferences during the parsing process. The XML parser produces a tree of nodes useful to Tidy but also suitable for use in other XML processing applications.
Go to the source code of this file.
Functions | |
| TY_PRIVATE Bool | TY_❪CheckNodeIntegrity❫ (Node *node) |
| Is used to perform a node integrity check recursively after parsing an HTML or XML document. | |
| TY_PRIVATE void | TY_❪CoerceNode❫ (TidyDocImpl *doc, Node *node, TidyTagId tid, Bool obsolete, Bool expected) |
| Transforms a given node to another element, for example, from a p to a br. | |
| TY_PRIVATE Node * | TY_❪DiscardElement❫ (TidyDocImpl *doc, Node *element) |
| Remove node from markup tree and discard it. | |
| TY_PRIVATE Node * | TY_❪DropEmptyElements❫ (TidyDocImpl *doc, Node *node) |
| Trims a tree of empty elements recursively, returning the next node. | |
| TY_PRIVATE void | TY_❪InsertNodeAfterElement❫ (Node *element, Node *node) |
| Insert node into markup tree after element. | |
| TY_PRIVATE void | TY_❪InsertNodeAtEnd❫ (Node *element, Node *node) |
| Insert node into markup tree as the last element of content of element. | |
| TY_PRIVATE void | TY_❪InsertNodeAtStart❫ (Node *element, Node *node) |
| Insert node into markup tree as the firt element of content of element. | |
| TY_PRIVATE void | TY_❪InsertNodeBeforeElement❫ (Node *element, Node *node) |
| Insert node into markup tree before element. | |
| TY_PRIVATE Bool | TY_❪IsBlank❫ (Lexer *lexer, Node *node) |
| Indicates whether or not a text node is blank, meaning that it consists of nothing, or a single space. | |
| TY_PRIVATE Bool | TY_❪IsJavaScript❫ (Node *node) |
| Indicates whether or not a node is declared as containing javascript code. | |
| TY_PRIVATE Bool | TY_❪IsNewNode❫ (Node *node) |
| Used to check if a node uses CM_NEW, which determines how attributes without values should be printed. | |
| TY_PRIVATE void | TY_❪ParseDocument❫ (TidyDocImpl *doc) |
| Parses a document after lexing using the HTML parser. | |
| TY_PRIVATE void | TY_❪ParseXMLDocument❫ (TidyDocImpl *doc) |
| Parses a document after lexing using the XML parser. | |
| TY_PRIVATE Node * | TY_❪RemoveNode❫ (Node *node) |
| Extract a node and its children from a markup tree. | |
| TY_PRIVATE Bool | TY_❪TextNodeEndWithSpace❫ (Lexer *lexer, Node *node) |
| Indicates whether or not a text node ends with a space or newline. | |
| TY_PRIVATE Node * | TY_❪TrimEmptyElement❫ (TidyDocImpl *doc, Node *element) |
| Trims a single, empty element, returning the next node. | |
| TY_PRIVATE Bool | TY_❪XMLPreserveWhiteSpace❫ (TidyDocImpl *doc, Node *element) |
| Indicates whether or not whitespace is to be preserved in XHTML/XML documents. | |