Home | Trees | Indices | Help |
|
---|
|
|
|||
HTMLParser An html5lib HTML parser with lxml as tree. |
|||
XHTMLParser An html5lib XHTML Parser with lxml as tree. |
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|
|||
xhtml_parser = XHTMLParser()
|
|||
html_parser = <lxml.html.html5parser.HTMLParser object>
|
|||
__package__ =
|
|
Parses several HTML elements, returning a list of elements. The first item in the list may be a string. If no_leading_text is true, then it will be an error if there is leading text, and it will always be a list of only elements. If |
Parses a single HTML element; it is an error if there is more than one element, or if anything but whitespace precedes or follows the element. If create_parent is true (or is a tag name) then a parent node will be created to encapsulate the HTML in a single element. In this case, leading or trailing text is allowed. |
Parse the html, returning a single element/document. This tries to minimally parse the chunk of text, without knowing if it is a fragment or a document. base_url will set the document's base_url attribute (and the tree's docinfo.URL) |
Home | Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0.1 on Sat Jun 3 17:42:30 2017 | http://epydoc.sourceforge.net |