Home | Trees | Index | Help |
|
---|
Package lxml :: Module etree :: Class HTMLParser |
|
object
--+ |_BaseParser
--+ | HTMLParser
The HTML parser. This parser allows reading HTML into a normal XML tree. By default, it can read broken (non well-formed) HTML, depending on the capabilities of libxml2. Use the 'recover' option to switch this off.
Available boolean keyword arguments: * recover - try hard to parse through broken HTML (default: True) * no_network - prevent network access (default: True) * remove_blank_text - discard empty text nodes * remove_comments - discard comments * remove_pis - discard processing instructions * compact - safe memory for short text content (default: True)
Note that you should avoid sharing parsers between threads for performance reasons.Method Summary | |
---|---|
x.__init__(...) initializes x; see x.__class__.__doc__ for signature | |
T.__new__(S, ...) -> a new object with type S, a subtype of T | |
Inherited from object | |
x.__delattr__('name') <==> del x.name | |
x.__getattribute__('name') <==> x.name | |
x.__hash__() <==> hash(x) | |
helper for pickle | |
helper for pickle | |
x.__repr__() <==> repr(x) | |
x.__setattr__('name', value) <==> x.name = value | |
x.__str__() <==> str(x) |
Class Variable Summary | |
---|---|
PyCObject |
__pyx_vtable__ = <PyCObject object at 0x401cb9c8>
|
Method Details |
---|
__init__(...)
x.__init__(...) initializes x; see x.__class__.__doc__ for
signature
|
__new__(T, S, ...)T.__new__(S, ...) -> a new object with type S, a subtype of T
|
Class Variable Details |
---|
__pyx_vtable__
|
Home | Trees | Index | Help |
|
---|
Generated by Epydoc 2.1 on Sat Aug 18 12:44:27 2007 | http://epydoc.sf.net |