mirror of
https://git.FreeBSD.org/ports.git
synced 2024-12-24 04:33:24 +00:00
ffefb4d23d
PyTidyLib is a Python package that wraps the HTML Tidy library. This allows you, from Python code, to "fix" invalid (X)HTML markup. Some of the library's many capabilities include: * Clean up unclosed tags and unescaped characters such as ampersands * Output HTML 4 or XHTML, strict or transitional, and add missing doctypes * Convert named entities to numeric entities, which can then be used in XML documents without an HTML doctype. * Clean up HTML from programs such as Word (to an extent) * Indent the output, including proper (i.e. no) indenting for pre elements, which some (X)HTML indenting code overlooks. WWW: http://countergram.com/open-source/pytidylib
14 lines
685 B
Plaintext
14 lines
685 B
Plaintext
PyTidyLib is a Python package that wraps the HTML Tidy library. This allows
|
|
you, from Python code, to "fix" invalid (X)HTML markup. Some of the library's
|
|
many capabilities include:
|
|
|
|
* Clean up unclosed tags and unescaped characters such as ampersands
|
|
* Output HTML 4 or XHTML, strict or transitional, and add missing doctypes
|
|
* Convert named entities to numeric entities, which can then be used in XML
|
|
documents without an HTML doctype.
|
|
* Clean up HTML from programs such as Word (to an extent)
|
|
* Indent the output, including proper (i.e. no) indenting for pre elements,
|
|
which some (X)HTML indenting code overlooks.
|
|
|
|
WWW: http://countergram.com/open-source/pytidylib
|