pytidylib

Python wrapper for HTML Tidy (tidylib) on Python 2 and 3

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Other Environment
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language
- Python
- Python :: 3
Topic

Project description

PyTidyLib is a Python package that wraps the HTML Tidy library. This allows you, from Python code, to “fix” invalid (X)HTML markup. Some of the library’s many capabilities include:

Clean up unclosed tags and unescaped characters such as ampersands
Output HTML 4 or XHTML, strict or transitional, and add missing doctypes
Convert named entities to numeric entities, which can then be used in XML documents without an HTML doctype.
Clean up HTML from programs such as Word (to an extent)
Indent the output, including proper (i.e. no) indenting for pre elements, which some (X)HTML indenting code overlooks.

Version usage

Windows: 0.2.0 and later
Python 3: Tests pass on 0.2.3
tidylib itself is not actively updated and may have problems with newer HTML

Small example of use

The following code cleans up an invalid HTML document and sets an option:

from tidylib import tidy_document
document, errors = tidy_document('''<p>f&otilde;o <img src="bar.jpg">''',
  options={'numeric-entities':1})
print document
print errors

Docs

Documentation is shipped with the source distribution and is available at the PyTidyLib web page.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Other Environment
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language
- Python
- Python :: 3
Topic

Release history Release notifications | RSS feed

0.3.2

Nov 16, 2016

0.3.1

Sep 29, 2016

0.3.0

Sep 22, 2016

This version

0.2.4

Dec 20, 2014

0.2.3

Jul 20, 2014

0.2.2

Jul 14, 2014

0.2.1

Nov 18, 2009

0.2.0

Nov 6, 2009

0.1.2

Aug 15, 2009

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytidylib-0.2.4.tar.gz (86.7 kB view hashes)

Uploaded Dec 20, 2014 Source

Hashes for pytidylib-0.2.4.tar.gz

Hashes for pytidylib-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`0af07bd8ebd256af70ca925ada9337faf16d85b3072624f975136a5134150ab6`
MD5	`2a28267370c9409b592cdb786649cb25`
BLAKE2b-256	`b4a0b70cf2b7b4ee1f9d8fa0f1b4abbbac081a2638a580dabf29b8fb554d5fc1`