sanitize alternatives and similar packages
Based on the "Web Content Extracting" category
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest. Visit our partner's website for more details.
Do you think we are missing an alternative of sanitize or a related project?
sanitize is a Python module for making sure various things (e.g. HTML) are safe to use.
It was originally written by Mark Pilgrim and is distributed under the BSD license.
>>> from sanitize import HTML >>> HTML('<b>hello') '<b>hello</b>' >>> HTML('<img>') '<img />' >>> HTML(("<b><b><b>hello") ... ) '<b><b><b>hello</b></b></b>' >>> HTML('<img src="foo"/') '' >>> HTML('<input type="checkbox" checked>') '<input type="checkbox" checked="checked" />' >>> # dangerous tags (a small sample) ... >>> HTML('safe<applet code="foo.class" codebase="http://example.com/"></applet> <b>description</b>') 'safe <b>description</b>' >>> HTML('safe<frameset rows="*"><frame src="http://example.com/"></frameset> <b>description</b>') 'safe <b>description</b>' >>> # bad protocols (a small sample) >>> HTML('<a href="java' + chr(1) + 'script:foo">bar</a>') '<a href="#foo">bar</a>' >>> HTML('<a href="vbscript:foo">bar</a>') '<a href="#foo">bar</a>' >>>
To see more usage examples see
python-sanitize is available on pypi
So easily install it by
pip install sanitize
$ easy_install sanitize
Another way is by cloning
python-sanitize's git repository
$ git clone git://github.com/Alir3z4/python-sanitize.git
Then install it by running
$ python setup.py install
To run unit tests:
$ python setup.py test
Sanitize is distributed under BSD license.
*Note that all licence references and agreements mentioned in the sanitize README section above are relevant to that project's source code only.