HTML packages

Showing projects tagged as Text Processing and HTML

  • Pattern

    9.1 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Jinja2

    9.0 8.4 L3 Python
    A very fast and expressive template engine.
  • Sphinx

    8.6 9.8 L2 Python
    The Sphinx documentation generator
  • WeasyPrint

    8.3 9.5 L1 Python
    The awesome document factory
  • Python-Markdown

    7.7 7.1 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sumy

    7.4 5.4 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • lxml

    6.9 6.6 L2 Python
    The lxml XML toolkit for Python
  • markdown2

    6.8 9.1 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • xhtml2pdf

    6.7 2.7 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • aeneas

    6.4 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • Scrapely

    6.2 0.0 HTML
    A pure-python HTML screen-scraping library
  • html5lib

    5.1 0.0 L2 Python
    Standards-compliant library for parsing and serializing HTML documents and fragments in Python
  • trafilatura

    4.6 9.2 Python
    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
  • selectolax

    4.2 5.8 Cython
    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
  • MarkupSafe

    4.1 4.8 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • mistletoe

    3.9 0.0 Python
    A fast, extensible and spec-compliant Markdown parser in pure Python.
  • opengraph

    2.9 0.0 L5 Python
    A python module to parse the Open Graph Protocol
  • htmldate

    1.7 7.1 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • Template Render Engine

    0.9 0.0 L4 Python
    Template Render Engine
  • Data Extractor

    0.9 0.0 Python
    Combine XPath, CSS Selectors and JSONPath for Web data extracting.