HTML packages

Showing projects tagged as Text Processing and HTML

  • Jinja2

    9.0 8.2 L3 Python
    A very fast and expressive template engine.
  • Pattern

    8.8 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Sphinx

    8.7 9.9 L2 Python
    The Sphinx documentation generator
  • WeasyPrint

    8.5 9.5 L1 Python
    The awesome document factory
  • Python-Markdown

    7.7 7.5 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sumy

    7.4 6.3 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • trafilatura

    7.3 8.5 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • lxml

    7.0 9.4 L2 Python
    The lxml XML toolkit for Python
  • xhtml2pdf

    6.8 7.1 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • markdown2

    6.7 8.4 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • aeneas

    6.5 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • Scrapely

    6.1 0.0 HTML
    A pure-python HTML screen-scraping library
  • html5lib

    5.3 4.1 L2 Python
    Standards-compliant library for parsing and serializing HTML documents and fragments in Python
  • selectolax

    4.8 8.5 Cython
    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
  • mistletoe

    4.4 5.8 Python
    A fast, extensible and spec-compliant Markdown parser in pure Python.
  • MarkupSafe

    4.3 8.3 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • opengraph

    3.0 0.0 L5 Python
    A python module to parse the Open Graph Protocol
  • htmldate

    2.1 6.9 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • Data Extractor

    1.0 7.2 Python
    Combine XPath, CSS Selectors and JSONPath for Web data extracting.
  • Template Render Engine

    0.9 2.4 L4 Python
    Template Render Engine