Text Processing packages

Showing projects tagged as HTML, HTTP, and Text Processing

  • Pattern

    9.0 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Jinja2

    9.0 8.1 L3 Python
    A very fast and expressive template engine.
  • Sphinx

    8.6 9.7 L2 Python
    The Sphinx documentation generator
  • WeasyPrint

    8.3 8.6 L1 Python
    The awesome document factory
  • Python-Markdown

    7.7 5.3 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • Scrapely

    6.2 0.0 HTML
    A pure-python HTML screen-scraping library
  • trafilatura

    5.9 8.0 Python
    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
  • selectolax

    4.2 7.1 Cython
    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
  • MarkupSafe

    4.0 0.0 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • htmldate

    1.8 0.0 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • Template Render Engine

    0.9 0.0 L4 Python
    Template Render Engine