Utilities packages

Showing projects tagged as Text Processing, HTML, and Utilities

  • Sphinx

    8.7 9.9 L2 Python
    The Sphinx documentation generator
  • trafilatura

    7.0 9.0 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • xhtml2pdf

    6.7 6.6 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • markdown2

    6.7 8.6 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • aeneas

    6.4 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • Data Extractor

    1.0 5.4 Python
    Combine XPath, CSS Selectors and JSONPath for Web data extracting.