XML packages

Showing projects tagged as Text Processing, Utilities, HTML, and XML

  • xhtml2pdf

    6.7 7.5 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • trafilatura

    6.7 8.7 Python
    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
  • aeneas

    6.4 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • Data Extractor

    0.9 6.0 Python
    Combine XPath, CSS Selectors and JSONPath for Web data extracting.