HTML packages

Showing projects tagged as Markup, XML, and HTML

  • trafilatura

    7.4 8.1 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • lxml

    7.0 9.4 L2 Python
    The lxml XML toolkit for Python
  • xhtml2pdf

    6.7 7.0 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • aeneas

    6.5 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)