Selected Tags

Click on a tag to remove it

More Tags

Click on a tag to add it and filter down

XML packages

Showing projects tagged as Text Processing, Utilities, Markup, and XML

  • trafilatura

    7.0 9.0 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • xhtml2pdf

    6.7 7.5 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • aeneas

    6.4 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)