Selected Tags

Click on a tag to remove it

XML packages

Showing projects tagged as HTML, Linguistic, and XML

  • trafilatura

    7.0 9.0 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • aeneas

    6.4 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)