Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter downXML packages
Showing projects tagged as HTML and XML
-
trafilatura
7.5 7.2 PythonPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
aeneas
6.5 0.0 L3 Pythonaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment) -
plutoprint
3.4 9.4 PythonA Python Library for Generating PDFs and Images from HTML, powered by PlutoBook -
GoBeautifulSoup
0.3 3.3 PythonGoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements.
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.