Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter downText Processing packages
Showing projects tagged as Scientific, Utilities, and Text Processing
-
trafilatura
7.1 9.0 PythonPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
pdftabextract
6.5 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
aeneas
6.4 0.0 L3 Pythonaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment) -
quepy
5.6 0.0 L5 PythonA python framework to transform natural language questions to queries in a database query language. -
PatZilla
2.2 5.4 PythonPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources. -
Kotori
2.0 1.8 PythonA flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple.
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.