Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter down-
Scientific
-
Linguistic
-
Information Analysis
-
Utilities
-
Internet
-
Natural Language Processing
-
Markup
-
Artificial Intelligence
-
HTML
-
WWW
-
HTTP
-
Human Machine Interfaces
-
Visualization
-
Database
-
Protocol Translator
-
Web Content Extracting
-
Interface Engine
-
Science And Data Analysis
-
System
-
Application Frameworks
-
Archiving
-
Specific Formats Processing
-
WSGI
-
XML
-
Data Mining
-
Education
-
Multimedia
-
Python
-
Opendata
-
Pyramid
-
Scraping
-
Web Scraping
-
HTTP Servers
-
Communications
Engineering packages
Showing projects tagged as Text Processing and Engineering
-
Pattern
9.1 0.0 L2 PythonWeb mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization. -
coala
8.1 0.0 L4 Pythoncoala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use. -
TextDistance
6.9 5.1 PythonCompute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
pdftabextract
6.5 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
aeneas
6.3 0.0 L3 Pythonaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment) -
quepy
5.7 0.0 L5 PythonA python framework to transform natural language questions to queries in a database query language. -
pymorphy2
4.8 0.0 PythonMorphological analyzer / inflection engine for Russian and Ukrainian languages. -
trafilatura
4.4 7.2 PythonPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments -
PatZilla
1.9 4.8 PythonPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources. -
Kotori
1.8 3.6 PythonA flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple. -
htmldate
1.6 7.8 PythonFast and robust date extraction from web pages, with Python or on the command-line -
pntl
0.9 2.0 PythonPractical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-of-speech (POS) tags, chunking (CHK), name entity recognition (NER), semantic role labeling (SRL) and syntactic parsing (PSG) with skip-gram all in Python and still more features will be added. The website give is for downlarding Senna tool
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.