Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter down-
Internet
-
Engineering
-
JSON
-
General
-
HTTP
-
HTML
-
Linguistic
-
Scientific
-
WWW
-
Web Content Extracting
-
Information Analysis
-
Markup
-
XML
-
System
-
Command-line Tools
-
Python
-
Productivity Tools
-
Natural Language Processing
-
Human Machine Interfaces
-
Protocol Translator
-
Printing
-
Parser
-
Database
-
Specific Formats Processing
-
Networking
-
Indexing
-
Interface Engine
-
Documentation
-
Visualization
-
WSGI
-
Data Mining
-
Education
-
Multimedia
-
Web Crawling
-
Opendata
-
Security
-
Science And Data Analysis
-
Communications
-
Application Frameworks
-
Type Hints
-
Archiving
-
Terminals
-
OCR
-
RESTful API
-
Markdown
-
Pyramid
-
HTTP Servers
-
PDF
Text Processing packages
Showing projects tagged as Utilities and Text Processing
-
httpie
9.7 5.6 L3 Python🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. -
HTTP Prompt
8.6 0.0 L4 PythonAn interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie -
PyMuPDF
7.9 9.7 PythonPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. -
Lark
7.7 8.0 PythonLark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity. -
trafilatura
6.9 9.0 PythonPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
python-readability
6.7 0.0 Pythonfast python port of arc90's readability tool, updated to match latest readability.js! -
pdftabextract
6.5 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
aeneas
6.4 0.0 L3 Pythonaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment) -
quepy
5.5 0.0 L5 PythonA python framework to transform natural language questions to queries in a database query language. -
Data Profiler
5.2 5.0 PythonWhat's in your data? Extract schema, statistics and entities from datasets -
Goose3
4.2 4.9 HTMLA Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html -
inscriptis -- HTML to text conversion library, command line client and Web service
2.8 8.5 PythonA python based HTML to text conversion library, command line client and Web service. -
AnyAscii
2.7 0.0 KotlinUnicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET -
json-streamer
2.5 2.4 PythonA fast streaming JSON parser for Python that generates SAX-like events using yajl -
PatZilla
2.1 5.4 PythonPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources. -
Kotori
2.0 4.9 PythonA flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple. -
odin-slides
1.8 7.8 PythonThis is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Transformer (GPT) of your choice. Leveraging the capabilities of Large Language Models (LLM), odin-slides enables you to turn the lengthiest Word documents into well organized presentations.
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.