42 Text Processing packages and projects
-
Lark
7.7 7.7 PythonLark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity. -
pyparsing
4.9 7.3 PythonPython library for creating PEG parsers [Moved to: https://github.com/pyparsing/pyparsing] -
jellyfish
5.9 6.9 Jupyter Notebook🪼 a python library for doing approximate and phonetic matching of strings. -
TextDistance
6.9 7.0 Python📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
Construct
4.5 7.6 PythonConstruct: Declarative data structures for python that allow symmetric parsing and building -
python-user-agents
5.4 0.0 L4 PythonA Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings. -
python-nameparser
4.2 4.3 L2 PythonA simple Python module for parsing human names into their individual components -
Levenshtein
5.0 0.0 L1 CThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity -
json-streamer
2.6 2.4 PythonA fast streaming JSON parser for Python that generates SAX-like events using yajl -
msgspec
- 8.9 PythonA fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML -
Data Profiler
5.1 7.0 PythonWhat's in your data? Extract schema, statistics and entities from datasets -
Efficient keyword mining with regular expressions
1.9 3.5 PythonEfficient string matching with regular expressions -
AnyAscii
2.6 4.7 KotlinUnicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Promo
www.influxdata.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.