Description
This package includes a pure Python function that enables you to represent a set of keywords (strings) as an efficient regular expression. With it, you can perform various operations, such as replacing and extracting keywords. The package's name comes from the internal trie used to build the regular expression (trie to regex). It is fast and integrates with pandas, spacy, and others.
Efficient keyword mining with regular expressions alternatives and similar packages
Based on the "Text Processing" category.
Alternatively, view trrex alternatives based on common mentions on social networks and blogs.
-
pydantic
Data validation using Python type hints -
fuzzywuzzy
Fuzzy String Matching in Python -
汉字拼音转换工具(Python 版)
汉字转拼音(pypinyin) -
sqlparse
A non-validating SQL parser module for Python -
Lark
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity. -
Pygments
A generic syntax highlighter. -
phonenumbers
Python port of Google's libphonenumber -
ftfy
Fixes mojibake and other glitches in Unicode text, after the fact. -
TextDistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
chardet
Python character encoding detector -
jellyfish
🪼 a python library for doing approximate and phonetic matching of strings. -
shortuuid
A generator library for concise, unambiguous and URL-safe UUIDs. -
python-user-agents
A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings. -
python-slugify
Returns unicode slugs -
Data Profiler
What's in your data? Extract schema, statistics and entities from datasets -
Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity -
pyparsing
Python library for creating PEG parsers [Moved to: https://github.com/pyparsing/pyparsing] -
pyfiglet
An implementation of figlet written in Python -
xpinyin
Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音 -
Construct
Construct: Declarative data structures for python that allow symmetric parsing and building -
python-nameparser
A simple Python module for parsing human names into their individual components -
ijson
Iterative JSON parser with Pythonic interface -
awesome-slugify
Python flexible slugify function -
Charset Normalizer
Truly universal encoding detector in pure Python -
unicode-slugify
A slugifier that works in unicode -
pangu.py
Paranoid text spacing in Python -
json-streamer
A fast streaming JSON parser for Python that generates SAX-like events using yajl -
uniout
Never see escaped bytes in output. -
nider
Python package to add text to images, textures and different backgrounds -
HaikunatorPY
Generate Heroku-like random names to use in your python applications -
simplematch
Minimal, super readable string pattern matching for python. -
json2xml
json to xml converter in python3 -
Python Left-Right Parser
Python Parser -
Atoma
Atom, RSS and JSON feed parser for Python 3 -
unidecode
ASCII transliterations of Unicode text. -
difflib
(Python standard library) Helpers for computing deltas.
Collect and Analyze Billions of Data Points in Real Time
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of Efficient keyword mining with regular expressions or a related project?