Efficient keyword mining with regular expressions

Efficient string matching with regular expressions

trrex.readthedocs.io Source Code Docs Changelog

Suggest Changes

Popularity

2.0

Growing

Activity

7.7

-

Stars 145

Watchers 3

Forks 6

Last Commit 12 days ago

Description

This package includes a pure Python function that enables you to represent a set of keywords (strings) as an efficient regular expression. With it, you can perform various operations, such as replacing and extracting keywords. The package's name comes from the internal trie used to build the regular expression (trie to regex). It is fast and integrates with pandas, spacy, and others.

Programming language: Python

License: MIT License

Tags: Text Processing Natural Language Processing Python Data Science Regular Expression Algorithms

Efficient keyword mining with regular expressions alternatives and similar packages

Based on the "Text Processing" category.
Alternatively, view trrex alternatives based on common mentions on social networks and blogs.

MarkItDown

9.9 9.1 Efficient keyword mining with regular expressions VS MarkItDown

Python tool for converting files and office documents to Markdown.
mem0

9.7 9.8 Efficient keyword mining with regular expressions VS mem0

Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

Promo www.influxdata.com

Docling

9.7 9.7 Efficient keyword mining with regular expressions VS Docling

Get your documents ready for gen AI
pydantic

9.5 9.7 Efficient keyword mining with regular expressions VS pydantic

Data validation using Python type hints
fuzzywuzzy

8.7 0.0 L4 Efficient keyword mining with regular expressions VS fuzzywuzzy

DISCONTINUED. Fuzzy String Matching in Python
汉字拼音转换工具（Python 版）

8.0 7.2 Efficient keyword mining with regular expressions VS 汉字拼音转换工具（Python 版）

汉字转拼音(pypinyin)
Lark

7.9 7.2 Efficient keyword mining with regular expressions VS Lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
sqlparse

7.6 6.4 L4 Efficient keyword mining with regular expressions VS sqlparse

A non-validating SQL parser module for Python
Pygments

7.3 - Efficient keyword mining with regular expressions VS Pygments

A generic syntax highlighter.
phonenumbers

7.2 8.5 L4 Efficient keyword mining with regular expressions VS phonenumbers

Python port of Google's libphonenumber
ftfy

7.1 8.5 L4 Efficient keyword mining with regular expressions VS ftfy

Fixes mojibake and other glitches in Unicode text, after the fact.
TextDistance

7.0 4.1 Efficient keyword mining with regular expressions VS TextDistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
PLY

6.9 0.0 L2 Efficient keyword mining with regular expressions VS PLY

Python Lex-Yacc
RenderCV

6.6 9.2 Efficient keyword mining with regular expressions VS RenderCV

Version-control CVs/resumes as source code
msgspec

6.5 6.9 Efficient keyword mining with regular expressions VS msgspec

A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
pyparsing

6.3 7.3 Efficient keyword mining with regular expressions VS pyparsing

DISCONTINUED. Python library for creating PEG parsers
chardet

6.2 3.9 L4 Efficient keyword mining with regular expressions VS chardet

Python character encoding detector
jellyfish

5.9 6.2 Efficient keyword mining with regular expressions VS jellyfish

🪼 a python library for doing approximate and phonetic matching of strings.
shortuuid

5.8 3.7 L5 Efficient keyword mining with regular expressions VS shortuuid

A generator library for concise, unambiguous and URL-safe UUIDs.
typeguard

5.4 7.8 Efficient keyword mining with regular expressions VS typeguard

Run-time type checker for Python
python-user-agents

5.4 0.0 L4 Efficient keyword mining with regular expressions VS python-user-agents

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
Data Profiler

5.3 5.5 Efficient keyword mining with regular expressions VS Data Profiler

What's in your data? Extract schema, statistics and entities from datasets
pyfiglet

5.2 5.5 L3 Efficient keyword mining with regular expressions VS pyfiglet

An implementation of figlet written in Python
python-slugify

5.2 0.0 L4 Efficient keyword mining with regular expressions VS python-slugify

Returns unicode slugs
Levenshtein

5.0 0.0 L1 Efficient keyword mining with regular expressions VS Levenshtein

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Mirascope

4.8 9.9 Efficient keyword mining with regular expressions VS Mirascope

LLM abstractions that aren't obstructions
Construct

4.7 2.7 Efficient keyword mining with regular expressions VS Construct

Construct: Declarative data structures for python that allow symmetric parsing and building
xpinyin

4.6 5.9 L4 Efficient keyword mining with regular expressions VS xpinyin

Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
python-nameparser

4.2 3.3 L2 Efficient keyword mining with regular expressions VS python-nameparser

A simple Python module for parsing human names into their individual components
ijson

4.0 0.3 Efficient keyword mining with regular expressions VS ijson

DISCONTINUED. Iterative JSON parser with Pythonic interface
Charset Normalizer

3.9 9.1 Efficient keyword mining with regular expressions VS Charset Normalizer

Truly universal encoding detector in pure Python
awesome-slugify

3.4 0.0 L5 Efficient keyword mining with regular expressions VS awesome-slugify

Python flexible slugify function
unicode-slugify

3.0 0.0 L4 Efficient keyword mining with regular expressions VS unicode-slugify

A slugifier that works in unicode
AnyAscii

2.9 6.5 Efficient keyword mining with regular expressions VS AnyAscii

Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET
json-streamer

2.7 2.3 Efficient keyword mining with regular expressions VS json-streamer

A fast streaming JSON parser for Python that generates SAX-like events using yajl
pangu.py

2.7 1.9 L5 Efficient keyword mining with regular expressions VS pangu.py

Paranoid text spacing in Python
simplematch

2.3 5.1 Efficient keyword mining with regular expressions VS simplematch

Minimal, super readable string pattern matching for python.
uniout

2.3 1.8 L5 Efficient keyword mining with regular expressions VS uniout

Never see escaped bytes in output.
nider

2.2 0.0 Efficient keyword mining with regular expressions VS nider

Python package to add text to images, textures and different backgrounds
HaikunatorPY

2.1 0.0 L5 Efficient keyword mining with regular expressions VS HaikunatorPY

Generate Heroku-like random names to use in your python applications
json2xml

2.0 7.6 Efficient keyword mining with regular expressions VS json2xml

json to xml converter in python3
Python Left-Right Parser

2.0 3.3 L4 Efficient keyword mining with regular expressions VS Python Left-Right Parser

Python Parser
Atoma

1.9 0.0 Efficient keyword mining with regular expressions VS Atoma

Atom, RSS and JSON feed parser for Python 3
LLMWorkbook

0.7 8.2 Efficient keyword mining with regular expressions VS LLMWorkbook

Effortlessly harness the power of LLMs on Excel and DataFrames—seamless, smart, and efficient!
GoBeautifulSoup

0.3 3.3 Efficient keyword mining with regular expressions VS GoBeautifulSoup

GoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements.
unidecode

- Efficient keyword mining with regular expressions VS unidecode

ASCII transliterations of Unicode text.
difflib

- Efficient keyword mining with regular expressions VS difflib

(Python standard library) Helpers for computing deltas.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of Efficient keyword mining with regular expressions or a related project?

Add another 'Text Processing' Package

Popular Comparisons

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.