10

8

6

4

2


35 Text Processing packages and projects

  • fuzzywuzzy

    9.0 0.4 L4 Python
    Fuzzy String Matching in Python
  • pydantic

    8.6 9.0 Python
    Data parsing and validation using Python type hints
  • Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
    Sponsored scoutapm.com
  • 汉字拼音转换工具(Python 版)

    7.6 6.1 Python
    汉字转拼音(pypinyin)
  • Pygments

    7.3 -
    A generic syntax highlighter.
  • phonenumbers

    7.1 7.7 L4 Python
    Python port of Google's libphonenumber
  • sqlparse

    7.1 5.2 L4 Python
    A non-validating SQL parser module for Python
  • ftfy

    7.0 8.2 L4 Python
    Fixes mojibake and other glitches in Unicode text, after the fact.
  • Lark

    6.8 9.2 Python
    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
  • PLY

    6.6 0.0 L2 Python
    Python Lex-Yacc
  • TextDistance

    6.3 4.0 Python
    Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • chardet

    5.9 6.1 L4 Python
    Python character encoding detector
  • jellyfish

    5.6 4.0 Python
    🎐 a python library for doing approximate and phonetic matching of strings.
  • shortuuid

    5.4 1.8 L5 Python
    A generator library for concise, unambiguous and URL-safe UUIDs.
  • python-user-agents

    5.3 2.6 L4 Python
    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
  • pyparsing

    4.9 7.3 Python
    Python library for creating PEG parsers [Moved to: https://github.com/pyparsing/pyparsing]
  • python-slugify

    4.7 4.8 L4 Python
    Returns unicode slugs
  • Levenshtein

    4.7 2.7 L1 C
    The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
  • xpinyin

    4.5 5.7 L4 Python
    Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
  • pyfiglet

    4.4 1.9 L3 Python
    An implementation of figlet written in Python
  • Construct

    4.3 8.9 Python
    Construct: Declarative data structures for python that allow symmetric parsing and building
  • ijson

    4.0 0.3 Python
    Iterative JSON parser with Pythonic interface
  • python-nameparser

    3.7 0.4 L2 Python
    A simple Python module for parsing human names into their individual components
  • awesome-slugify

    3.4 0.0 L5 Python
    Python flexible slugify function
  • unicode-slugify

    2.9 0.0 L4 Python
    A slugifier that works in unicode
  • json-streamer

    2.4 0.0 Python
    A fast streaming JSON parser for Python that generates SAX-like events using yajl
  • uniout

    2.3 0.0 L5 Python
    Never see escaped bytes in output.
  • pangu.py

    2.3 0.0 L5 Python
    Paranoid text spacing in Python
  • HaikunatorPY

    1.9 0.0 L5 Python
    Generate Heroku-like random names to use in your python applications
  • Charset Normalizer

    1.8 4.3 Python
    🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.
  • simplematch

    1.7 7.3 Python
    Minimal, super readable string pattern matching for python.
  • nider

    1.7 2.0 Python
    Python package to add text to images, textures and different backgrounds
  • Python Left-Right Parser

    1.7 1.5 L4 Python
    Python Parser
  • Atoma

    1.5 1.2 Python
    Atom, RSS and JSON feed parser for Python 3
  • difflib

    -
    (Python standard library) Helpers for computing deltas.
  • unidecode

    -
    ASCII transliterations of Unicode text.

Add another 'Text Processing' Package