10

8

6

4

2


35 Text Processing packages and projects

  • fuzzywuzzy

    8.9 0.8 L4 Python
    Fuzzy String Matching.
  • pydantic

    8.2 8.9 Python
    Data parsing using Python type hinting
  • 汉字拼音转换工具(Python 版)

    7.6 6.3 Python
    汉字拼音转换工具 Python 版(pypinyin)。
  • Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
    Promoted scoutapm.com
  • Pygments

    7.3 -
    A generic syntax highlighter.
  • phonenumbers

    7.1 7.4 L4 Python
    Parsing, formatting, storing and validating international phone numbers.
  • sqlparse

    7.1 7.3 L4 Python
    A non-validating SQL parser.
  • ftfy

    7.0 4.9 L4 Python
    Makes Unicode text less broken and more consistent automagically.
  • Lark

    6.6 9.3 Python
    A modern parsing library for Python, implementing Earley & LALR(1) and an easy interface
  • PLY

    6.5 3.0 L2 Python
    Implementation of lex and yacc parsing tools for Python
  • TextDistance

    6.2 6.1 Python
    Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • binary-parser

    5.9 2.8 Haskell
    A highly-efficient but limited parser API specialised for bytestrings
  • chardet

    5.8 1.7 L4 Python
    Python 2/3 compatible character encoding detector.
  • jellyfish

    5.5 6.4 Python
    A python library for doing approximate and phonetic matching of strings
  • shortuuid

    5.4 3.5 L5 Python
    A generator library for concise, unambiguous and URL-safe UUIDs.
  • python-user-agents

    5.3 4.5 L4 Python
    Browser user agent parser.
  • Levenshtein

    4.9 0.0 L1 C
    Fast computation of Levenshtein distance and string similarity.
  • python-slugify

    4.6 3.6 L4 Python
    A Python slugify library that translates unicode to ASCII.
  • xpinyin

    4.5 1.1 L4 Python
    A library to translate Chinese hanzi (漢字) to pinyin (拼音).
  • Construct

    4.3 5.7 Python
    Declarative data structures for python that allow symmetric parsing and building
  • pyfiglet

    4.2 4.1 L3 Python
    An implementation of figlet written in Python.
  • ijson

    4.0 0.3 Python
    Iterative JSON parser with Pythonic interface
  • python-nameparser

    3.6 3.2 L2 Python
    Parsing human names into their individual components.
  • awesome-slugify

    3.3 0.0 L5 Python
    A Python slugify library that can preserve unicode.
  • unicode-slugify

    3.0 0.0 L4 Python
    A slugifier that generates unicode slugs with Django as a dependency.
  • json-streamer

    2.3 0.0 Python
    A fast streaming JSON parser for Python that generates SAX-like events using yajl
  • uniout

    2.3 0.0 L5 Python
    Print readable chars instead of the escaped string.
  • pangu.py

    2.2 0.0 L5 Python
    Spacing texts for CJK and alphanumerics.
  • HaikunatorPY

    1.9 0.0 L5 Python
    Heroku-like random name generator for python.
  • nider

    1.8 5.3 Python
    Python package to add text to images, textures and different backgrounds
  • Python Left-Right Parser

    1.8 0.4 L4 Python
    Python Parser
  • Charset Normalizer

    1.6 0.4 Python
    🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.
  • Atoma

    1.5 2.6 Python
    Atom, RSS and JSON feed parser for Python 3
  • unidecode

    - -
    ASCII transliterations of Unicode text.
  • difflib

    - -
    (Python standard library) Helpers for computing deltas.
  • pyparsing

    - - Python
    A general purpose framework for generating parsers.

Add another 'Text Processing' Package