Text Processing packages

Showing projects tagged as Text Processing

  • Jieba

    9.8 0.0 L5 Python
    结巴中文分词
  • httpie

    9.6 9.5 L3 Python
    As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. https://twitter.com/httpie
  • gensim

    9.5 8.8 L3 Python
    Topic Modelling for Humans
  • MkDocs

    9.4 8.2 L5 Python
    Project documentation with Markdown.
  • Pattern

    9.1 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • pydantic

    9.1 9.0 Python
    Data parsing and validation using Python type hints
  • Jinja2

    9.0 8.5 L3 Python
    A very fast and expressive template engine.
  • fuzzywuzzy

    8.9 1.4 L4 Python
    Fuzzy String Matching in Python
  • TextBlob

    8.9 1.6 L3 Python
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
  • HTTP Prompt

    8.7 3.2 L4 Python
    An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
  • Stanza

    8.5 9.7 Python
    Official Stanford NLP Python Library for Many Human Languages
  • Sphinx

    8.5 9.8 L2 Python
    Main repository for the Sphinx documentation builder
  • PDFMiner

    8.4 0.0 L3 Python
    Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • WeasyPrint

    8.2 9.7 L1 Python
    The awesome document factory
  • coala

    8.1 0.0 L4 Python
    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.
  • xmltodict

    8.1 0.0 L4 Python
    Python module that makes working with XML feel like you are working with JSON
  • 汉字拼音转换工具(Python 版)

    7.9 6.6 Python
    汉字转拼音(pypinyin)
  • Python-Markdown

    7.6 7.0 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sqlparse

    7.4 4.1 L4 Python
    A non-validating SQL parser module for Python
  • sumy

    7.3 6.3 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • Pygments

    7.3 -
    A generic syntax highlighter.
  • phonenumbers

    7.2 8.4 L4 Python
    Python port of Google's libphonenumber
  • Lark

    7.2 9.0 Python
    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
  • ftfy

    7.0 6.0 L4 Python
    Fixes mojibake and other glitches in Unicode text, after the fact.
  • asciimatics

    7.0 7.9 L2 Python
    A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations
  • percol

    7.0 0.0 L4 Python
    adds flavor of interactive filtering to the traditional pipe concept of UNIX shell
  • TextDistance

    6.9 3.6 Python
    Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • markdown2

    6.8 8.5 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • xhtml2pdf

    6.8 8.4 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • lxml

    6.8 8.8 L2 Python
    The lxml XML toolkit for Python