Text Processing packages

Showing projects tagged as Text Processing

  • httpie

    9.9 9.3 L3 Python
    As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. https://twitter.com/httpie
  • Jieba

    9.8 0.0 L5 Python
    结巴中文分词
  • gensim

    9.5 9.0 L3 Python
    Topic Modelling for Humans
  • MkDocs

    9.4 8.7 L5 Python
    Project documentation with Markdown.
  • Pattern

    9.2 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Jinja2

    9.0 8.4 L3 Python
    A very fast and expressive template engine.
  • TextBlob

    8.9 4.4 L3 Python
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
  • fuzzywuzzy

    8.9 2.2 L4 Python
    Fuzzy String Matching in Python
  • HTTP Prompt

    8.8 5.2 L4 Python
    An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
  • pydantic

    8.8 9.1 Python
    Data parsing and validation using Python type hints
  • Sphinx

    8.5 9.9 L2 Python
    Main repository for the Sphinx documentation builder
  • Stanza

    8.5 9.5 Python
    Official Stanford NLP Python Library for Many Human Languages
  • PDFMiner

    8.4 0.0 L3 Python
    Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • coala

    8.2 4.3 L4 Python
    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.
  • WeasyPrint

    8.1 9.5 L1 Python
    The awesome document factory
  • xmltodict

    8.0 0.0 L4 Python
    Python module that makes working with XML feel like you are working with JSON
  • 汉字拼音转换工具(Python 版)

    7.7 7.3 Python
    汉字转拼音(pypinyin)
  • Python-Markdown

    7.5 6.0 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sqlparse

    7.3 5.5 L4 Python
    A non-validating SQL parser module for Python
  • Pygments

    7.3 -
    A generic syntax highlighter.
  • sumy

    7.2 2.8 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • phonenumbers

    7.2 8.4 L4 Python
    Python port of Google's libphonenumber
  • percol

    7.1 0.0 L4 Python
    adds flavor of interactive filtering to the traditional pipe concept of UNIX shell
  • Lark

    7.0 9.4 Python
    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
  • ftfy

    7.0 7.4 L4 Python
    Fixes mojibake and other glitches in Unicode text, after the fact.
  • asciimatics

    6.9 7.8 L2 Python
    A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations
  • markdown2

    6.8 5.8 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • lxml

    6.7 8.7 L2 Python
    The lxml XML toolkit for Python
  • TextDistance

    6.7 4.0 Python
    Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • xhtml2pdf

    6.7 0.6 L1 Python
    A library for converting HTML into PDFs using ReportLab