Text Processing packages

Showing projects tagged as Text Processing

  • httpie

    9.9 8.4 L3 Python
    As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. https://twitter.com/httpie
  • Jieba

    9.8 0.0 L5 Python
    结巴中文分词
  • gensim

    9.5 8.9 L3 Python
    Topic Modelling for Humans
  • MkDocs

    9.4 8.5 L5 Python
    Project documentation with Markdown.
  • Pattern

    9.2 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Jinja2

    9.0 8.6 L3 Python
    A very fast and expressive template engine.
  • fuzzywuzzy

    9.0 0.2 L4 Python
    Fuzzy String Matching in Python
  • TextBlob

    9.0 5.1 L3 Python
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
  • HTTP Prompt

    8.8 3.8 L4 Python
    An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
  • pydantic

    8.6 9.2 Python
    Data parsing and validation using Python type hints
  • PDFMiner

    8.5 0.0 L3 Python
    Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • Stanza

    8.5 9.5 Python
    Official Stanford NLP Python Library for Many Human Languages
  • Sphinx

    8.5 9.9 L2 Python
    Main repository for the Sphinx documentation builder
  • coala

    8.2 4.9 L4 Python
    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.
  • WeasyPrint

    8.1 9.5 L1 Python
    The awesome document factory
  • xmltodict

    8.0 0.0 L4 Python
    Python module that makes working with XML feel like you are working with JSON
  • 汉字拼音转换工具(Python 版)

    7.6 6.4 Python
    汉字转拼音(pypinyin)
  • Python-Markdown

    7.4 6.4 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sumy

    7.3 2.8 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • Pygments

    7.3 -
    A generic syntax highlighter.
  • sqlparse

    7.2 4.9 L4 Python
    A non-validating SQL parser module for Python
  • phonenumbers

    7.1 7.7 L4 Python
    Python port of Google's libphonenumber
  • percol

    7.1 0.0 L4 Python
    adds flavor of interactive filtering to the traditional pipe concept of UNIX shell
  • ftfy

    7.0 8.1 L4 Python
    Fixes mojibake and other glitches in Unicode text, after the fact.
  • asciimatics

    6.9 7.7 L2 Python
    A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations
  • xhtml2pdf

    6.8 7.1 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • markdown2

    6.8 5.6 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • Lark

    6.8 9.1 Python
    Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
  • lxml

    6.7 7.5 L2 Python
    The lxml XML toolkit for Python
  • python-readability

    6.6 0.9 HTML
    fast python port of arc90's readability tool, updated to match latest readability.js!