Text Processing packages

Showing projects tagged as Natural Language Processing and Text Processing

  • Jieba

    9.8 0.0 L5 Python
    结巴中文分词
  • Pattern

    9.0 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • TextBlob

    8.8 7.0 L3 Python
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
  • Stanza

    8.5 9.8 Python
    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
  • PyMuPDF

    7.5 9.8 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • polyglot

    6.5 0.0 Python
    Multilingual text (NLP) processing toolkit
  • langid.py

    6.4 0.0 L3 Python
    Stand-alone language identification system
  • textacy

    6.3 6.1 L3 Python
    NLP, before and after spaCy
  • awesome-embedding-models

    6.0 0.0 Jupyter Notebook
    A curated list of awesome embedding models tutorials, projects and communities.
  • quepy

    5.6 0.0 L5 Python
    A python framework to transform natural language questions to queries in a database query language.
  • Hazm

    4.9 9.0 Python
    Persian NLP Toolkit
  • IEPY

    4.8 0.0 L5 Python
    Information Extraction in Python
  • Lineflow

    2.2 1.0 Python
    :zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
  • Simplemma

    1.9 6.0 Python
    Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
  • Efficient keyword mining with regular expressions

    1.8 3.5 Python
    Efficient string matching with regular expressions
  • odin-slides

    1.6 7.8 Python
    This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Transformer (GPT) of your choice. Leveraging the capabilities of Large Language Models (LLM), odin-slides enables you to turn the lengthiest Word documents into well organized presentations.
  • py3langid

    1.1 0.0 Python
    Faster, modernized fork of the language identification tool langid.py
  • pntl

    0.9 2.0 Python
    DISCONTINUED. Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-of-speech (POS) tags, chunking (CHK), name entity recognition (NER), semantic role labeling (SRL) and syntactic parsing (PSG) with skip-gram all in Python and still more features will be added. The website give is for downlarding Senna tool