10

8

6

4

2


9.9

4.5

9.8
0.0

9.7

9.3

9.4

8.3

9.0
0.0

8.8

5.8

24 Natural Language Processing packages and projects

  • funNLP

    9.9 4.5 Python
    中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报告、自然语言生成、NLU太难了系列、自动对联数据及机器人、用户名黑名单列表、罪名法务名词及分类模型、微信公众号语料、cs224n深度学习自然语言处理课程、中文手写汉字识别、中文自然语言处理 语料/数据集、变量命名神器、分词语料库+代码、任务型对话英文数据集、ASR 语音数据集 + 基于深度学习的中文语音识别系统、笑声检测器、Microsoft多语言数字/单位/如日期时间识别包、中华新华字典数据库及api(包括常用歇后语、成语、词语和汉字)、文档图谱自动生成、SpaCy 中文模型、Common Voice语音识别数据集新版、神经网络关系抽取、基于bert的命名实体识别、关键词(Keyphrase)抽取包pke、基于医疗领域知识图谱的问答系统、基于依存句法与语义角色标注的事件三元组抽取、依存句法分析4万句高质量标注数据、cnocr:用来做中文OCR的Python3包、中文人物关系知识图谱项目、中文nlp竞赛项目及代码汇总、中文字符数据、speech-aligner: 从“人声语音”及其“语言文本”产生音素级别时间对齐标注的工具、AmpliGraph: 知识图谱表示学习(Python)库:知识图谱概念链接预测、Scattertext 文本可视化(python)、语言/知识表示工具:BERT & ERNIE、中文对比英文自然语言处理NLP的区别综述、Synonyms中文近义词工具包、HarvestText领域自适应文本挖掘工具(新词发现-情感分析-实体链接等)、word2word:(Py
  • Jieba

    9.8 0.0 L5 Python
    结巴中文分词
  • The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
    Promo
  • spaCy

    9.7 9.3 Python
    💫 Industrial-strength Natural Language Processing (NLP) in Python
  • NLTK

    9.4 8.3 L2 Python
    NLTK Source
  • Pattern

    9.0 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • TextBlob

    8.8 5.8 L3 Python
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
  • SnowNLP

    8.6 0.0 L4 Python
    Python library for processing Chinese text
  • Stanza

    8.5 9.6 Python
    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
  • pkuseg-python

    8.5 0.0 Python
    pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
  • pytext

    8.5 7.8 Python
    A natural language modeling framework based on PyTorch
  • polyglot

    6.5 0.0 Python
    Multilingual text (NLP) processing toolkit
  • langid.py

    6.4 0.0 L3 Python
    Stand-alone language identification system
  • PyTorch-NLP

    6.3 0.0 Python
    Basic Utilities for PyTorch Natural Language Processing (NLP)
  • textacy

    6.3 6.1 L3 Python
    NLP, before and after spaCy
  • quepy

    5.6 0.0 L5 Python
    A python framework to transform natural language questions to queries in a database query language.
  • IEPY

    4.9 0.0 L5 Python
    Information Extraction in Python
  • Hazm

    4.8 9.5 Python
    Persian NLP Toolkit
  • TextGrocery

    4.5 0.0 L1 C++
    A simple short-text classification tool based on LibLinear
  • Lineflow

    2.3 1.0 Python
    :zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
  • stanfordnlp

    2.2 3.2 Python
    [Deprecated] This library has been renamed to "Stanza". Latest development at: https://github.com/stanfordnlp/stanza
  • Simplemma

    1.8 5.6 Python
    Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
  • odin-slides

    1.5 7.9 Python
    This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Transformer (GPT) of your choice. Leveraging the capabilities of Large Language Models (LLM), odin-slides enables you to turn the lengthiest Word documents into well organized presentations.
  • py3langid

    1.1 0.0 Python
    Faster, modernized fork of the language identification tool langid.py
  • pntl

    0.9 2.0 Python
    Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-of-speech (POS) tags, chunking (CHK), name entity recognition (NER), semantic role labeling (SRL) and syntactic parsing (PSG) with skip-gram all in Python and still more features will be added. The website give is for downlarding Senna tool

Add another 'Natural Language Processing' Package