Engineering packages

Showing projects tagged as Linguistic and Engineering

  • gensim

    9.4 6.2 L3 Python
    Topic Modelling for Humans
  • Pattern

    8.9 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Stanza

    8.5 9.7 Python
    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
  • coala

    7.9 0.0 L4 Python
    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.
  • sumy

    7.4 6.3 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • trafilatura

    7.1 9.0 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • TextDistance

    6.9 5.6 Python
    📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • polyglot

    6.5 0.0 Python
    Multilingual text (NLP) processing toolkit
  • aeneas

    6.5 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

    6.4 0.0 L3 Python
    Stand-alone language identification system
  • quepy

    5.5 0.0 L5 Python
    A python framework to transform natural language questions to queries in a database query language.
  • pymorphy2

    4.9 0.0 Python
    Morphological analyzer / inflection engine for Russian and Ukrainian languages.
  • htmldate

    2.1 6.9 Python
    Fast and robust date extraction from web pages, with Python or on the command-line