Internet packages

Showing projects tagged as Text Processing and Internet

  • httpie

    9.7 8.4 L3 Python
    🥧 HTTPie for Terminal — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
  • pydantic

    9.2 9.6 Python
    Data validation using Python type hints
  • Pattern

    9.1 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Jinja2

    9.0 8.7 L3 Python
    A very fast and expressive template engine.
  • HTTP Prompt

    8.7 0.0 L4 Python
    An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
  • Sphinx

    8.6 9.8 L2 Python
    The Sphinx documentation generator
  • WeasyPrint

    8.2 8.9 L1 Python
    The awesome document factory
  • Python-Markdown

    7.7 7.7 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sumy

    7.3 6.2 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • python-readability

    6.6 3.9 Python
    fast python port of arc90's readability tool, updated to match latest readability.js!
  • Scrapely

    6.2 0.0 HTML
    A pure-python HTML screen-scraping library
  • python-user-agents

    5.3 0.0 L4 Python
    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
  • trafilatura

    4.4 7.2 Python
    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
  • selectolax

    4.1 6.7 Cython
    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
  • MarkupSafe

    4.0 6.3 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • Goose3

    3.9 0.0 HTML
    A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html
  • nider

    2.0 0.0 Python
    Python package to add text to images, textures and different backgrounds
  • PatZilla

    1.9 4.8 Python
    PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
  • Kotori

    1.8 3.6 Python
    A flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple.
  • htmldate

    1.6 7.8 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • Template Render Engine

    0.9 0.0 L4 Python
    Template Render Engine
  • Doublify API Toolkit

    0.5 0.0 Python
    Doublify API toolkit for Python