Markup packages

Showing projects tagged as Text Processing and Markup

  • Jinja2

    9.0 8.2 L3 Python
    A very fast and expressive template engine.
  • Pattern

    8.8 0.0 L2 Python
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
  • Sphinx

    8.7 9.9 L2 Python
    The Sphinx documentation generator
  • WeasyPrint

    8.4 9.5 L1 Python
    The awesome document factory
  • xmltodict

    8.0 7.9 L4 Python
    Python module that makes working with XML feel like you are working with JSON
  • Python-Markdown

    7.7 7.5 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • sumy

    7.4 6.3 L5 Python
    Module for automatic summarization of text documents and HTML pages.
  • trafilatura

    7.3 8.5 Python
    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
  • lxml

    7.0 9.3 L2 Python
    The lxml XML toolkit for Python
  • xhtml2pdf

    6.8 7.1 L1 Python
    A library for converting HTML into PDFs using ReportLab
  • markdown2

    6.7 8.4 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • Mistune

    6.5 7.5 L4 Python
    A fast yet powerful Python Markdown parser with renderers and plugins.
  • aeneas

    6.5 0.0 L3 Python
    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
  • feedparser

    6.3 7.6 L3 Python
    Parse feeds in Python
  • Scrapely

    6.1 0.0 HTML
    A pure-python HTML screen-scraping library
  • html5lib

    5.3 4.1 L2 Python
    Standards-compliant library for parsing and serializing HTML documents and fragments in Python
  • selectolax

    4.8 8.5 Cython
    Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).
  • mistletoe

    4.4 5.8 Python
    A fast, extensible and spec-compliant Markdown parser in pure Python.
  • MarkupSafe

    4.3 8.3 L5 Python
    Safely add untrusted strings to HTML/XML markup.
  • opengraph

    3.0 0.0 L5 Python
    A python module to parse the Open Graph Protocol
  • htmldate

    2.1 6.9 Python
    Fast and robust date extraction from web pages, with Python or on the command-line
  • Atoma

    1.9 0.0 Python
    Atom, RSS and JSON feed parser for Python 3
  • Template Render Engine

    1.0 2.4 L4 Python
    Template Render Engine