Specific Formats Processing packages

Showing projects tagged as Specific Formats Processing

  • PyPDF2

    8.6 9.5 L2 Python
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
  • PDFMiner

    8.3 0.0 L3 Python
    DISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • WeasyPrint

    8.3 9.7 L1 Python
    The awesome document factory
  • csvkit

    8.1 8.9 L3 Python
    A suite of utilities for converting to and working with CSV, the king of tabular file formats.
  • python-docx

    8.1 8.5 L5 Python
    Create and modify Word documents with Python
  • tablib

    7.9 7.0 L4 Python
    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
  • PyMuPDF

    7.7 9.8 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • Python-Markdown

    7.7 7.5 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • XlsxWriter

    7.5 7.6 L3 Python
    A Python module for creating Excel XLSX files.
  • Kaitai Struct

    7.2 7.3 Shell
    Kaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Nim / Perl / PHP / Python / Ruby
  • xlwings

    7.1 8.5 L4 Python
    xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
  • Camelot

    6.8 6.2 Python
    A Python library to extract tabular data from PDFs
  • borb

    6.8 5.3 Python
    borb is a library for reading, creating and manipulating PDF files in python.
  • markdown2

    6.7 8.8 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • unoconv

    6.7 0.0 Python
    Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.
  • python-pptx

    6.6 5.8 Python
    Create Open XML PowerPoint documents in Python
  • pdftabextract

    6.5 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • Mistune

    6.4 5.6 L4 Python
    A fast yet powerful Python Markdown parser with renderers and plugins.
  • docxtpl

    6.2 3.9 Python
    Use a docx as a jinja2 template
  • xlwt

    5.4 0.0 L3 Python
    DISCONTINUED. Writing and reading data and formatting information from Excel files.
  • pyexcel

    5.0 0.0 L5 Python
    Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files
  • pymorphy2

    4.8 0.0 Python
    Morphological analyzer / inflection engine for Russian and Ukrainian languages.
  • Construct

    4.5 7.0 Python
    Construct: Declarative data structures for python that allow symmetric parsing and building
  • openpyxl

    4.4 -
    A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
  • mistletoe

    4.2 6.8 Python
    A fast, extensible and spec-compliant Markdown parser in pure Python.
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • unp

    3.3 0.0 L5 Python
    Unpacks things.
  • vcspull

    2.4 9.3 L4 Python
    🔄 Synchronize projects via yaml/json manifest. Built using `libvcs`.
  • Marmir

    2.4 0.0 L4 Python
    Python powered spreadsheets
  • PyYAML

    2.3 -
    YAML implementations for Python.