Specific Formats Processing packages

Showing projects tagged as Specific Formats Processing

  • PDFMiner

    8.4 0.0 L3 Python
    Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • csvkit

    8.2 8.0 L3 Python
    A suite of utilities for converting to and working with CSV, the king of tabular file formats.
  • PyPDF2

    8.2 0.0 L2 Python
    A utility to read and write PDFs with Python
  • WeasyPrint

    8.1 9.5 L1 Python
    The awesome document factory
  • tablib

    8.0 5.4 L4 Python
    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
  • python-docx

    7.8 3.3 L5 Python
    Create and modify Word documents with Python
  • Python-Markdown

    7.5 5.9 Python
    A Python implementation of John Gruber’s Markdown with Extension support.
  • XlsxWriter

    7.4 8.2 L3 Python
    A Python module for creating Excel XLSX files.
  • xlwings

    6.8 9.1 L4 Python
    xlwings is a BSD-licensed Python library that makes it easy to call Python from Excel and vice versa. It works with Microsoft Excel on Windows and macOS.
  • unoconv

    6.8 4.2 Python
    Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.
  • Kaitai Struct

    6.8 4.1 Shell
    Kaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Perl / PHP / Python / Ruby
  • markdown2

    6.7 5.8 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • pdftabextract

    6.6 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • borb

    6.4 6.9 Python
    borb is a library for reading, creating and manipulating PDF files in python.
  • Mistune

    6.3 6.0 L4 Python
    A fast yet powerful Python Markdown parser with renderers and plugins.
  • python-pptx

    6.1 7.9 Python
    Create Open XML PowerPoint documents in Python
  • Camelot

    5.6 7.6 Python
    A Python library to extract tabular data from PDFs
  • docxtpl

    5.5 7.4 Python
    Use a docx as a jinja2 template
  • xlwt

    5.4 0.0 L3 Python
    Writing and reading data and formatting information from Excel files.
  • pyexcel

    4.8 7.6 L5 Python
    Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files
  • pymorphy2

    4.6 0.0 Python
    Morphological analyzer / inflection engine for Russian and Ukrainian languages.
  • Construct

    4.4 8.1 Python
    Construct: Declarative data structures for python that allow symmetric parsing and building
  • openpyxl

    4.4 -
    A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
  • mistletoe

    3.5 6.4 Python
    A fast, extensible and spec-compliant Markdown parser in pure Python.
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • unp

    3.4 0.0 L5 Python
    Unpacks things.
  • Marmir

    2.3 0.0 L4 Python
    Python powered spreadsheets
  • PyYAML

    2.3 -
    YAML implementations for Python.
  • vcspull

    2.2 7.4 L4 Python
    :arrows_counterclockwise: synchronize projects via yaml/json manifest. built on libvcs
  • libvcs

    1.1 6.4 L3 Python
    ⚙️ vcs abstraction layer