10

8

6

4

2


8.6

5.1

8.3

4.3

8.2

7.1

8.2
0.0

8.0

8.7

7.7
0.0

28 Specific Formats Processing packages and projects

  • PDFMiner

    8.6 5.1 L3 Python
    A tool for extracting information from PDF documents.
  • csvkit

    8.3 4.3 L3 Python
    Utilities for converting to and working with CSV.
  • tablib

    8.2 7.1 L4 Python
    A module for Tabular Datasets in XLS, CSV, JSON, YAML.
  • PyPDF2

    8.2 0.0 L2 Python
    A library capable of splitting, merging and transforming PDF pages.
  • WeasyPrint

    8.0 8.7 L1 Python
    WeasyPrint converts web documents (HTML with CSS, SVG, …) to PDF.
  • python-docx

    7.7 0.0 L5 Python
    Reads, queries and modifies Microsoft Word 2007/2008 docx files.
  • XlsxWriter

    7.4 7.6 L3 Python
    A Python module for creating Excel .xlsx files.
  • Python-Markdown

    7.3 7.3 HTML
    A Python implementation of John Gruber’s Markdown.
  • unoconv

    7.0 2.8 Python
    Convert between any document format supported by LibreOffice/OpenOffice.
  • markdown2

    6.9 6.6 Python
    markdown2: A fast and complete implementation of Markdown in Python
  • xlwings

    6.8 8.5 L4 Python
    A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
  • pdftabextract

    6.7 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • Mistune

    6.4 7.1 L4 Python
    Fastest and full featured pure Python parsers of Markdown.
  • python-pptx

    6.0 0.0 Python
    Python library for creating and updating PowerPoint (.pptx) files.
  • xlwt

    5.5 0.0 L3 Python
    Writing and reading data and formatting information from Excel files.
  • pyexcel

    4.9 7.3 L5 Python
    Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
  • pymorphy2

    4.6 0.0 Python
    Morphological analyzer / inflection engine for Russian and Ukrainian languages.
  • Camelot

    4.6 6.6 Python
    A Python library to extract tabular data from PDFs
  • openpyxl

    4.4 -
    A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
  • unp

    3.4 0.0 L5 Python
    A command line tool that can unpack archives easily.
  • mistletoe

    3.4 0.0 Python
    A fast, extensible Markdown parser in pure Python.
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • Marmir

    2.4 0.0 L4 Python
    Takes Python data structures and turns them into spreadsheets.
  • vcspull

    2.4 8.1 L4 Python
    vcs project manager
  • PyYAML

    2.3 -
    YAML implementations for Python.
  • libvcs

    1.3 9.3 L3 Python
    vcs abstraction layer
  • relatorio

    - -
    Templating OpenDocument files.
  • docxtpl

    - -
    Editing a docx document by jinja2 template

Popular Comparisons


99 Remote Jobs

Work from home. Anywhere in the world.
+ Post a job

Add another 'Specific Formats Processing' Package