PDF packages

Showing projects tagged as Specific Formats Processing and PDF

  • PyPDF2

    8.6 9.5 L2 Python
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
  • WeasyPrint

    8.3 9.4 L1 Python
    The awesome document factory
  • PDFMiner

    8.3 0.0 L3 Python
    Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • PyMuPDF

    7.5 9.8 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • borb

    6.8 5.2 Python
    borb is a library for reading, creating and manipulating PDF files in python.
  • Camelot

    6.7 6.9 Python
    A Python library to extract tabular data from PDFs
  • pdftabextract

    6.5 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • Meltano Singer SDK

    2.3 9.7 Python
    Write 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com