Utilities packages

Showing projects tagged as Specific Formats Processing, Text Processing, PDF, and Utilities

  • PyMuPDF

    7.5 9.8 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • pdftabextract

    6.5 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.