Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter down-
Office
-
Text Processing
-
PDF
-
Utilities
-
Markup
-
Business
-
Markdown
-
HTML
-
Scientific
-
YAML
-
Data Mining
-
WWW
-
OCR
-
Information Analysis
-
Parser
-
Internet
-
Engineering
-
HTTP
-
Financial
-
CGI Tools
-
Svn
-
Data Acquisition
-
Subversion
-
Version Control
-
System
-
Serialization
-
Tables
-
CSV
-
Git
-
Office Suites
-
JSON
-
Dynamic Content
-
Shells
-
Vcs
-
Spreadsheet
-
Library
-
Hg
-
Clone
-
Mercurial
Specific Formats Processing packages
Showing projects tagged as Specific Formats Processing
-
PyPDF2
8.7 9.6 L2 PythonA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files -
PyMuPDF
8.3 9.7 PythonPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. -
PDFMiner
8.3 0.0 L3 PythonDISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six. -
csvkit
8.1 8.1 L3 PythonA suite of utilities for converting to and working with CSV, the king of tabular file formats. -
Python-Markdown
7.7 7.5 PythonA Python implementation of John Gruber’s Markdown with Extension support. -
Kaitai Struct
7.3 6.4 ShellKaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Nim / Perl / PHP / Python / Ruby -
xlwings
7.1 8.9 L4 Pythonxlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web. -
unoconv
6.7 0.0 PythonDISCONTINUED. Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice. -
pdftabextract
6.4 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
Kreuzberg
5.4 9.4 PythonA text extraction library supporting PDFs, images, office documents and more -
xlwt
5.4 0.0 L3 PythonDISCONTINUED. Writing and reading data and formatting information from Excel files. -
pyexcel
5.0 8.9 L5 PythonSingle API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files -
pymorphy2
4.9 0.0 PythonMorphological analyzer / inflection engine for Russian and Ukrainian languages. -
Construct
4.6 2.7 PythonConstruct: Declarative data structures for python that allow symmetric parsing and building -
Meltano Singer SDK
2.5 9.7 PythonWrite 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.