33 Specific Formats Processing packages and projects
-
PyPDF2
8.6 9.5 L2 PythonA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files -
csvkit
8.2 8.6 L3 PythonA suite of utilities for converting to and working with CSV, the king of tabular file formats. -
Python-Markdown
7.7 7.8 PythonA Python implementation of John Gruber’s Markdown with Extension support. -
PyMuPDF
7.5 9.8 PythonPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. -
Kaitai Struct
7.2 7.5 ShellKaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Nim / Perl / PHP / Python / Ruby -
xlwings
7.1 8.4 L4 Pythonxlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web. -
unoconv
6.7 0.0 PythonUniversal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice. -
pdftabextract
6.5 0.0 L3 PythonA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. -
xlwt
5.4 0.0 L3 PythonDISCONTINUED. Writing and reading data and formatting information from Excel files. -
pyexcel
5.0 0.0 L5 PythonSingle API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files -
pymorphy2
4.8 0.0 PythonMorphological analyzer / inflection engine for Russian and Ukrainian languages. -
Meltano Singer SDK
2.3 9.7 PythonWrite 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com -
Python Schema Matching by XGboost and Sentence-Transformers
1.1 3.5 PythonA python tool using XGboost and sentence-transformers to perform schema matching task on tables.
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Promo
www.influxdata.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.