Data Analysis packages

Showing projects tagged as Data Analysis

  • Dask

    9.2 9.6 L2 Python
    Parallel computing with task scheduling
  • #<Sawyer::Resource:0x00007fbd82367850>

    7.7 9.8 Python
    Panel: The powerful data exploration & web app framework for Python
  • AWS Data Wrangler

    7.6 9.4 Python
    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
  • marimo

    7.6 9.9 Python
    A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
  • Sacred

    7.5 3.5 Python
    Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
  • Interactive Parallel Computing with IPython

    7.4 8.0 L3 Jupyter Notebook
    IPython Parallel: Interactive Parallel Computing in Python
  • Clairvoyant

    7.2 0.0 L3 Python
    Software designed to identify and monitor social/historical cues for short term stock movement
  • TextDistance

    6.9 6.1 Python
    📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • karateclub

    6.1 7.0 Python
    Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
  • jellyfish

    5.9 7.1 Jupyter Notebook
    🪼 a python library for doing approximate and phonetic matching of strings.
  • Cubes

    5.9 0.0 L3 Python
    [NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
  • Optimus

    5.5 0.0 Python
    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
  • Streamz

    5.0 0.0 Python
    Real-time stream processing for python
  • bcolz

    4.7 0.0 C
    DISCONTINUED. A columnar data container that can be compressed.
  • fastparquet

    4.3 7.2 Python
    python implementation of the parquet columnar file format.
  • pdpipe

    3.9 0.0 Jupyter Notebook
    Easy pipelines for pandas DataFrames.
  • Bubbles

    3.6 0.0 L5 Python
    [NOT MAINTAINED] Bubbles – Python ETL framework
  • Zef

    1.7 2.8 Python
    Toolkit for graph-relational data across space and time
  • Google Analytics Extractor

    1.2 0.0 Python
    Tool for extracting Google Analytics data suitable for migrating to other platforms/databases
  • pyxll-utils

    1.0 0.0 Python
    DISCONTINUED. Utility code for use with PyXLL, the Python Excel Add-In.