Data Analysis packages

Showing projects tagged as Data Analysis

  • Dask

    9.1 9.7 L2 Python
    Parallel computing with task scheduling
  • Sacred

    7.6 5.2 Python
    Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
  • Clairvoyant

    7.4 1.0 L3 Python
    Software designed to identify and monitor social/historical cues for short term stock movement
  • Interactive Parallel Computing with IPython

    7.4 9.6 L3 Jupyter Notebook
    IPython Parallel: Interactive Parallel Computing in Python
  • AWS Data Wrangler

    6.8 9.1 Python
    Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
  • TextDistance

    6.7 5.7 Python
    Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • Cubes

    6.1 0.0 L3 Python
    Light-weight Python OLAP framework for multi-dimensional data analysis
  • jellyfish

    5.7 7.2 Python
    🎐 a python library for doing approximate and phonetic matching of strings.
  • karateclub

    5.6 8.5 Python
    Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
  • Optimus

    5.2 9.9 Python
    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
  • Streamz

    4.7 7.2 Python
    Real-time stream processing for python
  • bcolz

    4.7 0.0 C
    A columnar data container that can be compressed.
  • pdpipe

    3.7 5.5 Python
    Easy pipelines for pandas DataFrames.
  • Bubbles

    3.6 0.0 L5 Python
    [NOT MAINTAINED] Bubbles – Python ETL framework
  • pyxll-utils

    1.0 0.0 Python
    Utility code for use with PyXLL, the Python Excel Add-In.