Popularity

2.5

Stable

Activity

0.0

Stable

Stars 166

Watchers 16

Forks 30

Last Commit over 4 years ago

Description

Lightweight python wrapper for vowpal_wabbit.

Why: Scalable, blazingly fast machine learning.

Code Quality Rank: L3

Programming language: Python

License: BSD 3-clause "New" or "Revised" License

Tags: Machine Learning Scientific Engineering Artificial Intelligence

Latest version: v0.3

vowpal_porpoise alternatives and similar packages

Based on the "Machine Learning" category.
Alternatively, view vowpal_porpoise alternatives based on common mentions on social networks and blogs.

tensorflow

10.0 10.0 L1 vowpal_porpoise VS tensorflow

An Open Source Machine Learning Framework for Everyone
Keras

9.9 9.9 L2 vowpal_porpoise VS Keras

Deep Learning for humans

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

scikit-learn

9.9 9.9 L3 vowpal_porpoise VS scikit-learn

scikit-learn: machine learning in Python
gym

9.8 0.0 vowpal_porpoise VS gym

A toolkit for developing and comparing reinforcement learning algorithms.
xgboost

9.8 9.6 L1 vowpal_porpoise VS xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
CNTK

9.6 0.0 L1 vowpal_porpoise VS CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
PaddlePaddle

9.6 10.0 L1 vowpal_porpoise VS PaddlePaddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
MLflow

9.5 9.9 vowpal_porpoise VS MLflow

Open source platform for the machine learning lifecycle
gensim

9.5 7.5 L3 vowpal_porpoise VS gensim

Topic Modelling for Humans
MindsDB

9.5 10.0 vowpal_porpoise VS MindsDB

The platform for customizing AI from enterprise data
Prophet

9.5 6.2 vowpal_porpoise VS Prophet

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
TFLearn

9.1 0.0 L3 vowpal_porpoise VS TFLearn

Deep learning library featuring a higher-level API for TensorFlow.
NuPIC

8.8 0.0 L3 vowpal_porpoise VS NuPIC

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
H2O

8.8 9.7 vowpal_porpoise VS H2O

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
dspy

8.8 9.9 vowpal_porpoise VS dspy

DSPy: The framework for programming—not prompting—foundation models
Pyro.ai

8.7 8.4 vowpal_porpoise VS Pyro.ai

Deep universal probabilistic programming with Python and PyTorch
Surprise

8.4 0.0 L4 vowpal_porpoise VS Surprise

A Python scikit for building and analyzing recommender systems
srez

8.3 0.0 L5 vowpal_porpoise VS srez

Image super-resolution through deep learning
LightFM

7.9 4.8 L4 vowpal_porpoise VS LightFM

A Python implementation of LightFM, a hybrid recommendation algorithm.
Pylearn2

7.8 0.0 L2 vowpal_porpoise VS Pylearn2

Warning: This project does not have any current developer. See bellow.
skflow

7.6 1.3 L4 vowpal_porpoise VS skflow

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning
Sacred

7.5 3.5 vowpal_porpoise VS Sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
PyBrain

7.5 0.0 L4 vowpal_porpoise VS PyBrain

Another Python Machine Learning Library.
Clairvoyant

7.2 0.0 L3 vowpal_porpoise VS Clairvoyant

Software designed to identify and monitor social/historical cues for short term stock movement
python-recsys

6.2 0.0 L4 vowpal_porpoise VS python-recsys

A python library for implementing a recommender system
Metrics

6.2 0.0 vowpal_porpoise VS Metrics

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave
karateclub

6.1 7.0 vowpal_porpoise VS karateclub

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
awesome-embedding-models

6.0 0.0 vowpal_porpoise VS awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.
pydeep

5.9 0.0 L3 vowpal_porpoise VS pydeep

Deep learning in Python
Crab

5.7 0.0 L2 vowpal_porpoise VS Crab

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).
hebel

5.0 0.0 L2 vowpal_porpoise VS hebel

GPU-Accelerated Deep Learning Library in Python
seqeval

4.6 0.0 vowpal_porpoise VS seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
adaptive

4.5 6.3 vowpal_porpoise VS adaptive

:chart_with_upwards_trend: Adaptive: parallel active learning of mathematical functions
Xorbits

4.4 8.8 vowpal_porpoise VS Xorbits

Scalable Python DS & ML, in an API compatible & lightning fast way.
TrueSkill, the video game rating system

4.2 1.4 vowpal_porpoise VS TrueSkill, the video game rating system

An implementation of the TrueSkill rating system for Python
SciKit-Learn Laboratory

3.9 8.7 vowpal_porpoise VS SciKit-Learn Laboratory

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.
pdpipe

3.9 0.0 vowpal_porpoise VS pdpipe

Easy pipelines for pandas DataFrames.
rwa

3.8 0.0 L5 vowpal_porpoise VS rwa

Machine Learning on Sequential Data Using a Recurrent Weighted Average
Feature Forge

3.5 0.0 L4 vowpal_porpoise VS Feature Forge

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API
nptyping

3.3 0.0 vowpal_porpoise VS nptyping

💡 Type hints for Numpy and Pandas
Data Flow Facilitator for Machine Learning (dffml)

3.3 9.0 vowpal_porpoise VS Data Flow Facilitator for Machine Learning (dffml)

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
brew

3.2 0.0 L4 vowpal_porpoise VS brew

Multiple Classifier Systems and Ensemble Learning Library in Python.
bodywork

3.1 0.0 vowpal_porpoise VS bodywork

ML pipeline orchestration and model deployments on Kubernetes.
Robocorp Action Server

3.0 9.8 vowpal_porpoise VS Robocorp Action Server

Create 🐍 Python AI Actions and 🤖 Automations, and deploy & operate them anywhere
MLP Classifier

2.8 0.0 L4 vowpal_porpoise VS MLP Classifier

A handwritten multilayer perceptron classifer using numpy.
redframes

2.7 1.4 vowpal_porpoise VS redframes

General Purpose Data Manipulation Library
OptaPy

2.7 5.5 vowpal_porpoise VS OptaPy

OptaPy is an AI constraint solver for Python to optimize planning and scheduling problems.
openskill.py

2.5 7.5 vowpal_porpoise VS openskill.py

Multiplayer Rating System. No Friction.
omega-ml

1.8 8.2 vowpal_porpoise VS omega-ml

MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
ChaiPy

1.5 0.0 vowpal_porpoise VS ChaiPy

A developer interface for creating advanced chatbots for the Chai app.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of vowpal_porpoise or a related project?

Add another 'Machine Learning' Package

Popular Comparisons

README

vowpal_porpoise

Lightweight python wrapper for vowpal_wabbit.

Why: Scalable, blazingly fast machine learning.

Install

Install vowpal_wabbit. Clone and run make
Install cython. pip install cython
Clone vowpal_porpoise
Run: python setup.py install to install.

Now can you do: import vowpal_porpoise from python.

Examples

Standard Interface

Linear regression with l1 penalty:

from vowpal_porpoise import VW

# Initialize the model
vw = VW(moniker='test',    # a name for the model
        passes=10,         # vw arg: passes
        loss='quadratic',  # vw arg: loss
        learning_rate=10,  # vw arg: learning_rate
        l1=0.01)           # vw arg: l1

# Inside the with training() block a vw process will be 
# open to communication
with vw.training():
    for instance in ['1 |big red square',\
                      '0 |small blue circle']:
        vw.push_instance(instance)

    # here stdin will close
# here the vw process will have finished

# Inside the with predicting() block we can stream instances and 
# acquire their labels
with vw.predicting():
    for instance in ['1 |large burnt sienna rhombus',\
                      '0 |little teal oval']:
        vw.push_instance(instance)

# Read the predictions like this:
predictions = list(vw.read_predictions_())

L-BFGS with a rank-5 approximation:

from vowpal_porpoise import VW

# Initialize the model
vw = VW(moniker='test_lbfgs', # a name for the model
        passes=10,            # vw arg: passes
        lbfgs=True,           # turn on lbfgs
        mem=5)                # lbfgs rank

Latent Dirichlet Allocation with 100 topics:

from vowpal_porpoise import VW

# Initialize the model
vw = VW(moniker='test_lda',  # a name for the model
        passes=10,           # vw arg: passes
        lda=100,             # turn on lda
        minibatch=100)       # set the minibatch size

Scikit-learn Interface

vowpal_porpoise also ships with an interface into scikit-learn, which allows awesome experiment-level stuff like cross-validation:

from sklearn.cross_validation import StratifiedKFold
from sklearn.grid_search import GridSearchCV
from vowpal_porpoise.sklearn import VW_Classifier

GridSearchCV(
        VW_Classifier(loss='logistic', moniker='example_sklearn',
                      passes=10, silent=True, learning_rate=10),
        param_grid=parameters,
        score_func=f1_score,
        cv=StratifiedKFold(y_train),
).fit(X_train, y_train)

Check out example_sklearn.py for more details

Library Interace (DISABLED as of 2013-08-12)

Via the VW interface:

with vw.predicting_library():
    for instance in ['1 |large burnt sienna rhombus', \
                      '1 |little teal oval']:
        prediction = vw.push_instance(instance)

Now the predictions are returned directly to the parent process, rather than having to read from disk. See examples/example1.py for more details.

Alternatively you can use the raw library interface:

import vw_c
vw = vw_c.VW("--loss=quadratic --l1=0.01 -f model")
vw.learn("1 |this is a positive example")
vw.learn("0 |this is a negative example")
vw.finish()

Currently does not support passes due to some limitations in the underlying vw C code.

Need more examples?

example1.py: SimpleModel class wrapper around VP (both standard and library flavors)
example_library.py: Demonstrates the low-level vw library wrapper, classifying lines of alice in wonderland vs through the looking glass.

Why

vowpal_wabbit is insanely fast and scalable. vowpal_porpoise is slower, but only during the initial training pass. Once the data has been properly cached it will idle while vowpal_wabbit does all the heavy lifting. Furthermore, vowpal_porpoise was designed to be lightweight and not to get in the way of vowpal_wabbit's scalability, e.g. it allows distributed learning via --nodes and does not require data to be batched in memory. In our research work we use vowpal_porpoise on an 80-node cluster running over multiple terabytes of data.

The main benefit of vowpal_porpoise is allowing rapid prototyping of new models and feature extractors. We found that we had been doing this in an ad-hoc way using python scripts to shuffle around massive gzipped text files, so we just closed the loop and made vowpal_wabbit a python library.

How it works

Wraps the vw binary in a subprocess and uses stdin to push data, temporary files to pull predictions. Why not use the prediction labels vw provides on stdout? It turns out that the python GIL basically makes streamining in and out of a process (even asynchronously) painfully difficult. If you know of a clever way to get around this, please email me. In other languages (e.g. in a forthcoming scala wrapper) this is not an issue.

Alternatively, you can use a pure api call (vw_c, wrapping libvw) for prediction.

Contact

Joseph Reisinger @josephreisinger

Contributors

Austin Waters ([email protected])
Joseph Reisinger ([email protected])
Daniel Duckworth ([email protected])

License

Apache 2.0

*Note that all licence references and agreements mentioned in the vowpal_porpoise README section above are relevant to that project's source code only.

vowpal_porpoise

lightweight python wrapper for vowpal wabbit

Description

vowpal_porpoise alternatives and similar packages

Popular Comparisons

README

vowpal_porpoise

Install

Examples

Standard Interface

Scikit-learn Interface

Library Interace (DISABLED as of 2013-08-12)

Need more examples?

Why

How it works

Contact

Contributors

License