Popularity

2.8

Stable

Activity

0.0

Declining

Stars 87

Watchers 37

Forks 79

Last Commit almost 5 years ago

Description

This project consists of a python client that interacts with the IBM Watson Speech To Text service through its WebSockets interface. The client streams audio to the STT service and receives recognition hypotheses in real time. It can run N simultaneous recognition sessions

Code Quality Rank: L5

Programming language: Python

Tags: Audio Speech Data

speech-to-text-websockets-python alternatives and similar packages

Based on the "Speech Data" category.
Alternatively, view speech-to-text-websockets-python alternatives based on common mentions on social networks and blogs.

SpeechRecognition

9.0 7.5 L3 speech-to-text-websockets-python VS SpeechRecognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.
Watson Developer Cloud Python SDK

6.6 7.1 L5 speech-to-text-websockets-python VS Watson Developer Cloud Python SDK

:snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

aeneas

6.4 0.0 L3 speech-to-text-websockets-python VS aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
speechpy

4.5 0.0 speech-to-text-websockets-python VS speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Prosodylab-Aligner

3.3 0.0 L4 speech-to-text-websockets-python VS Prosodylab-Aligner

Python interface for forced audio alignment using HTK and SoX
praatIO

2.8 4.8 L3 speech-to-text-websockets-python VS praatIO

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
ProMo

1.8 2.7 L4 speech-to-text-websockets-python VS ProMo

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.
pyAcoustics

1.7 3.9 L4 speech-to-text-websockets-python VS pyAcoustics

A collection of python scripts for extracting and analyzing acoustics from audio files.
pysle

1.2 4.6 L4 speech-to-text-websockets-python VS pysle

Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of speech-to-text-websockets-python or a related project?

Add another 'Speech Data' Package

Popular Comparisons

README

This sample has been deprecated. Please use the Official Watson Python SDK

Synopsis

Installation

There are some dependencies that need to be installed for this script to work. It is advisable to install the required packages in a separate virtual environment. Certain packages have been observed to conflict with the package requirements for this script; in particular the package nose conflicts with these required packages. In order to interact with the STT service via WebSockets, it is necessary to install pip, then write the following commands:

pip install -r requirements.txt

You also may need to write this command

$ apt-get install build-essential python-dev

If you are creating an environment using anaconda, proceed with the above pip command to install the packages--do not use conda to install the requirements as conda will install nose as a dependency.

Examples

The example below will run the default 10 WAV files through the WebSockets interface of the Speech To Text (STT) service and will dump the recognition hypotheses to a file under the "./output" directory.

$ python ./sttClient.py -credentials <username>:<password> -model en-US_BroadbandModel

The example below performs the same task much faster by opening 10 simultaneous recognition sessions (WebSocket connections) against the STT service.

$ python ./sttClient.py -credentials <username>:<password> -model en-US_BroadbandModel -threads 10

Options

To see the list of available options type:

$ python sttClient.py -h

Motivation

This script has been created by Daniel Bolanos in order to facilitate and promote the utilization of the IBM Watson Speech To Text service.