speech-recognition

Android Wear Custom Voice Actions

Deadly 提交于 2019-11-28 00:13:55
问题 I'm making an app that will work with Android Wear, And I wanted to implement a command into Google's "Ok Google" option. I saw this page: http://developer.android.com/training/wearables/apps/voice.html But it's related only to apps that include Activities in the Android Wear. I wanted to ask: Can I add custom commands? I mean, those who does not start with the word "Start"? Can I add commands that will do another thing than just opening the app? Like running a method? If it's not the place

Use offline voice-to-text in Android 4.1 (Jelly Bean) from my application?

此生再无相见时 提交于 2019-11-27 19:32:50
Android 4.1 includes offline voice typing. You can see this when you click the microphone on the pop-up keyboard, it still works if you have no network connection. However, it looks like the RecognizerIntent API still ends up requiring a network connection when you want to do speech recognition from your app. Is there an API for accessing new the offline voice typing feature in Android from my app? Android uses two different APIs for voice: voice typing (key on the keyboard) and voice recognition (everywhere else). Voice typing is enabled offline and voice recognition is not (for 4.1 JellyBean

Remove quotes from String in Python

橙三吉。 提交于 2019-11-27 19:16:39
I have a python Code that will recognize speech using the Google STT engine and give me back the results but I get the results in strings with "quotes". I don't want that quotes in my code as I will use it to run many commands and it doesn't work. I haven't tried anything so far as I didn't get anything to try! This is the function in the python code that will recognize speech: def recog(): p = subprocess.Popen(['./speech-recog.sh'], stdout=subprocess.PIPE, stderr=subprocess.PIPE) global out,err out, err = p.communicate() print out This is speech-recog.sh: #!/bin/bash hardware="plughw:1,0"

Keyword Spotting in Speech on Android?

ぃ、小莉子 提交于 2019-11-27 19:05:23
问题 I need to use speech input to insert text. How can i detect keyword when I'm speaking ? Can i do this with Android Speech Input or I need external library ? Any ideas ? Thanks 回答1: Keyword detection task is different from a speech recognition task. While second tries to understand the text being spoken and check all possible word combinations, keyword spotting usually check two hypothesis - word is here or garbage is here. Its way more efficient to check keyword presence but it requires

Building openears compatible language model

假装没事ソ 提交于 2019-11-27 18:24:23
I am doing some development on speech to text and text to speech and I found the OpenEars API very useful. The principle of this cmu-slm based API is it uses a language model to map the speech listened by the iPhone device. So I decided to find a big English language model to feed the API speech recognizer engine. But I failed to understand the format of the voxfourge english data model to use with OpenEars. Do anyone have any idea that how can I get the .languagemodel and .dic file for English language to work with OpenEars? Halle Old question, but maybe the answer is still interesting.

Java speech recognition API [closed]

喜欢而已 提交于 2019-11-27 17:21:23
I am currently developing an Android application where I am required to implement speech recognition. Could you suggest a link where I could find a Java speech recognition API? CMU Sphinx Open Source Toolkit For Speech Recognition CMU Sphinx Open Source Toolkit For Speech Recognition Android Please read these articles they contain good tuts on Speech Recognition 1) This - A video tutorial from Youtube. 2) This - A tutorial which goes through the process of creating a app which uses SpeechRecognition. 3) This - A opensource Voice recognition project... Anthony I would advise the android.speech

SAPI: Speech to Text example

≯℡__Kan透↙ 提交于 2019-11-27 16:58:36
问题 I am new to SAPI, and I would really appreciate if any of you can provide me a speech to text Hello World example in SAPI. I know MS got some examples like "Dictation" etc, but I would like to start with a very small one. Glad if you can help. 回答1: I played a bit with Windows Voice Recognition using SAPI, it really isn't user friendly. Here is an example of code I wrote (in C++) : #include <sphelper.h> #include <sapi.h> #include <iostream> #include <string> const ULONGLONG grammarId = 0;

Building Speech Dataset for LSTM binary classification

血红的双手。 提交于 2019-11-27 16:09:35
I'm trying to do binary LSTM classification using theano. I have gone through the example code however I want to build my own. I have a small set of "Hello" & "Goodbye" recordings that I am using. I preprocess these by extracting the MFCC features for them and saving these features in a text file. I have 20 speech files(10 each) and I am generating a text file for each word, so 20 text files that contains the MFCC features. Each file is a 13x56 matrix. My problem now is: How do I use this text file to train the LSTM? I am relatively new to this. I have gone through some literature on it as

How to train on and make a serialized feature vector for a Neural Network?

我怕爱的太早我们不能终老 提交于 2019-11-27 15:48:12
By serialized i mean that the values for an input come in discrete intervals of time and that size of the vector is also not known before hand . Conventionally the neural networks employ fixed size parallel input neurons and fixed size parallel output neurons. A serialized implementation could be used in speech recognition where i can feed the network with a time series of the waveform and on the output end get the phonemes. It would be great if someone can point out some existing implementation. Nikolay Shmyrev Simple neural network as a structure doesn't have invariance across time scale

API or SDK for speech to text(speech recognition ) iphone

谁都会走 提交于 2019-11-27 15:22:52
问题 Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form. Detailed Description is as follows: In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file). So if I am able to get the word from api or sdk then it is possible to highlight it. Apart from I googled a lot for api and I came across ceedvocalsdk