voice-recognition | 易学教程

Glass GDk : Contextual voice commands without the “Ok Glass”

阅读更多关于 Glass GDk : Contextual voice commands without the “Ok Glass”

问题 I have an application with a CardScrollView as activity. Ok Glass => My App => Ok Glass => Next Page Or OK Glass => Previous Page Would it be possible to skip the second "Ok Glass" ? The best would be to let the Voice Command always activated but I don't know if it's possible. 回答1: It is currently not possible to trigger contextual voice commands without the "ok glass" cue: the main reasons are to prevent false positive and narrow down the window when the microphone is listening for commands.

Recognizing multiple peoples voices

阅读更多关于 Recognizing multiple peoples voices

问题 I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this? 回答1: You can consider Bob SPEAR https://pypi.python.org/pypi/bob.bio.spear Alize/Mistral http://mistral.univ-avignon.fr/index_en.html GMM speaker identification in matlab https://github.com/codyaray/speaker-recognition Very basic speaker recognition in Java, not really accurate https://github

Recognizing multiple peoples voices

阅读更多关于 Recognizing multiple peoples voices

How To: Voice Commands into an android application

阅读更多关于 How To: Voice Commands into an android application

问题 There are many tutorials online for adding voice recognition to an android app. They are often confusing and the publishers of the coding are never available for questions. I need a simple tutorial with complete coding for adding voice recognition to my app. 回答1: If you want to add voice recognition to your group's android app it is very simple. Throughout this tutorial you will need to add imports as you paste in the code. create an xml file or use an existing one and make sure that you add

I can't install pyaudio on my python? How to do it?

阅读更多关于 I can't install pyaudio on my python? How to do it?

问题 I have a windows 10 pc and i want to install pyaudio to use it with my chatbot, powered by chatterbot, i try with 2 different ways to install pyaudio. The way is in writing on command prompt python -m pip install PyAudio That is the result: C:\Users\Waaberi>python -m pip install PyAudio Collecting PyAudio Using cached https://files.pythonhosted.org/packages/ab/42/b4f04721c5c5bfc196ce156b3c768998ef8c0ae3654ed29ea5020c749a6b/PyAudio-0.2.11.tar.gz Installing collected packages: PyAudio Running

Voice recognition on android with recorded sound clip?

阅读更多关于 Voice recognition on android with recorded sound clip?

问题 I've used the voice recognition feature on Android and I love it. It's one of my customers' most praised features. However, the format is somewhat restrictive. You have to call the recognizer intent, have it send the recording for transcription to google, and wait for the text back. Some of my ideas would require recording the audio within my app and then sending the clip to google for transcription. Is there any way I can send an audio clip to be processed with speech to text? 回答1: I got a

Why do MFCC extraction libs return different values?

阅读更多关于 Why do MFCC extraction libs return different values?

问题 I am extracting the MFCC features using two different libraries: The python_speech_features lib The BOB lib However the output of the two is different and even the shapes are not the same. Is that normal? or is there a parameter that I am missing? The relevant section of my code is the following: import bob.ap import numpy as np from scipy.io.wavfile import read from sklearn import preprocessing from python_speech_features import mfcc, delta, logfbank def bob_extract_features(audio, rate):

How to detect homophone

阅读更多关于 How to detect homophone

问题 I am fairly new to speech processing, but wondering how homophones are detected. I am in search for an API which gives similarity between two words on the basis of how they are pronounced. for example: "to" and "two" are highly similar in terms of how they sound with respect to say "to" and "from". 回答1: You might want to try calculating the edit distance not on the original strings, but on pronunciations, like they are available in the CMU Pronouncing Dictionary at http://www.speech.cs.cmu

Get raw voice recording from android's search framework

阅读更多关于 Get raw voice recording from android's search framework

问题 I'm planning to use Android's Search Framework with voice recognition to Search thru existing texts (Short sentences of 5-10 words) by typing or by voice If search was by voice, I'd like store the search itself (the search text) but also the raw sound, that was recorded. Looking at the guide and into everything written about this, I cannot find a way to get the raw recorded voice, nor can I find something where I can provide a recorded sound to the search interface. Does anybody know, if this

How can I use voice recognition with other languages android

阅读更多关于 How can I use voice recognition with other languages android

问题 I have a code that used to worked but for some reason it suddenly just stopped working, I'm trying to use voice recognition in Hebrew but it seems like since a few days ago it just starts voice recognition in English. Here is my code sr = SpeechRecognizer.createSpeechRecognizer(getApplicationContext()); test_voice_recognitiona listener = new test_voice_recognitiona(); sr.setRecognitionListener(listener); Intent fl = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); fl.putExtra("android