speech-recognition | 易学教程

SpeechRecognizer broken after Google Glass XE17 update — how to work around?

阅读更多关于 SpeechRecognizer broken after Google Glass XE17 update — how to work around?

问题 Standard Android SpeechRecognizer was working perfectly on Google Glass XE16 - XE16.2 Then the XE17 update suddenly broke everything, with the following error and no callbacks to the Listener anymore: E/AndroidRuntime(6321): FATAL EXCEPTION: main E/AndroidRuntime(6321): Process: com.google.glass.voice, PID: 6321 E/AndroidRuntime(6321): java.lang.NullPointerException: VoiceEngine.startListening: voiceConfig cannot be null E/AndroidRuntime(6321): at com.google.glass.predicates.Assert

Speech Recognition API without Grammar C#

阅读更多关于 Speech Recognition API without Grammar C#

问题 I am developing a program where 99% of the time i can use a set Grammar in the standard Microsoft Speech Recognition to detect words being said. However in a few circumstances, i need the user to say something. This is never going to be predictable. So is there a way to do this with the MS speech recognition? And are there any other speech recognition API's out there (for free), that can handle non-preset words? 回答1: There is a class called DictationGrammar that allows you to recognize

android speech recognition service null pointer exceptions

阅读更多关于 android speech recognition service null pointer exceptions

问题 i am trying to create an android speech recognitions service following the example here: Android Speech Recognition Continuous Service and below is my code: public class MainActivity extends Activity { private int mBindFlag; Messenger mServiceMessenger; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); Context activityContext = getApplicationContext(); Intent service = new Intent(activityContext, VoiceCommandService.class); activityContext

How to specify phonetic keywords for IBM Watson speech2text service?

阅读更多关于 How to specify phonetic keywords for IBM Watson speech2text service?

问题 While we have had good success with Bluemix Java SDK in the general case, we've bumped into problems while trying to recognize occasional non-English words (e.g., foreign last names). Our hope was that one could specify the keyword list using SPR phonetic notation (which works great for text2speech), but that does not seem to be supported for speech2text. Any suggestions/workarounds? SpeechToText service = new SpeechToText(); service.setUsernameAndPassword("USERNAME", "PASSWORD"); File audio

Methods to do outlier detection in sound recognition?

阅读更多关于 Methods to do outlier detection in sound recognition?

问题 There are models to recognize 2-class sounds, which are class-A and class-B. How to recognize class-C sounds as abnormal sound ? I tried to set a threshold while recognizing by frames. above 70% -> class A or B else -> abnormal For example, If a sound has 10 frames, and the result is frame 1 2 3 4 5 6 7 8 9 10 A B A B A A A B A A A=7 B=3 -> class A frame 1 2 3 4 5 6 7 8 9 10 B B A B A A A B A A A=6 B=4 -> abnormal The performance is very bad. what should I do ? 回答1: There are two ways to look

Semantic Interpretation in SRGS XML grammar

阅读更多关于 Semantic Interpretation in SRGS XML grammar

问题 I have got the following XML grammar to detect a number like 1000 or 2200 etc. <rule id="rule1" scope="public"> <one-of> <item>1</item> <item>2</item> <item>3</item> </one-of> <ruleref uri="#rule2"/> </rule> <rule id="rule2" scope="public"> <one-of> <item>thousand<tag>out="000";</tag></item> <item>thousand 100<tag>out=100;</tag></item> <item>thousand 200<tag>out=200;</tag></item> </one-of> </rule> However when the user says for example 2100, I get "2 thousand 100" instead of 2100. It seems

Could not find pyaudio check installation in mac

阅读更多关于 Could not find pyaudio check installation in mac

问题 I have installed portaudio using brew and installed speechRecognition, pyaudio using pip3 in my macos. I am getting error while running following code CODE: #!/usr/bin/env python # Python 3 import os import sys import speech_recognition as sr r = sr.Recognizer() with sr.Microphone() as source: print ("say something!"); audio = r.listen(source) try: print("google thinks you said" +r.recognize_google(audio)) except: pass ERROR: Traceback (most recent call last): File "test.py", line 8, in

Launch app on voice command android

阅读更多关于 Launch app on voice command android

问题 i need an example of app launch on particular word. i know this is possible using speech recognition api. I just need a clue to start. I see other apps are able to establish this. 回答1: I did this b4 using android service.i kept a background service working and added a listiner to it and keep checking the tresults if it equals ur voice command. 回答2: try to use Android Service which listens to voice + voice to text and you can go through the text and find the check word 来源： https:/

Make Sphinx4 Recognize all the numbers using custom .gram file

阅读更多关于 Make Sphinx4 Recognize all the numbers using custom .gram file

问题 Description A speech recognizer calculator in Java Using Sphinx4 library exists. The full code on github: here The gram file i am using is the below(on github): #JSGF V1.0; /** * JSGF Grammar */ grammar grammar; public <syntax> = (one | two | three| four| five | six | seven | eight | nine | ten | eleven | twelve | thirteen | fourteen | fifteen | sixteen | seventeen | eighteen | nineteen | twenty) (plus | minus | multiply | division) (one | two | three| four| five | six | seven | eight | nine

Where to find the NLU Context Tag for Nuance Speech kit configuration Swift 3

阅读更多关于 Where to find the NLU Context Tag for Nuance Speech kit configuration Swift 3

问题 I am using Nuance Speech kit in Swift 3 , I had downloaded the sample app for swift , the configuration has a variable , // Only needed if using NLU/Bolt var SKSNLUContextTag = "!NLU_CONTEXT_TAG!" I am able to find all the other credentials required for speech recognition except the context tag . How / where to find SKSNLUContextTag value ? 回答1: You actually need to sign up for Nuance Mix. After that you'll get NLU context tags. https://developer.nuance.com/mix 来源： https://stackoverflow.com