speech-recognition

speech recognition in java

[亡魂溺海] 提交于 2019-12-05 04:51:32
问题 I want to use speech recognition in my project and I found this code but when I run it I get an error which is: run: java.lang.NullPointerException at newpackage.HelloWorld.main(HelloWorld.java:55) Please could one of you help me in this problem? This is the server code that I use: package newpackage; import java.util.logging.Level; import java.util.logging.Logger; import javax.speech.*; import javax.speech.recognition.*; import java.io.FileReader; import java.util.Locale; public class

SpeechRecognizer not Hearing After First Result

99封情书 提交于 2019-12-05 04:31:17
问题 I am using SpeechRecognizer and RecognizerIntent in Android to implement speech recognition. My aim is to restart listening to speech after my speech recognizer displays the results on the screen. For that purpose, I am using the following code. The problem is, the first time runs fine and displays the results but after it starts listening for the second time (called from onResults method), it does not hear what is being spoken for some reason. Then it gives a ERROR_SPEECH_TIMEOUT error,

x-webkit-speech, how to auto click the mic icon to record many words automatically?

六眼飞鱼酱① 提交于 2019-12-05 04:28:47
I am working on chrome browser. The code <input x-webkit-speech> will show this: if you click the mic icon, you can input speech. like this My question is how to make it record voice to recognize automatically, without manually click the mic icon ? Such as when the page loaded, it start to record; or write codes to emulate a click event? Is it possible? May you give a sample code or x-webkit-speech API's document? Thank you. This should not be possible, because if it were it would be a privacy hazard — a page could start recording what is said in the vicinity without the user being aware of it

can i use google speech recognition api in my desktop application

与世无争的帅哥 提交于 2019-12-05 04:02:40
问题 I want to know whether i can use speech recognition api of google for my desktop application. I have seen some example in which i have to convert the speech to a file and send to a url. But it will be little cumbersome task because in my application the user have to continuously submit his voice. So is there any other alternative to use google speech api. I am least interested to go with sphinx because its accuracy is very less and i dont know how to add new words in the dictionary and

C# - Capture RTP Stream and send to speech recognition

谁说胖子不能爱 提交于 2019-12-05 00:53:20
问题 What I am trying to accomplish: Capture RTP Stream in C# Forward that stream to the System.Speech.SpeechRecognitionEngine I am creating a Linux-based robot which will take microphone input, send it Windows machine which will process the audio using Microsoft Speech Recognition and send the response back to the robot. The robot might be hundreds of miles from the server, so I would like to do this over the Internet. What I have done so far: Have the robot generate an RTP stream encoded in MP3

Non-Speech Noise or Sound Recognition Software?

对着背影说爱祢 提交于 2019-12-05 00:36:54
问题 I'm working on some software for children, and looking to add the ability for the software to respond to a number of non-speech sounds. For instance, clapping, barking, whistling, fart noises, etc. I've used CMU Sphinx and the Windows Speech API in the past, however, as far as I can tell neither of these have any support for non-speech noises, and in fact I believe actively filter them out. In general I'm looking for "How do I get this functionality" but I suspect it may help if I break it

Question SpeechSynthesizer.SetOutputToAudioStream audio format problem

假装没事ソ 提交于 2019-12-05 00:10:06
问题 I'm currently working on an application which requires transmission of speech encoded to a specific audio format. System.Speech.AudioFormat.SpeechAudioFormatInfo synthFormat = new System.Speech.AudioFormat.SpeechAudioFormatInfo(System.Speech.AudioFormat.EncodingFormat.Pcm, 8000, 16, 1, 16000, 2, null); This states that the audio is in PCM format, 8000 samples per second, 16 bits per sample, mono, 16000 average bytes per second, block alignment of 2. When I attempt to execute the following

Speech to text in emulator: No Activity found to handle Intent

被刻印的时光 ゝ 提交于 2019-12-04 23:47:04
问题 I want to ask how I can use speech to text code on my emulator. My codes work on real device but not work on emulator. The error said : No Activity found to handle Intent { act=android.speech.action.RECOGNIZE_SPEECH (has extras) } What can I do? 回答1: package net.viralpatel.android.speechtotextdemo; import java.util.ArrayList; import android.app.Activity; import android.content.ActivityNotFoundException; import android.content.Intent; import android.os.Bundle; import android.speech

Continuous Speech Recognition on browser like “ok google” or “hey siri”

自闭症网瘾萝莉.ら 提交于 2019-12-04 21:52:54
I am doing a POC and my requirement is that I want to implement the feature like OK google or Hey Siri on browser. I am using the Chrome Browser's Web speech api . The things I noticed that I can't continuous the recognition as it terminates automatically after a certain period of time and I know its relevant because of security concern. I just does another hack like when the SpeechReognition terminates then on its end event I further start the SpeechRecogntion but it is not the best way to implement such a solution because suppose if I am using the 2 instances of same application on the

C# grammar and switch wildcard

℡╲_俬逩灬. 提交于 2019-12-04 21:27:07
I would like to add, that whenever it recognizes 'search X' it is going to search for 'X', but i don't know how i have to add that to the grammar, or how to do such a thing with my switch statement. private void Form1_Load(object sender, EventArgs e) { Choices commands = new Choices(); commands.Add(new string[] { "hello", "start chrome", "search" }); GrammarBuilder gBuilder = new GrammarBuilder(); gBuilder.Append(commands); gBuilder.Culture = new System.Globalization.CultureInfo("en-GB"); Grammar grammar = new Grammar(gBuilder); recEngine.LoadGrammarAsync(grammar); recEngine