speech-recognition | 易学教程

speech recognition in java

阅读更多关于 speech recognition in java

问题 I want to use speech recognition in my project and I found this code but when I run it I get an error which is: run: java.lang.NullPointerException at newpackage.HelloWorld.main(HelloWorld.java:55) Please could one of you help me in this problem? This is the server code that I use: package newpackage; import java.util.logging.Level; import java.util.logging.Logger; import javax.speech.*; import javax.speech.recognition.*; import java.io.FileReader; import java.util.Locale; public class

SpeechRecognizer not Hearing After First Result

阅读更多关于 SpeechRecognizer not Hearing After First Result

问题 I am using SpeechRecognizer and RecognizerIntent in Android to implement speech recognition. My aim is to restart listening to speech after my speech recognizer displays the results on the screen. For that purpose, I am using the following code. The problem is, the first time runs fine and displays the results but after it starts listening for the second time (called from onResults method), it does not hear what is being spoken for some reason. Then it gives a ERROR_SPEECH_TIMEOUT error,

x-webkit-speech, how to auto click the mic icon to record many words automatically?

阅读更多关于 x-webkit-speech, how to auto click the mic icon to record many words automatically?

I am working on chrome browser. The code <input x-webkit-speech> will show this: if you click the mic icon, you can input speech. like this My question is how to make it record voice to recognize automatically, without manually click the mic icon ? Such as when the page loaded, it start to record; or write codes to emulate a click event? Is it possible? May you give a sample code or x-webkit-speech API's document? Thank you. This should not be possible, because if it were it would be a privacy hazard — a page could start recording what is said in the vicinity without the user being aware of it

can i use google speech recognition api in my desktop application

阅读更多关于 can i use google speech recognition api in my desktop application

问题 I want to know whether i can use speech recognition api of google for my desktop application. I have seen some example in which i have to convert the speech to a file and send to a url. But it will be little cumbersome task because in my application the user have to continuously submit his voice. So is there any other alternative to use google speech api. I am least interested to go with sphinx because its accuracy is very less and i dont know how to add new words in the dictionary and

C# - Capture RTP Stream and send to speech recognition

阅读更多关于 C# - Capture RTP Stream and send to speech recognition

问题 What I am trying to accomplish: Capture RTP Stream in C# Forward that stream to the System.Speech.SpeechRecognitionEngine I am creating a Linux-based robot which will take microphone input, send it Windows machine which will process the audio using Microsoft Speech Recognition and send the response back to the robot. The robot might be hundreds of miles from the server, so I would like to do this over the Internet. What I have done so far: Have the robot generate an RTP stream encoded in MP3

Non-Speech Noise or Sound Recognition Software?

阅读更多关于 Non-Speech Noise or Sound Recognition Software?

问题 I'm working on some software for children, and looking to add the ability for the software to respond to a number of non-speech sounds. For instance, clapping, barking, whistling, fart noises, etc. I've used CMU Sphinx and the Windows Speech API in the past, however, as far as I can tell neither of these have any support for non-speech noises, and in fact I believe actively filter them out. In general I'm looking for "How do I get this functionality" but I suspect it may help if I break it

Question SpeechSynthesizer.SetOutputToAudioStream audio format problem

阅读更多关于 Question SpeechSynthesizer.SetOutputToAudioStream audio format problem

问题 I'm currently working on an application which requires transmission of speech encoded to a specific audio format. System.Speech.AudioFormat.SpeechAudioFormatInfo synthFormat = new System.Speech.AudioFormat.SpeechAudioFormatInfo(System.Speech.AudioFormat.EncodingFormat.Pcm, 8000, 16, 1, 16000, 2, null); This states that the audio is in PCM format, 8000 samples per second, 16 bits per sample, mono, 16000 average bytes per second, block alignment of 2. When I attempt to execute the following

Speech to text in emulator: No Activity found to handle Intent

阅读更多关于 Speech to text in emulator: No Activity found to handle Intent

问题 I want to ask how I can use speech to text code on my emulator. My codes work on real device but not work on emulator. The error said : No Activity found to handle Intent { act=android.speech.action.RECOGNIZE_SPEECH (has extras) } What can I do? 回答1: package net.viralpatel.android.speechtotextdemo; import java.util.ArrayList; import android.app.Activity; import android.content.ActivityNotFoundException; import android.content.Intent; import android.os.Bundle; import android.speech

Continuous Speech Recognition on browser like “ok google” or “hey siri”

阅读更多关于 Continuous Speech Recognition on browser like “ok google” or “hey siri”

I am doing a POC and my requirement is that I want to implement the feature like OK google or Hey Siri on browser. I am using the Chrome Browser's Web speech api . The things I noticed that I can't continuous the recognition as it terminates automatically after a certain period of time and I know its relevant because of security concern. I just does another hack like when the SpeechReognition terminates then on its end event I further start the SpeechRecogntion but it is not the best way to implement such a solution because suppose if I am using the 2 instances of same application on the

C# grammar and switch wildcard

阅读更多关于 C# grammar and switch wildcard

I would like to add, that whenever it recognizes 'search X' it is going to search for 'X', but i don't know how i have to add that to the grammar, or how to do such a thing with my switch statement. private void Form1_Load(object sender, EventArgs e) { Choices commands = new Choices(); commands.Add(new string[] { "hello", "start chrome", "search" }); GrammarBuilder gBuilder = new GrammarBuilder(); gBuilder.Append(commands); gBuilder.Culture = new System.Globalization.CultureInfo("en-GB"); Grammar grammar = new Grammar(gBuilder); recEngine.LoadGrammarAsync(grammar); recEngine