speech-recognition

How can i make the python to wait till i complete speaking?

我怕爱的太早我们不能终老 提交于 2019-12-14 00:03:11
问题 I am writing a program to recognise the speech from a microphone and the code will process accordingly. The code I wrote for this purpose is below. import speech_recognition as sr import webbrowser import pyttsx from time import sleep engine = pyttsx.init() engine.setProperty('rate', 70) r = sr.Recognizer() def recognize(audio): try: return r.recognize(audio) except LookupError, e: print e return '' with sr.Microphone() as source: while True: engine.say("Hi How can i help you ?") sleep(0.15)

“SpeechHypothesized event not raised”

こ雲淡風輕ζ 提交于 2019-12-13 21:17:28
问题 I need to detect the user voice when they pick-up the reciever on the other end. Because Modems usually start playing files (playback terminal) when the first ring goes there. So I planned to use speech recognition when they say "hello", it can start playing the file until wait for playing file. Or even any noise interference it can start speak. I accomplished this with few settings. I found few common words that my engine detects when we speak and the words that comes when it's ringing. It

Failure to record audio while using webkitSpeechRecognition on Android Chrome [closed]

社会主义新天地 提交于 2019-12-13 21:07:57
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . In Chrome I'm recording audio while running speech recognition. The recording is using AudioRecorder which internally calls getUserMedia . The speech recognition is using webkitSpeechRecognition . On the desktop (Win7 + latest Chrome) it works perfectly. On Android (also latest Chrome) I get an "either or"

Speech Recognizer Sound

∥☆過路亽.° 提交于 2019-12-13 20:59:02
问题 I am using the Speech Recognizer Intent to take User Input and translate it into text. However I want the Intent to continuously take user input and translate it into text to see if the user and said a certain word. My Code is able to do that, but every time my app begins listening for input, the phone makes a short beep sound that it is ready for input. I was wondering if there is any way to delete the sound from playing every time the recognizer is ready to listen again. Here is my code:

Google Glass and Speech Recognizer time delay recognizing results

↘锁芯ラ 提交于 2019-12-13 18:31:55
问题 I am working on a Glass application that will perform "Next Card" and "Previous Card" via speech. The application works just fine, except the time from when the word is spoken to when the action is performed is just over 1 second. This is a long enough delay that it is noticeable. This does not respond as quickly as Google has it with "ok glass". The most obvious change seems to be to implement: EXTRA_SPEECH_INPUT_MINIMUM_LENGTH_MILLIS and/or EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS

Watson speech to text live stream C# code example

烂漫一生 提交于 2019-12-13 17:15:50
问题 I'm trying to build an app in C# that will take an audio stream (from a file for now, but later it will be a web stream) and return transcriptions from Watson in real time as they become available, similar to the demo at https://speech-to-text-demo.mybluemix.net/ Does anyone know where I can find some sample code, preferably in C#, that could help me get started? I tried this, based on the limited documentation at https://github.com/watson-developer-cloud/dotnet-standard-sdk/tree/development

problem in recognition of numbers in system.speech?

老子叫甜甜 提交于 2019-12-13 15:43:13
问题 speech and i added grmmar which only detects numbers like this SpeechRecognitionEngine RecognitionEngine = new SpeechRecognitionEngine(new CultureInfo("en-US", true)); // Created Grammar For only Numbers 1 to 10 var NumberChoice = new Choices(); for (var i = 0; i <= 10; i++) { NumberChoice.Add(i.ToString()); } var NumGrammarBuilder = new GrammarBuilder(NumberChoice); var NumGrammar = new Grammar(NumGrammarBuilder); RecognitionEngine.LoadGrammar(NumGrammar); RecognitionEngine

Voice recognizer or speech recognition that keeps listening to user's “input”

こ雲淡風輕ζ 提交于 2019-12-13 12:51:22
问题 Recently in past, I have jumped into iOS app development. I am onto conceptualizing an idea. Does anyone knows about some service/class for voice recognizer or speech recognition that keeps listening to user's " input " in the background while you are on your App screen. " input " in the sense of some speech/voice, Some words like "Go", "Wait") As per my homework, I know there's NSSpeechSynthesizer and NSSpeechRecognizer available on mac, but not for ios. Any pointers are highly appreciated.

How to detect homophone

♀尐吖头ヾ 提交于 2019-12-13 12:42:38
问题 I am fairly new to speech processing, but wondering how homophones are detected. I am in search for an API which gives similarity between two words on the basis of how they are pronounced. for example: "to" and "two" are highly similar in terms of how they sound with respect to say "to" and "from". 回答1: You might want to try calculating the edit distance not on the original strings, but on pronunciations, like they are available in the CMU Pronouncing Dictionary at http://www.speech.cs.cmu

Voice Detection in C#

落花浮王杯 提交于 2019-12-13 12:02:57
问题 I'm looking for a simple C# real-time voice detection library. The input should be an audio stream, and the output should be "human voice" or "not a human voice". I have no knowledge in speech recognition or signal processing, and I'll appreciate any kind of assistance. 回答1: Take a look at the answer for "Detecting audio silence in WAV files using C#". I am assuming the input is a WAV file. If not please provide the format of the audio stream, or if you are intending on taking input from the