speech-recognition | 易学教程

How can i make the python to wait till i complete speaking?

阅读更多关于 How can i make the python to wait till i complete speaking?

问题 I am writing a program to recognise the speech from a microphone and the code will process accordingly. The code I wrote for this purpose is below. import speech_recognition as sr import webbrowser import pyttsx from time import sleep engine = pyttsx.init() engine.setProperty('rate', 70) r = sr.Recognizer() def recognize(audio): try: return r.recognize(audio) except LookupError, e: print e return '' with sr.Microphone() as source: while True: engine.say("Hi How can i help you ?") sleep(0.15)

“SpeechHypothesized event not raised”

阅读更多关于 “SpeechHypothesized event not raised”

问题 I need to detect the user voice when they pick-up the reciever on the other end. Because Modems usually start playing files (playback terminal) when the first ring goes there. So I planned to use speech recognition when they say "hello", it can start playing the file until wait for playing file. Or even any noise interference it can start speak. I accomplished this with few settings. I found few common words that my engine detects when we speak and the words that comes when it's ringing. It

Failure to record audio while using webkitSpeechRecognition on Android Chrome [closed]

阅读更多关于 Failure to record audio while using webkitSpeechRecognition on Android Chrome [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . In Chrome I'm recording audio while running speech recognition. The recording is using AudioRecorder which internally calls getUserMedia . The speech recognition is using webkitSpeechRecognition . On the desktop (Win7 + latest Chrome) it works perfectly. On Android (also latest Chrome) I get an "either or"

Speech Recognizer Sound

阅读更多关于 Speech Recognizer Sound

问题 I am using the Speech Recognizer Intent to take User Input and translate it into text. However I want the Intent to continuously take user input and translate it into text to see if the user and said a certain word. My Code is able to do that, but every time my app begins listening for input, the phone makes a short beep sound that it is ready for input. I was wondering if there is any way to delete the sound from playing every time the recognizer is ready to listen again. Here is my code:

Google Glass and Speech Recognizer time delay recognizing results

阅读更多关于 Google Glass and Speech Recognizer time delay recognizing results

问题 I am working on a Glass application that will perform "Next Card" and "Previous Card" via speech. The application works just fine, except the time from when the word is spoken to when the action is performed is just over 1 second. This is a long enough delay that it is noticeable. This does not respond as quickly as Google has it with "ok glass". The most obvious change seems to be to implement: EXTRA_SPEECH_INPUT_MINIMUM_LENGTH_MILLIS and/or EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS

Watson speech to text live stream C# code example

阅读更多关于 Watson speech to text live stream C# code example

问题 I'm trying to build an app in C# that will take an audio stream (from a file for now, but later it will be a web stream) and return transcriptions from Watson in real time as they become available, similar to the demo at https://speech-to-text-demo.mybluemix.net/ Does anyone know where I can find some sample code, preferably in C#, that could help me get started? I tried this, based on the limited documentation at https://github.com/watson-developer-cloud/dotnet-standard-sdk/tree/development

problem in recognition of numbers in system.speech?

阅读更多关于 problem in recognition of numbers in system.speech?

问题 speech and i added grmmar which only detects numbers like this SpeechRecognitionEngine RecognitionEngine = new SpeechRecognitionEngine(new CultureInfo("en-US", true)); // Created Grammar For only Numbers 1 to 10 var NumberChoice = new Choices(); for (var i = 0; i <= 10; i++) { NumberChoice.Add(i.ToString()); } var NumGrammarBuilder = new GrammarBuilder(NumberChoice); var NumGrammar = new Grammar(NumGrammarBuilder); RecognitionEngine.LoadGrammar(NumGrammar); RecognitionEngine

Voice recognizer or speech recognition that keeps listening to user's “input”

阅读更多关于 Voice recognizer or speech recognition that keeps listening to user's “input”

问题 Recently in past, I have jumped into iOS app development. I am onto conceptualizing an idea. Does anyone knows about some service/class for voice recognizer or speech recognition that keeps listening to user's " input " in the background while you are on your App screen. " input " in the sense of some speech/voice, Some words like "Go", "Wait") As per my homework, I know there's NSSpeechSynthesizer and NSSpeechRecognizer available on mac, but not for ios. Any pointers are highly appreciated.

How to detect homophone

阅读更多关于 How to detect homophone

问题 I am fairly new to speech processing, but wondering how homophones are detected. I am in search for an API which gives similarity between two words on the basis of how they are pronounced. for example: "to" and "two" are highly similar in terms of how they sound with respect to say "to" and "from". 回答1: You might want to try calculating the edit distance not on the original strings, but on pronunciations, like they are available in the CMU Pronouncing Dictionary at http://www.speech.cs.cmu

Voice Detection in C#

阅读更多关于 Voice Detection in C#

问题 I'm looking for a simple C# real-time voice detection library. The input should be an audio stream, and the output should be "human voice" or "not a human voice". I have no knowledge in speech recognition or signal processing, and I'll appreciate any kind of assistance. 回答1: Take a look at the answer for "Detecting audio silence in WAV files using C#". I am assuming the input is a WAV file. If not please provide the format of the audio stream, or if you are intending on taking input from the