speech-recognition | 易学教程

9Implementing Nuance Speech Recognition on Swift, cannot listen to onResult, onError… events

阅读更多关于 9Implementing Nuance Speech Recognition on Swift, cannot listen to onResult, onError… events

问题 I have two parts of my Speech Recon project with Nuance, the .h file of a module (ObjectiveC) and a ViewController (swift). I want to set up a SpeechRecognition object in my swift viewController , and listen to onBegin, onStop... and such methods. The only way to make it compile is to use nil as the delegate parameter to initialize the SpeechRecon object. Obviously this is not good because my onStart... and onFinish functions don´t trigger. I have implemented a protocol to the SKRecogniser

Recognizing multiple peoples voices

阅读更多关于 Recognizing multiple peoples voices

问题 I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this? 回答1: You can consider Bob SPEAR https://pypi.python.org/pypi/bob.bio.spear Alize/Mistral http://mistral.univ-avignon.fr/index_en.html GMM speaker identification in matlab https://github.com/codyaray/speaker-recognition Very basic speaker recognition in Java, not really accurate https://github

Recognizing multiple peoples voices

阅读更多关于 Recognizing multiple peoples voices

Google-speech-api transcribing spoken numbers incorrectly

阅读更多关于 Google-speech-api transcribing spoken numbers incorrectly

问题 I started using google speech api to transcribe audio. The audio being transcribed contains many numbers spoken one after the other. E.g. 273 298 But the transcription comes back 270-3298 My guess is that it is interpreting it as some sort of phone number. What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own. Is there a setting or support for this kind of thing? thanks 回答1: So I had this exact same problem and I think we found a

Launch app on voice command (android)

阅读更多关于 Launch app on voice command (android)

问题 I need an example of how I could launch my app on a voice command (trigger word). So some sort of a service running in the background listening to everything and if the word matches a set textual value (I guess this can be done through Voice Recognition), app will open. I know this is possible, but I've no clue where to start... I see other apps are able to establish this. I've close to 1 million users and this is one of the most often requested features. 回答1: To do this you have to run

Launch app on voice command (android)

阅读更多关于 Launch app on voice command (android)

AVAudioEngine inputNode installTap crash when restarting recording

阅读更多关于 AVAudioEngine inputNode installTap crash when restarting recording

问题 I am implementing Speech Recognition in my app. When I first present the view controller with the speech recognition logic, everything works fine. However, when I try present the view controller again, I get the following crash: ERROR: [0x190bf000] >avae> AVAudioNode.mm:568: CreateRecordingTap: required condition is false: IsFormatSampleRateAndChannelCountValid(format) *** Terminating app due to uncaught exception 'com.apple.coreaudio.avfaudio', reason: 'required condition is false:

example of AlwaysOnHotwordDetector in Android

阅读更多关于 example of AlwaysOnHotwordDetector in Android

问题 Can someone provide an example of how to use the new AlwaysOnHotwordDetector class in Android? I'd like to build an app, that when the app is running in the background, can detect a hotword like "next", or "back", or "pause". 回答1: Unless I have a huge blind spot, I don't think third-party applications can make use of this API. Its strange that AlwaysOnHotwordDetector (and related classes VoiceInteractionService etc.) have been granted public access. If you are building a privileged app , look

System.Speech in Mono on Linux

阅读更多关于 System.Speech in Mono on Linux

问题 I'm working on a project in Linux (KUbuntu) using Mono and Monodevelop. I want to use the System.Speech library, which is completely possible with Monodevelop in Unity on Windows 7. I've been doing a lot of looking online over the past few hours and as far as I can tell System.Speech WAS added to Mono. I've updated all of mono, mono --version is showing 4.0.2 (latest version), and Monodevelop version is showing 5.9.4 (as far as I can tell that also is the most updated version). This is making

Using System.Speech.Recognition opens Windows Speech Recognition

阅读更多关于 Using System.Speech.Recognition opens Windows Speech Recognition

问题 I tried implementing some simple speech recognition WinForms program in C# like the one described here in Michael Levy answer: good Speech recognition API The problem i have is that any time i run the program Windows Speech Recognition opens and is also doing stuff based on what i am saying. Also when the program starts i have to say "start listening" for speech recognition to work. My question is: How can i use speech recognition without having Windows Speech Recognition also act on what i