speech-recognition

9Implementing Nuance Speech Recognition on Swift, cannot listen to onResult, onError… events

拟墨画扇 提交于 2019-12-18 07:18:10
问题 I have two parts of my Speech Recon project with Nuance, the .h file of a module (ObjectiveC) and a ViewController (swift). I want to set up a SpeechRecognition object in my swift viewController , and listen to onBegin, onStop... and such methods. The only way to make it compile is to use nil as the delegate parameter to initialize the SpeechRecon object. Obviously this is not good because my onStart... and onFinish functions don´t trigger. I have implemented a protocol to the SKRecogniser

Recognizing multiple peoples voices

僤鯓⒐⒋嵵緔 提交于 2019-12-18 05:21:34
问题 I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this? 回答1: You can consider Bob SPEAR https://pypi.python.org/pypi/bob.bio.spear Alize/Mistral http://mistral.univ-avignon.fr/index_en.html GMM speaker identification in matlab https://github.com/codyaray/speaker-recognition Very basic speaker recognition in Java, not really accurate https://github

Recognizing multiple peoples voices

我只是一个虾纸丫 提交于 2019-12-18 05:21:01
问题 I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this? 回答1: You can consider Bob SPEAR https://pypi.python.org/pypi/bob.bio.spear Alize/Mistral http://mistral.univ-avignon.fr/index_en.html GMM speaker identification in matlab https://github.com/codyaray/speaker-recognition Very basic speaker recognition in Java, not really accurate https://github

Google-speech-api transcribing spoken numbers incorrectly

守給你的承諾、 提交于 2019-12-18 05:08:48
问题 I started using google speech api to transcribe audio. The audio being transcribed contains many numbers spoken one after the other. E.g. 273 298 But the transcription comes back 270-3298 My guess is that it is interpreting it as some sort of phone number. What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own. Is there a setting or support for this kind of thing? thanks 回答1: So I had this exact same problem and I think we found a

Launch app on voice command (android)

落爺英雄遲暮 提交于 2019-12-18 05:05:10
问题 I need an example of how I could launch my app on a voice command (trigger word). So some sort of a service running in the background listening to everything and if the word matches a set textual value (I guess this can be done through Voice Recognition), app will open. I know this is possible, but I've no clue where to start... I see other apps are able to establish this. I've close to 1 million users and this is one of the most often requested features. 回答1: To do this you have to run

Launch app on voice command (android)

女生的网名这么多〃 提交于 2019-12-18 05:05:05
问题 I need an example of how I could launch my app on a voice command (trigger word). So some sort of a service running in the background listening to everything and if the word matches a set textual value (I guess this can be done through Voice Recognition), app will open. I know this is possible, but I've no clue where to start... I see other apps are able to establish this. I've close to 1 million users and this is one of the most often requested features. 回答1: To do this you have to run

AVAudioEngine inputNode installTap crash when restarting recording

空扰寡人 提交于 2019-12-18 04:40:07
问题 I am implementing Speech Recognition in my app. When I first present the view controller with the speech recognition logic, everything works fine. However, when I try present the view controller again, I get the following crash: ERROR: [0x190bf000] >avae> AVAudioNode.mm:568: CreateRecordingTap: required condition is false: IsFormatSampleRateAndChannelCountValid(format) *** Terminating app due to uncaught exception 'com.apple.coreaudio.avfaudio', reason: 'required condition is false:

example of AlwaysOnHotwordDetector in Android

我只是一个虾纸丫 提交于 2019-12-17 21:46:07
问题 Can someone provide an example of how to use the new AlwaysOnHotwordDetector class in Android? I'd like to build an app, that when the app is running in the background, can detect a hotword like "next", or "back", or "pause". 回答1: Unless I have a huge blind spot, I don't think third-party applications can make use of this API. Its strange that AlwaysOnHotwordDetector (and related classes VoiceInteractionService etc.) have been granted public access. If you are building a privileged app , look

System.Speech in Mono on Linux

时间秒杀一切 提交于 2019-12-17 21:10:22
问题 I'm working on a project in Linux (KUbuntu) using Mono and Monodevelop. I want to use the System.Speech library, which is completely possible with Monodevelop in Unity on Windows 7. I've been doing a lot of looking online over the past few hours and as far as I can tell System.Speech WAS added to Mono. I've updated all of mono, mono --version is showing 4.0.2 (latest version), and Monodevelop version is showing 5.9.4 (as far as I can tell that also is the most updated version). This is making

Using System.Speech.Recognition opens Windows Speech Recognition

点点圈 提交于 2019-12-17 21:08:56
问题 I tried implementing some simple speech recognition WinForms program in C# like the one described here in Michael Levy answer: good Speech recognition API The problem i have is that any time i run the program Windows Speech Recognition opens and is also doing stuff based on what i am saying. Also when the program starts i have to say "start listening" for speech recognition to work. My question is: How can i use speech recognition without having Windows Speech Recognition also act on what i