speech-recognition

Installing Pocketsphinx on Android

若如初见. 提交于 2019-11-29 15:40:34
问题 I have some questions on installing Pocketsphinx on and Android phone, and I haven't been able to find answers on CMUSphinx's support forum, IRC, existing StackOverflow posts, or intensive Googling. If anyone here has a minute and would be willing to help, I'd greatly appreciate it. First off, I'm confused as to whether I can even install Pocketsphinx on Android. I believe I need Java bins in order to run on Android, so I attempted to install Sphinx 4. To test the program, I made an empty

free speech recognition engines for iOS?

喜你入骨 提交于 2019-11-29 15:37:10
问题 I am looking for some free speech recognition engines to use in my iphone application... can you suggest any? 回答1: There are a couple of wrappers of the Sphinx speech recognition engine (http://cmusphinx.sourceforge.net/) available for iOS https://github.com/KingOfBrian/VocalKit Through reading his summary you can see he is actually pointing people towards http://www.politepix.com/openears 回答2: Nuance just opened the doors for developers to the Dragon Mobile SDK (they are industry-leaders).

API or SDK to make speech recognition only for numbers (between 1 and 10000)?

你。 提交于 2019-11-29 15:33:46
I need a specialized solution optimized to detect numbers between 1 and 1000 to be used on a smartphone. Best solution would be to have this SDK working offline. Any idea ? I do not find any configuration with Google Speech or Amazon Transcribe to allow "number only" It is not quite right to strictly expect numbers from people, they usually say many things like "i don't know" or "wait a bit" even if you ask them for numbers. You will harm the experience significantly. You have to analyze the recognition result intelligently and even if non-number is recognized you have to act accordingly. To

how to detect a presence of word /audio silence in the wav file using java?

淺唱寂寞╮ 提交于 2019-11-29 15:11:21
I am working on a speech recognizer project as a part of it want from a wav file want to find the presence of silence or detect the presence of word .and if a word is found then copy that word from start to end into a new wav file so it original wav file has 10 words then output is 10 file..problem is with detecting the silence or word want suggestion on how to implement this in java.. please suggest.. Well, wav is just PCM data. I'd start by reading this: http://en.wikipedia.org/wiki/Pulse-code_modulation I've done this before... You start be pulling samples out of the PCM data. You then

Speech Recognition API without Grammar C#

一个人想着一个人 提交于 2019-11-29 15:01:03
I am developing a program where 99% of the time i can use a set Grammar in the standard Microsoft Speech Recognition to detect words being said. However in a few circumstances, i need the user to say something. This is never going to be predictable. So is there a way to do this with the MS speech recognition? And are there any other speech recognition API's out there (for free), that can handle non-preset words? There is a class called DictationGrammar that allows you to recognize dictated speech. It also supports a "dictation context" - so you can specify that the dictated text should start

Speech To Text Recognizer not found

好久不见. 提交于 2019-11-29 14:58:22
I'm building an app that record the user's voice and then converts it to text. From this post Stackoverflow Post I've found the source code android developers , I've run the code but the button text reads "RECOGNIZER NOT PRESENT". I think I must be missing some permissions or something becouse earlier I created a Text to Speech converter and it was working fine. Any ideas what might be the problem? Michael Levy I am not sure why the recognizer functionality is not available on all devices. Make sure you install and update the latest Google Voice Search for Android. I believe it installs the

Custom grammar for Android speech recognition

余生长醉 提交于 2019-11-29 14:40:44
I am trying to apply the custom grammar for speech recognition. I have found a lot of documentation about android.speech.recognition package, it seams as the good place to start, but it seams that it doesn't exist in the Android SDK.. Is there a way to apply a custom grammar rules to the android speech recognition at all..? No, Android's recognizer intent only supports two language models. These are the "Free Form" model and the "web search" model. It does not support custom grammars. See http://developer.android.com/reference/android/speech/RecognizerIntent.html#EXTRA_LANGUAGE_MODEL You may

Offline voice recognition android taking unwanted voice

别来无恙 提交于 2019-11-29 14:16:46
I have did a lot for research and tried offline pocket sphinx but it is taking surrounding voice it is taking and reacting my app differently. Is there any Google offline app for above kitkat 4.4.. i am trying it from 2 week. Thank for your valuable answer. Details: When the activity starts first text has to read the content (Text to speech) after it complete, reading voice recognition has to take voice and as per command ex: (next, previous, forward, option, 1, 2, 3, 4 etc). As per command it has to recognize and react to it in onresult method. Error: i am getting error after taking some

Can I use OS X 10.8's speech recognition/dictation without a GUI?

南笙酒味 提交于 2019-11-29 14:08:14
问题 Ideally I would like to use 10.8's/Mountain Lion's dictation feature from Python, without a GUI. As far as I can tell dictation is currently only supported by using the UITextInput protocol, which implies a GUI (correct?). 回答1: You can create a dummy text field and programmatically start dictating by emulating the keypress (tried out successfully here) 来源: https://stackoverflow.com/questions/11851713/can-i-use-os-x-10-8s-speech-recognition-dictation-without-a-gui

Speech to text conversion for non-english language

主宰稳场 提交于 2019-11-29 12:47:30
I am trying to implement naive speech to text conversion for non-english language. where user speaks in other language and text is also in the same language .Is that possible ? If yes can anyone help with the idea of how to implement it ? Any Help would be greatly appreciated.Thanks in advance. Nikolay Shmyrev It is unlikely any commercial speech recognition solution will support Sanskrit, so the only choice you have is to add support for Sanskrit into open source engine like CMUSphinx. This is pretty straightforward, you actually just need to follow the documentation and you can get to the