voice-recognition | 易学教程

Android offline voice recognition

阅读更多关于 Android offline voice recognition

问题 I already use HTK (Hidden Markov Model Tool Kit) for recognizing specific commands used to control my Android application, but in this case I need to pass some voice data to a server and that may consume more time. To prevent this latency, I am thinking about using pocketsphinx to recognize the voice data locally with the Android application so that I won't need to pass that audio to the server. If this is a good idea, is it easy to learn pocketsphinx from scratch? Also, what are advantages

Python app listening for a keyword. (Like Cortana)

阅读更多关于 Python app listening for a keyword. (Like Cortana)

问题 I'm trying to make a Python app, which behaves like Alexa, Cortana or Google's "Ok, Google". I want it to constantly listen for a specific keyword. After it hears the keyword I want it to execute a function. How can I do this? 回答1: Take a look at Speech Recognition This is a library that allows speech recognition including Google Cloud Speech API. Relating to the second part of your question this seems relevant: How can i detect one word with speech recognition in Python Once you can listen

Android Voice Recognition Commands

阅读更多关于 Android Voice Recognition Commands

问题 Goal Voice recognition starts, a voice command is spoken and the correct action is done. (Play Some Music starts the music player of whatever supposed to happen.) Current situation I have a test application running which start the Android Voice Recognition, successfully listens and returns a result to my Activity. Snippet to start voice recognition: Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, RecognizerIntent

HTML5 speech input and Google Translate text-to-speech, problem in Chrome

阅读更多关于 HTML5 speech input and Google Translate text-to-speech, problem in Chrome

问题 I'm creating a voice/text-memo web application. Here: http://gustavstromberg.se/sandbox/html5/localstorage/ look at its source (very short, most of it is css) This is: Voice recognition, works only in chrome as far as I know. Local Storage, to store notes as text. Google Translate text-to-speech. Everything works, but in different browsers. The voice input works perfect, only in chrome. The text-to-speech works in safari. To dynamically load the memo into the audio>source element i use: $("

iOS speech to text conversion in number format

阅读更多关于 iOS speech to text conversion in number format

问题 Currently I'm using default iOS speech to text conversion without adding any code for it. When the user says 'five', it is displayed as 'five' or '5'. But, I need it to be converted as '5' always. Is there anything I can do with SFSpeechRecognizer or any other way to achieve this? 回答1: This can get you started, but it is not able to handle mixed strings that contain a number AND a non-number. Ideally, you would need to process each word as it comes through, but then that has potential effects

what to do for voice application for ios?

阅读更多关于 what to do for voice application for ios?

问题 how to make Vlingo like application? is there any api that can be used for making apps for ios? Please provide some guide line or any tutorials or an help or comment will be appreciated Thanks in advance i also like to know how the talking ben the dog and talking tom cat is working as we talk it repeats it in funny voice. how it is possible? 回答1: For Speech Recognition on iOS, there have been many similar questions. Please see Speech to text Conversion.? or Text-to-speech (voice generation)

It's possible to make SpeechRecognizer faster?

阅读更多关于 It's possible to make SpeechRecognizer faster?

问题 I'm developing an application which is using the android SpeechRecognizer. I'm using it for something simple. I click in a button, my SpeechRecognizer start listening and I got some results from what I said. Easy right? Well, My problem is that I need to make SpeechRecognizer fast. I mean, I click in my button, I say "Hello" and SpeechRecognizer takes like 3-4 seconds in return an array with the possible results. My question is: It's possible to make SpeechRecognizer return results more

Capturing audio sent to Google's speech recognition server

阅读更多关于 Capturing audio sent to Google's speech recognition server

问题 to recognize speech by Google server, I use SpeechRecognizer class in combination with RecognitionListener as suggested in Stephan's answer to this question . In addition, I try to capture the audio signal being recognized by using onBufferReceived() callback from RecognitionListener like: byte[] sig = new byte[500000] ; int sigPos = 0 ; ... public void onBufferReceived(byte[] buffer) { System.arraycopy(buffer, 0, sig, sigPos, buffer.length) ; sigPos += buffer.length ; } ... This seems

PocketSphinx android demo runtime exception

阅读更多关于 PocketSphinx android demo runtime exception

问题 I downloaded the source code of pocket sphinx demo. I'm trying to run it but it is throwing a runtime exception. I've posted the logcat of my code. 09-09 11:45:38.980: I/System.out(7912): Sending WAIT chunk 09-09 11:45:38.980: W/ActivityThread(7912): Application edu.cmu.pocketsphinx.demo is waiting for the debugger on port 8100... 09-09 11:45:39.030: I/dalvikvm(7912): Debugger is active 09-09 11:45:39.210: I/System.out(7912): Debugger has connected 09-09 11:45:39.210: I/System.out(7912):

Cant start service? (Speech recog)

阅读更多关于 Cant start service? (Speech recog)

问题 I want to listen for the word hello using pocketsphinx in a service continuously I get the error. Here is the full stack trace. Here is a small portion of it. Unable to create service curlybrace.ruchir.myApp.MyService: java.lang.RuntimeException: new_Decoder returned -1 It is caused by this: setupRecognizer(assetDir); //SETUP and this: .getRecognizer(); In my onCreate : Log.v(TAG, "Voice recognition activated!"); //Register voice recog listener :) Assets assets = null; try { assets = new