speech-recognition

Does the iOS Speech API support grammar?

孤街醉人 提交于 2020-01-01 15:04:15
问题 I was investigating various Speech Recognition strategies and I liked the idea of grammars as defined in the Web Speech spec. It seems that if you can tell the speech recognition service that you expect “Yes” or “No”, the service could more reliably recognize a “Yes” as “Yes”, “No” as `No”, and hopefully also be able to say “it didn’t sound like either of those!”. However, in SFSpeechRecognitionRequest, I only see taskHint with values from SFSpeechRecognitionTaskHint of confirmation ,

1D multiple peak detection?

孤街醉人 提交于 2020-01-01 05:42:07
问题 I am currently trying to implement basic speech recognition in AS3. I need this to be completely client side, as such I can't access powerful server-side speech recognition tools. The idea I had was to detect syllables in a word, and use that to determine the word spoken. I am aware that this will grealty limit the capacities for recognition, but I only need to recognize a few key words and I can make sure they all have a different number of syllables. I am currently able to generate a 1D

Use x-webkit-speech in an HTML/JavaScript extension

牧云@^-^@ 提交于 2020-01-01 04:26:10
问题 I am trying to use the new x-webkit-speech function in a simple HTML/JavaScript extension in Google Chrome. I, however, have tried and tried looking at a bunch of examples and cannot get it to successfully call the function. I have seen other people do it, and I don't really get why I cannot. I put the JavaScript code into a separate file, but I include using <script src="filename.js"> this is my line for the x-webkit-speech.... <input id="speechInput" type="text" style="font-size:25px;" x

Start speech recognition thru voice with phrase like “Ok Google”?

╄→尐↘猪︶ㄣ 提交于 2020-01-01 01:52:08
问题 I'm building an app that uses Voice Commands to perform certain functions. I got some codes working from here private static final int SPEECH_REQUEST_CODE = 0; // Create an intent that can start the Speech Recognizer activity private void displaySpeechRecognizer() { Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, RecognizerIntent.LANGUAGE_MODEL_FREE_FORM); // Start the activity, the intent will be populated with the

Python app listening for a keyword. (Like Cortana)

天涯浪子 提交于 2019-12-31 07:27:50
问题 I'm trying to make a Python app, which behaves like Alexa, Cortana or Google's "Ok, Google". I want it to constantly listen for a specific keyword. After it hears the keyword I want it to execute a function. How can I do this? 回答1: Take a look at Speech Recognition This is a library that allows speech recognition including Google Cloud Speech API. Relating to the second part of your question this seems relevant: How can i detect one word with speech recognition in Python Once you can listen

what to do for voice application for ios?

只愿长相守 提交于 2019-12-30 14:26:05
问题 how to make Vlingo like application? is there any api that can be used for making apps for ios? Please provide some guide line or any tutorials or an help or comment will be appreciated Thanks in advance i also like to know how the talking ben the dog and talking tom cat is working as we talk it repeats it in funny voice. how it is possible? 回答1: For Speech Recognition on iOS, there have been many similar questions. Please see Speech to text Conversion.? or Text-to-speech (voice generation)

Difference among Microsoft Speech products/platforms

女生的网名这么多〃 提交于 2019-12-30 06:43:26
问题 It seems Microsoft offers quite a few speech recognition products, I'd like to know the differences among all of them pls. There is Microsoft Speech API, or SAPI. But somehow Microsoft Cognitive Service Speech API has the same name. Ok now, Microsoft Cognitive Service on Azure offers Speech service API and Bing Speech API . I assume for speech-to-text, both APIs are the same. And then there is System.Speech.Recognition (or Desktop SAPI), Microsoft.Speech.Recognition (or Server SAPI) and

Difference among Microsoft Speech products/platforms

我的梦境 提交于 2019-12-30 06:42:33
问题 It seems Microsoft offers quite a few speech recognition products, I'd like to know the differences among all of them pls. There is Microsoft Speech API, or SAPI. But somehow Microsoft Cognitive Service Speech API has the same name. Ok now, Microsoft Cognitive Service on Azure offers Speech service API and Bing Speech API . I assume for speech-to-text, both APIs are the same. And then there is System.Speech.Recognition (or Desktop SAPI), Microsoft.Speech.Recognition (or Server SAPI) and

Offline google voice recognition on android

雨燕双飞 提交于 2019-12-30 06:30:10
问题 I am working on an android application which will listen to voice command and triggers actions accordingly. Below are some doubts: Is google voice recognition offline library is available to use in android apps. Can we create our own dictionary of words & commands with wake up word. Application should work in offline mode(Without Internet). 回答1: Is Google voice recognition offline library is available to use in android apps. I think this question is already answered here: Offline Speech

HTML5 Speech recognition — is there a way to set what the user is expected to say dynamically? (Using custom Grammars)

断了今生、忘了曾经 提交于 2019-12-30 01:58:10
问题 I am looking for a way to define what you expect the user to say in an <input> tag with the HTML 5 speech attribute set. I know that you can specify a specific grammar to use via the grammar attribute, like this: <input type="text" speech grammar="grammar.grxml" /> ( see http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0020/api-draft.html ) but I was hoping for a way to make this dynamic, so that I can specify what I expect the user to say via javascript. For example, if