speech-recognition

Android SpeechRecognizer set recognize engine?

前提是你 提交于 2019-12-04 15:11:29
I'm using SpeechRecognizer to convert speech to text. I'm using Samsung and I encounterd error while I set Voice recognizer to 'Samsung powered by Vlingo', after I changed to 'Google' and the error disappear. I was wondering is there any method to change recognize engine programmatically? Thanks. I've found the answer of my question. I can use 'public static SpeechRecognizer createSpeechRecognizer (Context context, ComponentName serviceComponent)' the method to choose which recognition service to bind. Here is the implementation. SpeechRecognizer.createSpeechRecognizer(mContext, ComponentName

Does the iOS Speech API support grammar?

我只是一个虾纸丫 提交于 2019-12-04 14:03:23
I was investigating various Speech Recognition strategies and I liked the idea of grammars as defined in the Web Speech spec . It seems that if you can tell the speech recognition service that you expect “Yes” or “No”, the service could more reliably recognize a “Yes” as “Yes”, “No” as `No”, and hopefully also be able to say “it didn’t sound like either of those!”. However, in SFSpeechRecognitionRequest , I only see taskHint with values from SFSpeechRecognitionTaskHint of confirmation , dictation , search , and unspecified . I also see SFSpeechRecognitionRequest.contextualStrings , but it

Burmese speech to text conversion in android?

别等时光非礼了梦想. 提交于 2019-12-04 12:01:41
问题 Can we add custom language for RecognizerIntent? I have search many SO Question like https://stackoverflow.com/questions/2080401/is-there-a-speech-to-text-api-by-google That solve my problem of using limited number of language during Speech to Text conversion. My problem is that, I need to used Burmese(Local Language of Burma) Speech and convert it to text. any other help can be appreciated. UPDATE : Google's servers currently support English, Mandarin Chinese, and Japanese. Speech Input API

How can i detect one word with speech recognition in Python

拈花ヽ惹草 提交于 2019-12-04 11:31:51
问题 I know how to detect speech with Python but this question is more specific: How can I make Python listening for only one word and then returns True if Python could recognize the word. I know, I could just let Python listen all the Time and then make something like that Pseudocode: while True: if stt.listen() == "keyword": return True I have already made that and the program is hanging up after some minutes of always listening (See at the end). So I need a way to only listen for one specific

Microsoft Speech Recognition Custom Training

核能气质少年 提交于 2019-12-04 09:34:54
I have been wanting to create an application using the Microsoft Speech Recognition. My application's users are expected to often say abbreviated things, such as 'LHC' for 'Large Hadron Collider' or 'CERN'. Given that exact order, my application will return You said: At age C. You said: Cern While it did work for 'CERN', it failed very badly for 'LHC'. However, if I could make my own custom training files, I could easily place the term 'LHC' somewhere in there. Then, I could make the user access the Speech Control Panel and run my training file. All the links I have found for this have been

Swift - How can I convert Saved Audio file conversations to Text?

若如初见. 提交于 2019-12-04 09:30:31
I work on speech recognition. I solve the text-to-speech and speech-to-text with IOS frameworks. But now i want to convert saved audio file conversations to text. How can i solve this ? Thank you for all replies. I have worked on same things which are working for me. I have audio file in my project bundle which. So I have written following code to convert audio to text. let audioURL = Bundle.main.url(forResource: "Song", withExtension: "mov") let recognizer = SFSpeechRecognizer(locale: Locale(identifier: "en-US")) let request = SFSpeechURLRecognitionRequest(url: audioURL!) request

Voice control API - high accuracy on specific phrases [closed]

纵然是瞬间 提交于 2019-12-04 09:23:25
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 6 years ago . I have several ideas for voice controlled apps. Unfortunately, based on what I've seen from Siri and Google Voice Actions, the technology doesn't seem to quite be there yet. Even in a perfectly quiet environment, the accuracy is so bad,

Is there a way to force Google Speech api to return only words as response?

半世苍凉 提交于 2019-12-04 09:06:52
问题 I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang="+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the result will be 1234 and if I say one thousand two hundred thirty four the result is still 1234 . Another issue is that with other languages i.e. the word elf in German is eleven . If you say elf the result is 11 , instead of elf. I know we have no control

SpeechRecognizer : not connected to recognition service

五迷三道 提交于 2019-12-04 07:57:58
In my app, am using SpeechRecognizer directly. I destroy SpeechRecognizer onPause of the Activity and I recreate it in onResume method as below ... public class NoUISpeechActivity extends Activity { protected static final String CLASS_TAG = "NoUISpeechActivity"; private SpeechRecognizer sr; @Override public void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout.activity_no_uispeech); sr = getSpeechRecognizer(); } @Override protected void onPause() { Log.i(CLASS_TAG, "on pause called"); if(sr!=null){ sr.stopListening(); sr.cancel(); sr.destroy();

How to Extend Google Now Voice Commands in Android with Custom Actions

大兔子大兔子 提交于 2019-12-04 07:45:53
I recently installed the Google Now Launcher on my Nexus 4, and it got me thinking about how I could use it to interact with my own apps. While I can open my app by asking for it by title, I was wondering if there's a way to intercept the voice commands (possibly through a broadcast receiver) so I can say something like "turn off living room light" to send a signal to an Arduino to turn off the light in a room across the house? On the same note, I haven't been able to find the documentation for Google Now, so if that's out there I would love to see the link for it so I can just read what's