speech

C# system.speech.recognition alternates

ぐ巨炮叔叔 提交于 2019-12-23 05:33:13
问题 I am using using system.speech.recognition to recognize a single word from many. Because the accuracy is not very good, I want to consider more options the engine gives me. However, the e.Result.Alternates only contains one result which is the same as e.Result.word. Is there a ways to force the engine to give me more options? Thank you! 来源: https://stackoverflow.com/questions/8214904/c-sharp-system-speech-recognition-alternates

how to pass language in speech recognition on android apps?

旧时模样 提交于 2019-12-22 13:59:34
问题 I've been working on speech Recognition API in android and found out that the speech results vary allot when the language settings are changed , is there a way to set it programmatically ? or is there an intent to lunch the speech language settings screen ? or what else ? note: I tried to use this intent extra: intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_PREFERENCE, "en-US"); and Intent detailsIntent = new Intent(RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS); sendOrderedBroadcast

Using c++ to call and use Windows Speech Recognition [closed]

那年仲夏 提交于 2019-12-22 10:48:25
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I am making an application that involves the use of windows speech recognition. I am thinking of using c++ to do this since i have some experience with this language. The way i want to use the speech recognition is so that it works internally. If i upload an audio file into my program, i want speech recognition

Emotion detection in speech

戏子无情 提交于 2019-12-21 21:43:25
问题 I would like to build an app which analyses the emotional content of speech from the mic. THis does not, although sometimes used as an extra feature, involve speech recognition. Emotional analyses is based on prosodic features of the voice (pitch change, speed etc., tone). I know this can be done on a desktop computer, but i dont want users to have to upload their recordings (phone conversations) to a server in order to get emotional feedback. What i need is an API which either provides the

Swift: stop speech recognition after x seconds of silence

不羁的心 提交于 2019-12-21 04:42:17
问题 I've looked for a solution, but all of the ones I've seen are confusing so I thought I'd create a new question. I'm using the speech library and I want the recognition task to stop after 2 seconds without input from the user. I know I want to use a timer, but I'm having trouble figuring out where to put it and how to update it. I start the timer when the record button is pressed and I invalidate it when the stop recording button is pressed. But where do I check if the user added new input? I

Can anyone recommend a decent DSP/speech library in C++? [closed]

一世执手 提交于 2019-12-20 17:27:11
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Google returns too much results, although SPUC caught my attention. Is there a standard recommended library like OpenCV for vision? The necessary features would be: Free Open Source filter design (Butterworth, Chebyshev, etc) FFT if possible, some speech processing features, like MFCC computation, although that

C# system.speech.recognition alternate words

前提是你 提交于 2019-12-20 04:18:50
问题 I am currently using the Microsoft.Speech API to dictate utterances into text, but what I really need is the alternative dictations the program could use. I am using this for my honours thesis, and for it I wish to know the top 10 interpretations of any utterance. A very similar, if not exact question was asked in 2011: C# system.speech.recognition alternates But was never answered. My question thus is: how does one get the alternatives to an interpretation of a dictation using the Microsoft

Keyword Spotting in Speech [closed]

生来就可爱ヽ(ⅴ<●) 提交于 2019-12-19 07:35:21
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 3 years ago . Is anyone aware of a Keyword Spotting System that is freely available, and possibly providing APIs ?? CMU Sphinx 4 and MS Speech API are speech recognition engines, and cannot be used for KWS. SRI has a keyword spotting system, but no download links, not even for evaluation. (I even couldn't find anywhere a link

Create timestamps for subtitles in audibook [closed]

社会主义新天地 提交于 2019-12-19 04:40:10
问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 3 years ago . I want to add timestamps to book sentences, fitting the relevant audiobook. In various languages ideally. Here's an example: Pride and prejudice text from gutenberg project audio from Librivox My idea was to find a voice recognition tool that puts timestamps on sentences (step 1),

Microsoft Speech Recognition setInputToDefaultAudioDevice throws exception

回眸只為那壹抹淺笑 提交于 2019-12-19 03:39:09
问题 hello guys I'm in trouble in MS Speech recognition. my code is simple. static void init() { string enUsEngine = string.Empty; foreach (RecognizerInfo ri in SpeechRecognitionEngine.InstalledRecognizers()) { Console.WriteLine(ri.Culture); if (ri.Culture.Name.Equals("en-US") == true) { enUsEngine = ri.Id; } } SpeechRecognitionEngine recogEngine = new SpeechRecognitionEngine(enUsEngine); Grammar grammar = new Grammar("grammar.xml"); recogEngine.LoadGrammar(grammar); recogEngine.SpeechRecognized +