speech | 易学教程

C# system.speech.recognition alternates

阅读更多关于 C# system.speech.recognition alternates

问题 I am using using system.speech.recognition to recognize a single word from many. Because the accuracy is not very good, I want to consider more options the engine gives me. However, the e.Result.Alternates only contains one result which is the same as e.Result.word. Is there a ways to force the engine to give me more options? Thank you! 来源： https://stackoverflow.com/questions/8214904/c-sharp-system-speech-recognition-alternates

how to pass language in speech recognition on android apps?

阅读更多关于 how to pass language in speech recognition on android apps?

问题 I've been working on speech Recognition API in android and found out that the speech results vary allot when the language settings are changed , is there a way to set it programmatically ? or is there an intent to lunch the speech language settings screen ? or what else ? note: I tried to use this intent extra: intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_PREFERENCE, "en-US"); and Intent detailsIntent = new Intent(RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS); sendOrderedBroadcast

Using c++ to call and use Windows Speech Recognition [closed]

阅读更多关于 Using c++ to call and use Windows Speech Recognition [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I am making an application that involves the use of windows speech recognition. I am thinking of using c++ to do this since i have some experience with this language. The way i want to use the speech recognition is so that it works internally. If i upload an audio file into my program, i want speech recognition

Emotion detection in speech

阅读更多关于 Emotion detection in speech

问题 I would like to build an app which analyses the emotional content of speech from the mic. THis does not, although sometimes used as an extra feature, involve speech recognition. Emotional analyses is based on prosodic features of the voice (pitch change, speed etc., tone). I know this can be done on a desktop computer, but i dont want users to have to upload their recordings (phone conversations) to a server in order to get emotional feedback. What i need is an API which either provides the

Swift: stop speech recognition after x seconds of silence

阅读更多关于 Swift: stop speech recognition after x seconds of silence

问题 I've looked for a solution, but all of the ones I've seen are confusing so I thought I'd create a new question. I'm using the speech library and I want the recognition task to stop after 2 seconds without input from the user. I know I want to use a timer, but I'm having trouble figuring out where to put it and how to update it. I start the timer when the record button is pressed and I invalidate it when the stop recording button is pressed. But where do I check if the user added new input? I

Can anyone recommend a decent DSP/speech library in C++? [closed]

阅读更多关于 Can anyone recommend a decent DSP/speech library in C++? [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Google returns too much results, although SPUC caught my attention. Is there a standard recommended library like OpenCV for vision? The necessary features would be: Free Open Source filter design (Butterworth, Chebyshev, etc) FFT if possible, some speech processing features, like MFCC computation, although that

C# system.speech.recognition alternate words

阅读更多关于 C# system.speech.recognition alternate words

问题 I am currently using the Microsoft.Speech API to dictate utterances into text, but what I really need is the alternative dictations the program could use. I am using this for my honours thesis, and for it I wish to know the top 10 interpretations of any utterance. A very similar, if not exact question was asked in 2011: C# system.speech.recognition alternates But was never answered. My question thus is: how does one get the alternatives to an interpretation of a dictation using the Microsoft

Keyword Spotting in Speech [closed]

阅读更多关于 Keyword Spotting in Speech [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 3 years ago . Is anyone aware of a Keyword Spotting System that is freely available, and possibly providing APIs ?? CMU Sphinx 4 and MS Speech API are speech recognition engines, and cannot be used for KWS. SRI has a keyword spotting system, but no download links, not even for evaluation. (I even couldn't find anywhere a link

Create timestamps for subtitles in audibook [closed]

阅读更多关于 Create timestamps for subtitles in audibook [closed]

问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 3 years ago . I want to add timestamps to book sentences, fitting the relevant audiobook. In various languages ideally. Here's an example: Pride and prejudice text from gutenberg project audio from Librivox My idea was to find a voice recognition tool that puts timestamps on sentences (step 1),

Microsoft Speech Recognition setInputToDefaultAudioDevice throws exception

阅读更多关于 Microsoft Speech Recognition setInputToDefaultAudioDevice throws exception

问题 hello guys I'm in trouble in MS Speech recognition. my code is simple. static void init() { string enUsEngine = string.Empty; foreach (RecognizerInfo ri in SpeechRecognitionEngine.InstalledRecognizers()) { Console.WriteLine(ri.Culture); if (ri.Culture.Name.Equals("en-US") == true) { enUsEngine = ri.Id; } } SpeechRecognitionEngine recogEngine = new SpeechRecognitionEngine(enUsEngine); Grammar grammar = new Grammar("grammar.xml"); recogEngine.LoadGrammar(grammar); recogEngine.SpeechRecognized +