speech

Speech to Text on Android

≯℡__Kan透↙ 提交于 2019-11-26 22:16:32
问题 I am looking to create an app which has Speech to text. I am aware of this kind of ability using the RecognizerIntent: http://android-developers.blogspot.com/search/label/Speech%20Input However - I do not want a new Intent to be popped up, I want to do the analysis a certain points in my current app, and I dont want it to pop something up stating that it is currently attempting to record your voice. Has anybody any ideas on how best to do this. I was perhaps thinking of trying Sphinx 4 - but

Acoustic training using SAPI 5.3 Speech API

[亡魂溺海] 提交于 2019-11-26 16:18:24
问题 Using Microsoft's SAPI 5.3 Speech API on Vista, how do you programatically do acoustic model training of a RecoProfile? More concretely, if you have a text file, and an audio file of a user speaking that text, what sequence of SAPI calls would you make to train the user's profile using that text and audio? Update: More information about this problem I still haven't solved: You call ISpRecognizer2.SetTrainingState( TRUE, TRUE ) at "the beginning" and ISpRecognizer2.SetTrainingState( FALSE,

how I can change the voice synthesizer gender and age in C#?

不想你离开。 提交于 2019-11-26 14:14:01
问题 I would like to change the gender and age of the voice of System.Speech in c#. For example, a girl of 10 years but can not find any simple example to help me adjust the parameters. 回答1: First, check which voices you have installed by enumerating the GetInstalledVoices method of the SpeechSynthesizer class, and then use SelectVoiceByHints to select one of them: using (SpeechSynthesizer synthesizer = new SpeechSynthesizer()) { // show installed voices foreach (var v in synthesizer

Is there a way to use a grammar with the HTML 5 speech input API?

∥☆過路亽.° 提交于 2019-11-26 11:22:39
问题 I\'m working with the HTML 5 speech input API and I want to let the server know which answers it can expect to be returned from the speech input. Is there a way to set a list of possible inputs? 回答1: In Google Chrome you can not use grammars yet, overall they decided to use free-form recognition only. Relevant question is Grammar in Google speech API. Grammars are supported in Firefox Web Speech API, but the feature is experimental. If your browser supports HTML5 audio, you might want to try

What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

生来就可爱ヽ(ⅴ<●) 提交于 2019-11-25 23:24:59
问题 There are two similar namespaces and assemblies for speech recognition in .NET. I’m trying to understand the differences and when it is appropriate to use one or the other. There is System.Speech.Recognition from the assembly System.Speech (in System.Speech.dll). System.Speech.dll is a core DLL in the .NET Framework class library 3.0 and later There is also Microsoft.Speech.Recognition from the assembly Microsoft.Speech (in microsoft.speech.dll). Microsoft.Speech.dll is part of the UCMA 2.0