speech-recognition

Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

谁都会走 提交于 2019-12-28 02:05:47
问题 I am trying to do "streaming" speech recognition in C# from a TCP socket. The problem I am having is that SpeechRecognitionEngine.SetInputToAudioStream() seems to require a Stream of a defined length which can seek. Right now the only way I can think to make this work is to repeatedly run the recognizer on a MemoryStream as more input comes in. Here's some code to illustrate: SpeechRecognitionEngine appRecognizer = new SpeechRecognitionEngine(); System.Speech.AudioFormat.SpeechAudioFormatInfo

Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

佐手、 提交于 2019-12-28 02:04:41
问题 I am trying to do "streaming" speech recognition in C# from a TCP socket. The problem I am having is that SpeechRecognitionEngine.SetInputToAudioStream() seems to require a Stream of a defined length which can seek. Right now the only way I can think to make this work is to repeatedly run the recognizer on a MemoryStream as more input comes in. Here's some code to illustrate: SpeechRecognitionEngine appRecognizer = new SpeechRecognitionEngine(); System.Speech.AudioFormat.SpeechAudioFormatInfo

Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

陌路散爱 提交于 2019-12-27 11:05:07
问题 Is there a comprehensive list of known APIs for desktop or browser environments? 回答1: I'll rehash and update an answer from Speech recognition in C or Java or PHP?. This is by no means comprehensive, but it might be a start for you From watching these questions for few months, I've seen most developer choices break down like this: Windows folks - use the System.Speech features of .Net or Microsoft.Speech and install the free recognizers Microsoft provides. Windows 7 includes a full speech

iPhone App › Add voice recognition? [closed]

心不动则不痛 提交于 2019-12-27 10:27:54
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this? 回答1: OpenEars looks promising... http://www.politepix.com/openears/

iPhone App › Add voice recognition? [closed]

不打扰是莪最后的温柔 提交于 2019-12-27 10:27:48
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this? 回答1: OpenEars looks promising... http://www.politepix.com/openears/

How can a Chrome extension get a user's permission to use user's computer's microphone?

浪尽此生 提交于 2019-12-25 12:51:48
问题 If we run HTML5's Web Speech API's JavaScript codes below in a website on a Chrome, Chrome will ask for user's permission for the use of user's computer's microphone. var recognition = new webkitSpeechRecognition(); recognition.start(); But if I run codes above on a Chrome extension's page, Chrome doesn't ask users to give a permission. How can a Chrome extension get a user's permission to use user's computer's microphone? Thank you. 回答1: I think you have to implement it yourself. In chrome

Speech Recognizer get confidence below API 14

大憨熊 提交于 2019-12-25 10:55:15
问题 I'm using the SpeechRecognizer with minSDK 14 and added a filter to get the most accurate result. This code I have in onActivityResult() of my Activity. @Override protected void onActivityResult(int requestCode, int resultCode, Intent data) { if (requestCode == VOICE_RECOGNITION && resultCode == RESULT_OK) { ArrayList<String> results = data .getStringArrayListExtra(RecognizerIntent.EXTRA_RESULTS); // this is only available in API 14 String confidenceExtra = RecognizerIntent.EXTRA_CONFIDENCE

Android speech recognition conflicting with Text to Speech

时光怂恿深爱的人放手 提交于 2019-12-25 08:18:12
问题 I have implemented continuous speech recognition in my app based on the method given here: Android Speech Recognition Continuous Service. It works reasonably well by itself. However, I want to implement a feature where the app is reading a web page using Text To Speech and during the reading is going on, I want to have the speech recognition turned on so that i can ask it to stop using voice. However, I am seeing that the speech recognition starts recognition the Text to Speech output as well

Having different results every run with GMM Classifier

我与影子孤独终老i 提交于 2019-12-25 08:11:24
问题 I'm currently doing a speech recognition and machine learning related project. I have two classes now, and I create two GMM classifiers for each class, for labels 'happy' and 'sad' I want to train GMM classifiers with MFCC vectors. I am using two GMM classifiers for each label. (Previously it was GMM per file): But every time I run the script I am having different results. What might be the cause for that with same test and train samples? In the outputs below please note that I have 10 test

“Speaker Recognition and Segmentation” [closed]

人盡茶涼 提交于 2019-12-25 07:37:30
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . Looking for a code that would process media file to "Who said what and when" in other words a "Speaker by speaker Segmentation" and what timing for each. Failing answers: doing any manual works to process the media file..thanks! 回答1: You can use speaker diarization from Kaldi, it is not easy to setup but results