speech-recognition | 易学教程

Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

阅读更多关于 Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

问题 I am trying to do "streaming" speech recognition in C# from a TCP socket. The problem I am having is that SpeechRecognitionEngine.SetInputToAudioStream() seems to require a Stream of a defined length which can seek. Right now the only way I can think to make this work is to repeatedly run the recognizer on a MemoryStream as more input comes in. Here's some code to illustrate: SpeechRecognitionEngine appRecognizer = new SpeechRecognitionEngine(); System.Speech.AudioFormat.SpeechAudioFormatInfo

Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

阅读更多关于 Streaming input to System.Speech.Recognition.SpeechRecognitionEngine

Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

阅读更多关于 Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

问题 Is there a comprehensive list of known APIs for desktop or browser environments? 回答1: I'll rehash and update an answer from Speech recognition in C or Java or PHP?. This is by no means comprehensive, but it might be a start for you From watching these questions for few months, I've seen most developer choices break down like this: Windows folks - use the System.Speech features of .Net or Microsoft.Speech and install the free recognizers Microsoft provides. Windows 7 includes a full speech

iPhone App › Add voice recognition? [closed]

阅读更多关于 iPhone App › Add voice recognition? [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 6 years ago . I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this? 回答1: OpenEars looks promising... http://www.politepix.com/openears/

iPhone App › Add voice recognition? [closed]

阅读更多关于 iPhone App › Add voice recognition? [closed]

How can a Chrome extension get a user's permission to use user's computer's microphone?

阅读更多关于 How can a Chrome extension get a user's permission to use user's computer's microphone?

问题 If we run HTML5's Web Speech API's JavaScript codes below in a website on a Chrome, Chrome will ask for user's permission for the use of user's computer's microphone. var recognition = new webkitSpeechRecognition(); recognition.start(); But if I run codes above on a Chrome extension's page, Chrome doesn't ask users to give a permission. How can a Chrome extension get a user's permission to use user's computer's microphone? Thank you. 回答1: I think you have to implement it yourself. In chrome

Speech Recognizer get confidence below API 14

阅读更多关于 Speech Recognizer get confidence below API 14

问题 I'm using the SpeechRecognizer with minSDK 14 and added a filter to get the most accurate result. This code I have in onActivityResult() of my Activity. @Override protected void onActivityResult(int requestCode, int resultCode, Intent data) { if (requestCode == VOICE_RECOGNITION && resultCode == RESULT_OK) { ArrayList<String> results = data .getStringArrayListExtra(RecognizerIntent.EXTRA_RESULTS); // this is only available in API 14 String confidenceExtra = RecognizerIntent.EXTRA_CONFIDENCE

Android speech recognition conflicting with Text to Speech

阅读更多关于 Android speech recognition conflicting with Text to Speech

问题 I have implemented continuous speech recognition in my app based on the method given here: Android Speech Recognition Continuous Service. It works reasonably well by itself. However, I want to implement a feature where the app is reading a web page using Text To Speech and during the reading is going on, I want to have the speech recognition turned on so that i can ask it to stop using voice. However, I am seeing that the speech recognition starts recognition the Text to Speech output as well

Having different results every run with GMM Classifier

阅读更多关于 Having different results every run with GMM Classifier

问题 I'm currently doing a speech recognition and machine learning related project. I have two classes now, and I create two GMM classifiers for each class, for labels 'happy' and 'sad' I want to train GMM classifiers with MFCC vectors. I am using two GMM classifiers for each label. (Previously it was GMM per file): But every time I run the script I am having different results. What might be the cause for that with same test and train samples? In the outputs below please note that I have 10 test

“Speaker Recognition and Segmentation” [closed]

阅读更多关于 “Speaker Recognition and Segmentation” [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . Looking for a code that would process media file to "Who said what and when" in other words a "Speaker by speaker Segmentation" and what timing for each. Failing answers: doing any manual works to process the media file..thanks! 回答1: You can use speaker diarization from Kaldi, it is not easy to setup but results