speech-recognition

C# Speech Recognition - Is this what the user said?

不羁的心 提交于 2019-11-26 19:29:48
I have need to write an application which uses a speech recognition engine -- either the built in vista one, or a third party one -- that can display a word or phrase, and recognise when the user reads it (or an approximation of it). I also need to be able to switch quickly between languages, without changing the language of the operating system. The users will be using the system for very short periods. The application needs to work without the requirement of first training the recognition engine to the users' voices. It would also be fantastic if this could work on Windows XP or lesser

Building openears compatible language model

☆樱花仙子☆ 提交于 2019-11-26 19:25:23
问题 I am doing some development on speech to text and text to speech and I found the OpenEars API very useful. The principle of this cmu-slm based API is it uses a language model to map the speech listened by the iPhone device. So I decided to find a big English language model to feed the API speech recognizer engine. But I failed to understand the format of the voxfourge english data model to use with OpenEars. Do anyone have any idea that how can I get the .languagemodel and .dic file for

How to train on and make a serialized feature vector for a Neural Network?

六眼飞鱼酱① 提交于 2019-11-26 17:22:25
问题 By serialized i mean that the values for an input come in discrete intervals of time and that size of the vector is also not known before hand . Conventionally the neural networks employ fixed size parallel input neurons and fixed size parallel output neurons. A serialized implementation could be used in speech recognition where i can feed the network with a time series of the waveform and on the output end get the phonemes. It would be great if someone can point out some existing

Grammar in Google Web Speech API

时间秒杀一切 提交于 2019-11-26 17:14:49
问题 Can I improve Google speech API recognition by give him a words list (in my case the request of user is very predictable) to make recognition more accurate? 回答1: I can't speak for Chrome, but in Android they are quite clear that you cannot provide a grammar. In Android speech recognition you are limited to a choice of two models: "free form" and "web search". See Android: Speech Recognition Append Dictionary? 回答2: Correct answer is: no, you can't . =( 回答3: Updated answer for August 20, 2019

SpeechRecognizer on Android device without Google Apps

时光怂恿深爱的人放手 提交于 2019-11-26 16:45:09
问题 SpeechRecognizer works well on Android that with Google Apps (GApps). However in China, most of the Android devices will remove these Google Apps. What will happen when SpeechRecognizer is used? How could I test it without an actual device? speechRecognizer = SpeechRecognizer.createSpeechRecognizer(getApplicationContext()); speechRecognizer.setRecognitionListener(new CustomListener()); Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); intent.putExtra(RecognizerIntent.EXTRA

SAPI and Windows 7 Problem

寵の児 提交于 2019-11-26 16:44:18
I'm trying to recognize speech with Windows 7 but it always recognizes a speech as a command or just says "What was that?". How I can get all speeches? CODE: SpeechRecognizer _speechRecognizer; public Window1() { InitializeComponent(); // set up the recognizer _speechRecognizer = new SpeechRecognizer(); _speechRecognizer.Enabled = false; _speechRecognizer.SpeechRecognized += new EventHandler<SpeechRecognizedEventArgs>(_speechRecognizer_SpeechRecognized); } Perhaps you want to use the .net System.Speech namespace instead of SAPI? There is a very good article that was published a few years ago

Need text to speech and speech recognition tools for Linux

ⅰ亾dé卋堺 提交于 2019-11-26 16:28:55
问题 I'm planning on writing a program for Linux that uses text to speech and speech recognition. What are the best tools/libraries for this? Should I use Windows instead to be able to use better tools? The tools need to be easily callable from a console or C program. 回答1: For speech recognition there are the various Sphinxes. The different variants have different pros and cons, there is a comparison here Comparison of Sphinx versions. Sphinx 4 is Java, but the others are C, I believe. 回答2: It

Acoustic training using SAPI 5.3 Speech API

[亡魂溺海] 提交于 2019-11-26 16:18:24
问题 Using Microsoft's SAPI 5.3 Speech API on Vista, how do you programatically do acoustic model training of a RecoProfile? More concretely, if you have a text file, and an audio file of a user speaking that text, what sequence of SAPI calls would you make to train the user's profile using that text and audio? Update: More information about this problem I still haven't solved: You call ISpRecognizer2.SetTrainingState( TRUE, TRUE ) at "the beginning" and ISpRecognizer2.SetTrainingState( FALSE,

Remove quotes from String in Python

*爱你&永不变心* 提交于 2019-11-26 15:59:43
问题 This question was migrated from Raspberry Pi Stack Exchange because it can be answered on Stack Overflow. Migrated 2 years ago . I have a python Code that will recognize speech using the Google STT engine and give me back the results but I get the results in strings with "quotes". I don't want that quotes in my code as I will use it to run many commands and it doesn't work. I haven't tried anything so far as I didn't get anything to try! This is the function in the python code that will

Android App Integrated with OK Google

吃可爱长大的小学妹 提交于 2019-11-26 15:43:07
问题 Is there a way to issue a voice command something like: OK GOOGLE ASK XXX Some App Specific Question or Command And have it launch "APP" with the recognized text: "Some App Specific Question or Command" My app has speech recognition as a service ... but when using my APP I can't ask questions that OK Google can handle ... 回答1: Through the Voice Actions API, your app can register for system actions, one of which is 'search' (so you could do 'search for Some Question or command on APP'). In the