speech-recognition | 易学教程

C# Speech Recognition - Is this what the user said?

阅读更多关于 C# Speech Recognition - Is this what the user said?

I have need to write an application which uses a speech recognition engine -- either the built in vista one, or a third party one -- that can display a word or phrase, and recognise when the user reads it (or an approximation of it). I also need to be able to switch quickly between languages, without changing the language of the operating system. The users will be using the system for very short periods. The application needs to work without the requirement of first training the recognition engine to the users' voices. It would also be fantastic if this could work on Windows XP or lesser

Building openears compatible language model

阅读更多关于 Building openears compatible language model

问题 I am doing some development on speech to text and text to speech and I found the OpenEars API very useful. The principle of this cmu-slm based API is it uses a language model to map the speech listened by the iPhone device. So I decided to find a big English language model to feed the API speech recognizer engine. But I failed to understand the format of the voxfourge english data model to use with OpenEars. Do anyone have any idea that how can I get the .languagemodel and .dic file for

How to train on and make a serialized feature vector for a Neural Network?

阅读更多关于 How to train on and make a serialized feature vector for a Neural Network?

问题 By serialized i mean that the values for an input come in discrete intervals of time and that size of the vector is also not known before hand . Conventionally the neural networks employ fixed size parallel input neurons and fixed size parallel output neurons. A serialized implementation could be used in speech recognition where i can feed the network with a time series of the waveform and on the output end get the phonemes. It would be great if someone can point out some existing

Grammar in Google Web Speech API

阅读更多关于 Grammar in Google Web Speech API

问题 Can I improve Google speech API recognition by give him a words list (in my case the request of user is very predictable) to make recognition more accurate? 回答1: I can't speak for Chrome, but in Android they are quite clear that you cannot provide a grammar. In Android speech recognition you are limited to a choice of two models: "free form" and "web search". See Android: Speech Recognition Append Dictionary? 回答2: Correct answer is: no, you can't . =( 回答3: Updated answer for August 20, 2019

SpeechRecognizer on Android device without Google Apps

阅读更多关于 SpeechRecognizer on Android device without Google Apps

问题 SpeechRecognizer works well on Android that with Google Apps (GApps). However in China, most of the Android devices will remove these Google Apps. What will happen when SpeechRecognizer is used? How could I test it without an actual device? speechRecognizer = SpeechRecognizer.createSpeechRecognizer(getApplicationContext()); speechRecognizer.setRecognitionListener(new CustomListener()); Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); intent.putExtra(RecognizerIntent.EXTRA

SAPI and Windows 7 Problem

阅读更多关于 SAPI and Windows 7 Problem

I'm trying to recognize speech with Windows 7 but it always recognizes a speech as a command or just says "What was that?". How I can get all speeches? CODE: SpeechRecognizer _speechRecognizer; public Window1() { InitializeComponent(); // set up the recognizer _speechRecognizer = new SpeechRecognizer(); _speechRecognizer.Enabled = false; _speechRecognizer.SpeechRecognized += new EventHandler<SpeechRecognizedEventArgs>(_speechRecognizer_SpeechRecognized); } Perhaps you want to use the .net System.Speech namespace instead of SAPI? There is a very good article that was published a few years ago

Need text to speech and speech recognition tools for Linux

阅读更多关于 Need text to speech and speech recognition tools for Linux

问题 I'm planning on writing a program for Linux that uses text to speech and speech recognition. What are the best tools/libraries for this? Should I use Windows instead to be able to use better tools? The tools need to be easily callable from a console or C program. 回答1: For speech recognition there are the various Sphinxes. The different variants have different pros and cons, there is a comparison here Comparison of Sphinx versions. Sphinx 4 is Java, but the others are C, I believe. 回答2: It

Acoustic training using SAPI 5.3 Speech API

阅读更多关于 Acoustic training using SAPI 5.3 Speech API

问题 Using Microsoft's SAPI 5.3 Speech API on Vista, how do you programatically do acoustic model training of a RecoProfile? More concretely, if you have a text file, and an audio file of a user speaking that text, what sequence of SAPI calls would you make to train the user's profile using that text and audio? Update: More information about this problem I still haven't solved: You call ISpRecognizer2.SetTrainingState( TRUE, TRUE ) at "the beginning" and ISpRecognizer2.SetTrainingState( FALSE,

Remove quotes from String in Python

阅读更多关于 Remove quotes from String in Python

问题 This question was migrated from Raspberry Pi Stack Exchange because it can be answered on Stack Overflow. Migrated 2 years ago . I have a python Code that will recognize speech using the Google STT engine and give me back the results but I get the results in strings with "quotes". I don't want that quotes in my code as I will use it to run many commands and it doesn't work. I haven't tried anything so far as I didn't get anything to try! This is the function in the python code that will

Android App Integrated with OK Google

阅读更多关于 Android App Integrated with OK Google

问题 Is there a way to issue a voice command something like: OK GOOGLE ASK XXX Some App Specific Question or Command And have it launch "APP" with the recognized text: "Some App Specific Question or Command" My app has speech recognition as a service ... but when using my APP I can't ask questions that OK Google can handle ... 回答1: Through the Voice Actions API, your app can register for system actions, one of which is 'search' (so you could do 'search for Some Question or command on APP'). In the