speech-recognition | 易学教程

Voce c++ include setup

阅读更多关于 Voce c++ include setup

问题 I have seen many posts about that, but I just don't understand everything. Could someone write ALL include and library directories and linkers I need to add to run Voce with c++ I've tried: Include Dir: \voce-0.9.1\src\c++ \voce-0.9.1\lib \ProgramFiles\java\jdk1.8.0_101\include \ProgramFiles\java\jdk1.8.0_101\include\win32 \ProgramFiles\java\jdk1.8.0_101\bin \ProgramFiles\java\jdk1.8.0_101\lib Library Dir: \ProgramFiles\java\jdk1.8.0_101\lib \ProgramFiles\java\jdk1.8.0_101\bin Linker input:

Simple word detector using MFCC

阅读更多关于 Simple word detector using MFCC

问题 I am implementing a software for speech recognition using Mel Frequency Cepstrum Coefficients. In particular the system must recognize a single specified word. Since the audio file I get the MFCCs in a matrix with 12 rows(the MFCCs) and as many columns as the number of voice frames. I make the average of the rows, so I get a vector with only the 12 rows (the ith-row is the average of all ith-MFCCs of all frames). My question is how to train a classifier to detect the word? I have a training

Not detecting alphanumeric input well - CMU Sphinx Java

阅读更多关于 Not detecting alphanumeric input well - CMU Sphinx Java

问题 I'm trying Sphinx to recognize alphanumeric input with the dialog (DialogDemo) as a start. I have a .gram file with: #JSGF V1.0; grammar dialog; <digit> = oh | zero | one | two | three | four | five | six | seven | eight | nine | a | b | c | d | e | f | g | h | i | j | k | j | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z ; <number> = <digit>+ [point <digit>+]; <menu_command> = digits | [go to [the]] bank account | weather forecast | exit [[the] program] ; <bank_command> = [show |

Speech Recognition: AttributeError: module 'speech_recognition' has no attribute 'Recognizer'

阅读更多关于 Speech Recognition: AttributeError: module 'speech_recognition' has no attribute 'Recognizer'

问题 I'm trying to run a speech recognition using the Speech Recognition Project I installed SpeechRecognition as illustrated. My code ran correctly for a few times. I was trying to input different files. Now I started getting the following error: import speech_recognition as sr Traceback (most recent call last): File "<ipython-input-1-a4d5c9aae5d0>", line 1, in <module> import speech_recognition as sr File "/Users/Sashank/Documents/Deep_Learning_A_Z/Personal Projects/Speech recognition/speech

UnicodeDecodeError from sound file

阅读更多关于 UnicodeDecodeError from sound file

问题 I'm trying to make a speech recogniser in Python using Google speech API. I've been using and adapting the code from here (converted to Python3). I'm using an audio file on my computer that's been converted from mp3 to flac 16000 Hz (as specified in the original code) using an online converter. When running the code I get this error: $ python3 speech_api.py 02-29-2016_00-12_msg1.flac Traceback (most recent call last): File "speech_api.py", line 12, in <module> data = f.read() File "/usr/lib

Possible to Stream wav File to Windows.Media.SpeechRecognition?

阅读更多关于 Possible to Stream wav File to Windows.Media.SpeechRecognition?

问题 Is there an out-of-the-box solution to stream a wav file/memory to a UWP app? Windows.Media.SpeechRecognition looks like it only accepts microphone input. Is there a way to use System.Speech.Recognition in a UWP app? Is there an official way to use Windows.Media.SpeechRecognition in a Windows desktop app? 回答1: Please check Speech APIs, this is a set of REST API provided by Microsoft which can accept wav files. 回答2: You can use VB-Audio Virtual cable http://vb-audio.pagesperso-orange.fr/Cable

Microsoft Speech Platform: recognize word repetitions

阅读更多关于 Microsoft Speech Platform: recognize word repetitions

问题 I use Microsoft Speech Platform to recognize speech at output it on screen. But, i have problem: for example, i have grammar (constructs by GrammarBuilder and Choices - "red","green","black") When i say- "red green black"- i can get only "red", maybe "red green" , but not "red green black". Some code: Thread.CurrentThread.CurrentCulture = new CultureInfo("ru-RU"); Thread.CurrentThread.CurrentUICulture = new CultureInfo("ru-RU"); // Create a new SpeechRecognitionEngine instance. _sre = new

Microsoft Speech API with Windows Store App yet?

阅读更多关于 Microsoft Speech API with Windows Store App yet?

问题 Is the Microsoft Speech API usable now? I really need to use it with my project. I need to receive some short keyword voice command and do the task as users want. Also, I'd like to use the text-to-speech So far what I've found is that it can't be used with Windows Store Apps, but those answers were from like half year ago... If anybody knows the answer or has a suggestion on any Speech API which I can use with my project, I'd appreciate it. 回答1: When you say a "Windows Store App" I assume you

Android specific words speech recognition

阅读更多关于 Android specific words speech recognition

问题 I am trying to have the app recognize certain words said by the user using the code below, but for some reason it isn't working at all. Please review this and tell me what is wrong with it. Thank you The app is simply suppose to display a toast message if the words "orange" or "apple" is said but nothing happens when using the code below. //button onclick to trigger RecognizerIntent public void OnClick_Speed_Detector(View v) { Intent i = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); i

Creating a In proc speech recognition using sapi c#?

阅读更多关于 Creating a In proc speech recognition using sapi c#?

问题 My SAPI code for setting up the in proc recognition engine looks like this: ISpeechRecoContext cpRecoCtx; // create the recognition context cpRecoCtx = new SpeechLib.SpInProcRecoContext(); ((SpInProcRecoContext)cpRecoCtx).Recognition += new _ISpeechRecoContextEvents_RecognitionEventHandler(RecoContext_Recognition); /****** END: set up recognition context *****/ So how do I setup my audio input to default audio input in c#? I have found solutions in C++ but need one for C#. 回答1: The default