speech-recognition

Voce c++ include setup

故事扮演 提交于 2019-12-25 04:05:24
问题 I have seen many posts about that, but I just don't understand everything. Could someone write ALL include and library directories and linkers I need to add to run Voce with c++ I've tried: Include Dir: \voce-0.9.1\src\c++ \voce-0.9.1\lib \ProgramFiles\java\jdk1.8.0_101\include \ProgramFiles\java\jdk1.8.0_101\include\win32 \ProgramFiles\java\jdk1.8.0_101\bin \ProgramFiles\java\jdk1.8.0_101\lib Library Dir: \ProgramFiles\java\jdk1.8.0_101\lib \ProgramFiles\java\jdk1.8.0_101\bin Linker input:

Simple word detector using MFCC

為{幸葍}努か 提交于 2019-12-25 02:53:10
问题 I am implementing a software for speech recognition using Mel Frequency Cepstrum Coefficients. In particular the system must recognize a single specified word. Since the audio file I get the MFCCs in a matrix with 12 rows(the MFCCs) and as many columns as the number of voice frames. I make the average of the rows, so I get a vector with only the 12 rows (the ith-row is the average of all ith-MFCCs of all frames). My question is how to train a classifier to detect the word? I have a training

Not detecting alphanumeric input well - CMU Sphinx Java

淺唱寂寞╮ 提交于 2019-12-25 02:34:27
问题 I'm trying Sphinx to recognize alphanumeric input with the dialog (DialogDemo) as a start. I have a .gram file with: #JSGF V1.0; grammar dialog; <digit> = oh | zero | one | two | three | four | five | six | seven | eight | nine | a | b | c | d | e | f | g | h | i | j | k | j | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z ; <number> = <digit>+ [point <digit>+]; <menu_command> = digits | [go to [the]] bank account | weather forecast | exit [[the] program] ; <bank_command> = [show |

Speech Recognition: AttributeError: module 'speech_recognition' has no attribute 'Recognizer'

谁说胖子不能爱 提交于 2019-12-24 19:57:39
问题 I'm trying to run a speech recognition using the Speech Recognition Project I installed SpeechRecognition as illustrated. My code ran correctly for a few times. I was trying to input different files. Now I started getting the following error: import speech_recognition as sr Traceback (most recent call last): File "<ipython-input-1-a4d5c9aae5d0>", line 1, in <module> import speech_recognition as sr File "/Users/Sashank/Documents/Deep_Learning_A_Z/Personal Projects/Speech recognition/speech

UnicodeDecodeError from sound file

瘦欲@ 提交于 2019-12-24 16:34:09
问题 I'm trying to make a speech recogniser in Python using Google speech API. I've been using and adapting the code from here (converted to Python3). I'm using an audio file on my computer that's been converted from mp3 to flac 16000 Hz (as specified in the original code) using an online converter. When running the code I get this error: $ python3 speech_api.py 02-29-2016_00-12_msg1.flac Traceback (most recent call last): File "speech_api.py", line 12, in <module> data = f.read() File "/usr/lib

Possible to Stream wav File to Windows.Media.SpeechRecognition?

天大地大妈咪最大 提交于 2019-12-24 15:51:53
问题 Is there an out-of-the-box solution to stream a wav file/memory to a UWP app? Windows.Media.SpeechRecognition looks like it only accepts microphone input. Is there a way to use System.Speech.Recognition in a UWP app? Is there an official way to use Windows.Media.SpeechRecognition in a Windows desktop app? 回答1: Please check Speech APIs, this is a set of REST API provided by Microsoft which can accept wav files. 回答2: You can use VB-Audio Virtual cable http://vb-audio.pagesperso-orange.fr/Cable

Microsoft Speech Platform: recognize word repetitions

99封情书 提交于 2019-12-24 14:34:52
问题 I use Microsoft Speech Platform to recognize speech at output it on screen. But, i have problem: for example, i have grammar (constructs by GrammarBuilder and Choices - "red","green","black") When i say- "red green black"- i can get only "red", maybe "red green" , but not "red green black". Some code: Thread.CurrentThread.CurrentCulture = new CultureInfo("ru-RU"); Thread.CurrentThread.CurrentUICulture = new CultureInfo("ru-RU"); // Create a new SpeechRecognitionEngine instance. _sre = new

Microsoft Speech API with Windows Store App yet?

元气小坏坏 提交于 2019-12-24 14:04:09
问题 Is the Microsoft Speech API usable now? I really need to use it with my project. I need to receive some short keyword voice command and do the task as users want. Also, I'd like to use the text-to-speech So far what I've found is that it can't be used with Windows Store Apps, but those answers were from like half year ago... If anybody knows the answer or has a suggestion on any Speech API which I can use with my project, I'd appreciate it. 回答1: When you say a "Windows Store App" I assume you

Android specific words speech recognition

馋奶兔 提交于 2019-12-24 13:17:28
问题 I am trying to have the app recognize certain words said by the user using the code below, but for some reason it isn't working at all. Please review this and tell me what is wrong with it. Thank you The app is simply suppose to display a toast message if the words "orange" or "apple" is said but nothing happens when using the code below. //button onclick to trigger RecognizerIntent public void OnClick_Speed_Detector(View v) { Intent i = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); i

Creating a In proc speech recognition using sapi c#?

守給你的承諾、 提交于 2019-12-24 13:03:04
问题 My SAPI code for setting up the in proc recognition engine looks like this: ISpeechRecoContext cpRecoCtx; // create the recognition context cpRecoCtx = new SpeechLib.SpInProcRecoContext(); ((SpInProcRecoContext)cpRecoCtx).Recognition += new _ISpeechRecoContextEvents_RecognitionEventHandler(RecoContext_Recognition); /****** END: set up recognition context *****/ So how do I setup my audio input to default audio input in c#? I have found solutions in C++ but need one for C#. 回答1: The default