cmusphinx | 易学教程

How to give an input wav file to pocket sphinx

阅读更多关于 How to give an input wav file to pocket sphinx

问题 Is there some command line utility of pocket sphinx or cmu sphinx to convert a .wav file to text? pocketsphinx_continuous -hmm -lm -dict will do. But I don't want to keep speaking the same sentence again and again. 回答1: pocketsphinx_continuous starting from version 0.8 has option -infile which you can use to decode a file. File must be in a specific format: 16khz 16bit mono wav file pocketsphinx_continuous -infile file.wav 来源： https://stackoverflow.com/questions/20303092/how-to-give-an-input

mp3 recognition using Sphinx 4

阅读更多关于 mp3 recognition using Sphinx 4

问题 Can we use mp3 files for the voice recognition process without using wav files? or can we generate a wav file from a mp3 and then do the voice recognition without a serious impact on the accuracy? The problem is I need to minimize the load transferred through the network in my application. Will the information which is lost in the conversion be a huge factor for accuracy? 回答1: Can we use mp3 files for the voice recognition process without using wav files? Not directly. To be able to recognize

Can't find pocketsphinx gstreamer plugin

阅读更多关于 Can't find pocketsphinx gstreamer plugin

I tried to install pocketsphinx by brew instal cmu-pocketsphinx and make install (followed instructions here: http://cmusphinx.sourceforge.net/wiki/gstreamer ) the pocketsphinx_continuous works properly, but I can't find a executable pocketsphinx for gst-inspect pocketsphinx could anyone tell me where to find it, please? Many many thanks pocketsphinx install log: http://pastebin.com/48QU0qjg You need to check if plugin pocketsphinx.so is installed in /usr/local/lib/gstreamer-<version> If plugin is there you need to export GST_PLUGIN_PATH environment variable to update gstreamer search path. If

Continuously recognize everything being said on Android?

阅读更多关于 Continuously recognize everything being said on Android?

问题 I'm working on a project that involves speech recognition on Android. And i have some questions without clear answers on this site (or any, actually). I need to do a something like a speech to text, the problem is that i need it working continuously, imagine an app running on background and writing everything it hears on a txt. I know i will need to correct a lot of "noise hearing", but it will come later.. I am using pocketsphinx-android, and tried to follow this tutorial: http://cmusphinx

Can't find pocketsphinx gstreamer plugin

阅读更多关于 Can't find pocketsphinx gstreamer plugin

问题 I tried to install pocketsphinx by brew instal cmu-pocketsphinx and make install (followed instructions here: http://cmusphinx.sourceforge.net/wiki/gstreamer) the pocketsphinx_continuous works properly, but I can't find a executable pocketsphinx for gst-inspect pocketsphinx could anyone tell me where to find it, please? Many many thanks pocketsphinx install log: http://pastebin.com/48QU0qjg 回答1: You need to check if plugin pocketsphinx.so is installed in /usr/local/lib/gstreamer-<version> If

How do you retrieve tags from JSGF grammars using sphinx?

阅读更多关于 How do you retrieve tags from JSGF grammars using sphinx?

问题 I have written an application in Java that utlizes the CloudGarden API to parse JSAPI grammars, return tags from the grammar rules and then process these tags accordingly. The issue with the CloudGarden API is its dependency on Windows. I need a cross-platform library that can perform the same task. I do not need speech synthesis nor do I need speech recognition. Essentially a string is passed to the program containing some transcribed speech. I need to parse this string against my grammar

CMUSphinx live speech recognition too slow?

阅读更多关于 CMUSphinx live speech recognition too slow?

问题 CMU Sphinix is toooo Slow for recognizing live speech.I don't know if you have any idea for boost it? This is my configuration: configuration.setAcousticModelPath("WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz"); configuration.setDictionaryPath("cmudict.0.6d"); configuration.setLanguageModelPath("en-us.lm.dmp"); 回答1: We are currently working on speedup, but for now sphinx4 is not realtime for large vocabulary. It's actually not a trivial task. If you want a fast and not very accurate transcription

Continuously recognize everything being said on Android?

阅读更多关于 Continuously recognize everything being said on Android?

I'm working on a project that involves speech recognition on Android. And i have some questions without clear answers on this site (or any, actually). I need to do a something like a speech to text, the problem is that i need it working continuously, imagine an app running on background and writing everything it hears on a txt. I know i will need to correct a lot of "noise hearing", but it will come later.. I am using pocketsphinx-android, and tried to follow this tutorial: http://cmusphinx.sourceforge.net/wiki/tutorialandroid The problem comes when i try to do a continuous recognition,

How can I do voice recognition for Turkish on cmusphinx?

阅读更多关于 How can I do voice recognition for Turkish on cmusphinx?

问题 I imported this project: https://github.com/cmusphinx/pocketsphinx-android-demo And it works fine on Android studio. But as you can see it on tutorial it has English voice recognition. How can I turn it to Turkish ? I searched a lot but I cant find it. thanks in advance. 回答1: You have to train the model yourself following the tutorial http://cmusphinx.sourceforge.net/wiki/tutorialam You have to collect transcribed speech data for that. Alternatively you can contact people who already did that

Getting WAV file transcription to work with Sphinx4

阅读更多关于 Getting WAV file transcription to work with Sphinx4

I've got Sphinx-4 installed on my windows XP system and JSAPI set up. I'd like to transcribe an English spoken WAV (or MP3) file to text. When I run the "WavFile" demo - it runs successfully. java -jar WavFile.jar But, when I pass my own wav file like this: java -jar WavFile.jar c:\test.wav I get: Loading Recognizer as defined in 'jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/config.xml'... Decoding jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/12345.wav Result: one two three four five It seems