cmusphinx

How to give an input wav file to pocket sphinx

社会主义新天地 提交于 2019-12-08 16:29:14
问题 Is there some command line utility of pocket sphinx or cmu sphinx to convert a .wav file to text? pocketsphinx_continuous -hmm -lm -dict will do. But I don't want to keep speaking the same sentence again and again. 回答1: pocketsphinx_continuous starting from version 0.8 has option -infile which you can use to decode a file. File must be in a specific format: 16khz 16bit mono wav file pocketsphinx_continuous -infile file.wav 来源: https://stackoverflow.com/questions/20303092/how-to-give-an-input

mp3 recognition using Sphinx 4

前提是你 提交于 2019-12-08 12:13:23
问题 Can we use mp3 files for the voice recognition process without using wav files? or can we generate a wav file from a mp3 and then do the voice recognition without a serious impact on the accuracy? The problem is I need to minimize the load transferred through the network in my application. Will the information which is lost in the conversion be a huge factor for accuracy? 回答1: Can we use mp3 files for the voice recognition process without using wav files? Not directly. To be able to recognize

Can't find pocketsphinx gstreamer plugin

試著忘記壹切 提交于 2019-12-08 06:42:29
I tried to install pocketsphinx by brew instal cmu-pocketsphinx and make install (followed instructions here: http://cmusphinx.sourceforge.net/wiki/gstreamer ) the pocketsphinx_continuous works properly, but I can't find a executable pocketsphinx for gst-inspect pocketsphinx could anyone tell me where to find it, please? Many many thanks pocketsphinx install log: http://pastebin.com/48QU0qjg You need to check if plugin pocketsphinx.so is installed in /usr/local/lib/gstreamer-<version> If plugin is there you need to export GST_PLUGIN_PATH environment variable to update gstreamer search path. If

Continuously recognize everything being said on Android?

老子叫甜甜 提交于 2019-12-08 04:15:52
问题 I'm working on a project that involves speech recognition on Android. And i have some questions without clear answers on this site (or any, actually). I need to do a something like a speech to text, the problem is that i need it working continuously, imagine an app running on background and writing everything it hears on a txt. I know i will need to correct a lot of "noise hearing", but it will come later.. I am using pocketsphinx-android, and tried to follow this tutorial: http://cmusphinx

Can't find pocketsphinx gstreamer plugin

情到浓时终转凉″ 提交于 2019-12-08 03:36:42
问题 I tried to install pocketsphinx by brew instal cmu-pocketsphinx and make install (followed instructions here: http://cmusphinx.sourceforge.net/wiki/gstreamer) the pocketsphinx_continuous works properly, but I can't find a executable pocketsphinx for gst-inspect pocketsphinx could anyone tell me where to find it, please? Many many thanks pocketsphinx install log: http://pastebin.com/48QU0qjg 回答1: You need to check if plugin pocketsphinx.so is installed in /usr/local/lib/gstreamer-<version> If

How do you retrieve tags from JSGF grammars using sphinx?

半世苍凉 提交于 2019-12-08 02:36:27
问题 I have written an application in Java that utlizes the CloudGarden API to parse JSAPI grammars, return tags from the grammar rules and then process these tags accordingly. The issue with the CloudGarden API is its dependency on Windows. I need a cross-platform library that can perform the same task. I do not need speech synthesis nor do I need speech recognition. Essentially a string is passed to the program containing some transcribed speech. I need to parse this string against my grammar

CMUSphinx live speech recognition too slow?

雨燕双飞 提交于 2019-12-07 15:48:07
问题 CMU Sphinix is toooo Slow for recognizing live speech.I don't know if you have any idea for boost it? This is my configuration: configuration.setAcousticModelPath("WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz"); configuration.setDictionaryPath("cmudict.0.6d"); configuration.setLanguageModelPath("en-us.lm.dmp"); 回答1: We are currently working on speedup, but for now sphinx4 is not realtime for large vocabulary. It's actually not a trivial task. If you want a fast and not very accurate transcription

Continuously recognize everything being said on Android?

孤者浪人 提交于 2019-12-06 17:45:28
I'm working on a project that involves speech recognition on Android. And i have some questions without clear answers on this site (or any, actually). I need to do a something like a speech to text, the problem is that i need it working continuously, imagine an app running on background and writing everything it hears on a txt. I know i will need to correct a lot of "noise hearing", but it will come later.. I am using pocketsphinx-android, and tried to follow this tutorial: http://cmusphinx.sourceforge.net/wiki/tutorialandroid The problem comes when i try to do a continuous recognition,

How can I do voice recognition for Turkish on cmusphinx?

只愿长相守 提交于 2019-12-06 11:58:42
问题 I imported this project: https://github.com/cmusphinx/pocketsphinx-android-demo And it works fine on Android studio. But as you can see it on tutorial it has English voice recognition. How can I turn it to Turkish ? I searched a lot but I cant find it. thanks in advance. 回答1: You have to train the model yourself following the tutorial http://cmusphinx.sourceforge.net/wiki/tutorialam You have to collect transcribed speech data for that. Alternatively you can contact people who already did that

Getting WAV file transcription to work with Sphinx4

人走茶凉 提交于 2019-12-06 11:26:23
I've got Sphinx-4 installed on my windows XP system and JSAPI set up. I'd like to transcribe an English spoken WAV (or MP3) file to text. When I run the "WavFile" demo - it runs successfully. java -jar WavFile.jar But, when I pass my own wav file like this: java -jar WavFile.jar c:\test.wav I get: Loading Recognizer as defined in 'jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/config.xml'... Decoding jar:file:/C:/sphinx4-1.0beta3-bin/sphinx4-1.0beta3/bin/WavFile.jar!/edu/cmu/sphinx/demo/wavfile/12345.wav Result: one two three four five It seems