cmusphinx | 易学教程

Setting up Pocketsphinx for Python in Windows

阅读更多关于 Setting up Pocketsphinx for Python in Windows

问题 Can somebody help me in building pocketsphinx speech recognition in windows. I'm facing difficulty in understanding the instruction provided by sphinx in ths page. I want to develop a speech controlled computer automation application, and I'm using Python. I don't know how to install sphinx so that I can import it in the Python environment like this: import pocketsphinx as ps But can't make out anything from it. I was able to do it easily in Ubuntu using the sudo apt-get install sphinx* But,

Input for Pocketsphinx on Android

阅读更多关于 Input for Pocketsphinx on Android

I make a demo for speech recognize to text. I have just built the demo Building Pocketsphinx On Android and it work well. But my problem is how to make input from an audio file, not from real time speaking. Any idea to solve it? Thanks. You can use Pocketsphinx API to process any binary data, including binary data read from file. You only need to make sure that data is in the required format. Once you read the binary data into the buffer of type short[] you can process it using pocketsphinx API calls: import edu.cmu.pocketsphinx.pocketsphinx; Pocketsphinx ps = new Decoder(....) ps.processRaw

Pocket Sphinx on Android

阅读更多关于 Pocket Sphinx on Android

问题 What are the steps required to use PocketSphinx on Android? I have found various hints around the web that it is possible, but not real answers and hence I ask my question here. 回答1: This is the most comprehensive resource I've found to date. I still have some compilation errors, though. The blog gives you a tutorial on how to build the PocketSphinxAndroidDemo app, which was developed (and discontinued) on GitHub. Personally, I'm having similar compilation errors to the Stack Overflow

Open-source tool for gender-recognition using voice [closed]

阅读更多关于 Open-source tool for gender-recognition using voice [closed]

Closed. This question is off-topic. It is not currently accepting answers. Learn more . Want to improve this question? Update the question so it's on-topic for Stack Overflow. Good evening, I'm working on a project with Kinect and I need to perform a gender-recognition using the voice information caught by the microphone of the sensor. The recognition could also be text-based , i.e. the sentence said by the user could always be the same. I "just" need the binary answer male/female. I've checked the quite popular open-source Sphinx but I can't understand if it could be used for my needs. Any

Open Source Software For Transcribing Speech in Audio Files

阅读更多关于 Open Source Software For Transcribing Speech in Audio Files

问题 Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius, but I've never been able to get either to work, and the documentation with each on transcribing files is sketchy at best. I'm developing on 64-bit Ubuntu 10.04, whose repos include sphinx2 and julius, as well as voxforge's julius acoustic modal for English. I'm focussing on transcribing files, instead of directly processing sound from a

How to use CMU Sphinx 4 for speech to text with english voxforge models

阅读更多关于 How to use CMU Sphinx 4 for speech to text with english voxforge models

问题 I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it does not help me. What I want is an executable where I can specify which model to use and which audio file to use as source and have the executable print out it's best guess about what the voice on the recording says. I hade some luck with:

Creating ARPA language model file with 50,000 words

阅读更多关于 Creating ARPA language model file with 50,000 words

问题 I want to create an ARPA language model file with nearly 50,000 words. I can't generate the language model by passing my text file to the CMU Language Tool. Is any other link available where I can get a language model for these many words? 回答1: I thought I'd answer this one since it has a few votes, although based on Christina's other questions I don't think this will be a usable answer for her since a 50,000-word language model almost certainly won't have an acceptable word error rate or

Pocket Sphinx on Android

阅读更多关于 Pocket Sphinx on Android

What are the steps required to use PocketSphinx on Android? I have found various hints around the web that it is possible, but not real answers and hence I ask my question here. Nick Ruiz This is the most comprehensive resource I've found to date. I still have some compilation errors, though. The blog gives you a tutorial on how to build the PocketSphinxAndroidDemo app, which was developed (and discontinued) on GitHub . Personally, I'm having similar compilation errors to the Stack Overflow question here , but my rep is too low to ask for clarification on the answer. The latest and most

Open Source Software For Transcribing Speech in Audio Files

阅读更多关于 Open Source Software For Transcribing Speech in Audio Files

Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius , but I've never been able to get either to work, and the documentation with each on transcribing files is sketchy at best. I'm developing on 64-bit Ubuntu 10.04, whose repos include sphinx2 and julius, as well as voxforge's julius acoustic modal for English. I'm focussing on transcribing files, instead of directly processing sound from a mic, because I've given up on expecting projects like these to work with Ubuntu's sound system. This

How to use CMU Sphinx 4 for speech to text with english voxforge models

阅读更多关于 How to use CMU Sphinx 4 for speech to text with english voxforge models

I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it does not help me. What I want is an executable where I can specify which model to use and which audio file to use as source and have the executable print out it's best guess about what the voice on the recording says. I hade some luck with: pocketsphinx_continuous -infile recording.wav 2> /dev/null But it aborts before the complete audio file is