speech-recognition | 易学教程

Speech recognition response is poor in sphinx4

阅读更多关于 Speech recognition response is poor in sphinx4

问题 Currently we are investigating into using sphinx4 for speech recognition. We are trying to achieve a good response for a dictation type application. The input is a wav file and we wish to transcribe it. I have looked into the LatticeDemo and Transcriber demo provided by Sphinx4. When i utilize the same configuration , the response is pretty poor. I have tried to tweak in the configuration files but it simply does not recognize the words. the transcriber demo provided is for digits, i have

Voice to text on Android in offline mode

阅读更多关于 Voice to text on Android in offline mode

问题 Is there anyway in which I can use the Voice to Text feature of Android in offline mode. In the given example VoiceRecognition.java, it starts and activity with the intent RecognizerIntent.ACTION_RECOGNIZE_SPEECH. Does it mean that any other apk needs to be installed before hand for this to work or do I need to write my own application to launch on this intent. I have been searching for this for a long time but is getting confused... Here is the code I used.. private static final int VOICE

Python Voice Recognition Library - Always Listen?

阅读更多关于 Python Voice Recognition Library - Always Listen?

问题 I've recently been working on using a speech recognition library in python in order to launch applications. I Intend to ultimately use the library for voice activated home automation using the Raspberry Pi GPIO. I have this working, it detects my voice and launches application. The problem is that it seems to hang on the one word I say (for example, I say internet and it launches chrome an infinite number of times) This is unusual behavior from what I have seen of while loops. I cant figure

Split speech audio file on words in python

阅读更多关于 Split speech audio file on words in python

问题 I feel like this is a fairly common problem but I haven't yet found a suitable answer. I have many audio files of human speech that I would like to break on words, which can be done heuristically by looking at pauses in the waveform, but can anyone point me to a function/library in python that does this automatically? 回答1: An easier way to do this is using pydub module. recent addition of silent utilities does all the heavy lifting such as setting up silence threahold , setting up silence

Audio analysis to detect human voice, gender, age and emotion — any prior open-source work done?

阅读更多关于 Audio analysis to detect human voice, gender, age and emotion — any prior open-source work done?

问题 Is there prior open-source work done in the field of 'Audio analysis' to detect human-voice (say in spite of some background noise), determine speaker's gender, possibly determine no. of speakers, age of speaker(s), and the emotion of speakers? My hunch is that the speech recognition software like CMU Sphinx could be a good place to start, but if there's something better, it'd be great. 回答1: I'm a graduate student doing speech recognition research. These are open research problems, and,

Android Speech Recognition not working

阅读更多关于 Android Speech Recognition not working

问题 I'm using this example from newboston and it prompt me for recording but after it recognized what I said, it won't update the list view. Here is the code. public class MainActivity extends Activity { private static final int RECOGNIZER_RESULT = 1234; ListView list; @Override public void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout.activity_main); list = (ListView) findViewById(R.id.list); Button btn_speach = (Button)findViewById(R.id.btn

Custom grammar for Android speech recognition

阅读更多关于 Custom grammar for Android speech recognition

问题 I am trying to apply the custom grammar for speech recognition. I have found a lot of documentation about android.speech.recognition package, it seams as the good place to start, but it seams that it doesn't exist in the Android SDK.. Is there a way to apply a custom grammar rules to the android speech recognition at all..? 回答1: No, Android's recognizer intent only supports two language models. These are the "Free Form" model and the "web search" model. It does not support custom grammars.

How do you enable a microphone input in the android emulator

阅读更多关于 How do you enable a microphone input in the android emulator

问题 I have been on a rough ride trying to do something using the Speech recognition on an android emulator. Having finally installed the market place and the Google Voice Search app - I am so close to enabling my emulator to do what I want - recognize my speech. First I need to enable the emulator to record audio , or at least think that a microphone is present. I believe adb used to have the -mic option - however I dont think it exists anymore. Has anyone done this or can anyone shed some light

How do you enable a microphone input in the android emulator

阅读更多关于 How do you enable a microphone input in the android emulator

Speech to text conversion for non-english language

阅读更多关于 Speech to text conversion for non-english language

问题 I am trying to implement naive speech to text conversion for non-english language. where user speaks in other language and text is also in the same language .Is that possible ? If yes can anyone help with the idea of how to implement it ? Any Help would be greatly appreciated.Thanks in advance. 回答1: It is unlikely any commercial speech recognition solution will support Sanskrit, so the only choice you have is to add support for Sanskrit into open source engine like CMUSphinx. This is pretty