speech-recognition | 易学教程

FLAC conversion utility not available - consider installing the FLAC command line application

阅读更多关于 FLAC conversion utility not available - consider installing the FLAC command line application

问题 I am making a simple speech_recognition program in python3.6.3 here is the python code: import speech_recognition as sr import os r = sr.Recognizer() r.energy_threshold = 10000 with sr.Microphone() as source: print('Say something') audio = r.listen(source) print("Done") try: text = r.recognize_google(audio) print(text) os.system("say '"+'I think you said,'+text+'!'+"'") except Exception as e: print(e) When I run it I get this error: OSError:FLAC conversion utility not available - consider

Android Continuous SpeechRecognition: Preventing End

阅读更多关于 Android Continuous SpeechRecognition: Preventing End

问题 How do you record and transcribe continuous speech on Android? Want to record and transcribe speech in Android for a mobile (cordova) app. How is it possible to force the Android API for speech recognition NOT to stop recording when it thinks speech is done? Have tried setting all of the parameters. The parameters that would seem to guarantee continuous recognition (see below) do not seem to work, even when tried in a whole bunch of values/combinations. No matter what values we have tried to

Wav file from microphone on Python

阅读更多关于 Wav file from microphone on Python

问题 My project is to do speaker recognition using a microphone. I'm using the speeh_recognition library to extract my microphone audio, but the object's class is 'speech_recognition.AudioData' and i don't know how to convert it into a wav file (in order to use librosa for example to then get audio features and maybe recognize the speaker). Could someone please help me on that ? I have been learning Python for not so long so there might also be easier ways to do speaker recognition using a mic :)

How to get speech recognition to detect SAPI emphasis markers?

阅读更多关于 How to get speech recognition to detect SAPI emphasis markers?

问题 It is possible to extract the default phonemes for a given word via SAPI by: Voice word with text-to-speech and store output in a .wav Use the .wav as input for speech recognition Upon recognition of the word extract the phonemes from the recognized phrase elements However I have not been able to capture (if available) emphasis markers ("1" and "2" per the American English Phoneme Table). Is there a way to do this? EDIT: Here is what I've attempted so far (not pretty, but functional). Sadly

Read Nist Wav File in TIMIT database into python numpy array

阅读更多关于 Read Nist Wav File in TIMIT database into python numpy array

问题 Is this possible?? I seem to be getting this error when using wavread from scikits.audiolab: x86_64.egg/scikits/audiolab/pysndfile/matapi.pyc in basic_reader(filename, last, first) 93 if not hdl.format.file_format == filetype: 94 raise ValueError, "%s is not a %s file (is %s)" \ ---> 95 % (filename, filetype, hdl.format.file_format) 96 97 fs = hdl.samplerate ValueError: si762.wav is not a wav file (is nist) I'm guessing it can't read NIST wav files but is there another way to easily read them

Is it possible to write a speech-recognition engine from scratch?

阅读更多关于 Is it possible to write a speech-recognition engine from scratch?

问题 Using some existing math libraries, though. The point is, that there is literally no research in this field in my language (Georgian). Is that possible? How long would I take? I know that this also depends on the skills, but still? 回答1: Also answered at Speech to text conversion for non-english language Is it possible to write a speech-recognition engine from scratch? You do not need to write engine from scratch, there are many engines already available, you can just pick one like CMUSphinx

Does Microsoft SAPI support speech recognition on offline mode just like system.speech api?

阅读更多关于 Does Microsoft SAPI support speech recognition on offline mode just like system.speech api?

问题 I have read official documentation of Microsoft SAPI but I couldn't find about whether the api can be used on offline mode or not. in there, they said that Microsoft SAPI is server based speech recognition api. So It seems like it doesn't support but I have to make sure. Can I use Microsoft SAPI on offline just like system.speech ? 回答1: That link does not say what you think it says. Both Microsoft.Speech.Recognition (server engine) and System.Speech.Recognition (desktop engine) run entirely

Microsoft Sam, SAPI alternatives

阅读更多关于 Microsoft Sam, SAPI alternatives

问题 We have a application that we were planing to use Microsoft speech API for. Now we tested it on Windows XP using Microsoft Sam voice and frankly it sound terrible ... It's almost impossible to hear what the voice is trying to say. Are there other, better voice. Are there any updates or newer versions out there that are better. Are there other product, open source projects etc that can work as an alternative? Just to clarify - It needs to have some sort of API so I actually can program against

Activity has leaked ServiceConnection android.speech.SpeechRecognizer$Connection

阅读更多关于 Activity has leaked ServiceConnection android.speech.SpeechRecognizer$Connection

问题 I'm trying to make a function in google glass that allows me to navigate between the cards without having to say the hotword "ok glass". I tried creating a SpeechRecognizer that will constantly listen if something is being said or not and if the correct "command" is being mentioned the app will act accordingly. However the onError method tells me Error occured: RecognitionService busy. and it's throwing a error that says Activity com.example.sw_stage.topfinder.MainActivity has leaked

Pocketsphinx recognizes random phrases in a silence

阅读更多关于 Pocketsphinx recognizes random phrases in a silence

问题 I have a pocketsphinx installed on Raspberry Pi and a microphone connected to it. When i run pocketsphinx_continuous using command pocketsphinx_continuous -inmic yes -dict dict.dict -hmm /home/pi/zero_ru.cd_cont_4000 -jsgf mygrammar.gram it starts to recognize random phrases (but in most cases the same phrase) when I am not speaking. And when I do, result is the same. I use acoustic model for russian language. Please, need your help. 回答1: You need to use keyword spotting mode. Pocketsphinx