speech-recognition

FLAC conversion utility not available - consider installing the FLAC command line application

为君一笑 提交于 2019-12-11 16:40:04
问题 I am making a simple speech_recognition program in python3.6.3 here is the python code: import speech_recognition as sr import os r = sr.Recognizer() r.energy_threshold = 10000 with sr.Microphone() as source: print('Say something') audio = r.listen(source) print("Done") try: text = r.recognize_google(audio) print(text) os.system("say '"+'I think you said,'+text+'!'+"'") except Exception as e: print(e) When I run it I get this error: OSError:FLAC conversion utility not available - consider

Android Continuous SpeechRecognition: Preventing End

╄→尐↘猪︶ㄣ 提交于 2019-12-11 15:29:57
问题 How do you record and transcribe continuous speech on Android? Want to record and transcribe speech in Android for a mobile (cordova) app. How is it possible to force the Android API for speech recognition NOT to stop recording when it thinks speech is done? Have tried setting all of the parameters. The parameters that would seem to guarantee continuous recognition (see below) do not seem to work, even when tried in a whole bunch of values/combinations. No matter what values we have tried to

Wav file from microphone on Python

时间秒杀一切 提交于 2019-12-11 15:26:52
问题 My project is to do speaker recognition using a microphone. I'm using the speeh_recognition library to extract my microphone audio, but the object's class is 'speech_recognition.AudioData' and i don't know how to convert it into a wav file (in order to use librosa for example to then get audio features and maybe recognize the speaker). Could someone please help me on that ? I have been learning Python for not so long so there might also be easier ways to do speaker recognition using a mic :)

How to get speech recognition to detect SAPI emphasis markers?

天大地大妈咪最大 提交于 2019-12-11 14:59:52
问题 It is possible to extract the default phonemes for a given word via SAPI by: Voice word with text-to-speech and store output in a .wav Use the .wav as input for speech recognition Upon recognition of the word extract the phonemes from the recognized phrase elements However I have not been able to capture (if available) emphasis markers ("1" and "2" per the American English Phoneme Table). Is there a way to do this? EDIT: Here is what I've attempted so far (not pretty, but functional). Sadly

Read Nist Wav File in TIMIT database into python numpy array

流过昼夜 提交于 2019-12-11 13:51:16
问题 Is this possible?? I seem to be getting this error when using wavread from scikits.audiolab: x86_64.egg/scikits/audiolab/pysndfile/matapi.pyc in basic_reader(filename, last, first) 93 if not hdl.format.file_format == filetype: 94 raise ValueError, "%s is not a %s file (is %s)" \ ---> 95 % (filename, filetype, hdl.format.file_format) 96 97 fs = hdl.samplerate ValueError: si762.wav is not a wav file (is nist) I'm guessing it can't read NIST wav files but is there another way to easily read them

Is it possible to write a speech-recognition engine from scratch?

牧云@^-^@ 提交于 2019-12-11 12:56:06
问题 Using some existing math libraries, though. The point is, that there is literally no research in this field in my language (Georgian). Is that possible? How long would I take? I know that this also depends on the skills, but still? 回答1: Also answered at Speech to text conversion for non-english language Is it possible to write a speech-recognition engine from scratch? You do not need to write engine from scratch, there are many engines already available, you can just pick one like CMUSphinx

Does Microsoft SAPI support speech recognition on offline mode just like system.speech api?

旧城冷巷雨未停 提交于 2019-12-11 12:02:37
问题 I have read official documentation of Microsoft SAPI but I couldn't find about whether the api can be used on offline mode or not. in there, they said that Microsoft SAPI is server based speech recognition api. So It seems like it doesn't support but I have to make sure. Can I use Microsoft SAPI on offline just like system.speech ? 回答1: That link does not say what you think it says. Both Microsoft.Speech.Recognition (server engine) and System.Speech.Recognition (desktop engine) run entirely

Microsoft Sam, SAPI alternatives

╄→гoц情女王★ 提交于 2019-12-11 11:18:45
问题 We have a application that we were planing to use Microsoft speech API for. Now we tested it on Windows XP using Microsoft Sam voice and frankly it sound terrible ... It's almost impossible to hear what the voice is trying to say. Are there other, better voice. Are there any updates or newer versions out there that are better. Are there other product, open source projects etc that can work as an alternative? Just to clarify - It needs to have some sort of API so I actually can program against

Activity has leaked ServiceConnection android.speech.SpeechRecognizer$Connection

試著忘記壹切 提交于 2019-12-11 11:15:54
问题 I'm trying to make a function in google glass that allows me to navigate between the cards without having to say the hotword "ok glass". I tried creating a SpeechRecognizer that will constantly listen if something is being said or not and if the correct "command" is being mentioned the app will act accordingly. However the onError method tells me Error occured: RecognitionService busy. and it's throwing a error that says Activity com.example.sw_stage.topfinder.MainActivity has leaked

Pocketsphinx recognizes random phrases in a silence

百般思念 提交于 2019-12-11 10:48:05
问题 I have a pocketsphinx installed on Raspberry Pi and a microphone connected to it. When i run pocketsphinx_continuous using command pocketsphinx_continuous -inmic yes -dict dict.dict -hmm /home/pi/zero_ru.cd_cont_4000 -jsgf mygrammar.gram it starts to recognize random phrases (but in most cases the same phrase) when I am not speaking. And when I do, result is the same. I use acoustic model for russian language. Please, need your help. 回答1: You need to use keyword spotting mode. Pocketsphinx