voice-recognition

Python Pocketsphinx: Keyword isn't recognised when using Decoder class

半世苍凉 提交于 2020-07-10 03:11:10
问题 I'm trying to detect a keyword from a .wav file using Pocketsphinx, specifically with the decoder class. When I give it this .wav file and print what it detects it isnt even close. Here is the code: import pocketsphinx as ps import requests import json import sys, os import subprocess model_path = ps.get_model_path() data_path = ps.get_data_path() print("start") print(os.getcwd()) subprocess.call("sox -V4 /home/miro/client_audio.wav -r 16000 -c 1 client_audio.wav", shell=True) config = ps

python webrtc voice activity detection is wrong

让人想犯罪 __ 提交于 2020-06-27 11:17:36
问题 I need to do voice activity detection as a step to classify audio files. Basically, I need to know with certainty if a given audio has spoken language. I am using py-webrtcvad, which I found in git-hub and is scarcely documented: https://github.com/wiseman/py-webrtcvad Thing is, when I try it on my own audio files, it works fine with the ones that have speech but keeps yielding false positives when I feed it with other types of audio (like music or bird sound), even if I set aggressiveness at

Can the Google Speech API be configured to return only numbers / letters?

荒凉一梦 提交于 2020-05-13 07:14:47
问题 Can the Google Speech API be configured to only return numbers and letters, as opposed to full words? The use case is translating Canadian postal codes. Ex. M 1 B 0 R 3. Google may return "Em 1 Be 0 Are 3" We have tried: Using speechContexts and feeding in letters A - Z, as individual phrases. This improved the accuracy for us. We did not have much success passing in individual numbers (ex 1, 2, 3). Specifying the codec and sample rate of our WAV file using the encoding and sampleRateHertz

Android SpeechRecognizer Audio Recording Error

风格不统一 提交于 2020-04-11 04:43:26
问题 I'm trying to use SpeechRecognizer on Android without Intent Dialog. It works fine on most of devices but some devices return Audio Recording Error (Error Code 3) and There's no detail for this error. Is there any solution or reason for this? Thank you!! 回答1: I had the same error when Google App(com.google.android.googlequicksearchbox) does not have recording permission. In addition, SpeechRecognizer uses service of com.google.android.googlequicksearchbox. Thus, if this app is disabled or

Microsoft Cognitive Services - Speaker Recognition API - Identification - error

微笑、不失礼 提交于 2020-04-07 08:25:12
问题 In this API I had successfully created Identification Profile, as well as created enrollment successfully and checked the operation status and received successfully enrolled. Now I am trying to identify speaker but I am getting an error : b'{"error":{"code":"BadRequest","message":"Audio too long"}}' b'{"error":{"code":"BadRequest","message":"Audio too short"}}' I tried various voice samples with different sizes like 5-Second, 10-Second, 15-Second, 30-Second, 40-Seconds, 80-Seconds. And also

Compare two voice in android

丶灬走出姿态 提交于 2020-04-05 17:25:38
问题 I am working on one voice messaging application, I need to compare two voice like, Register with app by record your voice Sent voice message to another user by record voice, but first need to compare this voice to recorded voice in profile. Its for security purpose and need to know recorded message is from specific user or not. I tried : Compare two sound in Android http://www.dreamincode.net/forums/topic/274280-using-fft-to-compare-two-audio-files-and-then-realtime-comparison/ But not

Compare two voice in android

时光怂恿深爱的人放手 提交于 2020-04-05 17:24:35
问题 I am working on one voice messaging application, I need to compare two voice like, Register with app by record your voice Sent voice message to another user by record voice, but first need to compare this voice to recorded voice in profile. Its for security purpose and need to know recorded message is from specific user or not. I tried : Compare two sound in Android http://www.dreamincode.net/forums/topic/274280-using-fft-to-compare-two-audio-files-and-then-realtime-comparison/ But not

Android Speech recognition

给你一囗甜甜゛ 提交于 2020-03-25 17:25:09
问题 I am trying to create an app that simply detects specific phrases that the user can speak into the device and the activity will do something depending on what the user has spoken. I had a hard time finding tutorials on this specific thing so please help me out. So far I have created a button that will start the Recognizer Intent and I have a onActivityResult which I hope can detect what the user is saying and then call specific functions depending on the phrase the user has spoken. public

Voice/Speech to text [closed]

断了今生、忘了曾经 提交于 2020-03-12 08:31:20
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 3 years ago . I need an API or library (preferably free) that will convert voice/speech through a microphone, into text (string). Additionally, I will need an API or library that can do text-to-speech. I'd like to use C# and .NET, but other languages will suffice. Thanks. 回答1: You can use CMU Sphinx as it is pretty open and

How to compare two audio data?

安稳与你 提交于 2020-01-31 18:06:50
问题 I will record my own voice and save them as wav files in my computer. Later on I will speak and computer should match my voice command with preexisting/pre-recorded wav files.. Question: How to check two audio data are equal or there is 80%match between two audio? if(audio1 == audio2) DO Task A else if( audio1 is a bit similar to audio 2) DO TASK B else if( audio1 (80% match) audio 2) DO TASK C end if What is the best way to compare two audio data? 回答1: Unfortunately you won't get anywhere