voice | 易学教程

Launch app on voice command (android)

阅读更多关于 Launch app on voice command (android)

问题 I need an example of how I could launch my app on a voice command (trigger word). So some sort of a service running in the background listening to everything and if the word matches a set textual value (I guess this can be done through Voice Recognition), app will open. I know this is possible, but I've no clue where to start... I see other apps are able to establish this. I've close to 1 million users and this is one of the most often requested features. 回答1: To do this you have to run

Launch app on voice command (android)

阅读更多关于 Launch app on voice command (android)

Record audio to NSData

阅读更多关于 Record audio to NSData

问题 I have set up a TCP connection between two iPhones and I am able to send NSData packages between the two. I would like to talk into the microphone and get the recording as an NSData object and send this to the other iPhone. I have successfulyl used Audio Queue Services to record audio and play it but I have not managed to get the recording as NSData. I posted a question about converting the recording to NSData when using Audio Queue Services but it has not got me any further. Therefore I

android mediaRecorder.setAudioSource failed

阅读更多关于 android mediaRecorder.setAudioSource failed

问题 I have android G1 firmware 1.6, I am trying to record voice from the app with the follow code. MediaRecorder recorder = new MediaRecorder(); recorder.setAudioSource(MediaRecorder.AudioSource.MIC); recorder.setOutputFormat(MediaRecorder.OutputFormat.THREE_GPP); recorder.setAudioEncoder(MediaRecorder.AudioEncoder.AMR_NB); recorder.setOutputFile(PATH_NAME); recorder.prepare(); recorder.start(); my manifest.xml has: <uses-permission android:name="android.permission.RECORD_AUDIO" /> I got: 09-23

Voice recognition on android with recorded sound clip?

阅读更多关于 Voice recognition on android with recorded sound clip?

问题 I've used the voice recognition feature on Android and I love it. It's one of my customers' most praised features. However, the format is somewhat restrictive. You have to call the recognizer intent, have it send the recording for transcription to google, and wait for the text back. Some of my ideas would require recording the audio within my app and then sending the clip to google for transcription. Is there any way I can send an audio clip to be processed with speech to text? 回答1: I got a

VXML getting 'src' for subdialog from variable

阅读更多关于 VXML getting 'src' for subdialog from variable

问题 Our business is transitioning from an old asp system to a newer ASP.NET system, including massive improvements to our database. There is going to be a period of coexistence, during which old & new will run side-by-side. One app which needs coexistence is a voicemail system we use. It works as follows: User calls. User inputs an account number. User records message for that account. User gets option to re-record, accept, or discard. During coexistence, the existing app is being modified. When

Why do MFCC extraction libs return different values?

阅读更多关于 Why do MFCC extraction libs return different values?

问题 I am extracting the MFCC features using two different libraries: The python_speech_features lib The BOB lib However the output of the two is different and even the shapes are not the same. Is that normal? or is there a parameter that I am missing? The relevant section of my code is the following: import bob.ap import numpy as np from scipy.io.wavfile import read from sklearn import preprocessing from python_speech_features import mfcc, delta, logfbank def bob_extract_features(audio, rate):

TTS: Ivona SDK for iOS - impelentation in Project increases the app to 200MB+

阅读更多关于 TTS: Ivona SDK for iOS - impelentation in Project increases the app to 200MB+

问题 i am currently trying out ivona SDK for iOS, amazing voice and very very natural. But the voice i am using (german female) have a voicefile with a filesize of 230 MB. when i want to use 4 voices then my app is approximately 1GB big. And also no use for offline. Is this voice just for the testphase? Or is it also for production? I think its horrible to implement a few voices for a small TTS application so that the app size is very very huge... can someone give me an answer to that? 回答1:

Voice recognizer or speech recognition that keeps listening to user's “input”

阅读更多关于 Voice recognizer or speech recognition that keeps listening to user's “input”

问题 Recently in past, I have jumped into iOS app development. I am onto conceptualizing an idea. Does anyone knows about some service/class for voice recognizer or speech recognition that keeps listening to user's " input " in the background while you are on your App screen. " input " in the sense of some speech/voice, Some words like "Go", "Wait") As per my homework, I know there's NSSpeechSynthesizer and NSSpeechRecognizer available on mac, but not for ios. Any pointers are highly appreciated.

Voice Detection in C#

阅读更多关于 Voice Detection in C#

问题 I'm looking for a simple C# real-time voice detection library. The input should be an audio stream, and the output should be "human voice" or "not a human voice". I have no knowledge in speech recognition or signal processing, and I'll appreciate any kind of assistance. 回答1: Take a look at the answer for "Detecting audio silence in WAV files using C#". I am assuming the input is a WAV file. If not please provide the format of the audio stream, or if you are intending on taking input from the