voice | 易学教程

Any simple VAD implementation?

阅读更多关于 Any simple VAD implementation?

I'm looking for some C/C++ code for VAD (Voice Activity Detection). Basically, my application is reading PCM frames from the device. I would like to know when the user is talking. I'm not looking for any speech recognition algorithm but only for voice detection. I would like to know when the user is talking and when he finishes: bool isVAD(short* pcm,size_t count); There are open source implementations in the Sphinx and Freeswitch projects. I think they are all energy based detectors do won't need any kind model. Sphinx 4 (Java but it should be easy to port to C/C++) PocketSphinx Freeswitch

Voice Recognition as a background service

阅读更多关于 Voice Recognition as a background service

Is it possible to implement an activity as a service? My activity is a voice recognition activity. I want to have the activity running in the background of the app constantly checking for voice and when the user says a command it will recognize it and then perform the action. My question is...is it possible to do this and if so how can the background service notify the current activity or application? There was a previous post on this which had no clear answer...Thanks for any input or help. Here is the voice activity...taken from another StackOverflow post: import android.app.Activity; import

Algorithm to remove vocal from sound track [closed]

阅读更多关于 Algorithm to remove vocal from sound track [closed]

I want to remove vocals from mp3 sound tracks. I searched google, and tried few softwares but none of them are convincing. I am planning to read the mp3 file, get a waveform and remove the waveform that is above a specified limit. do you have any suggestions on how to proceed. -- Update I just want code that can read mp3 file format. Are there any softwares?? This isn't so much an "algorithm" as a "trick" but it could be automated in code. It works mostly for stereo tracks with where the vocals are centered. If the vocals are centered, they manifest equally in both tracks. If you invert one of

Is it possible to send/receive voice from PC to Android device over Bluetooth

阅读更多关于 Is it possible to send/receive voice from PC to Android device over Bluetooth

问题 I have a PC (where java program is running), Bluetooth dongle (which supports A2DP) and Android device (2.3.6, which also supports A2DP). I want to: Send real time voice data from PC's microphone to android device via bluetooth Send real time voice data from Android device's microphone to PC via bluetooth I found this question, where Dennis says that "Phones are typically only Source devices (Source of the stream that can stream to sink devices), Sinks are Headsets or Bluetooth speakers.". So

Launch app on voice command (android)

阅读更多关于 Launch app on voice command (android)

I need an example of how I could launch my app on a voice command (trigger word). So some sort of a service running in the background listening to everything and if the word matches a set textual value (I guess this can be done through Voice Recognition), app will open. I know this is possible, but I've no clue where to start... I see other apps are able to establish this. I've close to 1 million users and this is one of the most often requested features. To do this you have to run Android speech recognition as a service instead of as an activity. Then have your service listen for the key word

Changing the voice with PYTTSX module in python

阅读更多关于 Changing the voice with PYTTSX module in python

When using the Pyttsx module within python, how do you change the voice ID that is used when playing out text? The documentation provided illustrates how to cycle through all the available voices, but does not make clear how to choose a specific one. Uh, you should use engine.setProperty('voice', voice_id) (with voice_id being an ID of the voice in your system; you can grab the list of available voices from engine.getProperty('voices') ) as proposed in that example : engine = pyttsx.init() voices = engine.getProperty('voices') for voice in voices: engine.setProperty('voice', voice.id) #

Recognizing multiple peoples voices

阅读更多关于 Recognizing multiple peoples voices

I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this? You can consider Bob SPEAR https://pypi.python.org/pypi/bob.bio.spear Alize/Mistral http://mistral.univ-avignon.fr/index_en.html GMM speaker identification in matlab https://github.com/codyaray/speaker-recognition Very basic speaker recognition in Java, not really accurate https://github.com/amaurycrickx/recognito In python, quite a state of the art toolkit https://pypi.python.org/pypi/SIDEKIT

Continues Speech Recognition beep sound after Google Search update

阅读更多关于 Continues Speech Recognition beep sound after Google Search update

I have an app that keeps on listening to voice and converting it to commands using Google Voice API. I have been using setStreamMute(AudioManager.STREAM_SYSTEM, true) to mute the beep and it worked until a couple of days ago before "Google Search" new update. Is there any workaround fix for it? I know I can use setRingerMode(AudioManager.RINGER_MODE_SILENT) , but maybe there is another method? In the update they switched the output of the 'beep' to the media stream. So you'll need to mute the AudioManager.STREAM_MUSIC There's an enhancement request about it here The beep sound can be muted by

Record audio to NSData

阅读更多关于 Record audio to NSData

I have set up a TCP connection between two iPhones and I am able to send NSData packages between the two. I would like to talk into the microphone and get the recording as an NSData object and send this to the other iPhone. I have successfulyl used Audio Queue Services to record audio and play it but I have not managed to get the recording as NSData. I posted a question about converting the recording to NSData when using Audio Queue Services but it has not got me any further. Therefore I would like to hear if there is any other approach I can take to speak into the microphone of an iPhone and

Voice Recognition as a background service

阅读更多关于 Voice Recognition as a background service

问题 Is it possible to implement an activity as a service? My activity is a voice recognition activity. I want to have the activity running in the background of the app constantly checking for voice and when the user says a command it will recognize it and then perform the action. My question is...is it possible to do this and if so how can the background service notify the current activity or application? There was a previous post on this which had no clear answer...Thanks for any input or help.