speech

Android: How to detect when a user stops talking into the microphone

杀马特。学长 韩版系。学妹 提交于 2019-12-09 11:22:33
问题 I have an Android application that begins recording from the microphone when the application starts. In my current version, the user must press a STOP button to stop recording. How do I detect that the user has stopped talking and use that to trigger the recorder to stop? Similar to what is implemented in the Speech Recognition functionality in Android. The user stops talking and then the speech is translated. I have seen other apps that do it, like Talking Tom type apps. As a side note I

Google Speech Recognition API: timestamp for each word?

为君一笑 提交于 2019-12-09 04:44:37
问题 It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said " one two three for five " in a WAV file. Google API gives me this: { u'alternative': [ {u'transcript': u'12345'}, {u'transcript': u'1 2 3 4 5'}, {u'transcript': u'one two three four five'} ], u'final': True } Question: is it possible to get the time (in seconds) at which each word has been

I can't get kinect sdk to do speech recognition and track skeletal data at the sime time

让人想犯罪 __ 提交于 2019-12-08 05:13:19
问题 I' ve a program in witch I enabled speech recognition with.. RecognizerInfo ri = GetKinectRecognizer(); speechRecognitionEngine = new SpeechRecognitionEngine(ri.Id); // Create a grammar from grammar definition XML file. using (var memoryStream = new MemoryStream(Encoding.ASCII.GetBytes(fileContent))) { var g = new Grammar(memoryStream); speechRecognitionEngine.LoadGrammar(g); } speechRecognitionEngine.SpeechRecognized += new EventHandler<SpeechRecognizedEventArgs>(speechEngine

Microsoft speech API 5.1, 5.3?

你离开我真会死。 提交于 2019-12-08 05:04:37
问题 I'm a little confuse between the different SAPI version available. First of all, I only find the SDK to develop with the 5.1 version, is there any SDK for the 5.3 version available, if not, why ? Witch version can I use if I'm developing with the 3.5 version of the .Net framework. Is there any good tutorial because the only one I found are pretty old (they use 2003 version of visual studio) : http://msdn.microsoft.com/en-us/library/ms986944.aspx Is there any way I can use the speech API

python 3.x speech module [closed]

帅比萌擦擦* 提交于 2019-12-08 04:28:09
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Is there pyTTS or some kind of speech module for python 3.0? I can't seem to find it anywhere and I'd really like to try it out. 回答1: On windows, simplest way would be to directly access SAPI using python com interface, e.g. import win32com.client speaker = win32com.client.Dispatch("SAPI.SpVoice") speaker.Speak(

How to split male and female voices from an audio file(in c++ or java)

孤街醉人 提交于 2019-12-07 07:18:08
问题 I want to differentiate betwen the male n female voices in an audio file and seperate them.As an output I want the two voices seperated.Can u please help me out n can the coding be done in java or c++ 回答1: This is potentially a very complicated question, and it is similar to writing your own speech recognition (or identification) algorithm. You would start by converting the audio into the frequency domain, which is done using a Fast Fourier Transform. For each slice in time that you take an

python 3.x speech module [closed]

Deadly 提交于 2019-12-06 21:03:31
Closed. This question is off-topic . It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Is there pyTTS or some kind of speech module for python 3.0? I can't seem to find it anywhere and I'd really like to try it out. On windows, simplest way would be to directly access SAPI using python com interface, e.g. import win32com.client speaker = win32com.client.Dispatch("SAPI.SpVoice") speaker.Speak("Hello, it works!") Festival is the only TTS engine for Linux I know of, and it has pyfestival - http:/

Microsoft speech API 5.1, 5.3?

早过忘川 提交于 2019-12-06 14:52:19
I'm a little confuse between the different SAPI version available. First of all, I only find the SDK to develop with the 5.1 version, is there any SDK for the 5.3 version available, if not, why ? Witch version can I use if I'm developing with the 3.5 version of the .Net framework. Is there any good tutorial because the only one I found are pretty old (they use 2003 version of visual studio) : http://msdn.microsoft.com/en-us/library/ms986944.aspx Is there any way I can use the speech API directly in Asp.Net web site in speech-to-text mode ? Thx! Wikipedia tells me that SAPI 5.3 was included in

speech recognition and sound comparation witth musicg

瘦欲@ 提交于 2019-12-06 13:41:54
I'm trying to make an Android application with speech recognition, but unfortunately google doesn't support my language (MACEDONIAN) and i'm trying to compare two recording sounds. I'm using http://code.google.com/p/musicg/ to record and compare speech, and i'm stack on initialization the settings for detecting the speech .Some one can tell me how to rewrite this init function for speech detection it's very important to me.. or some other idea how to do that. this is the initialization for whistle detection // settings for detecting a whistle minFrequency = 600.0f; maxFrequency = Double.MAX

how to pass language in speech recognition on android apps?

拈花ヽ惹草 提交于 2019-12-06 09:01:58
I've been working on speech Recognition API in android and found out that the speech results vary allot when the language settings are changed , is there a way to set it programmatically ? or is there an intent to lunch the speech language settings screen ? or what else ? note: I tried to use this intent extra: intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_PREFERENCE, "en-US"); and Intent detailsIntent = new Intent(RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS); sendOrderedBroadcast(detailsIntent, null, new LanguageDetailsChecker(), null, Activity.RESULT_OK, null, null); Yes hanifs, that