speech-recognition

Combine free-form dictation and semantic in a srgs grammar

让人想犯罪 __ 提交于 2019-12-19 11:53:08
问题 I'm trying to combine both the result of a semantic and a dictation request in the semantic value of a SRGS document. For example, I would say "Search potato" and the output would be something like out="Search Potato" where Potato is a random word spoken by the user. I tought about using the garbage special rule, but it doesn't seem to work. So far that's what I have : <rule id="rule1" scope="public"> <one-of> <item xml:lang="en-us">Search</item> <item>Cherche</item> </one-of> <tag>out

Microsoft.Speech.Synthesis does not work for Text To Speech BUT System.Speech.Synthesis works.Why?

核能气质少年 提交于 2019-12-19 09:40:39
问题 I just try to run simple microsoft example for Text To Speech using using Microsoft.Speech.dll; using System; using Microsoft.Speech.Synthesis; namespace TTS { class Program { public static void Main(string[] args) { Console.WriteLine("Testing TTS!"); // Initialize a new instance of the SpeechSynthesizer. using (SpeechSynthesizer synth = new SpeechSynthesizer()) { // Output information about all of the installed voices. Console.WriteLine("Installed voices -"); foreach (InstalledVoice voice in

Speech recognition and getUserMedia

核能气质少年 提交于 2019-12-19 06:56:30
问题 I'm building a web application and plan on using both speechRecognition and navigator.getUserMedia for audio input. I noticed that my desktop browser (Chrome on Mac, v. 31.0.1650.63) asks twice for permission to use the microphone. While this may be a little bit annoying for the user, both voice recognition and audio input seem to work. However, if I open the same page on Android (Nexus 7, Android v4.4.2; Chrome v31.0.1650.59), it asks twice for permission to use my microphone, but I can only

Speech recognition and getUserMedia

假装没事ソ 提交于 2019-12-19 06:55:21
问题 I'm building a web application and plan on using both speechRecognition and navigator.getUserMedia for audio input. I noticed that my desktop browser (Chrome on Mac, v. 31.0.1650.63) asks twice for permission to use the microphone. While this may be a little bit annoying for the user, both voice recognition and audio input seem to work. However, if I open the same page on Android (Nexus 7, Android v4.4.2; Chrome v31.0.1650.59), it asks twice for permission to use my microphone, but I can only

Speech to Text: Play MP3 message by itself and retrieve words

我只是一个虾纸丫 提交于 2019-12-19 04:56:25
问题 I have few MP3 files which are speeches. I have used Android Speech to Text before so I know it can store spoken words. Is there any way where we can get the spoken words from the MP3 and display it in a EditText ? I am thinking about playing the MP3 silently and identify the words, but have no idea about how to do that. I am using Google Speech Engine. 回答1: There is no native way to convert an audio file that contains spoken words to text on Android. You'll need to use a third-party API to

SpeechRecognizer - time limit

不打扰是莪最后的温柔 提交于 2019-12-19 03:57:04
问题 I am using SppechRecognizer for voice recognizer application. Its working fine. My requirement is i want to stop the voice listening after 1 sec or 2 sec. How to achieve that? 回答1: 1 or 2 seconds doesn't seem to be a lot of time but if you want to set a time limit, you'd probably have to thread it. Android has some default extras to set the minimum length of speech input and maximum amount after a user has stopped speaking, but none to set the maximum length of time for speech input. Your

Vista Speech Recognition in Delphi

陌路散爱 提交于 2019-12-19 03:20:43
问题 I would like to be able to dictate into my Delphi application using Microsoft Vista's speech recognition. However when attempting to dictate into a TMemo, it simply does not work. I noticed that Firefox 3.0 has the same issue and after they contacted Microsoft about the problem, they told them that they need to implement the Text Services Framework for their application. I am wondering if there is any way to implement this in Delphi so that I can dictate into a TMemo or a TRichEdit. Searching

Vista Speech Recognition in Delphi

♀尐吖头ヾ 提交于 2019-12-19 03:20:16
问题 I would like to be able to dictate into my Delphi application using Microsoft Vista's speech recognition. However when attempting to dictate into a TMemo, it simply does not work. I noticed that Firefox 3.0 has the same issue and after they contacted Microsoft about the problem, they told them that they need to implement the Text Services Framework for their application. I am wondering if there is any way to implement this in Delphi so that I can dictate into a TMemo or a TRichEdit. Searching

How does Google Keep do Speech Recognition while saving the audio recording at the same time?

淺唱寂寞╮ 提交于 2019-12-18 15:35:11
问题 Android's SpeechRecognizer apparently doesn't allow to record the input on which you're doing speech recognition into an audio file. That is, either you record voice using a MediaRecorder (or AudioRecord for that matter) or you do Speech Recognition with a SpeechRecognizer, in which case the audio isn't recorded into a file (at least not one you can access); but you can't do both at the same time. The question of how to achieve recording audio and doing speech recognition at the same time in

Voice Recognition in PHP?

强颜欢笑 提交于 2019-12-18 13:28:32
问题 I am looking for Voice Recognition in PHP. I have a lot of wav files (10-15 seconds) and I would like Voice Recognition to recognise 1 or 2 words from each wav file and then flag it to database or csv. All the files have the same voice and accent (same person lol) Example pseudocode: $voice = new play('file.wav'); $result = $voice->recognise("Good Morning"); if ($result) { echo "Matched Good Morning"; //flag to database or csv } else { echo "No match found"; } Which PHP Voice Recognition